Rakuten AI
Rakuten AI
Who We Are
About Us
Careers
English
日本語
Counterfactual Model Selection in Contextual Bandits
7月 18, 2025
—
by
ar-mami.a.chappey@rakuten.com
in
Uncategorized
counterfactual estimation
Meta Learning
multi-armed bandit algorithm
off-policy
Recommendation
Recommender systems
Reinforcement Learning
←
Previous:
Comparative Analysis Between Decentralized and Centralized Network Digital Twins of Kubernetes Clusters
Next:
Lookalike Audience Expansion: A Graph-Based Model withLLMs
→