专题:Advanced Bandit Algorithms Research

This cluster of papers focuses on the optimization of multi-armed bandit problems, including topics such as Bayesian optimization, contextual bandits, online learning, convex optimization, Thompson sampling, regret analysis, Gaussian process optimization, hyperparameter optimization, and adversarial multi-armed bandits.
最新文献
Cost-Aware Bayesian Optimization for Interactive Devices

article Full Text OpenAlex

An adaptive dropout approach for high-dimensional bayesian optimization

article Full Text OpenAlex

A No-Go Theorem for Introspective Prediction in Computational Machines

article Full Text OpenAlex

Expectations in Expectation Propagation

article Full Text OpenAlex

Adapting LLMs for High-Dimensional Bayesian Optimization

article Full Text OpenAlex

Preference Robustness for DPO with Applications to Public Health

article Full Text OpenAlex

Unified Minimax Optimization Framework for Propensity Score Estimation in Debiased Recommendation

article Full Text OpenAlex

DAO-GP Drift Aware Online Non-Linear Regression Gaussian-Process

article Full Text OpenAlex

Memory Instance Gated Transformer Reinforcement Learning for Portfolio Management

article Full Text OpenAlex

Dual peer effects and cross-stock predictability

article Full Text OpenAlex

近5年高被引文献
Bias and Debias in Recommender System: A Survey and Future Directions

article Full Text OpenAlex 598 FWCI165.5779

Contrastive Learning for Sequential Recommendation

article Full Text OpenAlex 583 FWCI78.0104

Reinforcement learning algorithms: A brief survey

article Full Text OpenAlex 520 FWCI87.767

Bayesian Inverse Reinforcement Learning

book-chapter Full Text OpenAlex 471 FWCI32.556

Reinforcement Learning based Recommender Systems: A Survey

review Full Text OpenAlex 450 FWCI130.4893

Contrastive Learning for Representation Degeneration Problem in Sequential Recommendation

article Full Text OpenAlex 389 FWCI53.0762

Recent Advances in Bayesian Optimization

review Full Text OpenAlex 377 FWCI63.972

Intent Contrastive Learning for Sequential Recommendation

article Full Text OpenAlex 360 FWCI49.5633

Off-Policy Deep Reinforcement Learning without Exploration

article Full Text OpenAlex 300 FWCI0

Evaluating Recommender Systems: Survey and Framework

review Full Text OpenAlex 240 FWCI66.9969