专题:Advanced Bandit Algorithms Research

This cluster of papers focuses on the optimization of multi-armed bandit problems, including topics such as Bayesian optimization, contextual bandits, online learning, convex optimization, Thompson sampling, regret analysis, Gaussian process optimization, hyperparameter optimization, and adversarial multi-armed bandits.
最新文献
A Survey on Inference Optimization Techniques for Mixture of Experts Models

article Full Text OpenAlex

MTS: A deep reinforcement learning portfolio management framework with time-awareness and short-selling

article Full Text OpenAlex

Compression Efficiency and Structural Learning as a Computational Model of DLN Cognitive Stages

article Full Text OpenAlex

Convergence and Inference of Stream Stochastic Gradient Descent, with Applications to Queueing Systems and Inventory Control

article Full Text OpenAlex

On Hadamard well-posedness and convergence in set optimization

article Full Text OpenAlex

Real-Time Fair-Exposure Ad Allocation for SMBs and Underserved Creators via Contextual Bandits-with-Knapsacks

article Full Text OpenAlex

Reputation-Filtered Reward Reshaping: Encouraging Cooperation in High Dimensional Semi-Cooperative Multi-agent Settings

article Full Text OpenAlex

Attention-enhanced reinforcement learning for dynamic portfolio optimization

article Full Text OpenAlex

Adaptive Budget Optimization for Multichannel Advertising Using Combinatorial Bandits

article Full Text OpenAlex

Fast UCB-type Algorithms for Stochastic Bandits with Heavy and Super Heavy Symmetric Noise

article Full Text OpenAlex

近5年高被引文献
Bias and Debias in Recommender System: A Survey and Future Directions

article Full Text OpenAlex 582 FWCI164.6188

Contrastive Learning for Sequential Recommendation

article Full Text OpenAlex 561 FWCI78.1763

Reinforcement learning algorithms: A brief survey

article Full Text OpenAlex 485 FWCI85.654

Bayesian Inverse Reinforcement Learning

book-chapter Full Text OpenAlex 469 FWCI36.0017

Reinforcement Learning based Recommender Systems: A Survey

review Full Text OpenAlex 444 FWCI130.242

Contrastive Learning for Representation Degeneration Problem in Sequential Recommendation

article Full Text OpenAlex 373 FWCI53.3401

Recent Advances in Bayesian Optimization

review Full Text OpenAlex 355 FWCI62.7099

Intent Contrastive Learning for Sequential Recommendation

article Full Text OpenAlex 348 FWCI49.6766

Off-Policy Deep Reinforcement Learning without Exploration

article Full Text OpenAlex 300 FWCI0

Evaluating Recommender Systems: Survey and Framework

review Full Text OpenAlex 225 FWCI66.5532