专题:Advanced Bandit Algorithms Research

This cluster of papers focuses on the optimization of multi-armed bandit problems, including topics such as Bayesian optimization, contextual bandits, online learning, convex optimization, Thompson sampling, regret analysis, Gaussian process optimization, hyperparameter optimization, and adversarial multi-armed bandits.
最新文献
Multi-parameter Control for the (1+(λ, λ))-GA on OneMax via Deep Reinforcement Learning

article Full Text OpenAlex

Bayesian safe policy learning with chance constrained optimization: application to military security assessment during the Vietnam War

article Full Text OpenAlex

Algorithmic Collusion and the Minimum Price Markov Game

preprint Full Text OpenAlex

Online Learning from data streams via decentralized and asynchronous SGD

article Full Text OpenAlex

Forgetting-Factor Regrets for Projection-Free Distributed Online Optimization

article Full Text OpenAlex

Safe and Efficient Online Convex Optimization with Linear Budget Constraints and Partial Feedback

article Full Text OpenAlex

Differentially Private Distributed Online Optimization via Signs of Relative States

article Full Text OpenAlex

Distributed Online Projection Tracking for Constrained Aggregative Optimization Under Directed Graph

article Full Text OpenAlex

MbExplainer: Multilevel bandit-based explanations for downstream models with augmented graph embeddings

article Full Text OpenAlex

Addressing Incremental Backstepping Control Limitations with Direct Online Gaussian Process Adaptation

article Full Text OpenAlex

近5年高被引文献
Bayesian Inverse Reinforcement Learning

book-chapter Full Text OpenAlex 458 FWCI25.045

Smart “Predict, then Optimize”

article Full Text OpenAlex 447 FWCI50.23

Bias and Debias in Recommender System: A Survey and Future Directions

article Full Text OpenAlex 434 FWCI124.97

Online learning: A comprehensive survey

article Full Text OpenAlex 428 FWCI39.177

Contrastive Learning for Sequential Recommendation

article Full Text OpenAlex 421 FWCI80.018

A Survey on Session-based Recommender Systems

review Full Text OpenAlex 393 FWCI21.209

Reinforcement Learning based Recommender Systems: A Survey

review Full Text OpenAlex 325 FWCI17.952

Sequential Recommendation with Graph Neural Networks

article Full Text OpenAlex 324 FWCI47.38

Causal Intervention for Leveraging Popularity Bias in Recommendation

article Full Text OpenAlex 316 FWCI48.636

Intent Contrastive Learning for Sequential Recommendation

article Full Text OpenAlex 275 FWCI54.199