专题:Advanced Bandit Algorithms Research

This cluster of papers focuses on the optimization of multi-armed bandit problems, including topics such as Bayesian optimization, contextual bandits, online learning, convex optimization, Thompson sampling, regret analysis, Gaussian process optimization, hyperparameter optimization, and adversarial multi-armed bandits.
最新文献
Prompt-Enabled Large AI Models for CSI Feedback

article Full Text OpenAlex

Policy Constraint by Only Support Constraint for Offline Reinforcement Learning

book-chapter Full Text OpenAlex

Explainable zero-shot trading using multi-agent LLM architecture: A backtested approach for Bitcoin price

article Full Text OpenAlex

LEGO: A Lightweight and Efficient Multiple-Attribute Unlearning Framework for Recommender Systems

article Full Text OpenAlex

The Safety-Privacy Tradeoff in Linear Bandits

article Full Text OpenAlex

Sequential Change Detection for Learning in Piecewise Stationary Bandit Environments

article Full Text OpenAlex

Online Clustering With Bandit Information

article Full Text OpenAlex

Exploiting large language model with reinforcement learning for generative job recommendations

article Full Text OpenAlex

Time-aware hybrid recommender for sparse data with nonlinear interactions and dynamic shifts

article Full Text OpenAlex

On the influence of dependent features in classification problems: A game-theoretic perspective

article Full Text OpenAlex

近5年高被引文献
Diagnosing Non-Intermittent Anomalies in Reinforcement Learning Policy Executions (Short Paper)

preprint Full Text OpenAlex 11188 FWCI1752.80833879

Bias and Debias in Recommender System: A Survey and Future Directions

article Full Text OpenAlex 557 FWCI191.51551972

Contrastive Learning for Sequential Recommendation

article Full Text OpenAlex 535 FWCI86.19713755

Bayesian Inverse Reinforcement Learning

book-chapter Full Text OpenAlex 469 FWCI63.4526394

Reinforcement Learning based Recommender Systems: A Survey

review Full Text OpenAlex 420 FWCI151.236462

Reinforcement learning algorithms: A brief survey

article Full Text OpenAlex 411 FWCI104.98702682

Contrastive Learning for Representation Degeneration Problem in Sequential Recommendation

article Full Text OpenAlex 357 FWCI58.73317434

Intent Contrastive Learning for Sequential Recommendation

article Full Text OpenAlex 337 FWCI55.58970867

Recent Advances in Bayesian Optimization

review Full Text OpenAlex 320 FWCI81.74172405

Off-Policy Deep Reinforcement Learning without Exploration

article Full Text OpenAlex 299 FWCI0