专题:Advanced Bandit Algorithms Research

This cluster of papers focuses on the optimization of multi-armed bandit problems, including topics such as Bayesian optimization, contextual bandits, online learning, convex optimization, Thompson sampling, regret analysis, Gaussian process optimization, hyperparameter optimization, and adversarial multi-armed bandits.
最新文献
One-pass online learning from data streams with unpredictable feature evolution

article Full Text OpenAlex

Do Calibrated Recommendations Affect Explanations? A Study on Post-Hoc Adjustments

article Full Text OpenAlex

High-Dimensional Learning in Finance

preprint Full Text OpenAlex

A Multi-Agent Consensus Equilibrium Perspective for Multi-feature Enhancement Sparse SAR Imaging

article Full Text OpenAlex

Online Modeling and Monitoring for Dependent Dynamic Processes Under Resource Constraints

article Full Text OpenAlex

Distributed Online Optimization With Communication And Feedback Delays

article Full Text OpenAlex

Demystifying Inference After Adaptive Experiments

article Full Text OpenAlex

Faster Convergence for Unknown-Game Bandits

article Full Text OpenAlex

Development of a contextual bandits-based thermal mass preconditioning algorithm for dynamic electricity pricing

article Full Text OpenAlex

Revisiting Ranking for Online Bipartite Matching with Random Arrivals: the Primal-Dual Analysis

article Full Text OpenAlex

近5年高被引文献
Bayesian Inverse Reinforcement Learning

book-chapter Full Text OpenAlex 457 FWCI29.147

Smart “Predict, then Optimize”

article Full Text OpenAlex 417 FWCI49.976

Bias and Debias in Recommender System: A Survey and Future Directions

article Full Text OpenAlex 416 FWCI120.322

Online learning: A comprehensive survey

article Full Text OpenAlex 394 FWCI39.013

Contrastive Learning for Sequential Recommendation

article Full Text OpenAlex 383 FWCI78.098

A Survey on Session-based Recommender Systems

review Full Text OpenAlex 382 FWCI21.142

Reinforcement Learning based Recommender Systems: A Survey

review Full Text OpenAlex 314 FWCI17.851

Sequential Recommendation with Graph Neural Networks

article Full Text OpenAlex 309 FWCI47.391

Causal Intervention for Leveraging Popularity Bias in Recommendation

article Full Text OpenAlex 306 FWCI48.648

Intent Contrastive Learning for Sequential Recommendation

article Full Text OpenAlex 244 FWCI51.503