专题:Advanced Bandit Algorithms Research

This cluster of papers focuses on the optimization of multi-armed bandit problems, including topics such as Bayesian optimization, contextual bandits, online learning, convex optimization, Thompson sampling, regret analysis, Gaussian process optimization, hyperparameter optimization, and adversarial multi-armed bandits.
最新文献
Architecture Optimization using Surrogate-based Incremental Learning for Quality-attribute Analyses

article Full Text OpenAlex

Communicating Scientific Uncertainty via Approximate Posteriors

article Full Text OpenAlex

Resolving Feynman’s restaurant problem reveals optimal solutions and human strategies

article Full Text OpenAlex

Conservative Risk-Sensitive Reinforcement Learning for Reliable Decision-Making under Uncertainty

article Full Text OpenAlex

Scheduling Training-Inference Co-Location in Demand Response for Sustainable Edge AI

article Full Text OpenAlex

Robust Reinforcement Learning: Methods, Benchmarks and Challenges

article Full Text OpenAlex

Cost-Aware Bayesian Optimization for Interactive Devices

article Full Text OpenAlex

A quantum gradient descent algorithm for optimizing Gaussian process models

article Full Text OpenAlex

Curiosity-driven decision causal-convolutional transformer with adaptive training for offline-to-online multi-agent reinforcement learning

article Full Text OpenAlex

An adaptive dropout approach for high-dimensional bayesian optimization

article Full Text OpenAlex

近5年高被引文献
Contrastive Learning for Sequential Recommendation

article Full Text OpenAlex 607 FWCI77.8868

Bias and Debias in Recommender System: A Survey and Future Directions

article Full Text OpenAlex 607 FWCI165.7051

Reinforcement learning algorithms: A brief survey

article Full Text OpenAlex 553 FWCI89.3338

Bayesian Inverse Reinforcement Learning

book-chapter Full Text OpenAlex 474 FWCI26.4431

Reinforcement Learning based Recommender Systems: A Survey

review Full Text OpenAlex 468 FWCI130.9877

Contrastive Learning for Representation Degeneration Problem in Sequential Recommendation

article Full Text OpenAlex 407 FWCI52.9899

Recent Advances in Bayesian Optimization

review Full Text OpenAlex 399 FWCI64.456

Intent Contrastive Learning for Sequential Recommendation

article Full Text OpenAlex 374 FWCI49.6417

Off-Policy Deep Reinforcement Learning without Exploration

article Full Text OpenAlex 300 FWCI0

Evaluating Recommender Systems: Survey and Framework

review Full Text OpenAlex 257 FWCI66.9284