Q-learning
PulseAugur coverage of Q-learning — every cluster mentioning Q-learning across labs, papers, and developer communities, ranked by signal.
-
New ME-AM framework enhances offline RL with entropy maximization
Researchers have introduced Maximum Entropy Adjoint Matching (ME-AM), a new framework designed to improve offline reinforcement learning. This method addresses limitations in existing approaches, such as popularity bias…
-
New Long-Horizon Q-Learning method improves reinforcement learning accuracy
Researchers have introduced Long-Horizon Q-Learning (LQL), a novel method designed to improve the stability of value-based reinforcement learning. LQL addresses the issue of compounding estimation errors in traditional …
-
New Q-learning theory offers tighter convergence rate analysis
Researchers have developed a novel theoretical framework for analyzing Q-learning, a fundamental algorithm in reinforcement learning. This new approach views Q-learning through the lens of switching systems, deriving a …
-
Researchers develop MDP and POMDP for error mitigation in digital twins
Researchers have developed a new framework for mitigating error propagation in modular digital twins by treating it as a sequential decision-making problem. They formulated this using a Markov Decision Process (MDP) and…