PulseAugur
LIVE 11:04:25
research · [2 sources] ·
0
research

ELVIS: Ensemble-Calibrated Latent Imagination for Long-Horizon Visual MPC

Researchers have developed ELVIS, a novel approach to long-horizon visual planning in reinforcement learning that uses a Gaussian-mixture model predictive controller to maintain multiple hypotheses over extended rollouts. This method, detailed in a new paper, also incorporates an uncertainty-aware return mechanism to stabilize imagination and limit compounding errors. ELVIS demonstrates state-of-the-art performance on visual control tasks and shows promise for real-world applications with occlusions. Separately, another paper introduces TRAP, a backdoor attack targeting world models by manipulating the ranking of imagined trajectories, which has shown to degrade performance on agents like DreamerV3 and TD-MPC2. AI

Summary written by None from 2 sources. How we write summaries →

IMPACT New methods for long-horizon planning and security evaluations for world models could advance agent capabilities and safety.

RANK_REASON Two new arXiv papers detail advancements in reinforcement learning planning and introduce a novel attack vector against world models.

Read on arXiv cs.LG →

COVERAGE [2]

  1. arXiv cs.LG TIER_1 · Yurui Du, Pinhao Song, Yutong Hu, Renaud Detry ·

    ELVIS: Ensemble-Calibrated Latent Imagination for Long-Horizon Visual MPC

    arXiv:2605.04709v1 Announce Type: new Abstract: A central challenge of visual control with model-based reinforcement learning (RL) is reliable long-horizon planning: long rollouts with learned latent dynamics exhibit branching futures and multi-modal action-value distributions. I…

  2. arXiv cs.LG TIER_1 · Siyuan Duan, Ke Zhang, Xizhao Luo ·

    TRAP: Tail-aware Ranking Attack for World-Model Planning

    arXiv:2605.01950v1 Announce Type: new Abstract: World models enable long-horizon planning by internally generating and evaluating imagined trajectories, making them a promising foundation for generalist agents. However, this imagination-driven decision process also introduces new…