PulseAugur
EN
LIVE 21:22:01
ENTITY D4RL

D4RL

PulseAugur coverage of D4RL — every cluster mentioning D4RL across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
3
3 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
3
3 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 3 TOTAL
  1. RESEARCH · CL_65476 ·

    New research explores Q-learning stability and offline RL methods

    Two new research papers explore advancements in reinforcement learning techniques. One paper introduces Drift Q-Learning, a method that combines a drift-based behavioral regularizer with critic-driven policy improvement…

  2. TOOL · CL_38233 ·

    New COOPO framework boosts reinforcement learning efficiency

    Researchers have developed a new framework called COOPO (Cyclic Offline-Online Policy Optimization) to address limitations in offline and online reinforcement learning. This method repeatedly cycles between offline trai…

  3. TOOL · CL_21965 ·

    SlimDT paper proposes injecting RTG outside sequential modeling

    Researchers have developed SlimDT, a modification of the Decision Transformer (DT) model for offline reinforcement learning. SlimDT removes the Return-to-Go (RTG) token from the autoregressive sequence, instead injectin…