ENTITY D4RL

D4RL

PulseAugur coverage of D4RL — every cluster mentioning D4RL across labs, papers, and developer communities, ranked by signal.

Total · 30d

3

3 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

3

3 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 3 TOTAL

RESEARCH · CL_65476 · May 31 · 15:46

New research explores Q-learning stability and offline RL methods

Two new research papers explore advancements in reinforcement learning techniques. One paper introduces Drift Q-Learning, a method that combines a drift-based behavioral regularizer with critic-driven policy improvement…
TOOL · CL_38233 · May 18 · 17:15

New COOPO framework boosts reinforcement learning efficiency

Researchers have developed a new framework called COOPO (Cyclic Offline-Online Policy Optimization) to address limitations in offline and online reinforcement learning. This method repeatedly cycles between offline trai…
TOOL · CL_21965 · May 8 · 04:00

SlimDT paper proposes injecting RTG outside sequential modeling

Researchers have developed SlimDT, a modification of the Decision Transformer (DT) model for offline reinforcement learning. SlimDT removes the Return-to-Go (RTG) token from the autoregressive sequence, instead injectin…