PulseAugur
LIVE 06:58:51
ENTITY TerminalBench 2.0

TerminalBench 2.0

PulseAugur coverage of TerminalBench 2.0 — every cluster mentioning TerminalBench 2.0 across labs, papers, and developer communities, ranked by signal.

Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 1 TOTAL
  1. RESEARCH · CL_05561 ·

    Open-source AI agent surpasses Gemini and GPT-4 on TerminalBench 2.0

    An open-source AI agent, developed in Turkey and named OSS Agent I, has achieved a 65.2% success rate on the TerminalBench 2.0 benchmark. This performance surpasses that of established models like Google's Gemini-3-flas…