ENTITY
WildChat
WildChat
PulseAugur coverage of WildChat — every cluster mentioning WildChat across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
New research tackles AI agent training with realistic user personas
Two new research papers explore the limitations of current user simulators for training AI agents. The first paper introduces Persona Policies (PPol), a method to generate more realistic and varied user personas for sim…
-
New benchmark 'Prosa' evaluates LLMs on Brazilian Portuguese chats
Researchers have introduced Prosa, a new benchmark designed to evaluate Large Language Models (LLMs) using real user conversations in Brazilian Portuguese. This benchmark utilizes a rubric-based scoring system with mult…