ENTITY
Nexus Labs
Nexus Labs
PulseAugur coverage of Nexus Labs — every cluster mentioning Nexus Labs across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
LLM evaluation harness updated with production data and adversarial testing
A new approach to evaluating Large Language Models (LLMs) has been proposed to address the issue of static evaluation harnesses failing to detect model regressions. This method involves refreshing evaluation datasets we…
-
Measuring AI Gateway Failover: 30 Days of Production Data
Anthropic has released an update on Claude's sycophancy, noting that Opus 4.7 shows a 50% reduction in sycophantic responses compared to Opus 4.6, particularly in relationship guidance conversations. The company also de…