ENTITY
GLM-4.6
GLM-4.6
PulseAugur coverage of GLM-4.6 — every cluster mentioning GLM-4.6 across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
New benchmark reveals AI sycophancy, humans outperform models
A new benchmark has been developed to test large language models for sycophancy, or their tendency to provide agreeable rather than accurate responses. The benchmark, compiled from viral social media posts, found that e…
-
New benchmark evaluates multimodal LLMs for dental practice capabilities
Researchers have developed OralMLLM-Bench, a new benchmark designed to evaluate the cognitive abilities of multimodal large language models (MLLMs) specifically within the field of dental radiography. This benchmark cov…