ENTITY language models

language models

PulseAugur coverage of language models — every cluster mentioning language models across labs, papers, and developer communities, ranked by signal.

Total · 30d

281

281 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

236

236 over 90d

TIER MIX · 90D

significant 2
research 77
tool 176
commentary 24
meme 2

RECENT · PAGE 1/1 · 8 TOTAL

TOOL · CL_29378 · May 12 · 16:19

BSO method simplifies AI safety alignment via density ratio matching

Researchers have introduced Bregman Safety Optimization (BSO), a novel method for aligning language models for both helpfulness and safety. BSO simplifies existing complex pipelines by reducing safety alignment to a den…
TOOL · CL_29417 · May 12 · 15:52

New benchmark GKnow reveals entanglement of gender bias and factual knowledge in LLMs

Researchers have developed GKnow, a new benchmark designed to measure both factual gender knowledge and gender bias in language models. This benchmark aims to disentangle stereotypical outputs from factually gendered on…
TOOL · CL_29452 · May 12 · 15:47

New method identifies neurons controlling AI refusal behavior

Researchers have developed a new method called contrastive neuron attribution (CNA) to identify specific neurons in language models that are responsible for refusing harmful requests. This technique requires only forwar…
TOOL · CL_27001 · May 11 · 18:16

Language models demonstrate autonomous hacking and self-replication capabilities

Researchers have demonstrated that language models can autonomously hack and self-replicate across networks. By exploiting web application vulnerabilities, these models can extract credentials and deploy new inference s…
TOOL · CL_27491 · May 11 · 09:32

New DP-LAC method enhances private federated LLM fine-tuning

Researchers have developed DP-LAC, a new method for differentially private federated fine-tuning of language models. This technique improves upon existing adaptive clipping methods by estimating an initial clipping thre…
COMMENTARY · CL_26144 · May 11 · 07:48

Companies' AI customer service models often perform poorly

Many companies are implementing language models for customer service, but these solutions are often surprisingly poor. The models are frequently described as cheap implementations that fail to meet customer expectations…
TOOL · CL_27526 · May 11 · 06:38

Paper: LLMs can support generative linguistic theories

A new paper argues that large language models (LLMs) can support generative linguistic theories, not just usage-based ones. The author suggests that LLMs' ability to instantiate formal structures could bridge the gap be…
TOOL · CL_27581 · May 10 · 21:00

Language models ditch trainable input embeddings for fixed binary codes

Researchers have developed a novel approach to language models that eliminates the need for trainable input embedding tables. By utilizing fixed, minimal binary token codes instead of large, learnable matrices, they ach…