Causal2Vec enhances decoder-only LLMs for embeddings without architecture changes

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have introduced Causal2Vec, a novel method to enhance decoder-only large language models (LLMs) for embedding tasks without altering their core architecture. This approach involves pre-encoding input text into a single 'Contextual token' which is then added to the LLM's input sequence. Causal2Vec also uses a combined embedding from Contextual and EOS tokens to mitigate recency bias, achieving state-of-the-art results on the MTEB benchmark for retrieval datasets. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Introduces a new technique to improve LLM embedding performance without architectural changes, potentially reducing computational costs for specific tasks.

RANK_REASON Academic paper introducing a new method for LLM embedding models. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

COVERAGE [1]

arXiv cs.CL TIER_1 · Ailiang Lin, Zhuoyun Li, Yusong Wang, Kotaro Funakoshi, Manabu Okumura · 2026-05-05 04:00

Causal2Vec: Improving Decoder-only LLMs as Embedding Models through a Contextual Token

arXiv:2507.23386v3 Announce Type: replace Abstract: Decoder-only large language models (LLMs) have been increasingly adopted to build embedding models for diverse tasks. To overcome the inherent limitations of causal attention in representation learning, many existing methods mod…

COVERAGE [1]

Causal2Vec: Improving Decoder-only LLMs as Embedding Models through a Contextual Token

RELATED ENTITIES

RELATED TOPICS