ENTITY BM25

BM25

PulseAugur coverage of BM25 — every cluster mentioning BM25 across labs, papers, and developer communities, ranked by signal.

Total · 30d

9

9 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

8

8 over 90d

TIER MIX · 90D

research 6
tool 2
commentary 1

RELATIONSHIPS

SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 9 TOTAL

RESEARCH · CL_27949 · May 12 · 07:41

Qwen 2.5 powers multi-turn retrieval system to top SemEval ranks

Researchers have developed a three-stage retrieval system for multi-turn conversations, enhancing accuracy in information retrieval tasks. The system first refines context-dependent queries using a fine-tuned Qwen 2.5 7…
TOOL · CL_25291 · May 10 · 17:51

RAG approaches evolve from basic to agentic for enhanced LLM accuracy

Retrieval-Augmented Generation (RAG) is not a single architecture but a family of approaches designed for varying accuracy and complexity needs. Basic RAG involves chunking documents, creating embeddings, and retrieving…
RESEARCH · CL_22168 · May 8 · 04:00

New bilingual dataset and RAG system improve geospatial question answering

Researchers have developed a new bilingual dataset and a hybrid retrieval-augmented generation (RAG) system for answering geospatial questions about Tatarstan. The system integrates semantic search with geospatial filte…
RESEARCH · CL_18245 · May 5 · 14:51

New benchmark LIMIT+ reveals neural retrievers struggle with complex set-compositional queries

A new study published on arXiv investigates the performance of information retrieval systems when faced with complex, set-compositional queries. Researchers found that while neural retrieval methods significantly outper…
RESEARCH · CL_09820 · Apr 29 · 07:48

New framework benchmarks enterprise AI document processing pipelines

Researchers have developed EnterpriseDocBench, a new framework for evaluating the end-to-end performance of enterprise AI document processing pipelines. The framework assesses parsing fidelity, indexing efficiency, retr…
RESEARCH · CL_05135 · Apr 27 · 04:00

New RAG research tackles tabular data, cost, and cross-lingual knowledge

Several recent research papers explore advancements in Retrieval-Augmented Generation (RAG) systems. One paper introduces Orthogonal Subspace Decomposition (OSD) to separate task-specific behavior from document knowledg…
TOOL · CL_17427 · Mar 31 · 16:29

PostgreSQL extension adds BM25 relevance-ranked full-text search

A new open-source PostgreSQL extension, pg_textsearch, has been released, offering advanced BM25 relevance-ranked full-text search capabilities. This extension integrates seamlessly with PostgreSQL's existing text searc…
SIGNIFICANT · CL_00225 · Jun 25 · 20:45

Apple Intelligence debuts, enhancing devices with generative AI and RAG

Apple has unveiled "Apple Intelligence," a new personal intelligence system integrating generative models into its devices like the iPhone, iPad, and Mac. This announcement was a key topic at WWDC 2024, highlighting App…
COMMENTARY · CL_04715 · Jul 20 · 00:00

Eugene Yan explains how to bootstrap labels for search relevance

Eugene Yan's blog post addresses a reader's question about bootstrapping labels for semantic search systems without relying on expensive human annotators. Yan suggests starting with traditional lexical search methods li…