BM25
PulseAugur coverage of BM25 — every cluster mentioning BM25 across labs, papers, and developer communities, ranked by signal.
2 day(s) with sentiment data
-
Qwen 2.5 powers multi-turn retrieval system to top SemEval ranks
Researchers have developed a three-stage retrieval system for multi-turn conversations, enhancing accuracy in information retrieval tasks. The system first refines context-dependent queries using a fine-tuned Qwen 2.5 7…
-
RAG approaches evolve from basic to agentic for enhanced LLM accuracy
Retrieval-Augmented Generation (RAG) is not a single architecture but a family of approaches designed for varying accuracy and complexity needs. Basic RAG involves chunking documents, creating embeddings, and retrieving…
-
New bilingual dataset and RAG system improve geospatial question answering
Researchers have developed a new bilingual dataset and a hybrid retrieval-augmented generation (RAG) system for answering geospatial questions about Tatarstan. The system integrates semantic search with geospatial filte…
-
New benchmark LIMIT+ reveals neural retrievers struggle with complex set-compositional queries
A new study published on arXiv investigates the performance of information retrieval systems when faced with complex, set-compositional queries. Researchers found that while neural retrieval methods significantly outper…
-
New framework benchmarks enterprise AI document processing pipelines
Researchers have developed EnterpriseDocBench, a new framework for evaluating the end-to-end performance of enterprise AI document processing pipelines. The framework assesses parsing fidelity, indexing efficiency, retr…
-
New RAG research tackles tabular data, cost, and cross-lingual knowledge
Several recent research papers explore advancements in Retrieval-Augmented Generation (RAG) systems. One paper introduces Orthogonal Subspace Decomposition (OSD) to separate task-specific behavior from document knowledg…
-
PostgreSQL extension adds BM25 relevance-ranked full-text search
A new open-source PostgreSQL extension, pg_textsearch, has been released, offering advanced BM25 relevance-ranked full-text search capabilities. This extension integrates seamlessly with PostgreSQL's existing text searc…
-
Apple Intelligence debuts, enhancing devices with generative AI and RAG
Apple has unveiled "Apple Intelligence," a new personal intelligence system integrating generative models into its devices like the iPhone, iPad, and Mac. This announcement was a key topic at WWDC 2024, highlighting App…
-
Eugene Yan explains how to bootstrap labels for search relevance
Eugene Yan's blog post addresses a reader's question about bootstrapping labels for semantic search systems without relying on expensive human annotators. Yan suggests starting with traditional lexical search methods li…