Researchers have developed CleanBase, a novel method to identify malicious documents within retrieval-augmented generation (RAG) knowledge databases. The system leverages the high semantic similarity often found among malicious documents crafted for prompt injection attacks. CleanBase constructs a similarity graph where documents forming cliques are flagged as malicious, thereby enhancing the security and integrity of RAG systems. AI
Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →
IMPACT Enhances RAG system security by detecting and mitigating prompt injection attacks through malicious document identification.
RANK_REASON This is a research paper detailing a new method for detecting malicious documents in RAG systems.