Researchers have introduced MMM-Bench, a new benchmark designed to address the limitations of existing document classification systems. This benchmark features a five-level hierarchical taxonomy and a dataset of 5,990 real-world multi-modal documents from 12 commercial domains within Alibaba. MMM-Bench aims to better reflect the complexity of practical document intelligence by incorporating multi-level, multi-domain, and multi-modal aspects, and the team has released the data and evaluation toolkit to facilitate further research. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
IMPACT Establishes a more realistic benchmark for document intelligence, potentially accelerating progress in enterprise content management.
RANK_REASON The cluster describes the release of a new academic benchmark and dataset for document classification. [lever_c_demoted from research: ic=1 ai=1.0]