AI copilots match pathologists on digital pathology tasks, study finds

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 2 sources

A new benchmark called DALPHIN has been developed to evaluate AI copilots in digital pathology. The benchmark includes over 1200 images and a performance comparison with 31 human pathologists. General-purpose models like GPT-5 and Gemini 2.5 Pro, along with a specialized copilot, PathChat+, were tested on various diagnostic tasks. AI

Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →

IMPACT Establishes a new standard for evaluating AI's diagnostic capabilities in a specialized medical field, potentially guiding future development and adoption.

RANK_REASON The cluster describes a new academic paper introducing a benchmark dataset and evaluation methodology for AI in digital pathology.

Read on arXiv cs.AI →

paper
other

COVERAGE [2]

arXiv cs.AI TIER_1 · Francesco Ciompi · 2026-05-05 09:15

DALPHIN: Benchmarking Digital Pathology AI Copilots Against Pathologists on an Open Multicentric Dataset

Foundation models with visual question answering capabilities for digital pathology are emerging. Such unprecedented technology requires independent benchmarking to assess its potential in assisting pathologists in routine diagnostics. We created DALPHIN, the first multicentric o…
arXiv cs.CV TIER_1 · Carlijn Lems, Sander Moonemans, Nat\'alie Klub\'i\v{c}kov\'a, Biagio Brattoli, Taebum Lee, Seokhwi Kim, Veronica Vilaplana, Laura Pons, Sapir Hochman, Mauricio Eduardo Su\'arez-Franck, Pedro Luis Fernandez, Julius Drachneris, Donatas Petroska, Renaldas Au · 2026-05-06 04:00

DALPHIN: Benchmarking Digital Pathology AI Copilots Against Pathologists on an Open Multicentric Dataset

arXiv:2605.03544v1 Announce Type: new Abstract: Foundation models with visual question answering capabilities for digital pathology are emerging. Such unprecedented technology requires independent benchmarking to assess its potential in assisting pathologists in routine diagnosti…

COVERAGE [2]

DALPHIN: Benchmarking Digital Pathology AI Copilots Against Pathologists on an Open Multicentric Dataset

DALPHIN: Benchmarking Digital Pathology AI Copilots Against Pathologists on an Open Multicentric Dataset

RELATED ENTITIES

RELATED TOPICS