MiniCPM-o 4.5 is a new 9B parameter omni-modal large language model designed for real-time, full-duplex interaction. It can simultaneously process and generate audio, video, and text, enabling proactive behaviors and continuous environmental understanding. The model utilizes the Omni-Flow framework for time-aligned processing and is optimized for efficient inference, allowing it to run on edge devices with less than 12GB of RAM. AI
Summary written by None from 3 sources. How we write summaries →
IMPACT Enables real-time, full-duplex omni-modal interaction on consumer hardware, lowering the barrier for advanced AI applications.
RANK_REASON Release of a technical report and open-source model with performance claims and new framework.