PulseAugur
EN
LIVE 19:54:00
ENTITY MMDiT

MMDiT

PulseAugur coverage of MMDiT — every cluster mentioning MMDiT across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
5
5 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
5
5 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 5 TOTAL
  1. RESEARCH · CL_53474 ·

    New frameworks and benchmarks advance audio-visual generation

    Researchers have introduced OmniCustom, a framework for customizing both video identity and audio timbre simultaneously from reference images and audio. This DiT-based model uses separate LoRA modules for identity and t…

  2. RESEARCH · CL_40805 ·

    New framework creates lightweight diffusion models via knowledge distillation

    Researchers have developed a new knowledge distillation framework called LIFT and PLACE to create more efficient diffusion models. This method addresses the difficulty students have in mimicking complex teacher models b…

  3. RESEARCH · CL_15684 ·

    New benchmarks challenge MLLMs' spatial and functional reasoning abilities

    Researchers have introduced new benchmarks to evaluate the spatial and functional reasoning capabilities of multimodal large language models (MLLMs). These benchmarks aim to move beyond basic geometric perception to ass…

  4. TOOL · CL_15629 ·

    AttnRouter enhances image editing on MMDiT with per-category attention routing

    Researchers have developed AttnRouter, a novel method for training-free image editing on the MMDiT model. This approach utilizes KVInject, a single-forward attention manipulation that blends source-image key/value proje…

  5. RESEARCH · CL_04941 ·

    OccDirector: Language-Guided Behavior and Interaction Generation in 4D Occupancy Space

    Researchers have introduced OccDirector, a new framework designed to generate complex 4D occupancy dynamics for autonomous driving simulations based solely on natural language instructions. This system acts as a "scenar…