PulseAugur
LIVE 08:57:34
research · [2 sources] ·
0
research

New AI models generate high-quality 3D human motion in real-time

Researchers have developed new transformer-based frameworks for generating high-quality 3D human motion from text. MOGO utilizes a hierarchical vector quantization and a single-pass causal transformer for real-time generation, demonstrating competitive quality and improved performance. MotionHiFlow employs a hierarchical flow matching approach, progressively generating motion from coarse semantics to fine temporal details, incorporating cross-scale transitions and explicit structural modeling for precise alignment. AI

Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →

IMPACT Advances in text-to-motion generation could enable more realistic virtual environments and character animations in gaming and film.

RANK_REASON Two new research papers introduce novel transformer-based architectures for text-to-3D human motion generation.

Read on arXiv cs.CV →

COVERAGE [2]

  1. arXiv cs.CV TIER_1 · Dongjie Fu, Tengjiao Sun, Pengcheng Fang, Xiaohao Cai, Hansung Kim ·

    MOGO: Residual Quantized Hierarchical Causal Transformer for High-Quality and Real-Time 3D Human Motion Generation

    arXiv:2506.05952v4 Announce Type: replace Abstract: Recent advances in transformer-based text-to-motion generation have led to impressive progress in synthesizing high-quality human motion. Nevertheless, jointly achieving high fidelity, streaming capability, real-time responsiven…

  2. arXiv cs.CV TIER_1 · Heng Li, Xiaotong Lin, Ling-An Zeng, Yulei Kang, Shuai Li, Jian-Fang Hu ·

    MotionHiFlow: Text-to-motion via hierarchical flow matching

    arXiv:2604.23264v1 Announce Type: new Abstract: Text-to-motion generation aims to generate 3D human motions that are tightly aligned with the input text while remaining physically plausible and rich in fine-grained detail. Although recent approaches can produce complex and natura…