New methods accelerate visual generation models with variable codebooks and optimized decoding

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 4 sources

Researchers have introduced Variable Codebook Size Quantization (VCQ) to address limitations in autoregressive visual generation models. VCQ modifies the codebook size dynamically along the sequence, improving reconstruction performance and reducing the gFID score significantly on datasets like ImageNet. Additionally, new methods like VVS and Speculative Coupled Decoding (SCD) are accelerating inference speeds for these models by optimizing speculative decoding techniques, reducing the number of forward passes required while maintaining generation quality. AI

Summary written by gemini-2.5-flash-lite from 4 sources. How we write summaries →

IMPACT These advancements in quantization and speculative decoding promise faster and more efficient visual generation models, potentially lowering inference costs and enabling new applications.

RANK_REASON This cluster contains multiple arXiv papers detailing novel research in autoregressive visual generation and speculative decoding techniques.

Read on arXiv cs.CV →

paper
infra

COVERAGE [4]

arXiv cs.LG TIER_1 · Bowen Zheng, Weijian Luo, Guang Yang, Colin Zhang, Tianyang Hu · 2026-05-08 04:00

Taming the Entropy Cliff: Variable Codebook Size Quantization for Autoregressive Visual Generation

arXiv:2605.06207v1 Announce Type: cross Abstract: Most discrete visual tokenizers rely on a default design: every position in the sequence shares the same codebook. Researchers try to scale the codebook size $K$ to get better reconstruction performance. Such a constant-codebook d…
arXiv cs.CV TIER_1 · Tianyang Hu · 2026-05-07 13:13

Taming the Entropy Cliff: Variable Codebook Size Quantization for Autoregressive Visual Generation

Most discrete visual tokenizers rely on a default design: every position in the sequence shares the same codebook. Researchers try to scale the codebook size $K$ to get better reconstruction performance. Such a constant-codebook design hits a fundamental information-theoretic lim…
arXiv cs.CV TIER_1 · Haotian Dong, Ye Li, Rongwei Lu, Chen Tang, Shu-Tao Xia, Zhi Wang · 2026-05-07 04:00

VVS: Accelerating Speculative Decoding for Visual Autoregressive Generation via Partial Verification Skipping

arXiv:2511.13587v3 Announce Type: replace Abstract: Visual autoregressive (AR) generation models have demonstrated strong potential for image generation, yet their next-token-prediction paradigm introduces considerable inference latency. Although speculative decoding (SD) has bee…
arXiv cs.CV TIER_1 · Junhyuk So, Hyunho Kook, Chaeyeon Jang, Eunhyeok Park · 2026-05-06 04:00

Speculative Coupled Decoding for Training-Free Lossless Acceleration of Autoregressive Visual Generation

arXiv:2510.24211v2 Announce Type: replace Abstract: Autoregressive (AR) modeling has recently emerged as a promising new paradigm in visual generation, but its practical adoption is severely constrained by the slow inference speed of per-token generation, which often requires tho…

COVERAGE [4]

Taming the Entropy Cliff: Variable Codebook Size Quantization for Autoregressive Visual Generation

Taming the Entropy Cliff: Variable Codebook Size Quantization for Autoregressive Visual Generation

VVS: Accelerating Speculative Decoding for Visual Autoregressive Generation via Partial Verification Skipping

Speculative Coupled Decoding for Training-Free Lossless Acceleration of Autoregressive Visual Generation

RELATED ENTITIES

RELATED TOPICS