DiBA method compresses neural network weights using diagonal and binary matrices

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have developed DiBA, a novel method for compressing neural network weights by approximating dense matrices with a combination of diagonal and binary matrices. This technique significantly reduces computational costs for matrix-vector products, decreasing multiplications from mn to m+k+n. DiBA also introduces efficient optimization strategies, including DiBA-Greedy and DiBARD, which have shown substantial accuracy improvements in downstream tasks like text classification and audio recognition without extensive retraining. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Introduces a novel compression technique that could lead to more efficient deployment of large neural networks on resource-constrained devices.

RANK_REASON This is a research paper detailing a new method for neural network weight compression. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

paper
other

COVERAGE [1]

arXiv cs.LG TIER_1 · Nobutaka Ono · 2026-05-08 04:00

DiBA: Diagonal and Binary Matrix Approximation for Neural Network Weight Compression

arXiv:2605.05994v1 Announce Type: new Abstract: In this paper, we propose DiBA (Diagonal and Binary Matrix Approximation), a compact matrix factorization for neural network weight compression. Many components of modern networks, including linear layers, $1\times1$ convolutions, a…

COVERAGE [1]

DiBA: Diagonal and Binary Matrix Approximation for Neural Network Weight Compression

RELATED ENTITIES

RELATED TOPICS