Researchers have identified a new method for analyzing how neural networks learn by examining loss gradients instead of optimizer updates. This approach, termed Gradient-Direction Sensitivity (GDS), reveals a stronger coupling between specific feature directions and linear centroids than previously observed. The study found that GDS significantly increases the measured coupling by one to two orders of magnitude, offering a clearer diagnostic of feature formation in parameter space. Furthermore, constraining attention updates to a rank-3 subspace using GDS accelerated model grokking by approximately 2.3 times. AI
Summary written by gemini-2.5-flash-lite from 3 sources. How we write summaries →
IMPACT Introduces a novel diagnostic for understanding feature formation in neural networks, potentially improving training efficiency.
RANK_REASON This is a research paper detailing a new diagnostic method for analyzing neural network training.