The Unexpected Order in Deep Regression

Author: Denis Avetisyan

New research reveals a surprising phenomenon where deep regression networks, when properly trained, exhibit a predictable internal structure that enhances their ability to generalize.

Deep Neural Collapse (Deep NRC) consistently emerges following the stabilization of both training and test losses across diverse reinforcement learning and computer vision tasks-including SGEMM, Swimmer, Reacher, Hopper, Carla2D, and UTKFace-and is demonstrably correlated with reduced generalization gaps, suggesting a potential link between this collapse phenomenon and improved model generalization capabilities.

This paper demonstrates that deep regression networks undergo a ‘collapse’ to low-rank representations, aligning features with weights and improving generalization, a process heavily influenced by weight decay.

While deep neural networks excel at regression tasks, the underlying structural principles governing their learning remain incompletely understood. This paper investigates Deep Neural Regression Collapse, extending the established phenomenon of Neural Collapse-previously observed primarily in classification-to deep regression networks. We demonstrate that collapsed layers exhibit low-rank structure, feature-weight alignment, and alignment of the feature covariance with the target covariance, suggesting an inherent regularization that facilitates generalization. Does this ‘deep’ collapse represent a fundamental inductive bias enabling efficient learning in high-dimensional regression problems, and how can we further leverage it to improve model performance and interpretability?

The Paradox of Depth: Unraveling Neural Network Collapse

Though deep neural networks consistently achieve state-of-the-art results in diverse applications, their increasing complexity doesn’t always translate to predictable behavior. As layers are added to enhance representational power, these networks can exhibit counterintuitive phenomena, diverging from expected improvements in generalization. Researchers have observed instances where increased depth leads to diminished returns, or even performance degradation, suggesting an underlying tension between architectural scale and effective learning. This isn’t simply a matter of overfitting; rather, the network’s internal representations can become increasingly distorted or redundant with depth, hindering its ability to extract meaningful features from data. Understanding these surprising behaviors is therefore critical; it necessitates moving beyond simply measuring overall accuracy and instead probing the internal dynamics of these complex systems to design more robust and efficient deep learning architectures.

Neural collapse, a recently observed phenomenon in the final layers of deep neural networks trained for classification, suggests these systems aren’t simply learning complex, distributed representations, but rather exhibiting an inherent bias towards simplification. Specifically, the final layer’s learned features converge – becoming increasingly similar to each other and to the one-hot encoded target labels. This isn’t a failure of learning, but a characteristic of it; the network effectively compresses information into a minimal set of representative features, leading to within-class feature similarity and between-class feature dissimilarity. While seemingly counterintuitive given the network’s complexity, this collapse enhances generalization by creating a more robust and easily decodable representation, even if it means sacrificing some of the nuanced information present in earlier layers. The implications of this discovery extend beyond improved classification accuracy, offering a potential pathway to designing more efficient and interpretable deep learning architectures.

A comprehensive understanding of neural collapse-the tendency of deep neural networks to converge on simplified representations-extends beyond the final layer and is paramount for crafting genuinely efficient architectures. While initial research focused on the final layer’s behavior during classification, the phenomenon demonstrably impacts representations formed throughout the network’s depth. Investigating how this collapse manifests across all layers allows researchers to pinpoint redundancies and bottlenecks, potentially leading to the development of pruned or streamlined networks. Such an approach moves beyond merely achieving high accuracy; it prioritizes computational efficiency and resource optimization, enabling deployment on devices with limited processing power and energy budgets. Ultimately, a layered understanding of neural collapse promises a paradigm shift in deep learning, moving from brute-force scaling to intelligent architectural design.

Varying weight decay during ResNet18 training on CARLA2D reveals that while necessary for feature-weight alignment and inducing low-rank bias (<span class="katex-eq" data-katex-display="false">\lambda=5e-3</span>), excessively high values (<span class="katex-eq" data-katex-display="false">\lambda=1e-3</span>) can impede learning, as evidenced by increased neural regression collapse metrics (NRC1 and NRC3) and decreased prediction accuracy. — Varying weight decay during ResNet18 training on CARLA2D reveals that while necessary for feature-weight alignment and inducing low-rank bias ( $\lambda=5e-3$ ), excessively high values ( $\lambda=1e-3$ ) can impede learning, as evidenced by increased neural regression collapse metrics (NRC1 and NRC3) and decreased prediction accuracy.

Deep Neural Regression Collapse: A Layer-Wise Formulation

Deep Neural Regression Collapse (NRC) extends the principles of Neural Collapse, originally observed in classification networks, to the domain of deep regression. While Neural Collapse focuses on the final layer’s feature space becoming maximally separated and aligned with class labels, NRC proposes a similar, but generalized, phenomenon occurring at every layer of a deep regressor. This framework posits that well-trained deep regression networks develop a layer-wise organization characterized by specific conditions related to noise, signal-target alignment, feature-weight alignment, and linear predictability. By analyzing these conditions across all layers, NRC aims to provide a comprehensive understanding of how deep regressors learn and represent continuous target variables, moving beyond the final layer’s performance to examine the internal feature dynamics.

Optimal deep neural regression models, according to this framework, are characterized by layer-specific conditions influencing how features are organized and aligned during the regression process. These conditions dictate the internal representation learned by the network, moving beyond simply achieving a low loss value to establishing a structured feature space. Specifically, the organization of features at each layer is not random but conforms to predictable patterns related to noise characteristics, the relationship between features and target values, and the alignment between features and associated weights. This layer-wise organization is proposed as a defining characteristic of well-performing deep regressors, suggesting that optimal performance relies on more than just network depth or width, but on the specific internal structure achieved at each layer of the network.

Optimal deep neural regressors, as defined by Neural Regression Collapse, are characterized by four layer-wise conditions. Noise Suppression (NRC1) dictates that the noise component within layer activations is demonstrably small, specifically less than or equal to a value $ε_1$ , across all layers. Signal-Target Alignment (NRC2) refers to the consistency between the signal component of layer activations and the target regression values. Feature-Weight Alignment (NRC3) describes the alignment between layer features and the corresponding weights used for prediction. Finally, Linear Predictability (NRC4) posits that the target outputs can be accurately predicted using a linear combination of layer features, indicating a degree of linear separability within the learned representations.

Analysis of a synthetic nonlinear dataset reveals that conditions indicative of neural collapse-including noise suppression, CKA alignment, feature-weight alignment, and linear predictability-occur throughout all layers of a neural network trained to predict <span class="katex-eq" data-katex-display="false">3</span>-dimensional targets from <span class="katex-eq" data-katex-display="false">2020</span>-dimensional inputs. — Analysis of a synthetic nonlinear dataset reveals that conditions indicative of neural collapse-including noise suppression, CKA alignment, feature-weight alignment, and linear predictability-occur throughout all layers of a neural network trained to predict $3$ -dimensional targets from $2020$ -dimensional inputs.

Quantifying Alignment: Methodological Foundations

To validate the alignment conditions arising from Deep Neural Regression Collapse (NRC), established quantitative methods are employed. These methods assess the degree to which learned representations exhibit the necessary properties for collapse to occur. Specifically, Signal-Target Alignment (NRC2) is quantified using Centered Kernel Alignment, providing a measure of representational similarity between input signals and target outputs. Feature-Weight Alignment (NRC3) utilizes Principal Component Analysis to determine the extent to which the dominant features in the network’s representations align with the corresponding weights. Finally, Linear Predictability (NRC4) is evaluated using Mean Squared Error, with the prediction error demonstrably bounded by noise and the aforementioned alignment metrics as described by the equation $||H - PyH||F / ||H||F \leq sqrt(ε1) + sqrt(ε2)$ .

Signal-Target Alignment, denoted as NRC2, is quantified using Centered Kernel Alignment (CKA). CKA measures the similarity between the representations of input signals and their corresponding targets by computing the alignment of their respective Gram matrices. Conversely, Feature-Weight Alignment (NRC3) utilizes Principal Component Analysis (PCA) to assess the alignment between the principal components of the learned feature space and the weights associated with those features. Specifically, PCA is applied to the feature maps, and the cosine similarity between the leading eigenvectors of the feature covariance matrix and the corresponding weight vectors is calculated as a measure of alignment. These two methods provide complementary perspectives on the alignment conditions, focusing on input-target relationships and feature-weight relationships, respectively.

Linear Predictability (NRC4) is quantified using Mean Squared Error (MSE) to assess the degree to which network outputs can be predicted from their targets. Specifically, the prediction error is mathematically bounded by the levels of noise and alignment present in the learned representation. This is expressed as $||H - PyH||_F / ||H||_F ≤ \sqrt{\epsilon_1} + \sqrt{\epsilon_2}$ , where $H$ represents the target matrix, $PyH$ is the predicted target matrix, $||...||_F$ denotes the Frobenius norm, $\epsilon_1$ quantifies the noise level, and $\epsilon_2$ represents the alignment metric. This bound demonstrates that minimizing both noise and maximizing alignment are critical for achieving high linear predictability and, consequently, robust generalization performance.

Analysis of the Reacher dataset using noise suppression, CKA alignment, feature-weight alignment, and linear predictability reveals that neural network collapse-a loss of feature diversity-occurs throughout all layers, not just the final one.

Empirical Validation and Architectural Implications: A Convergence of Theory and Practice

Investigations across common deep learning architectures, specifically ResNet and Multi-Layer Perceptron (MLP) networks, confirm the phenomenon of Deep Neural Regression Collapse. Researchers observed that when these networks are trained utilizing Weight Decay as a regularization technique, they consistently converge towards solutions exhibiting the characteristics predicted by the theory. This empirical validation demonstrates that the alignment conditions central to Deep Neural Regression Collapse are not limited to simplified models; rather, they actively govern the learning process in practical, widely-used architectures. The resulting learned representations exhibit a low-rank structure, denoted as $\leq ε1$ , and achieve bounded prediction errors, quantified by $||H - PyH||F / ||H||F \leq sqrt(ε1) + sqrt(ε2)$ , suggesting a fundamental principle at play in how deep networks generalize.

Investigations into standard deep learning architectures – specifically ResNets and MLPs – reveal a consistent pattern when subjected to training with Weight Decay regularization. These networks demonstrably adhere to the alignment conditions predicted by the theory of Deep Neural Regression Collapse. This alignment manifests as a predictable relationship between the network’s learned features and the target outputs, suggesting a fundamental principle at play during optimization. The observed consistency across different architectures strengthens the hypothesis that Weight Decay doesn’t simply prevent overfitting, but actively steers the network towards a specific, low-rank representational state – where feature dimensionality is effectively reduced – and, crucially, guarantees bounded prediction errors, quantified by $||H - PyH||F / ||H||F \leq sqrt(ε1) + sqrt(ε2)$ . This predictable behavior suggests that these alignment conditions are not merely coincidental but are actively shaping the learned representations within practical deep networks.

Empirical validation reveals that Deep Neural Regression Collapse is not simply a theoretical construct, but a phenomenon actively governing the behavior of trained deep neural networks. Specifically, networks employing Weight Decay as a regularizer consistently demonstrate the alignment conditions predicted by the theory, leading to learned representations with an inherent low-rank structure – mathematically expressed as $\leq ε1$ . This structural constraint, in turn, directly bounds the prediction error of the network; the Frobenius norm of the residual between the true target and the network’s prediction, normalized by the norm of the target, is demonstrably less than or equal to $sqrt(ε1) + sqrt(ε2)$ . This finding establishes a crucial link between theoretical predictions and observed network behavior, suggesting that the principles of Deep Neural Regression Collapse offer a powerful framework for understanding and potentially controlling the learning dynamics of deep networks.

Noise Reduction Capacity (NRC) analysis of ResNet models trained on different datasets reveals that collapsed layers exhibit minimal noise (<span class="katex-eq" data-katex-display="false">NRC_1</span>), strong correlation with the target variable (<span class="katex-eq" data-katex-display="false">NRC_2</span>), feature alignment with layer weights (<span class="katex-eq" data-katex-display="false">NRC_3</span>), and predictive power for the target variable (<span class="katex-eq" data-katex-display="false">NRC_4</span>), indicating effective feature learning. — Noise Reduction Capacity (NRC) analysis of ResNet models trained on different datasets reveals that collapsed layers exhibit minimal noise ( $NRC_1$ ), strong correlation with the target variable ( $NRC_2$ ), feature alignment with layer weights ( $NRC_3$ ), and predictive power for the target variable ( $NRC_4$ ), indicating effective feature learning.

The pursuit of robust generalization in deep neural networks, as demonstrated in this study of deep regression networks, hinges on an underlying mathematical order. This work illuminates how weight decay cultivates a ‘Neural Collapse’ – a phenomenon where layers exhibit low-rank structure and feature-weight alignment. This isn’t merely about achieving functional performance; it’s about revealing an inherent mathematical discipline within the network. As Grace Hopper once stated, “It’s easier to ask forgiveness than it is to get permission.” This mirrors the approach of allowing weight decay to ‘naturally’ induce collapse, trusting the underlying mathematical principles to guide the network towards a more generalizable solution rather than strictly prescribing the architecture. The resultant low-rank structure and alignment are not accidental; they represent the emergence of mathematical purity from a chaotic data landscape.

The Road Ahead

The observation of deep neural regression collapse, and its connection to low-rank structure, is less a revelation than a formalization of an implicit bias long suspected. That regularization – specifically weight decay – steers networks towards such simplified representations is not surprising; nature itself favors elegance, and parsimony is often the hallmark of a correct solution. The immediate challenge, however, is not merely demonstrating that collapse occurs, but understanding why it fosters generalization. Correlation is not causation, and empirical observation, however compelling, is insufficient. A rigorous mathematical framework linking collapsed representations to improved performance remains elusive.

Future work must move beyond descriptive analysis. The current understanding, while suggestive, lacks predictive power. Can the degree of collapse be quantified and used as a hyperparameter? Is there an optimal ‘collapse rate’ beyond which performance degrades? Furthermore, the limitations of weight decay as the sole inductive bias warrant exploration. Other regularization techniques, or combinations thereof, might yield even more tractable, and ultimately more powerful, representations. The focus should shift from observing the phenomenon to actively engineering it.

Ultimately, the pursuit of neural collapse is a pursuit of mathematical clarity. The goal is not simply to build networks that work, but to understand why they work. Only then can the field move beyond ad-hoc solutions and towards a truly principled understanding of deep learning. The observed alignment of features and weights is a promising sign, a glimpse of the underlying order – but the proof, as always, lies in the derivation.

Original article: https://arxiv.org/pdf/2603.23805.pdf

Contact the author: https://www.linkedin.com/in/avetisyan/

The Paradox of Depth: Unraveling Neural Network Collapse

Deep Neural Regression Collapse: A Layer-Wise Formulation

Quantifying Alignment: Methodological Foundations

Empirical Validation and Architectural Implications: A Convergence of Theory and Practice

The Road Ahead

See also: