Smarter Turbulence Models: Harnessing Data and Symmetry

Author: Denis Avetisyan

New research explores how data-driven approaches, particularly those respecting fundamental physical symmetries, can improve the accuracy of large-eddy simulations.

At an initial time, Large Eddy Simulation (LES) solutions align with filtered Direct Numerical Simulation (DNS) data; however, for the Clark model, this alignment breaks down around <span class="katex-eq" data-katex-display="false">t = 2.1</span>, leading to computational instability and preventing solution completion. — At an initial time, Large Eddy Simulation (LES) solutions align with filtered Direct Numerical Simulation (DNS) data; however, for the Clark model, this alignment breaks down around $t = 2.1$ , leading to computational instability and preventing solution completion.

This review compares data-driven symmetry-preserving closure models for large-eddy simulation, demonstrating enhanced performance over unconstrained neural networks.

Maintaining physically consistent behavior is a long-standing challenge in turbulence modeling, particularly when employing data-driven approaches. This is addressed in ‘Comparison of data-driven symmetry-preserving closure models for large-eddy simulation’, which investigates the impact of enforcing inherent symmetries within neural network-based closures for large-eddy simulation. The study demonstrates that symmetry-preserving models-using tensor bases and group convolutions-outperform unconstrained networks not only in prediction accuracy but also in generating more physically realistic velocity-gradient statistics. Does prioritizing structural consistency-through symmetry enforcement-represent a crucial step towards truly generalizable and robust turbulence closures?

The Inherent Instability of Sub-Grid Modeling

Large Eddy Simulation, a powerful technique for studying turbulent flows, operates on the principle of explicitly resolving the large, energy-containing eddies while approximating the effects of the smaller, unresolved scales – these are known as sub-grid stresses (SFS). Because directly simulating all scales of turbulence is computationally prohibitive, LES necessitates modeling how these SFS transfer energy and momentum within the flow. This modeling is not merely a simplification; it’s a crucial step in accurately capturing the overall behavior of the turbulence. The fidelity of an LES prediction hinges directly on the ability of the chosen model to represent these SFS, as inaccuracies in their estimation can propagate through the simulation, leading to substantial errors in quantities like drag, heat transfer, and mixing rates. Therefore, the development and validation of robust SFS models remain a central challenge in the field of turbulence research.

Conventional turbulence closure models, designed to represent the impact of unresolved scales in Large Eddy Simulation (LES), frequently fall short when dealing with the intricate physics of sub-grid stresses. These models typically rely on simplifying assumptions about the nature of turbulence, often relating the stress tensor to the mean strain rate through eddy viscosity or similar concepts. However, real turbulent flows exhibit non-local effects, anisotropy, and complex interactions between different scales, which these simplified models struggle to capture accurately. Consequently, LES predictions can be significantly compromised, manifesting as errors in quantities like Reynolds stresses, turbulent kinetic energy, and ultimately, the overall flow field. The inability of these models to properly account for the full range of turbulent phenomena underscores a persistent challenge in achieving high-fidelity simulations of complex flows, particularly those involving separation, rotation, or strong streamline curvature.

Accurate prediction of turbulent flows remains a significant challenge in numerous scientific and engineering disciplines, and progress is fundamentally linked to the development of robust closure models. These models address the inherent limitation of simulating all scales of turbulence, instead representing the effects of the smallest, unresolved scales – known as sub-grid stresses – on the larger, resolved motions. Current modeling approaches often rely on simplifying assumptions that fail to capture the complex, anisotropic, and non-local physics governing these stresses, leading to inaccuracies in simulations of phenomena ranging from weather patterns and aircraft design to combustion processes and geophysical flows. Consequently, a persistent drive exists to refine these models, incorporating more sophisticated representations of sub-grid stress transport and accounting for the intricate interplay between resolved and unresolved turbulence, ultimately enabling more reliable and insightful predictions of fluid behavior.

The relative errors of the Large Eddy Simulation (LES) solution, as calculated by <span class="katex-eq" data-katex-display="false"> ext{eq. 37}</span>, demonstrate the solution's accuracy over time. — The relative errors of the Large Eddy Simulation (LES) solution, as calculated by $ext{eq. 37}$ , demonstrate the solution’s accuracy over time.

Neural Networks: A Shift Towards Empiricism

Neural network-based closure modeling represents a departure from traditional approaches that rely on hand-crafted constitutive relationships or statistical assumptions. These networks function as universal approximators, trained on high-fidelity simulation or experimental data to directly map resolved flow variables – such as strain rate or velocity gradients – to unresolved sub-grid stresses. This data-driven methodology allows the network to capture non-linear and complex interactions between flow features without requiring a priori assumptions about their functional form. The flexibility inherent in neural networks enables them to represent a wider range of physical phenomena and adapt to diverse flow conditions, potentially improving the accuracy and robustness of Large Eddy Simulations (LES) and other turbulence models. The learned mappings effectively replace the parameterized models traditionally used to represent the effects of unresolved scales on the resolved flow.

Current research explores the application of multiple neural network architectures to turbulence modeling, specifically for representing sub-grid stresses in Large Eddy Simulation (LES). Standard Convolutional Networks (Conv) provide a baseline approach leveraging established image processing techniques. Tensor Basis Neural Networks (TBNN) utilize tensor decompositions to reduce the number of learnable parameters and potentially improve generalization. Group Convolutional Networks (G-conv) introduce group convolutions, aiming to enforce symmetries and reduce computational cost. These architectures differ in their internal structure and parameterization, but all are being investigated for their ability to learn a mapping between resolved flow field data and the corresponding sub-grid stress tensor, ultimately impacting LES accuracy and efficiency.

Neural network closure models function by establishing a learned relationship between the resolved scales of a turbulent flow and the unresolved sub-grid stresses. This is achieved through training the network on high-fidelity data, allowing it to predict the statistical contribution of the sub-grid scales to the overall stress tensor. Evaluations using a-posteriori Large Eddy Simulation (LES) have demonstrated that different network architectures – including standard Convolutional Networks (Conv), Tensor Basis Neural Networks (TBNN), and Group Convolutional Networks (G-conv) – yield comparable solution accuracy. This is evidenced by approximately equal normalized root-mean-squared errors calculated across all three architectures when compared against reference data, indicating that the specific network structure does not significantly impact predictive capability within the tested parameters.

Similar to weight projection in 1D convolutional neural networks, our octahedral group-convolutions utilize a comparable procedure but remove the locality constraint, allowing for non-zero values across the entire weight space.

Symmetry as a Fundamental Constraint

Turbulent flows are governed by the invariance of the Navier-Stokes equations under spatial translations and rotations. This inherent symmetry implies that statistical properties of the turbulence, such as the energy spectrum and two-point correlation functions, should remain unchanged under these transformations. Closure models, which approximate unclosed terms in the Reynolds-Averaged Navier-Stokes (RANS) or Large Eddy Simulation (LES) equations, often introduce asymmetry through their functional form or training data. Failure to preserve these symmetries can lead to non-physical predictions, numerical instability, and reduced accuracy, particularly when modeling complex flows or when extrapolating beyond the training dataset. Consequently, maintaining symmetry in closure models is crucial for ensuring both physical realism and computational robustness.

Group Convolutional Networks (G-conv) are designed to explicitly incorporate known symmetries into turbulence closure models, contrasting with standard, unconstrained Convolutional Networks (Conv). Quantitative analysis reveals that the unconstrained Conv network demonstrates an equivariance error of 5.8% when applied to turbulent flow simulations, indicating a deviation from physically realistic behavior. This error stems from the network’s inability to consistently maintain symmetry under transformations of the input data. G-conv addresses this by leveraging group theory to constrain the convolutional operations, ensuring that the modeled turbulence adheres to fundamental physical principles related to symmetry and rotational invariance.

Preservation of symmetry within turbulence closure models directly impacts the modeled energy spectrum, resulting in greater adherence to the $k^{-5/3}$ scaling law described by the Kolmogorov Spectrum. This spectral alignment indicates a more physically realistic representation of energy transfer across scales within the turbulent flow. Models that fail to preserve symmetry demonstrate deviations from this $k^{-5/3}$ behavior, particularly at higher wave numbers, signifying an inaccurate depiction of small-scale turbulence and potentially leading to instability in simulations. The degree to which a model preserves symmetry correlates with the accuracy of its representation of the energy cascade and the overall statistical properties of the simulated turbulence.

Kernel density estimates of normalized <span class="katex-eq" data-katex-display="false">(r, q)</span> pairs reveal that both filtered DNS and LES solutions closely follow the Vieillefosse tail <span class="katex-eq" data-katex-display="false">(r/2)^{2}+(q/3)^{3}=0</span>, indicating accurate representation of turbulent kinetic energy transfer. — Kernel density estimates of normalized $(r, q)$ pairs reveal that both filtered DNS and LES solutions closely follow the Vieillefosse tail $(r/2)^{2}+(q/3)^{3}=0$ , indicating accurate representation of turbulent kinetic energy transfer.

The Imperative of Rigorous Training

The creation of robust neural network closure models hinges on the quality of the training data, making DataGeneration a foundational step in the process. These models, designed to predict sub-grid scale phenomena in complex simulations, require extensive datasets that accurately represent the underlying physics. Researchers therefore rely on high-fidelity simulations – computationally expensive but essential for capturing the nuances of turbulent flows and other intricate systems. These simulations generate the training and testing data, providing the neural networks with examples to learn from, and enabling them to generalize effectively to unseen scenarios. The accuracy of these initial simulations directly impacts the predictive power of the closure models, emphasizing the critical need for rigorous validation and careful consideration of the underlying physical assumptions during the data generation phase.

The efficacy of training neural network closure models hinges on a well-defined LossFunction, which serves as the critical metric for evaluating performance and directing the learning process. This function quantifies the discrepancy between the model’s predicted sub-grid stresses – the unresolved forces within turbulent flows – and the actual, ground-truth values obtained from high-resolution simulations. By minimizing this error, the neural network iteratively refines its ability to accurately represent these complex stresses. A carefully chosen LossFunction not only accelerates the training process but also ensures the model generalizes effectively to unseen data, ultimately improving the accuracy and reliability of large-eddy simulations. The selection of an appropriate function, therefore, is paramount to successfully bridging the gap between computational efficiency and physical realism in fluid dynamics.

A standardized training protocol was employed across all neural network architectures, utilizing the AdamOptimizer for 55 epochs. This consistent regime-a specific number of passes through the entire training dataset-ensures a fair comparison of performance between different network designs. The AdamOptimizer, known for its adaptive learning rate capabilities, efficiently navigates the complex error landscape during training, seeking to minimize the discrepancy between predicted and actual sub-grid stresses. Maintaining uniformity in the training duration and optimization algorithm minimizes the influence of these hyperparameters, allowing researchers to more confidently attribute performance differences to the inherent capabilities of each network’s structure and connectivity.

Towards Predictive Fidelity in Turbulence Modeling

Large Eddy Simulation (LES) relies on models to approximate the effects of unresolved turbulent scales, and recent advancements demonstrate a pathway to significantly improved predictive power. Researchers are integrating neural network closure models – trained to learn the complex relationships within turbulent flows – with techniques that enforce physical symmetries, ensuring the simulations remain stable and realistic. Crucially, these models are developed using robust training methodologies, exposing them to diverse flow conditions to enhance generalization and reliability. This combined approach circumvents limitations of traditional modeling, leading to more accurate representation of turbulent kinetic energy and dissipation rates, ultimately yielding LES results that better align with experimental observations and high-fidelity simulations.

The refinement of turbulence prediction models promises to reshape understanding across a diverse range of scientific and engineering disciplines. Accurate simulation of turbulent flows is paramount in aerospace, where it directly impacts drag reduction, lift optimization, and overall vehicle performance. Similarly, climate modeling relies heavily on representing atmospheric and oceanic turbulence to predict weather patterns, assess climate change impacts, and refine long-term forecasts. Beyond these, engineering design – encompassing everything from efficient mixing processes to the design of quieter engines and streamlined infrastructure – benefits significantly from the enhanced accuracy these models provide, enabling optimization and innovation while reducing reliance on costly physical prototyping and experimentation. Ultimately, a more precise grasp of turbulent phenomena translates to more effective solutions and designs across a broad spectrum of applications.

Continued development centers on embedding these refined turbulence models within larger, more holistic simulation environments, moving beyond isolated tests to encompass the full complexity of real-world flows. This integration aims to evaluate performance not just on idealized cases, but also in scenarios demanding higher fidelity and computational resources. A key metric for this advancement involves achieving an average Taylor-scale Reynolds number of 268, representing a substantial leap in the ability to resolve small-scale turbulent structures and capture the nuances of energy transfer within the flow. Successfully navigating these increasingly complex simulations will unlock predictive capabilities crucial for advancements across diverse fields, from optimizing aircraft design to refining climate projections and enhancing engineering processes.

Different large eddy simulation (LES) models produce varying time-averaged energy spectra, demonstrating their influence on resolving turbulent energy distribution.

The pursuit of robust closure models, as detailed in this study, echoes a fundamental tenet of mathematical rigor. The investigation into symmetry-preserving neural networks highlights the importance of inherent physical consistency within computational frameworks. This aligns with the notion that a truly elegant solution isn’t merely one that appears to function, but one provably grounded in established principles. As Erwin Schrödinger once stated, “The task is, first, to understand the correct mathematical formulation of the problem.” The research demonstrates that enforcing symmetry constraints-a mathematical purity-can demonstrably improve the accuracy and reliability of large-eddy simulations, moving beyond empirical success towards a more deterministic and verifiable outcome.

What’s Next?

The pursuit of turbulence modeling, as evidenced by this work, continues to resemble an exercise in controlled approximation. Symmetry-preserving neural networks represent a step towards a more principled approach, yet the fundamental question remains: are these models learning physical law, or merely skillful pattern recognition? The demonstrated improvements, while encouraging, do not address the core issue of a priori uncertainty inherent in any data-driven closure. A model that faithfully reproduces training data is not necessarily a model that generalizes to unseen states, nor is it necessarily correct.

Future work must move beyond simply demonstrating improved statistical performance. Rigorous analysis of the learned representations is crucial-can these networks genuinely capture the underlying symmetries of the Navier-Stokes equations, or are they simply exploiting spurious correlations? Furthermore, the computational expense associated with these complex models remains a practical limitation. The field must grapple with the trade-off between accuracy and efficiency, recognizing that a marginally more accurate model is of little value if it renders simulations intractable.

Ultimately, the goal is not to create models that appear to work, but models that are provably consistent with fundamental physical principles. Heuristics, while convenient, are compromises-elegant shortcuts around intractable problems, but not solutions in themselves. The true measure of progress will be a shift from empirical validation to mathematical certainty.

Original article: https://arxiv.org/pdf/2603.05325.pdf

Contact the author: https://www.linkedin.com/in/avetisyan/