Convolutional causal learning for aerodynamic flows | Journal of Fluid Mechanics

1. Introduction

Understanding the interaction between vortical structures and the exerted force response is essential for efficient flow control and modelling of aerodynamic flows. The process by which the surrounding flows induce forces is regarded as a causal relationship, where the vortical structures act as the cause and the aerodynamic force as the effect. This study discusses how such an aerodynamic relationship can be captured in a data-driven manner.

To examine the contribution of specific vortical structures to aerodynamic forces, a range of approaches have been considered, including the force elemental method (Chang Reference Chang1992) and the vortex force map approach (Ōtomo et al. Reference Ōtomo, Gehlert, Babinsky and Li2025), which provide spatial locations of force-generating structures. Modal analyses, such as proper orthogonal decomposition (Lumley Reference Lumley1967) and dynamic mode decomposition (Schmid Reference Schmid2010), have been employed widely to extract dominant coherent structures from fluid flows. Such techniques can serve to compress the inherently high-dimensional flow data as a foundation for reduced-order modelling, which is followed by nonlinear machine-learning techniques, capturing intrinsic nonlinearity into a low-dimensional representation (Brunton, Noack & Koumoutsakos Reference Brunton, Noack and Koumoutsakos2020; Taira, Rigas & Fukami Reference Taira, Rigas and Fukami2025).

Data-driven analysis for fluid dynamics has evolved to address complex flow physics in several directions. For example, to extract coherent structures that are highly correlated with a specific target variable, such as aerodynamic forces, extended proper orthogonal decomposition has been considered (Boree Reference Boree2003; Discetti et al. Reference Discetti, Bellani, Örlü, Serpieri, Vila, Raiola, Zheng, Mascotelli, Talamelli and Ianiro2019). To capture the nonlinear temporal evolution of such dynamics, time-delay embedding (Takens Reference Takens2006; Arbabi & Mezic Reference Arbabi and Mezic2017; Bakarji et al. Reference Bakarji, Champion, Kutz and Brunton2023) can be employed, elevating modal analysis from feature extraction to the prediction of complex dynamics (Schmid Reference Schmid2022). Furthermore, spectral linear stochastic estimation (Adrian & Moin Reference Adrian and Moin1988; Tinney et al. Reference Tinney, Coiffet, Delville, Hall, Jordan and Glauser2006) provides the mean square linear estimate of a complex turbulent component from a predictor field based on the correlation, which has been applied to identify large-scale coherent structures in turbulent jets (Tinney et al. Reference Tinney, Coiffet, Delville, Hall, Jordan and Glauser2006). While conventional data-driven modal analyses are designed primarily for statistically steady flows, recent studies have presented that data-driven time-dependent bases analysis enables the analysis of transient flow dynamics with an unsteady base state (Zamani Ashtiani, Malik & Babaee Reference Zamani Ashtiani, Malik and Babaee2022; Zamani Ashtiani & Fukami Reference Zamani Ashtiani and Fukami2025), which has also been examined with operator-based techniques (Babaee & Sapsis Reference Babaee and Sapsis2016; Kern et al. Reference Kern, Negi, Hanifi and Henningson2024; Zhong et al. Reference Zhong, Amiri-Margavi, Babaee and Taira2025).

Moving beyond correlation-based analysis, a causality-inspired approach has recently emerged to identify the causal drivers of unsteady flows, thereby assisting in a deeper understanding of complex fluid motion and dynamics. For example, the use of the Shapley additive explanations (SHAP) algorithm for turbulent flow prediction has been shown as an effective data-driven approach to identify vortical structures (Cremades et al. Reference Cremades, Hoyas and Vinuesa2025b
, Reference Cremades, Martinez-Sanchez, Lozano-Duran and Vinuesa2026), with respect to the conveyed information about the temporal flow development. Causality-based modal analysis, referred to as informative and non-informative mode decomposition, has also been introduced (Arranz & Lozano-Durán Reference Arranz and Lozano-Durán2024). The formulation has been extended to aerodynamic flows by considering the lift-generating mechanism as a cause-and-effect relationship (Fukami & Araki Reference Fukami and Araki2026). It is important to note that such approaches enable the modal analysis of flow with aperiodic or transient base states, while traditional techniques are often limited to the dynamics with a statistically stationary base flow (Linot et al. Reference Linot, Lopez-Doriga, Zhong and Taira2025). Furthermore, the inherent nonlinearity offers an advantage to capture dominant features in a nonlinear system over the conventional linear method (Arranz & Lozano-Durán Reference Arranz and Lozano-Durán2024).

This study considers information-theoretic convolutional learning that achieves mode decomposition based on the informatics point of view, providing modal structures related to aerodynamic force while identifying reduced-order representations of important vortical structures. While existing information-theoretic mode decompositions operate in a point-wise manner, which yields spatial discontinuity along with an expensive inference cost (Fukami & Araki Reference Fukami and Araki2026), the current technique, based on convolutional networks (LeCun et al. Reference LeCun, Bottou, Bengio and Haffner1998), enables the extraction of coherent modal structures and identifies a submanifold that represents the relationship between vortical flows and aerodynamic forces. The proposed method is applied to a range of aerodynamic flow examples, exhibiting that the present information-theory-assisted model is capable of extracting the relationship between vortical structures and exerted lift in a transient manner.

The present paper is organised as follows. The formulation of informative mode decomposition is described in § 2. Results are discussed in § 3. Conclusions are remarked in § 4.

2. Approach

This study aims to capture the causal relationship between vortical flows and aerodynamic response in a data-driven manner. To achieve this, we consider decomposing a given state

${\boldsymbol q}({\boldsymbol x},t)$

based on the contribution to the target variables at a future time step

$\boldsymbol{\lambda }(\boldsymbol{x},t+\Delta t)$

. The present approach, illustrated in figure 1, decomposes a given flow snapshot in a manner,

(2.1)

\begin{equation} \boldsymbol{q}(\boldsymbol x, t) = \boldsymbol{q}_I(\boldsymbol x, t) +\boldsymbol{q}_R(\boldsymbol x, t), \end{equation}

where

${\boldsymbol{q}}_{I}$

and

${\boldsymbol{q}}_{R}$

are the informative and residual components, respectively (Arranz & Lozano-Durán Reference Arranz and Lozano-Durán2024). In this study, the field of vorticity

$\boldsymbol \omega$

and the second invariant

$Q$

of the velocity gradient tensor are considered as the given state

$\boldsymbol q(\boldsymbol x, t)$

, while the lift coefficient

$C_L \equiv F_L/(0.5\rho u_\infty ^2 c)$

is selected as the target variable

$\boldsymbol \lambda$

. Here,

$F_L$

,

$u_\infty$

,

$c$

and

$\rho$

denote the lift force, free-stream velocity, chord length and fluid density, respectively. This set-up is motivated by the fundamental aerodynamic relationship between lift and circulation

$\varGamma$

, i.e.

$\varGamma \propto C_L$

, which may enable us to examine how the present method captures such underlying physics in a data-driven manner.

Figure 1.

An example of the given state

$\boldsymbol q$

and the informative component

$\boldsymbol q _I$

decomposed by a data-driven technique.

An illustration of a data-driven technique decomposing a given state into an informative component, showing input data, a convolutional neural network, and the resulting informative mode.

Figure 1. Long description

The illustration depicts a data-driven technique used to decompose a given state into an informative component. On the left, the input data is represented by a complex, colorful, three-dimensional structure. In the center, a series of blocks and layers symbolize a convolutional neural network, indicating the processing steps involved. On the right, the resulting informative mode is shown as a simplified, structured form of the original input data. The image highlights the transformation and extraction of key features from the input data through the neural network, emphasizing the reduction of complexity and the identification of significant patterns.

While traditional data-driven approaches extract dominant features based on correlation, this study aims to perform modal extraction based on the predefined causal relationship. To achieve this, we consider measuring the amount of information quantitatively with the concept of Shannon entropy, which assesses uncertainty and randomness of arbitrary variables (Shannon Reference Shannon1948; Martínez-Sánchez et al. Reference Martínez-Sánchez, Arranz and Lozano-Durán2024), following the formulation of informative and non-informative decomposition originally introduced by Arranz & Lozano-Durán (Reference Arranz and Lozano-Durán2024). The Shannon entropy

$H(\boldsymbol{\lambda })$

for a target variable

$\boldsymbol{\lambda }$

in the future is described as

(2.2)

\begin{align} H(\boldsymbol{\lambda })=-\sum _{\boldsymbol S \in \mathcal{S} } p_{\boldsymbol{\lambda }}(\boldsymbol{\lambda }=\boldsymbol S)\log p_{\boldsymbol{\lambda }}(\boldsymbol{\lambda }=\boldsymbol S), \end{align}

where

$p_{\boldsymbol{\lambda }}$

is the probability of

$\boldsymbol \lambda$

being in state

$\boldsymbol S$

and

$\mathcal{S}$

represents the set of all possible states of

$\boldsymbol \lambda$

. The remaining information about

$\boldsymbol{\lambda }$

, not contained in

$\boldsymbol q_I$

, is measured by the conditional Shannon entropy described by

(2.3)

\begin{align} H(\boldsymbol{\lambda }|\boldsymbol{q}_I)=-\sum _{\boldsymbol S \in \mathcal S} \sum _{\boldsymbol R \in \mathcal R} p_{\boldsymbol{\lambda },\boldsymbol{q}_I}(\boldsymbol S, \boldsymbol R)\log \frac {{p_{\boldsymbol{\lambda }, \boldsymbol q_I}(\boldsymbol S, \boldsymbol R)}}{p_{\boldsymbol{q}_I}(\boldsymbol R)}, \end{align}

where

$p_{\boldsymbol \lambda , \boldsymbol q_I}$

is the joint probability distribution of

$\boldsymbol \lambda$

and

$\boldsymbol p$

,

$\boldsymbol R$

is a state of the informative component

$\boldsymbol q_I$

, and

$\mathcal R$

is all the possible states of

$\boldsymbol q_I$

. The difference between

$H(\boldsymbol{\lambda }|\boldsymbol{q}_I)$

and

$H(\boldsymbol{\lambda })$

is called the mutual information such that

$ I(\boldsymbol \lambda ; \boldsymbol q) = H(\boldsymbol{\lambda }) – H(\boldsymbol{\lambda }|\boldsymbol{q}_I)$

, expressing the amount of information shared between

$\boldsymbol q_I$

and

$\boldsymbol \lambda$

. The informative component is defined as the state that maximises mutual information with

$\boldsymbol \lambda$

at a future time stamp. This is achieved when the conditional Shannon entropy of

$\boldsymbol \lambda (t+\Delta t)$

and the informative component

$\boldsymbol q_I$

is zero,

$H(\boldsymbol{\lambda }|\boldsymbol{q}_I) = 0$

, which means

$\boldsymbol q_I$

completely determines the target variables

$\boldsymbol{\lambda }$

. Furthermore, the mutual information between the informative and residual components should be zero,

(2.4)

\begin{equation} I\left(\boldsymbol q_R; \boldsymbol q_I\right) = 0, \end{equation}

since each decomposed component should be statistically independent.

Figure 2.

Informative mode extractor

$\mathcal F$

based on (

$a$

) convolutional autoencoder and (

$b$

) convolutional neural network.

A diagram showing two types of informative mode extractors: convolutional autoencoder and convolutional neural network.

Figure 2. Long description

The diagram illustrates two types of informative mode extractors. On the left, a convolutional autoencoder is depicted with a series of blue blocks representing input data, a network of interconnected nodes in the middle, and another series of blue blocks representing output data. A green circle labeled ‘Target variable’ is connected to the network. On the right, a convolutional neural network is shown with a series of gray blocks, each containing a small red square, representing different layers of the network. A green circle labeled ‘Target variable’ is connected to one of the layers.

To extract the informative vortical structure

$\boldsymbol q_I$

from the given vorticity field

$\boldsymbol q$

with respect to a future target of lift, i.e.

$\lambda = C_L$

, we construct an informative mode extractor

$\mathcal F$

,

(2.5)

\begin{align} \boldsymbol q_I

where

$\boldsymbol w$

is the weight parameter of the mode extractor. This extractor is implemented as a certain type of neural network, convolutional deep sigmoidal flow (Huang et al. Reference Huang, Krueger, Lacoste and Courville2018). This model is constrained to possess non-negative weight values with bijective activation functions, which guarantees a bijective transformation to offer the solution

$\boldsymbol q _I$

satisfying an information-theoretic condition of

$H(C_L|\boldsymbol{q}_I) = 0$

. This extractor decomposes the given state at each time stamp, yielding time-varying informative modes. The optimisation for the weight distribution is performed with

(2.6)

\begin{align} \boldsymbol w^* = \operatorname {argmin}_{\boldsymbol w}||\boldsymbol q – \boldsymbol q_I||_2+\beta ||I(\boldsymbol q_R; \boldsymbol q_I)||_2, \end{align}

where the regular regression loss and mutual information loss are balanced with a constant parameter

$\beta$

(Arranz & Lozano-Durán Reference Arranz and Lozano-Durán2024). Without the second term, the present model simply replicates the input field, i.e.

$\boldsymbol q_I \approx \boldsymbol q$

. The value of

$\beta$

is determined based on the L-curve analysis (Hansen & O’Leary Reference Hansen and O’Leary1993), facilitating the identification of the trade-off relationship between two terms, which will be discussed later. By minimising the cost function, the model is designed to tune

$\boldsymbol w$

to extract vortical structures

$\boldsymbol q_I$

that contain information about aerodynamic response. Note that the current formulation differs from conventional definitions of causal inference (Imbens & Rubin Reference Imbens and Rubin2015) as this study does not consider any manipulations or perturbations in the system. Rather, this study focuses on the force generation dynamics due to the presence of vortical structures.

This study chooses two neural-network architectures as a mode extractor, depending on the flow of interest. We first consider a convolutional-autoencoder-based model (LeCun et al. Reference LeCun, Bottou, Bengio and Haffner1998; Hinton & Salakhutdino Reference Hinton and Salakhutdino2006), as illustrated in figure 2(

$a$

). This is used for cases where the embedding latent dimensions are recognised to be few, i.e.

${\mathcal O}(10^0)$

. In other words, the model provides a low-order representation, in addition to performing the current decomposition. A series of convolutional neural networks without compression is then employed for the analysis of turbulent vortical structures, as shown in figure 2(

$b$

).

Distinct from previous studies performing the information-based decomposition via multi-layer perceptrons that require flattening of the input data (Arranz & Lozano-Durán Reference Arranz and Lozano-Durán2024; Fukami & Araki Reference Fukami and Araki2026), both models in this study are based on the convolutional operation, allowing the decomposition across the entire snapshot with one shot while preserving the spatial arrangement of vortical structures (Morimoto et al. Reference Morimoto, Fukami, Zhang, Nair and Fukagata2021). This mitigates an issue of point-wise decomposition, often yielding spatially discontinuous modal structures (Fukami & Araki Reference Fukami and Araki2026). These information-theoretic models are trained to learn the relationship between two inputs: the current flow field snapshot

${\boldsymbol q}

and the lift coefficient over the time interval

$\Delta t$

,

${C_L}(t+\Delta t)$

. In other words, the models are designed to remove redundant structures, which do not contribute to the lift after the time gap

$\Delta t$

, thereby providing the time-varying informative mode

${\boldsymbol q}_I

, expected to vary with the value for

$\Delta t$

. We also discuss the dependence of the informative mode on

$\Delta t$

.

Note that the ‘mode’ produced with the current technique differs from the time-independent basis in linear modal analyses. The current informative mode

${\boldsymbol q}_I

would be analogous to modes multiplied or amplified with time-varying temporal coefficients in a linear context. While some previous studies have examined the isolation of explicit nonlinear bases from the internal architecture of machine-learning models (Erichson et al. Reference Erichson, Mathelin, Yao, Brunton, Mahoney and Kutz2020), we hereafter follow previous studies that refer to nonlinearly reconstructed flow fields, obtained from the time-varying coefficients and the basis of a nonlinear network, as ‘modes’ (Murata, Fukami & Fukagata Reference Murata, Fukami and Fukagata2020; Fukami, Nakamura & Fukagata Reference Fukami, Nakamura and Fukagata2020; Eivazi et al. Reference Eivazi, Le Clainche, Hoyas and Vinuesa2022; Cremades et al. Reference Cremades, Martinez-Sanchez, Lozano-Duran and Vinuesa2026).

3. Results and discussion

To examine how the current informative mode decomposition (IMD) extracts time-varying modes based on the contribution to the future lift response, we consider three examples of aerodynamic flows, covering a range of spatiotemporal complexity across the chord-based Reynolds number

${\textit{Re}}$

: (

$\mathrm{i}$

) extreme vortex-gust–aerofoil interactions at Re = 100 (Fukami & Taira Reference Fukami and Taira2023), (

$\mathrm{ii}$

) experimentally measured transverse-gust–wing interactions at

${\textit{Re}} = 20\,000$

(Towne et al. Reference Towne2023) and (

$\mathrm{iii}$

) a separated turbulent wake over a wing section at

${\textit{Re}} = 20\,000$

(Liu et al. Reference Liu, Trujillo Corona, Espinoza, Shu and Gross2025).

The first example with a discrete vortex gust highlights the applicability of the present method in capturing highly unsteady transient dynamics. With the second example of experimental datasets, the robustness against experimental noise of the present model is examined. We also consider a separated turbulent wake in a three-dimensional domain under quasi-cyclic behaviour, discussing how the underlying physics is extracted for aerodynamic scenarios exhibiting a range of spatial length scales.

3.1. Example 1: extreme vortex-gust–aerofoil interactions

Let us consider an extreme vortex-gust–aerofoil interaction around an NACA0012 aerofoil with an angle of attack

$\alpha = 40^\circ$

at

${\textit{Re}} =100$

. Data sets are produced by fully validated and verified direct numerical simulations (Fukami & Taira Reference Fukami and Taira2023; Fukami, Nakao & Taira Reference Fukami, Nakao and Taira2024
b). Unsteady periodic shedding is observed for the case without the presence of a vortex gust. A Taylor vortex (Taylor Reference Taylor1918) is introduced upstream of an aerofoil, producing transient and drastic excitation of aerodynamic characteristics. Gust ratio

$G\equiv u_{\theta ,{\textit{max}}}/u_{\infty }$

, where

$u_{\theta ,{\textit{max}}}$

is the maximum rotational velocity, and the gust diameter

$D$

normalised by the chord length are set to

$(G,D) = (2,0.5)$

. Note that the current condition with

$|G|\gt 1$

, which can happen in wind shear, mountain-covering areas and severe atmospheric turbulence, is classified as extreme aerodynamics, traditionally thought of as unflyable for small-sized aircraft (Jones, Cetiner & Smith Reference Jones, Cetiner and Smith2022; Fukami & Taira Reference Fukami and Taira2023; Taira Reference Taira2026). Understanding the process of lift generation in such transient dynamics is crucial for achieving stable flight operations. This example, hence, can serve as an ideal case to examine how the model identifies time-varying relationships between vortical motion and aerodynamic response.

Multiple graphs depict vorticity field, extracted modes, lift coefficient, and latent-variable evolution in vortex-aerofoil interaction.

Figure 3. Long description

The image contains multiple graphs and visualizations related to vortex-aerofoil interaction. The top section shows three columns of vorticity fields and extracted modes at different time points, labeled as Input and IMD with different time intervals. The bottom section includes a line graph of the lift coefficient over time, with a zoomed-in view of an extracted mode and another graph showing the evolution of latent variables. The vorticity fields are color-coded to represent different values, with red indicating positive vorticity and blue indicating negative vorticity. The line graph shows the lift coefficient with a grey line representing the undisturbed case and red dots indicating specific data points. The zoomed-in view highlights the detailed structure of the extracted mode, and the latent-variable evolution graph shows the changes over time with different line styles representing different variables. All values are approximated.

We perform the present decomposition on the vorticity field

$\boldsymbol{\omega } (\boldsymbol x,t)$

, as exhibited in figure 3. Here, the balancing parameter

$\beta$

is set to

$0.05$

, although the effect of

$\beta$

will be examined later. The present convolutional learning-based approach offers time-varying modes that capture the effect of gust on the lift response, as shown in figure 3(

$a$

). With

$\Delta t = 0.0085$

, where an almost instantaneous contribution is considered, the current approach offers a time-dependent informative mode capturing the influence of the gust on the lift response, as shown in figure 3(

$a$

). At

$t = -0.724$

, when the gust is still distant from the leading edge, the gust does not appear in the informative mode, implying that the temporal lift variation is determined by flow around a wing, rather than the gust itself. Strong impingement of the counter-clockwise vortex near the leading edge introduces a sharp peak in the lift response, as presented in figure 3(

$b$

). Near the peak at

$t = -0.299$

, the gust appears in the extracted mode since the effect of the gust becomes dominant. In contrast, when the lift fluctuation heads to the level of the undisturbed scenario at

$t = 0.977$

, the separating structure is recognised as less informative than the structure around the aerofoil. The presence of the gust in the informative mode is observed within a limited time duration, exhibiting that the exerted lift is governed by a finite time interaction.

The dependence of the informative mode on the time interval

$\Delta t$

is then examined. With a larger time window of

$\Delta t = 0.255$

, the vortex core appears in the informative mode at

$t = -0.724$

, whereas the model with

$\Delta t = 0.0085$

disregards it. Furthermore, the vortex core appears to split into left and right halves near the leading edge at

$t = -0.299$

, as highlighted in figure 3(

$c$

). The left part is also highlighted in the informative mode with

$\Delta t = 0.0085$

, suggesting that this structure directly affects the aerofoil. On the other hand, the right part, which disappears

$0.7$

convective time later than the left one, seems to contribute to the lift response by interacting with the separated wake. This suggests that the model captures two types of lift-generation processes in the current transient aerodynamic scenario.

Figure 4.

The dependence of the mutual information loss on the number of training snapshots

$n_{ {\textit{snapshot}}}$

among all the snapshots

$n_{ {all}}$

. (a–d) Informative components at

$t = -0.299$

.

A 3D plot shows the dependence of mutual information loss on the number of training snapshots and beta values, with four subplots illustrating informative components.

Figure 4. Long description

A 3D plot depicts the relationship between mutual information loss, the ratio of training snapshots to total snapshots, and beta values. The plot includes four distinct lines, each representing different beta values: 0.001, 0.005, 0.01, and 0.05. The lines show how mutual information loss varies with the number of training snapshots. Four subplots (a, b, c, d) illustrate informative components at different snapshot ratios: 1/16, 1/8, 1/8, and 1, respectively. Each subplot features color-coded regions indicating different levels of information, with blue and red areas representing varying degrees of informativeness. The subplots provide visual insights into the distribution of informative components at specific snapshot ratios.

Figure 5.

The dependence of the informative components on the value of

$\beta$

. The decomposed informative modes at

$t = -0.299$

are shown with the L-curve plot.

A scatter plot and three images showing fluid dynamics analysis with varying beta values.

Figure 5. Long description

The image contains a scatter plot and three fluid dynamics visualizations. The scatter plot on the left shows the dependence of informative components on the value of beta, with different colored dots representing various beta values. The x-axis is labeled with the norm of the difference between two vectors, and the y-axis is labeled with the informative components. The three images on the right display the decomposed informative modes at different beta values: 0.05, 0.01, and 0.1. Each image shows a fluid dynamics pattern with color gradients indicating different flow characteristics. The first image is labeled ‘Input,’ the second ‘IMD: beta = 0.05,’ the third ‘IMD: beta = 0.01,’ and the fourth ‘IMD: beta = 0.1.’ The color bar next to the scatter plot indicates the range of beta values used in the analysis.

A low-dimensional representation identified through the present autoencoder-based model is further analysed in figures 3(

$d$

) and 3(

$e$

). The latent dimension is set to be three, following the previous study that discusses the manifold discovery of extreme vortex–aerofoil interactions (Fukami & Taira Reference Fukami and Taira2023). The red trajectory for the disturbed case deviates from the grey trajectory for the undisturbed flow and shows peaks, highlighting when the effect of the gust is prominent. With

$\Delta t = 0.0085$

, these peaks coincide with those of the lift coefficient induced by interaction with the gust, suggesting that the current latent space contains an instantaneous relationship with aerodynamic response. On the other hand, with

$\Delta t = 0.225$

, time gaps are observed between these peaks. The assessment of the gust contribution to the lift response is embedded in the low-order representation based on the information about the target variable. The other two latent variables exhibit a similar trend, although not shown here. The proposed method learns the relationship between vortical motions and future lift response in a transient manner and extracts coherent structures as a time-varying mode in highly unsteady aerodynamic environments.

To extract informative components from snapshot data by learning the relationship between vortical structures and future lift response, an appropriate amount of training snapshots needs to be provided to the present model. The influence of the number of snapshots on the mutual information loss

$I(\boldsymbol q_R; \boldsymbol q_I)$

is evaluated for

$n_{ {\textit{snapshot}}} /n_{ {all}} = 1/16$

to

$1$

in figure 4, where

$n_{ {all}} (=2400)$

represents the total number of prepared snapshots. In general, the mutual information loss increases as

$n_{ {\textit{snapshot}}}$

decreases, indicating that the resulting extraction is affected by the decrease in the training data set. Here, we also examine the decomposed informative components for

$n_{ {\textit{snapshot}}} /n_{ {all}} = 1/16$

and

$1/8$

with different values for the balancing parameter

$\beta$

. While the extracted mode with

$n_{ {\textit{snapshot}}}/n_{ {all}} = 1/8$

and

$\beta = 0.001$

exhibits high mutual information loss and deformed structures, by carefully tuning

$\beta$

, the present model is capable of extracting vortical structures with a mutual information loss comparable to that obtained using all snapshots, thereby identifying similar structures.

The image contains multiple graphs and visualizations related to fluid dynamics analysis.

Figure 6. Long description

The image consists of four subfigures labeled (a) through (d). Subfigure (a) displays three rows of vorticity snapshots at different time points, showing the input data, reconstructed flow field via convolutional autoencoder, and extracted informative fields. Subfigure (b) presents a line graph of the time series of the lift coefficient, with key points marked. Subfigures (c) and (d) show scatter plots of the latent space identified by the models without and with additional geometric constraints, respectively, with color coding to represent time. The x and y axes in subfigures (c) and (d) represent different latent space dimensions, and the dashed circle in subfigure (d) indicates a geometric constraint. All values are approximated.

To determine the value of the balancing parameter

$\beta$

, we employ L-curve analysis (Hansen & O’Leary Reference Hansen and O’Leary1993), facilitating the identification of the trade-off relationship between the two terms in (2.6). The relationship between the reconstruction loss and the mutual information loss across several values of

$\beta$

, along with the resulting decomposed modes, is examined, as shown in figure 5. With a small

$\beta$

of

$0.01$

, the model prioritises the regular regression loss, yielding modes almost identical to the given state, while increasing

$\beta$

allows the model to output modal structures distinct from the input field. Excessive penalty for the mutual information term hinders the reconstruction. By following this procedure, the balancing parameter

$\beta$

is systematically determined for each case.

3.2. Example 2: experimental measurements of large-amplitude transverse gust encounter

To examine the applicability of the current techniques to experimental measurements, the flow around a flat plate at a constant angle of attack

$\alpha = 0^{\circ }$

with

${\textit{Re}} = 20\,000$

passing through a large-amplitude transverse jet at gust ratio

$G = 1.5$

, made available by Towne et al. (Reference Towne2023), is considered. Details on the data curation and experimental set-ups are referred to Andreu-Angulo et al. (Reference Andreu-Angulo, abinsky, Biler, Sedky and Jones2020) and Biler et al. (Reference Biler, Sedky, Jones, Saritas and Cetiner2021). Here, we consider the spanwise vorticity field

$\boldsymbol{\omega } _z (\boldsymbol x,t)$

as a given state. For this case, we also employ a convolutional-neural-network-based autoencoder without the information-theoretic term (Fukagata & Fukami Reference Fukagata and Fukami2025). This comparison is intended to distinguish the effects of denoising and removal of redundant components, as an autoencoder composed solely of regression loss can remove noise and extract coherent structures through the compression process, as shown in figure 6(

$a$

). The difference in reconstruction between the regular autoencoder and the current method derives from introducing a mutual information term and target variable input, exhibiting how the inferred mutual information works in the present data-driven modal analysis.

Let us perform the present IMD for the experimental data set. Here, the parameters of the time window

$\Delta t$

and the balancing parameter

$\beta$

are set to

$0.005$

and

$1$

, respectively. The current model extracts coherent structures capturing the effect of aerodynamic response, as shown in figure 6(

$a$

). At

$t = 0.250$

, when the lift coefficient is nearly zero, the model disregards the majority of the given state, including the approaching gust structures. In other words, the lift force is recognised as statistically independent from the surrounding vortical flow. When the large-scale separation occurs along with the formation of positive vorticity at the leading edge, the time series of the lift coefficient shows a sharp increase for

$0.5\lt t\lt 1.74$

, as exhibited in figure 6(

$b$

). At

$t = 1.74$

, large-scale structures with positive vorticity at the leading edge and separated wake with negative vorticity at the trailing edge are assessed as informative. Once the transverse gust transitions to turbulence and dissipates at

$t = 3.06$

, the fine-scale structures that appear above the flat plate are disregarded. Similar to the case of the vortex–aerofoil interaction, vortical structures modified extensively by interaction with the gust are dominant contributors to the lift response.

Figure 7.

Probability density functions of the vorticity (grey, input; red, IMD) are shown with informative components at representative time

$t = 0.250$

,

$1.00$

and

$1.74$

.

Three graphs show probability density functions of vorticity at different times with input and IMD data.

Figure 7. Long description

The image contains three side-by-side graphs displaying probability density functions of vorticity at different times. Each graph compares input data in grey and IMD data in red. The x-axis represents normalized vorticity values ranging from -1 to 1, while the y-axis represents the probability density function on a logarithmic scale. The three graphs correspond to different time points: t = 0.250, t = 1.00, and t = 1.74. At each time point, the graphs show how the vorticity distribution evolves, with notable peaks and variations in the data. The red and grey lines indicate the differences between the input and IMD data, highlighting the changes in vorticity over time. All values are approximated.

Low-order representations are also examined. The latent dimension is set to be 2, following Smith et al. (Reference Smith, Fukami, Sedky, Jones and Taira2024), showing that the transverse gust encounter is possible to be considered as a cyclic event and compressed into a circle-shaped latent space. The compression result with the cost function in (2.6) is shown in figure 6(

$c$

). The trajectory collapses into a line with temporal discontinuity, which is challenging to interpret. To address this, we constrain the latent space to lie on a circle by adding a geometry-related cost function (Smith et al. Reference Smith, Fukami, Sedky, Jones and Taira2024). The resulting latent variables form a circular-like trajectory that evolves continuously over time, as shown in figure 6(

$d$

). Note that despite such an additional constraint, the present model successfully extracts the informative structure in a similar manner to that shown earlier. This indicates that the present approach allows identification of physically interpretable low-order representations in addition to providing time-varying modal structures based on the relationship with the target variable.

Let us examine the probability density function of the input vorticity and the decomposed field, p.d.f.

$(w_z)$

, as shown in figure 7. The probability density function of informative components exhibits a sharper peak near zero compared with the input data, suggesting that experimental noise contained in the input snapshots is removed through the extraction process. In addition, the tail of the probability density function for the informative component, corresponding to the extreme positive and negative vorticity, is shrunk compared with that for the input. This implies that intensely rotating structures do not necessarily act as the dominant drivers of future lift. Furthermore, at

$t = 1.74$

, when strong impingement causes a sharp excitation of the lift force, we observe that the informative components retain the negative values in the probability density function. This indicates that the extraction process reconstructs the transverse jet reaching the lower surface of the flat plate, a lift-related phenomenon obscured by experimental noise. The current assessment highlights the capability of the present model to selectively preserve lift-related vortical structures while mitigating experimental noise.

3.3. Example 3: separated turbulent wake over a wing section of NLF(1)-0115 aerofoil

Let us also examine turbulent flow with the current decomposition. Turbulent wake over a wing section, NLF(1)-0115 aerofoil at an angle of attack of

$\alpha = 5^\circ$

with

${\textit{Re}}= 20\,000$

is considered. Separation occurs at the leading edge, producing a quasi-cyclic turbulent wake. We discuss how the proposed method captures informative structures responsible for lift with the presence of a range of scales in length. The datasets are produced by three-dimensional large-eddy simulations. Further details on data curation are seen in Liu et al. (Reference Liu, Trujillo Corona, Espinoza, Shu and Gross2025). The field of the second invariant for the velocity gradient tensor,

$\boldsymbol{Q}(\boldsymbol x, t)$

, is considered as the source state. In this example, a series of convolutional neural networks without data compression is employed, unlike the other two cases. This is based on the prior knowledge that fine-scale structures could be lost due to two reasons: their small contribution to aerodynamic force (Fukami & Araki Reference Fukami and Araki2026), and the compression process inside the network (Fukami, Smith & Taira Reference Fukami, Smith and Taira2025). By removing the compression layers inside the network, we aim to isolate the physical contribution of these structures.

Figure 8.

Informative modal structure of spanwise-averaged separated flow over wing section at

${\textit{Re}} = 20\, 000$

and dependence of decomposed mode on time window.

A combination of line graphs and scatter plots showing fluid dynamics analysis over time with different time-delay embeddings.

Figure 8. Long description

The image contains a combination of line graphs and scatter plots. The top section features a line graph with the y-axis labeled C_L and the x-axis labeled t, displaying oscillating data points with three red dots highlighting specific values. Below, three rows of scatter plots illustrate fluid dynamics at different time-delay embeddings (Delta t = 0.05, Delta t = 0.75). Each row shows the evolution of fluid structures over time, with color gradients indicating different values of Q. The first row represents the input data, while the second and third rows show the results of time-delay embedding analysis. The scatter plots depict the spatial distribution and temporal evolution of fluid structures, with insets zooming in on specific regions for detailed examination. All values are approximated.

The current decomposition is performed into the two-dimensional spanwise-averaged field based on an instantaneous and a delayed contribution where convective time windows

$\Delta t$

are set to

$0.05$

and

$0.75$

, respectively, as shown in figure 8. The balancing parameter

$\beta$

is set to

$1 \times 10^4$

. For the case of

$\Delta t = 0.05$

, large-scale vortex cores and structures around the leading edge are identified primarily due to their significant contribution. The result with

$\Delta t = 0.75$

shows that the present model selectively isolates the contribution of the leading edge and the vortex core near the trailing edge to the lift response, while identifying the convection process of the vortex cores and shear to disregard them. Furthermore, the vortex cores in the informative mode are observed to be located between structures with negative

$Q$

. This shear is captured mainly on both the upstream and downstream sides, indicating that the shear induced by the convection of the vortex core in the streamwise direction is recognised as informative in addition to the core itself.

The present decomposition is finally extended to the three-dimensional flow field, as shown in figure 9(

$a$

). Similar to the spanwise-averaged case, the large-scale vortex cores are extracted as informative, while fine-scale structures are disregarded due to their small contribution to the lift.

Three graphs depict the structure of three-dimensional separated flow over a wing section.

Figure 9. Long description

The image contains three graphs. The first graph on the left shows the informative modal structure of three-dimensional separated flow over a wing section, visualized with an iso-surface colored by streamwise velocity. The second graph in the middle displays the probability density function of the Q-criterion field. The third graph on the right illustrates scale-decomposed fields with two cuts of length scales. The graphs collectively examine the contribution of specific vortical structures to aerodynamic forces using various analytical techniques.

We next examine the probability density function of the reference and the decomposed

$Q$

-criterion field, p.d.f.

$(\boldsymbol Q)$

, as shown in figure 9(

$b$

). The probability density function of the decomposed informative mode exhibits that the present model predominantly retains the structures with

$Q \gt 0$

. The current rotation-dominated extraction suggests that the present model assesses rotating structures as the primary drivers of aerodynamic lift generation. It is also observed that the probability density of the informative component is lower than the reference at the extreme positive region, indicating that not all intensely rotating structures are regarded as informative for the future lift. The model selectively filters out these low-contribution structures, despite their intense magnitude, isolating the lift-related structures.

The length scale of structures extracted through the current method for the three-dimensional case is further discussed through the scale-decomposition analysis (Goto, Saito & Kawahara Reference Goto, Saito and Kawahara2017; Fujino, Motoori & Goto Reference Fujino, Motoori and Goto2023). A spatial band-pass filter based on a two-dimensional Gaussian kernel

$G$

is applied to the velocity field in the

$x$

and

$y$

directions. The cutoff length scale

$\sigma _{ {\textit{max}}}$

is set to

$\sigma _{ {\textit{max}}}=c/(2\pi St)$

, which is the diameter of the roller vortices, shed with the Strouhal number

$St$

. The operation is described as

(3.1)

\begin{align} \boldsymbol{u}^{[\sigma _1, \sigma _2]}(x, y, z, t) &= \int _{\mathcal D} \boldsymbol{u}(x, y, z, t) \left[G\left(x^{\prime}, y^{\prime}; x, y, \sigma _1\right) – G\left(x^{\prime}, y^{\prime}; x, y, \sigma _2\right)\right] {\rm d}x^{\prime} {\rm d}y^{\prime}, \end{align}

where

$\mathcal D$

is the domain of integration. The

$Q$

-criterion fields for a length-scale range of [

$\sigma _1, 2\sigma _1$

] with

$\sigma _1 =(\sigma _{ {\textit{max}}}, \sigma _{ { max}}/6)$

are computed from scale-decomposed velocity fields, as exhibited in figure 9(

$c$

).

With

$\sigma _1=\sigma _{ {\textit{max}}}$

, the scale-decomposed mode shows structures qualitatively similar in scale to those of the present method. In contrast, the scale-decomposed field for

$\sigma _1=\sigma _{ {\textit{max}}}/6$

contains the rib structures in addition to the dominant vortex cores, suggesting that the scale-based decomposition inherently conflates rib structures with vortex cores due to their overlapping length scales. Consequently, this comparison suggests that the present method successfully distinguishes large-scale vortical motions based on contribution to the future lift, not solely on the length-scale information.

Figure 10. Long description

A scatter plot showing the relationship between two variables, R and Q, with hundreds of data points color-coded from blue to red. The x-axis represents the variable R, ranging from -4000 to 4000, and the y-axis represents the variable Q, ranging from -750 to 750. The data points are color-coded based on the value of Q, with blue indicating lower values and red indicating higher values. The plot shows a dense cluster of points around the center, with a gradient transitioning from blue to red as the values of Q increase. There is a noticeable spread of points both above and below the central cluster, indicating variability in the data. The overall trend suggests a concentration of data points around the central region, with fewer points scattered towards the peripheries. All values are approximated.

Moreover, we evaluate the decomposed structures on the

$Q$

–

$R$

plane, where

$R$

is the third invariant of the velocity gradient tensor, as exhibited in figure 10. While the second invariant

$Q$

characterises the balance between the rotation and strain rates, the third invariant

$R$

governs the dynamics of vortex stretching and compression. According to these invariants, the local flow topologies are classified into four distinct states: vortex stretching (

$Q\gt 0, R\gt 0$

), vortex compression (

$Q\gt 0, R\lt 0$

), axial strain (

$Q\lt 0, R\gt 0$

) and biaxial strain (

$Q\lt 0, R\lt 0$

) (Davidson Reference Davidson2015). While the input field exhibits the well-known teardrop shape spanning both positive and negative

$Q$

-criterion (Ooi et al. Reference Ooi, Martin, Soria and Chong1999; Fukami et al. Reference Fukami, Goto and Taira2024a
), the present extraction is dominantly activated for rotation-dominated structures, in particular, those experiencing vortex compression. Note that since the present model takes only the

$Q$

-criterion field as input, the present model selectively distinguishes between vortex stretching and compression based on the flow of information with respect to future lift, indicating a relationship with the lift-generating mechanism in the current flow field.

We then discuss the dependence of informative mode on the time window

$\Delta t$

. The spanwise power spectra density of the

$Q$

-criterion, averaged over time

$P(Q)$

, is examined as a function of the spanwise wavenumber

$k_z$

, in figure 11(

$a$

). The present model generally underestimates informative components compared with the reference field, suggesting that redundant components are removed due to their smaller contribution to the lift generation. Furthermore, the influence of the time delay

$\Delta t$

on the power spectral density of the informative mode is concentrated predominantly in the low-wavenumber region. This implies that the contribution to the future lift is governed primarily by these large-scale structures.

To discuss the convection process of the vortex structure captured in extracted modal structures, the streamwise variation of the absolute value of

$Q$

-criterion,

$\bar {|Q|}(x) = \overline {|Q(x,y,z,t)|}^{y,z,t}$

, measuring the ensemble average over time and the

$y{-}z$

plane, is assessed, as shown in figure 11(

$b$

). The indicator of the

$\bar {|Q|}(x)$

downstream region decreases as

$\Delta t$

increases, suggesting that the contribution of the structures there is recognised as less significant due to the convection process. These findings support the applicability of the present method to turbulent flows, highlighting its potential to advance the physical interpretation of unsteady aerodynamics via data-driven modal analysis.

4. Concluding remarks

This study considered an information-theoretic machine-learning method that provides time-varying informative vortical structure related to the future lift coefficient. Through three example flows: extreme vortex-gust–aerofoil interactions, experimentally measured transverse gust encounter, and separated turbulent wake, the present method extracts structures associated with lift generation from the snapshot data. Built upon the previous studies (Arranz & Lozano-Durán Reference Arranz and Lozano-Durán2024; Fukami & Araki Reference Fukami and Araki2026), which perform the decomposition locally, the present data-driven approach enables the extraction of the dynamic relationship between vortical structures and the lift response by framing the cause-and-effect association within the lift-generation mechanism. The current convolutional network-based deep sigmoidal flow decomposes the input flow field globally with a spatiotemporal arrangement of vortical structures preserved. Furthermore, a low-order representation of the informative vortical structures and the aerodynamic coefficient is identified while performing decomposition. The proposed method is capable of visualising the relationships and offers key insights into the mechanism of force production under highly unsteady aerodynamics, based solely on flow field data and information metrics.

The current global extraction approach enables the extraction of coherent structure across the snapshots since the present information-theoretic convolutional machine-learning model has access to the spatial arrangement of vortical structures. Due to the capability of the present method to account for such spatial correlations, one promising future prospect is to apply this global approach to flow fields characterised by a broader range of structural scales, including wall-bounded turbulence. Furthermore, it will be of interest to conduct a direct comparison between the present global framework and the conventional local decomposition (Arranz & Lozano-Durán Reference Arranz and Lozano-Durán2024; Fukami & Araki Reference Fukami and Araki2026). By examining how the incorporation of spatial arrangements influences the mode extraction process compared with an isolated point-wise analysis, we could further discuss the coupling effects of vortical structures on the target variable.

We interpreted the lift-generation mechanism as a cause-and-effect relationship, where vortical structures act as the cause, while the resulting aerodynamic force acts as the effect. To elucidate this relationship from an aerodynamic approach, one can consider using the force elemental method (Chang Reference Chang1992) or the vortex force map approach (Ōtomo et al. Reference Ōtomo, Gehlert, Babinsky and Li2025). Comparing or incorporating them would be of interest to deepen our understanding of vortex-induced lift generation. Since how the input and output, along with the optimisation set-up, are prepared greatly affects the resulting structures assessed as informative (Cremades et al. Reference Cremades, Hoyas and Vinuesa2025a
; Hoyas et al. Reference Hoyas, Benedikt, Cremades and Vinuesa2025), examining the dependence of identified structures on the selected variable and the mathematical definition of the cause-and-effect relationship would be of interest. Although this study considered the relationship between lift and vortical structures over time based on prior knowledge of aerodynamics, evaluating them based on emerging approaches in the field of causal inference may clarify the rather ambiguous concepts of causality in the current vortex-induced force generation problem (Imbens & Rubin Reference Imbens and Rubin2015).

Furthermore, the application of extracted ‘informativeness’ in terms of information theory can be explored by incorporating the current approach to the reinforcement learning-based active flow control. While a previous study has utilised informative components for instantaneous opposition control for drag reduction in turbulent channel flows (Arranz & Lozano-Durán Reference Arranz and Lozano-Durán2024), extending this concept to lift-enhancement strategies will provide insights for aerodynamic flow control.

Source link

創建binance帳戶 commented on MEGA sconto del 34% su Amazon: Your article helped me a lot, is there any more re
binance registrering commented on Global Industrial Automation Services Market Size to Reach: Your point of view caught my eye and was very inte
binance commented on WestMetric Defends Controversial On-Page SEO Services for the Era of AI: I don't think the title of your article matches th
创建个人账户 commented on AI in CMO Strategy: Transforming Marketing Leadership: Can you be more specific about the content of your
binance account creation commented on The rise of Artificial Intelligence in Film & TV: Thank you for your sharing. I am worried that I la

Convolutional causal learning for aerodynamic flows | Journal of Fluid Mechanics

1. Introduction

2. Approach

3. Results and discussion

3.1. Example 1: extreme vortex-gust–aerofoil interactions

3.2. Example 2: experimental measurements of large-amplitude transverse gust encounter

3.3. Example 3: separated turbulent wake over a wing section of NLF(1)-0115 aerofoil

4. Concluding remarks

RECENT POSTS

AI learned the laws of the universe and it became a problem

Should the community be “excited” about AI data center projects?

AI company Firmus will become Tasmania’s biggest power user if three planned sites come to fruition

1. Introduction

2. Approach

3. Results and discussion

3.1. Example 1: extreme vortex-gust–aerofoil interactions

3.2. Example 2: experimental measurements of large-amplitude transverse gust encounter

3.3. Example 3: separated turbulent wake over a wing section of NLF(1)-0115 aerofoil

4. Concluding remarks

Related Posts