Convolutional causal learning for aerodynamic flows | Journal of Fluid Mechanics

Machine Learning


1. Introduction

Understanding the interaction between vortical structures and the exerted force response is essential for efficient flow control and modelling of aerodynamic flows. The process by which the surrounding flows induce forces is regarded as a causal relationship, where the vortical structures act as the cause and the aerodynamic force as the effect. This study discusses how such an aerodynamic relationship can be captured in a data-driven manner.

To examine the contribution of specific vortical structures to aerodynamic forces, a range of approaches have been considered, including the force elemental method (Chang Reference Chang1992) and the vortex force map approach (Ōtomo et al. Reference Ōtomo, Gehlert, Babinsky and Li2025), which provide spatial locations of force-generating structures. Modal analyses, such as proper orthogonal decomposition (Lumley Reference Lumley1967) and dynamic mode decomposition (Schmid Reference Schmid2010), have been employed widely to extract dominant coherent structures from fluid flows. Such techniques can serve to compress the inherently high-dimensional flow data as a foundation for reduced-order modelling, which is followed by nonlinear machine-learning techniques, capturing intrinsic nonlinearity into a low-dimensional representation (Brunton, Noack & Koumoutsakos Reference Brunton, Noack and Koumoutsakos2020; Taira, Rigas & Fukami Reference Taira, Rigas and Fukami2025).

Data-driven analysis for fluid dynamics has evolved to address complex flow physics in several directions. For example, to extract coherent structures that are highly correlated with a specific target variable, such as aerodynamic forces, extended proper orthogonal decomposition has been considered (Boree Reference Boree2003; Discetti et al. Reference Discetti, Bellani, Örlü, Serpieri, Vila, Raiola, Zheng, Mascotelli, Talamelli and Ianiro2019). To capture the nonlinear temporal evolution of such dynamics, time-delay embedding (Takens Reference Takens2006; Arbabi & Mezic Reference Arbabi and Mezic2017; Bakarji et al. Reference Bakarji, Champion, Kutz and Brunton2023) can be employed, elevating modal analysis from feature extraction to the prediction of complex dynamics (Schmid Reference Schmid2022). Furthermore, spectral linear stochastic estimation (Adrian & Moin Reference Adrian and Moin1988; Tinney et al. Reference Tinney, Coiffet, Delville, Hall, Jordan and Glauser2006) provides the mean square linear estimate of a complex turbulent component from a predictor field based on the correlation, which has been applied to identify large-scale coherent structures in turbulent jets (Tinney et al. Reference Tinney, Coiffet, Delville, Hall, Jordan and Glauser2006). While conventional data-driven modal analyses are designed primarily for statistically steady flows, recent studies have presented that data-driven time-dependent bases analysis enables the analysis of transient flow dynamics with an unsteady base state (Zamani Ashtiani, Malik & Babaee Reference Zamani Ashtiani, Malik and Babaee2022; Zamani Ashtiani & Fukami Reference Zamani Ashtiani and Fukami2025), which has also been examined with operator-based techniques (Babaee & Sapsis Reference Babaee and Sapsis2016; Kern et al. Reference Kern, Negi, Hanifi and Henningson2024; Zhong et al. Reference Zhong, Amiri-Margavi, Babaee and Taira2025).

Moving beyond correlation-based analysis, a causality-inspired approach has recently emerged to identify the causal drivers of unsteady flows, thereby assisting in a deeper understanding of complex fluid motion and dynamics. For example, the use of the Shapley additive explanations (SHAP) algorithm for turbulent flow prediction has been shown as an effective data-driven approach to identify vortical structures (Cremades et al. Reference Cremades, Hoyas and Vinuesa2025b
, Reference Cremades, Martinez-Sanchez, Lozano-Duran and Vinuesa2026), with respect to the conveyed information about the temporal flow development. Causality-based modal analysis, referred to as informative and non-informative mode decomposition, has also been introduced (Arranz & Lozano-Durán Reference Arranz and Lozano-Durán2024). The formulation has been extended to aerodynamic flows by considering the lift-generating mechanism as a cause-and-effect relationship (Fukami & Araki Reference Fukami and Araki2026). It is important to note that such approaches enable the modal analysis of flow with aperiodic or transient base states, while traditional techniques are often limited to the dynamics with a statistically stationary base flow (Linot et al. Reference Linot, Lopez-Doriga, Zhong and Taira2025). Furthermore, the inherent nonlinearity offers an advantage to capture dominant features in a nonlinear system over the conventional linear method (Arranz & Lozano-Durán Reference Arranz and Lozano-Durán2024).

This study considers information-theoretic convolutional learning that achieves mode decomposition based on the informatics point of view, providing modal structures related to aerodynamic force while identifying reduced-order representations of important vortical structures. While existing information-theoretic mode decompositions operate in a point-wise manner, which yields spatial discontinuity along with an expensive inference cost (Fukami & Araki Reference Fukami and Araki2026), the current technique, based on convolutional networks (LeCun et al. Reference LeCun, Bottou, Bengio and Haffner1998), enables the extraction of coherent modal structures and identifies a submanifold that represents the relationship between vortical flows and aerodynamic forces. The proposed method is applied to a range of aerodynamic flow examples, exhibiting that the present information-theory-assisted model is capable of extracting the relationship between vortical structures and exerted lift in a transient manner.

The present paper is organised as follows. The formulation of informative mode decomposition is described in § 2. Results are discussed in § 3. Conclusions are remarked in § 4.

2. Approach

This study aims to capture the causal relationship between vortical flows and aerodynamic response in a data-driven manner. To achieve this, we consider decomposing a given state

bold italic q left parenthesis bold italic x comma t right parenthesis


${\boldsymbol q}({\boldsymbol x},t)$


based on the contribution to the target variables at a future time step

bold italic lamda left parenthesis bold italic x comma t plus normal upper Delta t right parenthesis


$\boldsymbol{\lambda }(\boldsymbol{x},t+\Delta t)$


. The present approach, illustrated in figure 1, decomposes a given flow snapshot in a manner,

(2.1)

bold italic q left parenthesis bold italic x comma t right parenthesis equals bold italic q Subscript upper I Baseline left parenthesis bold italic x comma t right parenthesis plus bold italic q Subscript upper R Baseline left parenthesis bold italic x comma t right parenthesis comma


\begin{equation} \boldsymbol{q}(\boldsymbol x, t) = \boldsymbol{q}_I(\boldsymbol x, t) +\boldsymbol{q}_R(\boldsymbol x, t), \end{equation}

where

bold italic q Subscript upper I


${\boldsymbol{q}}_{I}$


and

bold italic q Subscript upper R


${\boldsymbol{q}}_{R}$


are the informative and residual components, respectively (Arranz & Lozano-Durán Reference Arranz and Lozano-Durán2024). In this study, the field of vorticity

bold italic omega


$\boldsymbol \omega$


and the second invariant

upper Q


$Q$


of the velocity gradient tensor are considered as the given state

bold italic q left parenthesis bold italic x comma t right parenthesis


$\boldsymbol q(\boldsymbol x, t)$


, while the lift coefficient

upper C Subscript upper L Baseline identical to upper F Subscript upper L Baseline divided by left parenthesis 0.5 rho u Subscript normal infinity Superscript 2 Baseline c right parenthesis


$C_L \equiv F_L/(0.5\rho u_\infty ^2 c)$


is selected as the target variable

bold italic lamda


$\boldsymbol \lambda$


. Here,

upper F Subscript upper L


$F_L$


,

u Subscript normal infinity


$u_\infty$


,

c


$c$


and

rho


$\rho$


denote the lift force, free-stream velocity, chord length and fluid density, respectively. This set-up is motivated by the fundamental aerodynamic relationship between lift and circulation

upper Gamma


$\varGamma$


, i.e.

upper Gamma proportional to upper C Subscript upper L


$\varGamma \propto C_L$


, which may enable us to examine how the present method captures such underlying physics in a data-driven manner.

Figure 1.

An example of the given state

bold italic q


$\boldsymbol q$


and the informative component

bold italic q Subscript upper I


$\boldsymbol q _I$


decomposed by a data-driven technique.

An illustration of a data-driven technique decomposing a given state into an informative component, showing input data, a convolutional neural network, and the resulting informative mode.
Figure 1. Long description

The illustration depicts a data-driven technique used to decompose a given state into an informative component. On the left, the input data is represented by a complex, colorful, three-dimensional structure. In the center, a series of blocks and layers symbolize a convolutional neural network, indicating the processing steps involved. On the right, the resulting informative mode is shown as a simplified, structured form of the original input data. The image highlights the transformation and extraction of key features from the input data through the neural network, emphasizing the reduction of complexity and the identification of significant patterns.

While traditional data-driven approaches extract dominant features based on correlation, this study aims to perform modal extraction based on the predefined causal relationship. To achieve this, we consider measuring the amount of information quantitatively with the concept of Shannon entropy, which assesses uncertainty and randomness of arbitrary variables (Shannon Reference Shannon1948; Martínez-Sánchez et al. Reference Martínez-Sánchez, Arranz and Lozano-Durán2024), following the formulation of informative and non-informative decomposition originally introduced by Arranz & Lozano-Durán (Reference Arranz and Lozano-Durán2024). The Shannon entropy

upper H left parenthesis bold italic lamda right parenthesis


$H(\boldsymbol{\lambda })$


for a target variable

bold italic lamda


$\boldsymbol{\lambda }$


in the future is described as

(2.2)

StartLayout 1st Row  upper H left parenthesis bold italic lamda right parenthesis equals minus sigma summation Underscript bold italic upper S element of script upper S Endscripts p Subscript bold italic lamda Baseline left parenthesis bold italic lamda equals bold italic upper S right parenthesis log p Subscript bold italic lamda Baseline left parenthesis bold italic lamda equals bold italic upper S right parenthesis comma EndLayout


\begin{align} H(\boldsymbol{\lambda })=-\sum _{\boldsymbol S \in \mathcal{S} } p_{\boldsymbol{\lambda }}(\boldsymbol{\lambda }=\boldsymbol S)\log p_{\boldsymbol{\lambda }}(\boldsymbol{\lambda }=\boldsymbol S), \end{align}

where

p Subscript bold italic lamda


$p_{\boldsymbol{\lambda }}$


is the probability of

bold italic lamda


$\boldsymbol \lambda$


being in state

bold italic upper S


$\boldsymbol S$


and

script upper S


$\mathcal{S}$


represents the set of all possible states of

bold italic lamda


$\boldsymbol \lambda$


. The remaining information about

bold italic lamda


$\boldsymbol{\lambda }$


, not contained in

bold italic q Subscript upper I


$\boldsymbol q_I$


, is measured by the conditional Shannon entropy described by

(2.3)

StartLayout 1st Row  upper H left parenthesis bold italic lamda vertical bar bold italic q Subscript upper I Baseline right parenthesis equals minus sigma summation Underscript bold italic upper S element of script upper S Endscripts sigma summation Underscript bold italic upper R element of script upper R Endscripts p Subscript bold italic lamda comma bold italic q Sub Subscript upper I Subscript Baseline left parenthesis bold italic upper S comma bold italic upper R right parenthesis log StartFraction p Subscript bold italic lamda comma bold italic q Sub Subscript upper I Subscript Baseline left parenthesis bold italic upper S comma bold italic upper R right parenthesis Over p Subscript bold italic q Sub Subscript upper I Subscript Baseline left parenthesis bold italic upper R right parenthesis EndFraction comma EndLayout


\begin{align} H(\boldsymbol{\lambda }|\boldsymbol{q}_I)=-\sum _{\boldsymbol S \in \mathcal S} \sum _{\boldsymbol R \in \mathcal R} p_{\boldsymbol{\lambda },\boldsymbol{q}_I}(\boldsymbol S, \boldsymbol R)\log \frac {{p_{\boldsymbol{\lambda }, \boldsymbol q_I}(\boldsymbol S, \boldsymbol R)}}{p_{\boldsymbol{q}_I}(\boldsymbol R)}, \end{align}

where

p Subscript bold italic lamda comma bold italic q Sub Subscript upper I Subscript


$p_{\boldsymbol \lambda , \boldsymbol q_I}$


is the joint probability distribution of

bold italic lamda


$\boldsymbol \lambda$


and

bold italic p


$\boldsymbol p$


,

bold italic upper R


$\boldsymbol R$


is a state of the informative component

bold italic q Subscript upper I


$\boldsymbol q_I$


, and

script upper R


$\mathcal R$


is all the possible states of

bold italic q Subscript upper I


$\boldsymbol q_I$


. The difference between

upper H left parenthesis bold italic lamda vertical bar bold italic q Subscript upper I Baseline right parenthesis


$H(\boldsymbol{\lambda }|\boldsymbol{q}_I)$


and

upper H left parenthesis bold italic lamda right parenthesis


$H(\boldsymbol{\lambda })$


is called the mutual information such that

upper I left parenthesis bold italic lamda semicolon bold italic q right parenthesis equals upper H left parenthesis bold italic lamda right parenthesis minus upper H left parenthesis bold italic lamda vertical bar bold italic q Subscript upper I Baseline right parenthesis


$ I(\boldsymbol \lambda ; \boldsymbol q) = H(\boldsymbol{\lambda }) – H(\boldsymbol{\lambda }|\boldsymbol{q}_I)$


, expressing the amount of information shared between

bold italic q Subscript upper I


$\boldsymbol q_I$


and

bold italic lamda


$\boldsymbol \lambda$


. The informative component is defined as the state that maximises mutual information with

bold italic lamda


$\boldsymbol \lambda$


at a future time stamp. This is achieved when the conditional Shannon entropy of

bold italic lamda left parenthesis t plus normal upper Delta t right parenthesis


$\boldsymbol \lambda (t+\Delta t)$


and the informative component

bold italic q Subscript upper I


$\boldsymbol q_I$


is zero,

upper H left parenthesis bold italic lamda vertical bar bold italic q Subscript upper I Baseline right parenthesis equals 0


$H(\boldsymbol{\lambda }|\boldsymbol{q}_I) = 0$


, which means

bold italic q Subscript upper I


$\boldsymbol q_I$


completely determines the target variables

bold italic lamda


$\boldsymbol{\lambda }$


. Furthermore, the mutual information between the informative and residual components should be zero,

(2.4)

upper I left parenthesis bold italic q Subscript upper R Baseline semicolon bold italic q Subscript upper I Baseline right parenthesis equals 0 comma


\begin{equation} I\left(\boldsymbol q_R; \boldsymbol q_I\right) = 0, \end{equation}

since each decomposed component should be statistically independent.

Figure 2.

Informative mode extractor

script upper F


$\mathcal F$


based on (

a


$a$


) convolutional autoencoder and (

b


$b$


) convolutional neural network.

A diagram showing two types of informative mode extractors: convolutional autoencoder and convolutional neural network.
Figure 2. Long description

The diagram illustrates two types of informative mode extractors. On the left, a convolutional autoencoder is depicted with a series of blue blocks representing input data, a network of interconnected nodes in the middle, and another series of blue blocks representing output data. A green circle labeled ‘Target variable’ is connected to the network. On the right, a convolutional neural network is shown with a series of gray blocks, each containing a small red square, representing different layers of the network. A green circle labeled ‘Target variable’ is connected to one of the layers.

To extract the informative vortical structure

bold italic q Subscript upper I


$\boldsymbol q_I$


from the given vorticity field

bold italic q


$\boldsymbol q$


with respect to a future target of lift, i.e.

lamda equals upper C Subscript upper L


$\lambda = C_L$


, we construct an informative mode extractor

script upper F


$\mathcal F$


,

(2.5)

StartLayout 1st Row  bold italic q Subscript upper I Baseline left parenthesis t right parenthesis equals script upper F left parenthesis upper C Subscript upper L Baseline left parenthesis t plus normal upper Delta t right parenthesis comma bold italic q left parenthesis t right parenthesis semicolon bold italic w right parenthesis comma EndLayout


\begin{align} \boldsymbol q_I

where

bold italic w


$\boldsymbol w$


is the weight parameter of the mode extractor. This extractor is implemented as a certain type of neural network, convolutional deep sigmoidal flow (Huang et al. Reference Huang, Krueger, Lacoste and Courville2018). This model is constrained to possess non-negative weight values with bijective activation functions, which guarantees a bijective transformation to offer the solution

bold italic q Subscript upper I


$\boldsymbol q _I$


satisfying an information-theoretic condition of

upper H left parenthesis upper C Subscript upper L Baseline vertical bar bold italic q Subscript upper I Baseline right parenthesis equals 0


$H(C_L|\boldsymbol{q}_I) = 0$


. This extractor decomposes the given state at each time stamp, yielding time-varying informative modes. The optimisation for the weight distribution is performed with

(2.6)

StartLayout 1st Row  bold italic w Superscript asterisk Baseline equals a r g m i n Subscript bold italic w Baseline StartAbsoluteValue EndAbsoluteValue bold italic q minus bold italic q Subscript upper I Baseline StartAbsoluteValue EndAbsoluteValue Subscript 2 Baseline plus beta StartAbsoluteValue EndAbsoluteValue upper I left parenthesis bold italic q Subscript upper R Baseline semicolon bold italic q Subscript upper I Baseline right parenthesis StartAbsoluteValue EndAbsoluteValue Subscript 2 Baseline comma EndLayout


\begin{align} \boldsymbol w^* = \operatorname {argmin}_{\boldsymbol w}||\boldsymbol q – \boldsymbol q_I||_2+\beta ||I(\boldsymbol q_R; \boldsymbol q_I)||_2, \end{align}

where the regular regression loss and mutual information loss are balanced with a constant parameter

beta


$\beta$


(Arranz & Lozano-Durán Reference Arranz and Lozano-Durán2024). Without the second term, the present model simply replicates the input field, i.e.

bold italic q Subscript upper I Baseline almost equals bold italic q


$\boldsymbol q_I \approx \boldsymbol q$


. The value of

beta


$\beta$


is determined based on the L-curve analysis (Hansen & O’Leary Reference Hansen and O’Leary1993), facilitating the identification of the trade-off relationship between two terms, which will be discussed later. By minimising the cost function, the model is designed to tune

bold italic w


$\boldsymbol w$


to extract vortical structures

bold italic q Subscript upper I


$\boldsymbol q_I$


that contain information about aerodynamic response. Note that the current formulation differs from conventional definitions of causal inference (Imbens & Rubin Reference Imbens and Rubin2015) as this study does not consider any manipulations or perturbations in the system. Rather, this study focuses on the force generation dynamics due to the presence of vortical structures.

This study chooses two neural-network architectures as a mode extractor, depending on the flow of interest. We first consider a convolutional-autoencoder-based model (LeCun et al. Reference LeCun, Bottou, Bengio and Haffner1998; Hinton & Salakhutdino Reference Hinton and Salakhutdino2006), as illustrated in figure 2(

a


$a$


). This is used for cases where the embedding latent dimensions are recognised to be few, i.e.

script upper O left parenthesis 10 Superscript 0 Baseline right parenthesis


${\mathcal O}(10^0)$


. In other words, the model provides a low-order representation, in addition to performing the current decomposition. A series of convolutional neural networks without compression is then employed for the analysis of turbulent vortical structures, as shown in figure 2(

b


$b$


).

Distinct from previous studies performing the information-based decomposition via multi-layer perceptrons that require flattening of the input data (Arranz & Lozano-Durán Reference Arranz and Lozano-Durán2024; Fukami & Araki Reference Fukami and Araki2026), both models in this study are based on the convolutional operation, allowing the decomposition across the entire snapshot with one shot while preserving the spatial arrangement of vortical structures (Morimoto et al. Reference Morimoto, Fukami, Zhang, Nair and Fukagata2021). This mitigates an issue of point-wise decomposition, often yielding spatially discontinuous modal structures (Fukami & Araki Reference Fukami and Araki2026). These information-theoretic models are trained to learn the relationship between two inputs: the current flow field snapshot

bold italic q left parenthesis t right parenthesis


${\boldsymbol q}


and the lift coefficient over the time interval

normal upper Delta t


$\Delta t$


,

upper C Subscript upper L Baseline left parenthesis t plus normal upper Delta t right parenthesis


${C_L}(t+\Delta t)$


. In other words, the models are designed to remove redundant structures, which do not contribute to the lift after the time gap

normal upper Delta t


$\Delta t$


, thereby providing the time-varying informative mode

bold italic q Subscript upper I Baseline left parenthesis t right parenthesis


${\boldsymbol q}_I


, expected to vary with the value for

normal upper Delta t


$\Delta t$


. We also discuss the dependence of the informative mode on

normal upper Delta t


$\Delta t$


.

Note that the ‘mode’ produced with the current technique differs from the time-independent basis in linear modal analyses. The current informative mode

bold italic q Subscript upper I Baseline left parenthesis t right parenthesis


${\boldsymbol q}_I


would be analogous to modes multiplied or amplified with time-varying temporal coefficients in a linear context. While some previous studies have examined the isolation of explicit nonlinear bases from the internal architecture of machine-learning models (Erichson et al. Reference Erichson, Mathelin, Yao, Brunton, Mahoney and Kutz2020), we hereafter follow previous studies that refer to nonlinearly reconstructed flow fields, obtained from the time-varying coefficients and the basis of a nonlinear network, as ‘modes’ (Murata, Fukami & Fukagata Reference Murata, Fukami and Fukagata2020; Fukami, Nakamura & Fukagata Reference Fukami, Nakamura and Fukagata2020; Eivazi et al. Reference Eivazi, Le Clainche, Hoyas and Vinuesa2022; Cremades et al. Reference Cremades, Martinez-Sanchez, Lozano-Duran and Vinuesa2026).

3. Results and discussion

To examine how the current informative mode decomposition (IMD) extracts time-varying modes based on the contribution to the future lift response, we consider three examples of aerodynamic flows, covering a range of spatiotemporal complexity across the chord-based Reynolds number

italic Re


${\textit{Re}}$


: (

normal i


$\mathrm{i}$


) extreme vortex-gust–aerofoil interactions at Re = 100 (Fukami & Taira Reference Fukami and Taira2023), (

normal i normal i


$\mathrm{ii}$


) experimentally measured transverse-gust–wing interactions at

italic Re equals 20 000


${\textit{Re}} = 20\,000$


(Towne et al. Reference Towne2023) and (

normal i normal i normal i


$\mathrm{iii}$


) a separated turbulent wake over a wing section at

italic Re equals 20 000


${\textit{Re}} = 20\,000$


(Liu et al. Reference Liu, Trujillo Corona, Espinoza, Shu and Gross2025).

The first example with a discrete vortex gust highlights the applicability of the present method in capturing highly unsteady transient dynamics. With the second example of experimental datasets, the robustness against experimental noise of the present model is examined. We also consider a separated turbulent wake in a three-dimensional domain under quasi-cyclic behaviour, discussing how the underlying physics is extracted for aerodynamic scenarios exhibiting a range of spatial length scales.

3.1. Example 1: extreme vortex-gust–aerofoil interactions

Let us consider an extreme vortex-gust–aerofoil interaction around an NACA0012 aerofoil with an angle of attack

alpha equals 40 Superscript ring


$\alpha = 40^\circ$


at

italic Re equals 100


${\textit{Re}} =100$


. Data sets are produced by fully validated and verified direct numerical simulations (Fukami & Taira Reference Fukami and Taira2023; Fukami, Nakao & Taira Reference Fukami, Nakao and Taira2024
b). Unsteady periodic shedding is observed for the case without the presence of a vortex gust. A Taylor vortex (Taylor Reference Taylor1918) is introduced upstream of an aerofoil, producing transient and drastic excitation of aerodynamic characteristics. Gust ratio

upper G identical to u Subscript theta comma italic max Baseline divided by u Subscript normal infinity


$G\equiv u_{\theta ,{\textit{max}}}/u_{\infty }$


, where

u Subscript theta comma italic max


$u_{\theta ,{\textit{max}}}$


is the maximum rotational velocity, and the gust diameter

upper D


$D$


normalised by the chord length are set to

left parenthesis upper G comma upper D right parenthesis equals left parenthesis 2 comma 0.5 right parenthesis


$(G,D) = (2,0.5)$


. Note that the current condition with

StartAbsoluteValue upper G EndAbsoluteValue greater than 1


$|G|\gt 1$


, which can happen in wind shear, mountain-covering areas and severe atmospheric turbulence, is classified as extreme aerodynamics, traditionally thought of as unflyable for small-sized aircraft (Jones, Cetiner & Smith Reference Jones, Cetiner and Smith2022; Fukami & Taira Reference Fukami and Taira2023; Taira Reference Taira2026). Understanding the process of lift generation in such transient dynamics is crucial for achieving stable flight operations. This example, hence, can serve as an ideal case to examine how the model identifies time-varying relationships between vortical motion and aerodynamic response.

Multiple graphs depict vorticity field, extracted modes, lift coefficient, and latent-variable evolution in vortex-aerofoil interaction.
Figure 3. Long description

The image contains multiple graphs and visualizations related to vortex-aerofoil interaction. The top section shows three columns of vorticity fields and extracted modes at different time points, labeled as Input and IMD with different time intervals. The bottom section includes a line graph of the lift coefficient over time, with a zoomed-in view of an extracted mode and another graph showing the evolution of latent variables. The vorticity fields are color-coded to represent different values, with red indicating positive vorticity and blue indicating negative vorticity. The line graph shows the lift coefficient with a grey line representing the undisturbed case and red dots indicating specific data points. The zoomed-in view highlights the detailed structure of the extracted mode, and the latent-variable evolution graph shows the changes over time with different line styles representing different variables. All values are approximated.

We perform the present decomposition on the vorticity field

bold italic omega left parenthesis bold italic x comma t right parenthesis


$\boldsymbol{\omega } (\boldsymbol x,t)$


, as exhibited in figure 3. Here, the balancing parameter

beta


$\beta$


is set to

0.05


$0.05$


, although the effect of

beta


$\beta$


will be examined later. The present convolutional learning-based approach offers time-varying modes that capture the effect of gust on the lift response, as shown in figure 3(

a


$a$


). With

normal upper Delta t equals 0.0085


$\Delta t = 0.0085$


, where an almost instantaneous contribution is considered, the current approach offers a time-dependent informative mode capturing the influence of the gust on the lift response, as shown in figure 3(

a


$a$


). At

t equals negative 0.724


$t = -0.724$


, when the gust is still distant from the leading edge, the gust does not appear in the informative mode, implying that the temporal lift variation is determined by flow around a wing, rather than the gust itself. Strong impingement of the counter-clockwise vortex near the leading edge introduces a sharp peak in the lift response, as presented in figure 3(

b


$b$


). Near the peak at

t equals negative 0.299


$t = -0.299$


, the gust appears in the extracted mode since the effect of the gust becomes dominant. In contrast, when the lift fluctuation heads to the level of the undisturbed scenario at

t equals 0.977


$t = 0.977$


, the separating structure is recognised as less informative than the structure around the aerofoil. The presence of the gust in the informative mode is observed within a limited time duration, exhibiting that the exerted lift is governed by a finite time interaction.

The dependence of the informative mode on the time interval

normal upper Delta t


$\Delta t$


is then examined. With a larger time window of

normal upper Delta t equals 0.255


$\Delta t = 0.255$


, the vortex core appears in the informative mode at

t equals negative 0.724


$t = -0.724$


, whereas the model with

normal upper Delta t equals 0.0085


$\Delta t = 0.0085$


disregards it. Furthermore, the vortex core appears to split into left and right halves near the leading edge at

t equals negative 0.299


$t = -0.299$


, as highlighted in figure 3(

c


$c$


). The left part is also highlighted in the informative mode with

normal upper Delta t equals 0.0085


$\Delta t = 0.0085$


, suggesting that this structure directly affects the aerofoil. On the other hand, the right part, which disappears

0.7


$0.7$


convective time later than the left one, seems to contribute to the lift response by interacting with the separated wake. This suggests that the model captures two types of lift-generation processes in the current transient aerodynamic scenario.

Figure 4.

The dependence of the mutual information loss on the number of training snapshots

n Subscript italic snapshot


$n_{ {\textit{snapshot}}}$


among all the snapshots

n Subscript a l l


$n_{ {all}}$


. (ad) Informative components at

t equals negative 0.299


$t = -0.299$


.

A 3D plot shows the dependence of mutual information loss on the number of training snapshots and beta values, with four subplots illustrating informative components.
Figure 4. Long description

A 3D plot depicts the relationship between mutual information loss, the ratio of training snapshots to total snapshots, and beta values. The plot includes four distinct lines, each representing different beta values: 0.001, 0.005, 0.01, and 0.05. The lines show how mutual information loss varies with the number of training snapshots. Four subplots (a, b, c, d) illustrate informative components at different snapshot ratios: 1/16, 1/8, 1/8, and 1, respectively. Each subplot features color-coded regions indicating different levels of information, with blue and red areas representing varying degrees of informativeness. The subplots provide visual insights into the distribution of informative components at specific snapshot ratios.

Figure 5.

The dependence of the informative components on the value of

beta


$\beta$


. The decomposed informative modes at

t equals negative 0.299


$t = -0.299$


are shown with the L-curve plot.

A scatter plot and three images showing fluid dynamics analysis with varying beta values.
Figure 5. Long description

The image contains a scatter plot and three fluid dynamics visualizations. The scatter plot on the left shows the dependence of informative components on the value of beta, with different colored dots representing various beta values. The x-axis is labeled with the norm of the difference between two vectors, and the y-axis is labeled with the informative components. The three images on the right display the decomposed informative modes at different beta values: 0.05, 0.01, and 0.1. Each image shows a fluid dynamics pattern with color gradients indicating different flow characteristics. The first image is labeled ‘Input,’ the second ‘IMD: beta = 0.05,’ the third ‘IMD: beta = 0.01,’ and the fourth ‘IMD: beta = 0.1.’ The color bar next to the scatter plot indicates the range of beta values used in the analysis.

A low-dimensional representation identified through the present autoencoder-based model is further analysed in figures 3(

d


$d$


) and 3(

e


$e$


). The latent dimension is set to be three, following the previous study that discusses the manifold discovery of extreme vortex–aerofoil interactions (Fukami & Taira Reference Fukami and Taira2023). The red trajectory for the disturbed case deviates from the grey trajectory for the undisturbed flow and shows peaks, highlighting when the effect of the gust is prominent. With

normal upper Delta t equals 0.0085


$\Delta t = 0.0085$


, these peaks coincide with those of the lift coefficient induced by interaction with the gust, suggesting that the current latent space contains an instantaneous relationship with aerodynamic response. On the other hand, with

normal upper Delta t equals 0.225


$\Delta t = 0.225$


, time gaps are observed between these peaks. The assessment of the gust contribution to the lift response is embedded in the low-order representation based on the information about the target variable. The other two latent variables exhibit a similar trend, although not shown here. The proposed method learns the relationship between vortical motions and future lift response in a transient manner and extracts coherent structures as a time-varying mode in highly unsteady aerodynamic environments.

To extract informative components from snapshot data by learning the relationship between vortical structures and future lift response, an appropriate amount of training snapshots needs to be provided to the present model. The influence of the number of snapshots on the mutual information loss

upper I left parenthesis bold italic q Subscript upper R Baseline semicolon bold italic q Subscript upper I Baseline right parenthesis


$I(\boldsymbol q_R; \boldsymbol q_I)$


is evaluated for

n Subscript italic snapshot Baseline divided by n Subscript a l l Baseline equals 1 divided by 16


$n_{ {\textit{snapshot}}} /n_{ {all}} = 1/16$


to

1


$1$


in figure 4, where

n Subscript a l l Baseline left parenthesis equals 2400 right parenthesis


$n_{ {all}} (=2400)$


represents the total number of prepared snapshots. In general, the mutual information loss increases as

n Subscript italic snapshot


$n_{ {\textit{snapshot}}}$


decreases, indicating that the resulting extraction is affected by the decrease in the training data set. Here, we also examine the decomposed informative components for

n Subscript italic snapshot Baseline divided by n Subscript a l l Baseline equals 1 divided by 16


$n_{ {\textit{snapshot}}} /n_{ {all}} = 1/16$


and

1 divided by 8


$1/8$


with different values for the balancing parameter

beta


$\beta$


. While the extracted mode with

n Subscript italic snapshot Baseline divided by n Subscript a l l Baseline equals 1 divided by 8


$n_{ {\textit{snapshot}}}/n_{ {all}} = 1/8$


and

beta equals 0.001


$\beta = 0.001$


exhibits high mutual information loss and deformed structures, by carefully tuning

beta


$\beta$


, the present model is capable of extracting vortical structures with a mutual information loss comparable to that obtained using all snapshots, thereby identifying similar structures.

The image contains multiple graphs and visualizations related to fluid dynamics analysis.
Figure 6. Long description

The image consists of four subfigures labeled (a) through (d). Subfigure (a) displays three rows of vorticity snapshots at different time points, showing the input data, reconstructed flow field via convolutional autoencoder, and extracted informative fields. Subfigure (b) presents a line graph of the time series of the lift coefficient, with key points marked. Subfigures (c) and (d) show scatter plots of the latent space identified by the models without and with additional geometric constraints, respectively, with color coding to represent time. The x and y axes in subfigures (c) and (d) represent different latent space dimensions, and the dashed circle in subfigure (d) indicates a geometric constraint. All values are approximated.

To determine the value of the balancing parameter

beta


$\beta$


, we employ L-curve analysis (Hansen & O’Leary Reference Hansen and O’Leary1993), facilitating the identification of the trade-off relationship between the two terms in (2.6). The relationship between the reconstruction loss and the mutual information loss across several values of

beta


$\beta$


, along with the resulting decomposed modes, is examined, as shown in figure 5. With a small

beta


$\beta$


of

0.01


$0.01$


, the model prioritises the regular regression loss, yielding modes almost identical to the given state, while increasing

beta


$\beta$


allows the model to output modal structures distinct from the input field. Excessive penalty for the mutual information term hinders the reconstruction. By following this procedure, the balancing parameter

beta


$\beta$


is systematically determined for each case.

3.2. Example 2: experimental measurements of large-amplitude transverse gust encounter

To examine the applicability of the current techniques to experimental measurements, the flow around a flat plate at a constant angle of attack

alpha equals 0 Superscript ring


$\alpha = 0^{\circ }$


with

italic Re equals 20 000


${\textit{Re}} = 20\,000$


passing through a large-amplitude transverse jet at gust ratio

upper G equals 1.5


$G = 1.5$


, made available by Towne et al. (Reference Towne2023), is considered. Details on the data curation and experimental set-ups are referred to Andreu-Angulo et al. (Reference Andreu-Angulo, abinsky, Biler, Sedky and Jones2020) and Biler et al. (Reference Biler, Sedky, Jones, Saritas and Cetiner2021). Here, we consider the spanwise vorticity field

bold italic omega Subscript z Baseline left parenthesis bold italic x comma t right parenthesis


$\boldsymbol{\omega } _z (\boldsymbol x,t)$


as a given state. For this case, we also employ a convolutional-neural-network-based autoencoder without the information-theoretic term (Fukagata & Fukami Reference Fukagata and Fukami2025). This comparison is intended to distinguish the effects of denoising and removal of redundant components, as an autoencoder composed solely of regression loss can remove noise and extract coherent structures through the compression process, as shown in figure 6(

a


$a$


). The difference in reconstruction between the regular autoencoder and the current method derives from introducing a mutual information term and target variable input, exhibiting how the inferred mutual information works in the present data-driven modal analysis.

Let us perform the present IMD for the experimental data set. Here, the parameters of the time window

normal upper Delta t


$\Delta t$


and the balancing parameter

beta


$\beta$


are set to

0.005


$0.005$


and

1


$1$


, respectively. The current model extracts coherent structures capturing the effect of aerodynamic response, as shown in figure 6(

a


$a$


). At

t equals 0.250


$t = 0.250$


, when the lift coefficient is nearly zero, the model disregards the majority of the given state, including the approaching gust structures. In other words, the lift force is recognised as statistically independent from the surrounding vortical flow. When the large-scale separation occurs along with the formation of positive vorticity at the leading edge, the time series of the lift coefficient shows a sharp increase for

0.5 less than t less than 1.74


$0.5\lt t\lt 1.74$


, as exhibited in figure 6(

b


$b$


). At

t equals 1.74


$t = 1.74$


, large-scale structures with positive vorticity at the leading edge and separated wake with negative vorticity at the trailing edge are assessed as informative. Once the transverse gust transitions to turbulence and dissipates at

t equals 3.06


$t = 3.06$


, the fine-scale structures that appear above the flat plate are disregarded. Similar to the case of the vortex–aerofoil interaction, vortical structures modified extensively by interaction with the gust are dominant contributors to the lift response.

Figure 7.

Probability density functions of the vorticity (grey, input; red, IMD) are shown with informative components at representative time

t equals 0.250


$t = 0.250$


,

1.00


$1.00$


and

1.74


$1.74$


.

Three graphs show probability density functions of vorticity at different times with input and IMD data.
Figure 7. Long description

The image contains three side-by-side graphs displaying probability density functions of vorticity at different times. Each graph compares input data in grey and IMD data in red. The x-axis represents normalized vorticity values ranging from -1 to 1, while the y-axis represents the probability density function on a logarithmic scale. The three graphs correspond to different time points: t = 0.250, t = 1.00, and t = 1.74. At each time point, the graphs show how the vorticity distribution evolves, with notable peaks and variations in the data. The red and grey lines indicate the differences between the input and IMD data, highlighting the changes in vorticity over time. All values are approximated.

Low-order representations are also examined. The latent dimension is set to be 2, following Smith et al. (Reference Smith, Fukami, Sedky, Jones and Taira2024), showing that the transverse gust encounter is possible to be considered as a cyclic event and compressed into a circle-shaped latent space. The compression result with the cost function in (2.6) is shown in figure 6(

c


$c$


). The trajectory collapses into a line with temporal discontinuity, which is challenging to interpret. To address this, we constrain the latent space to lie on a circle by adding a geometry-related cost function (Smith et al. Reference Smith, Fukami, Sedky, Jones and Taira2024). The resulting latent variables form a circular-like trajectory that evolves continuously over time, as shown in figure 6(

d


$d$


). Note that despite such an additional constraint, the present model successfully extracts the informative structure in a similar manner to that shown earlier. This indicates that the present approach allows identification of physically interpretable low-order representations in addition to providing time-varying modal structures based on the relationship with the target variable.

Let us examine the probability density function of the input vorticity and the decomposed field, p.d.f.

left parenthesis w Subscript z Baseline right parenthesis


$(w_z)$


, as shown in figure 7. The probability density function of informative components exhibits a sharper peak near zero compared with the input data, suggesting that experimental noise contained in the input snapshots is removed through the extraction process. In addition, the tail of the probability density function for the informative component, corresponding to the extreme positive and negative vorticity, is shrunk compared with that for the input. This implies that intensely rotating structures do not necessarily act as the dominant drivers of future lift. Furthermore, at

t equals 1.74


$t = 1.74$


, when strong impingement causes a sharp excitation of the lift force, we observe that the informative components retain the negative values in the probability density function. This indicates that the extraction process reconstructs the transverse jet reaching the lower surface of the flat plate, a lift-related phenomenon obscured by experimental noise. The current assessment highlights the capability of the present model to selectively preserve lift-related vortical structures while mitigating experimental noise.

3.3. Example 3: separated turbulent wake over a wing section of NLF(1)-0115 aerofoil

Let us also examine turbulent flow with the current decomposition. Turbulent wake over a wing section, NLF(1)-0115 aerofoil at an angle of attack of

alpha equals 5 Superscript ring


$\alpha = 5^\circ$


with

italic Re equals 20 000


${\textit{Re}}= 20\,000$


is considered. Separation occurs at the leading edge, producing a quasi-cyclic turbulent wake. We discuss how the proposed method captures informative structures responsible for lift with the presence of a range of scales in length. The datasets are produced by three-dimensional large-eddy simulations. Further details on data curation are seen in Liu et al. (Reference Liu, Trujillo Corona, Espinoza, Shu and Gross2025). The field of the second invariant for the velocity gradient tensor,

bold italic upper Q left parenthesis bold italic x comma t right parenthesis


$\boldsymbol{Q}(\boldsymbol x, t)$


, is considered as the source state. In this example, a series of convolutional neural networks without data compression is employed, unlike the other two cases. This is based on the prior knowledge that fine-scale structures could be lost due to two reasons: their small contribution to aerodynamic force (Fukami & Araki Reference Fukami and Araki2026), and the compression process inside the network (Fukami, Smith & Taira Reference Fukami, Smith and Taira2025). By removing the compression layers inside the network, we aim to isolate the physical contribution of these structures.

Figure 8.

Informative modal structure of spanwise-averaged separated flow over wing section at

italic Re equals 20 000


${\textit{Re}} = 20\, 000$


and dependence of decomposed mode on time window.

A combination of line graphs and scatter plots showing fluid dynamics analysis over time with different time-delay embeddings.
Figure 8. Long description

The image contains a combination of line graphs and scatter plots. The top section features a line graph with the y-axis labeled C_L and the x-axis labeled t, displaying oscillating data points with three red dots highlighting specific values. Below, three rows of scatter plots illustrate fluid dynamics at different time-delay embeddings (Delta t = 0.05, Delta t = 0.75). Each row shows the evolution of fluid structures over time, with color gradients indicating different values of Q. The first row represents the input data, while the second and third rows show the results of time-delay embedding analysis. The scatter plots depict the spatial distribution and temporal evolution of fluid structures, with insets zooming in on specific regions for detailed examination. All values are approximated.

The current decomposition is performed into the two-dimensional spanwise-averaged field based on an instantaneous and a delayed contribution where convective time windows

normal upper Delta t


$\Delta t$


are set to

0.05


$0.05$


and

0.75


$0.75$


, respectively, as shown in figure 8. The balancing parameter

beta


$\beta$


is set to

1 times 10 Superscript 4


$1 \times 10^4$


. For the case of

normal upper Delta t equals 0.05


$\Delta t = 0.05$


, large-scale vortex cores and structures around the leading edge are identified primarily due to their significant contribution. The result with

normal upper Delta t equals 0.75


$\Delta t = 0.75$


shows that the present model selectively isolates the contribution of the leading edge and the vortex core near the trailing edge to the lift response, while identifying the convection process of the vortex cores and shear to disregard them. Furthermore, the vortex cores in the informative mode are observed to be located between structures with negative

upper Q


$Q$


. This shear is captured mainly on both the upstream and downstream sides, indicating that the shear induced by the convection of the vortex core in the streamwise direction is recognised as informative in addition to the core itself.

The present decomposition is finally extended to the three-dimensional flow field, as shown in figure 9(

a


$a$


). Similar to the spanwise-averaged case, the large-scale vortex cores are extracted as informative, while fine-scale structures are disregarded due to their small contribution to the lift.

Three graphs depict the structure of three-dimensional separated flow over a wing section.
Figure 9. Long description

The image contains three graphs. The first graph on the left shows the informative modal structure of three-dimensional separated flow over a wing section, visualized with an iso-surface colored by streamwise velocity. The second graph in the middle displays the probability density function of the Q-criterion field. The third graph on the right illustrates scale-decomposed fields with two cuts of length scales. The graphs collectively examine the contribution of specific vortical structures to aerodynamic forces using various analytical techniques.

We next examine the probability density function of the reference and the decomposed

upper Q


$Q$


-criterion field, p.d.f.

left parenthesis bold italic upper Q right parenthesis


$(\boldsymbol Q)$


, as shown in figure 9(

b


$b$


). The probability density function of the decomposed informative mode exhibits that the present model predominantly retains the structures with

upper Q greater than 0


$Q \gt 0$


. The current rotation-dominated extraction suggests that the present model assesses rotating structures as the primary drivers of aerodynamic lift generation. It is also observed that the probability density of the informative component is lower than the reference at the extreme positive region, indicating that not all intensely rotating structures are regarded as informative for the future lift. The model selectively filters out these low-contribution structures, despite their intense magnitude, isolating the lift-related structures.

The length scale of structures extracted through the current method for the three-dimensional case is further discussed through the scale-decomposition analysis (Goto, Saito & Kawahara Reference Goto, Saito and Kawahara2017; Fujino, Motoori & Goto Reference Fujino, Motoori and Goto2023). A spatial band-pass filter based on a two-dimensional Gaussian kernel

upper G


$G$


is applied to the velocity field in the

x


$x$


and

y


$y$


directions. The cutoff length scale

sigma Subscript italic max


$\sigma _{ {\textit{max}}}$


is set to

sigma Subscript italic max Baseline equals c divided by left parenthesis 2 pi upper S t right parenthesis


$\sigma _{ {\textit{max}}}=c/(2\pi St)$


, which is the diameter of the roller vortices, shed with the Strouhal number

upper S t


$St$


. The operation is described as

(3.1)

StartLayout 1st Row 1st Column bold italic u Superscript left bracket sigma 1 comma sigma 2 right bracket Baseline left parenthesis x comma y comma z comma t right parenthesis 2nd Column equals integral Underscript script upper D Endscripts bold italic u left parenthesis x comma y comma z comma t right parenthesis left bracket upper G left parenthesis x prime comma y prime semicolon x comma y comma sigma 1 right parenthesis minus upper G left parenthesis x prime comma y prime semicolon x comma y comma sigma 2 right parenthesis right bracket normal d x prime normal d y Superscript prime Baseline comma EndLayout


\begin{align} \boldsymbol{u}^{[\sigma _1, \sigma _2]}(x, y, z, t) &= \int _{\mathcal D} \boldsymbol{u}(x, y, z, t) \left[G\left(x^{\prime}, y^{\prime}; x, y, \sigma _1\right) – G\left(x^{\prime}, y^{\prime}; x, y, \sigma _2\right)\right] {\rm d}x^{\prime} {\rm d}y^{\prime}, \end{align}

where

script upper D


$\mathcal D$


is the domain of integration. The

upper Q


$Q$


-criterion fields for a length-scale range of [

sigma 1 comma 2 sigma 1


$\sigma _1, 2\sigma _1$


] with

sigma 1 equals left parenthesis sigma Subscript italic max Baseline comma sigma Subscript m a x Baseline divided by 6 right parenthesis


$\sigma _1 =(\sigma _{ {\textit{max}}}, \sigma _{ { max}}/6)$


are computed from scale-decomposed velocity fields, as exhibited in figure 9(

c


$c$


).

With

sigma 1 equals sigma Subscript italic max


$\sigma _1=\sigma _{ {\textit{max}}}$


, the scale-decomposed mode shows structures qualitatively similar in scale to those of the present method. In contrast, the scale-decomposed field for

sigma 1 equals sigma Subscript italic max Baseline divided by 6


$\sigma _1=\sigma _{ {\textit{max}}}/6$


contains the rib structures in addition to the dominant vortex cores, suggesting that the scale-based decomposition inherently conflates rib structures with vortex cores due to their overlapping length scales. Consequently, this comparison suggests that the present method successfully distinguishes large-scale vortical motions based on contribution to the future lift, not solely on the length-scale information.

A scatter plot showing the relationship between two variables, R and Q, with hundreds of data points color-coded from blue to red.
Figure 10. Long description

A scatter plot showing the relationship between two variables, R and Q, with hundreds of data points color-coded from blue to red. The x-axis represents the variable R, ranging from -4000 to 4000, and the y-axis represents the variable Q, ranging from -750 to 750. The data points are color-coded based on the value of Q, with blue indicating lower values and red indicating higher values. The plot shows a dense cluster of points around the center, with a gradient transitioning from blue to red as the values of Q increase. There is a noticeable spread of points both above and below the central cluster, indicating variability in the data. The overall trend suggests a concentration of data points around the central region, with fewer points scattered towards the peripheries. All values are approximated.

Moreover, we evaluate the decomposed structures on the

upper Q


$Q$




upper R


$R$


plane, where

upper R


$R$


is the third invariant of the velocity gradient tensor, as exhibited in figure 10. While the second invariant

upper Q


$Q$


characterises the balance between the rotation and strain rates, the third invariant

upper R


$R$


governs the dynamics of vortex stretching and compression. According to these invariants, the local flow topologies are classified into four distinct states: vortex stretching (

upper Q greater than 0 comma upper R greater than 0


$Q\gt 0, R\gt 0$


), vortex compression (

upper Q greater than 0 comma upper R less than 0


$Q\gt 0, R\lt 0$


), axial strain (

upper Q less than 0 comma upper R greater than 0


$Q\lt 0, R\gt 0$


) and biaxial strain (

upper Q less than 0 comma upper R less than 0


$Q\lt 0, R\lt 0$


) (Davidson Reference Davidson2015). While the input field exhibits the well-known teardrop shape spanning both positive and negative

upper Q


$Q$


-criterion (Ooi et al. Reference Ooi, Martin, Soria and Chong1999; Fukami et al. Reference Fukami, Goto and Taira2024a
), the present extraction is dominantly activated for rotation-dominated structures, in particular, those experiencing vortex compression. Note that since the present model takes only the

upper Q


$Q$


-criterion field as input, the present model selectively distinguishes between vortex stretching and compression based on the flow of information with respect to future lift, indicating a relationship with the lift-generating mechanism in the current flow field.

We then discuss the dependence of informative mode on the time window

normal upper Delta t


$\Delta t$


. The spanwise power spectra density of the

upper Q


$Q$


-criterion, averaged over time

upper P left parenthesis upper Q right parenthesis


$P(Q)$


, is examined as a function of the spanwise wavenumber

k Subscript z


$k_z$


, in figure 11(

a


$a$


). The present model generally underestimates informative components compared with the reference field, suggesting that redundant components are removed due to their smaller contribution to the lift generation. Furthermore, the influence of the time delay

normal upper Delta t


$\Delta t$


on the power spectral density of the informative mode is concentrated predominantly in the low-wavenumber region. This implies that the contribution to the future lift is governed primarily by these large-scale structures.

To discuss the convection process of the vortex structure captured in extracted modal structures, the streamwise variation of the absolute value of

upper Q


$Q$


-criterion,

ModifyingAbove StartAbsoluteValue upper Q EndAbsoluteValue With bar left parenthesis x right parenthesis equals ModifyingAbove StartAbsoluteValue upper Q left parenthesis x comma y comma z comma t right parenthesis EndAbsoluteValue With bar Superscript y comma z comma t


$\bar {|Q|}(x) = \overline {|Q(x,y,z,t)|}^{y,z,t}$


, measuring the ensemble average over time and the

y minus z


$y{-}z$


plane, is assessed, as shown in figure 11(

b


$b$


). The indicator of the

ModifyingAbove StartAbsoluteValue upper Q EndAbsoluteValue With bar left parenthesis x right parenthesis


$\bar {|Q|}(x)$


downstream region decreases as

normal upper Delta t


$\Delta t$


increases, suggesting that the contribution of the structures there is recognised as less significant due to the convection process. These findings support the applicability of the present method to turbulent flows, highlighting its potential to advance the physical interpretation of unsteady aerodynamics via data-driven modal analysis.

4. Concluding remarks

This study considered an information-theoretic machine-learning method that provides time-varying informative vortical structure related to the future lift coefficient. Through three example flows: extreme vortex-gust–aerofoil interactions, experimentally measured transverse gust encounter, and separated turbulent wake, the present method extracts structures associated with lift generation from the snapshot data. Built upon the previous studies (Arranz & Lozano-Durán Reference Arranz and Lozano-Durán2024; Fukami & Araki Reference Fukami and Araki2026), which perform the decomposition locally, the present data-driven approach enables the extraction of the dynamic relationship between vortical structures and the lift response by framing the cause-and-effect association within the lift-generation mechanism. The current convolutional network-based deep sigmoidal flow decomposes the input flow field globally with a spatiotemporal arrangement of vortical structures preserved. Furthermore, a low-order representation of the informative vortical structures and the aerodynamic coefficient is identified while performing decomposition. The proposed method is capable of visualising the relationships and offers key insights into the mechanism of force production under highly unsteady aerodynamics, based solely on flow field data and information metrics.

The current global extraction approach enables the extraction of coherent structure across the snapshots since the present information-theoretic convolutional machine-learning model has access to the spatial arrangement of vortical structures. Due to the capability of the present method to account for such spatial correlations, one promising future prospect is to apply this global approach to flow fields characterised by a broader range of structural scales, including wall-bounded turbulence. Furthermore, it will be of interest to conduct a direct comparison between the present global framework and the conventional local decomposition (Arranz & Lozano-Durán Reference Arranz and Lozano-Durán2024; Fukami & Araki Reference Fukami and Araki2026). By examining how the incorporation of spatial arrangements influences the mode extraction process compared with an isolated point-wise analysis, we could further discuss the coupling effects of vortical structures on the target variable.

We interpreted the lift-generation mechanism as a cause-and-effect relationship, where vortical structures act as the cause, while the resulting aerodynamic force acts as the effect. To elucidate this relationship from an aerodynamic approach, one can consider using the force elemental method (Chang Reference Chang1992) or the vortex force map approach (Ōtomo et al. Reference Ōtomo, Gehlert, Babinsky and Li2025). Comparing or incorporating them would be of interest to deepen our understanding of vortex-induced lift generation. Since how the input and output, along with the optimisation set-up, are prepared greatly affects the resulting structures assessed as informative (Cremades et al. Reference Cremades, Hoyas and Vinuesa2025a
; Hoyas et al. Reference Hoyas, Benedikt, Cremades and Vinuesa2025), examining the dependence of identified structures on the selected variable and the mathematical definition of the cause-and-effect relationship would be of interest. Although this study considered the relationship between lift and vortical structures over time based on prior knowledge of aerodynamics, evaluating them based on emerging approaches in the field of causal inference may clarify the rather ambiguous concepts of causality in the current vortex-induced force generation problem (Imbens & Rubin Reference Imbens and Rubin2015).

Furthermore, the application of extracted ‘informativeness’ in terms of information theory can be explored by incorporating the current approach to the reinforcement learning-based active flow control. While a previous study has utilised informative components for instantaneous opposition control for drag reduction in turbulent channel flows (Arranz & Lozano-Durán Reference Arranz and Lozano-Durán2024), extending this concept to lift-enhancement strategies will provide insights for aerodynamic flow control.



Source link