image:
Overview of CNN architecture: The first dimension of every layer “?” refers to the batch size. This is left as an unknown or unspecified variable within the network architecture for selection during training. All layers were 4-dimensional before planarization. The second and third dimensions corresponded to the dimensions of the matrix. As explained earlier, the dimension of the input matrix before convolution was 8 × 11. The last dimension corresponded to the channel dimension. In the input layer, the value 1 was taken because we only input a two-dimensional matrix. Conv2D used 32 filters and generated 32 activations from the original 1D input. Therefore, after Conv2D, the last dimension was expanded to 32. After flattening the data, two dimensions remained. The first was the batch size as before, and the second was the array length. Finally, we coded a common dense layer and five different layers to predict each model parameter. The final layer simply concatenated the predictions for each model parameter into one array.
view more
Credit: Brigo, D. et al.
Deep learning has become a powerful tool in quantitative finance, used for a variety of applications, from option pricing to model tuning. However, despite its accuracy and speed, one major concern remains. That said, neural networks often operate like “black boxes” and it’s difficult to understand how they reach their conclusions. This means a lack of validation, accountability and risk management in financial decisions.
In a new study published in risk scienceA team of Italian and British researchers is investigating how deep learning models can be created that are interpretable in financial environments. Their goal was to understand whether their interpretation tools could truly explain what neural networks had learned, rather than simply producing visually appealing but potentially misleading explanations.
The researchers focused on calibrating the Heston model, one of the most widely used stochastic volatility models in option pricing. The mathematical and financial properties of the Heston model are well understood. This makes it an ideal benchmark to test whether interpretability techniques provide explanations that are consistent with established financial intuition.
“We used synthetic data generated from the model itself to train a neural network that learned the relationship between volatility smiles and the underlying parameters of the Heston model,” said lead author Damiano Brigo, professor of mathematical finance at Imperial College London. “We then applied various interpretability techniques to explain how the network maps inputs to outputs.”
These techniques included local techniques such as LIME, DeepLIFT, and layer-wise association propagation, as well as global techniques based on Shapley values, originally developed in cooperative game theory.
The results showed that there is a clear difference between local and global interpretability approaches. “Local approaches that explain individual predictions by locally fitting a model often resulted in unstable or financially unintuitive explanations,” Brigo says. “In contrast, global methods based on Shapley values consistently emphasized input features such as option expiration and exercise in a manner consistent with the known behavior of the Heston model.”
The team’s analysis also revealed that Shapley values can be used as a practical diagnostic tool for model design. By comparing different neural network architectures, the researchers found that fully connected neural networks outperform convolutional neural networks in both accuracy and interpretability for this calibration task, unlike what is commonly observed in image recognition.
“Shapley values not only help explain the model’s predictions, but also help choose better neural network architectures that reflect the true financial structure of the problem,” explains co-author Xiaoshan Huang, a quantitative analyst at Barclays.
By demonstrating that global interpretability techniques can significantly reduce the black-box nature of deep learning in finance, this study provides a path towards more transparent, reliable, and robust machine learning tools for financial modeling.
###
Please contact the author:
Damiano Brigo
Department of Mathematics
Imperial College London, UK
Email: damiano.brigo@ imperial.ac.uk
KeAi Publishers was founded by Elsevier and China Science Publishing & Media Ltd to deliver high-quality research globally. In 2013, our focus shifted to open access publishing. We currently proudly publish over 200 world-class, open access, English-language journals across all scientific disciplines. Many of these are titles published by us in partnership with prestigious societies and academic institutions such as the National Natural Science Foundation of China (NSFC).
Research method
Data/statistical analysis
Research theme
not applicable
Article title
Interpretability of deep learning for finance: A case study of the Heston model
Conflict of interest statement
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. Disclaimer: This document reflects the personal views of Mr. Andrea Pallavicini and does not represent the official position of his employer, Banca IMI.
Disclaimer: AAAS and EurekAlert! We are not responsible for the accuracy of news releases posted on EurekAlert! Use of Information by Contributing Institutions or via the EurekAlert System.
