MatterSim: Deep learning models of materials under real-world conditions

Machine Learning


The image features a complex network of interconnected nodes with a molecular structure, illuminated in blue against a dark background.

In the search for breakthrough materials essential for nanoelectronics, energy storage, and healthcare, a key challenge looms is predicting material properties before they are created. This is no small feat, given the 118 elements on the periodic table and the arbitrary combinations of temperature and pressure ranges used to synthesize and manipulate the materials. These factors have a significant impact on the atomic interactions within the material, making accurate property prediction and behavior simulation highly desirable.

Here at Microsoft Research, we developed MatterSim, a deep learning model for accurate and efficient materials simulation and property prediction across a wide range of elements, temperatures, and pressures. in silico material design. MatterSim employs deep learning to understand atomic interactions from the very fundamental principles of quantum mechanics across a comprehensive range of elements and conditions from 0 to 5,000 Kelvin (K) and from standard atmospheric pressure to 10,000,000 atmospheres. Masu. In our experiments, MatterSim efficiently handles the simulation of a variety of materials, including metals, oxides, sulfides, halides, and various states such as crystals, amorphous solids, and liquids. Additionally, it also provides customization options for complex prediction tasks by incorporating user-provided data.

Figure 1: There are two partial views. On the left, the atomic structures of 12 substances are shown, belonging to metals, oxides, sulfides, halides, and organic molecules. On the right, the temperature and pressure ranges for material application and synthesis are plotted.
Figure 1. MatterSim can model material properties and behavior under realistic temperature and pressure conditions for a wide range of applications.

Simulation of materials under realistic conditions across the periodic table

MatterSim's learning foundation is built on large-scale synthetic data generated through a combination of active learning, generative models, and molecular dynamics simulations. This data generation strategy provides extensive coverage of the materials space and enables models to predict energy, atomic forces, and stresses. This acts as a machine learning force field with a level of accuracy compatible with ab initio predictions. Specifically, MatterSim is now 10x more accurate in predicting material properties at finite temperatures and pressures compared to previous state-of-the-art models. Our research has demonstrated proficiency in simulating a vast array of material properties, including thermal, mechanical, and transport properties, and is also capable of predicting phase diagrams.

Figure 2: There are three partial views. The left panel shows a comparison of the highest phonon frequencies predicted by MatterSim and the ab initio method. The two values ​​for each material are very close and form an almost straight line in the parity plot. The middle panel shows the same relationship for free energy for about 50 materials and a comparison of MatterSim and ab initio results. The right panel shows the phase diagram of MgO predicted using MatterSim.  The x-axis shows temperature and the y-axis shows pressure.  The pressure range of the B1 phase of MgO is less than 500 GPa, and this range decreases with increasing temperature. The blue line shows the prediction from MatterSim, which is in good agreement with the shaded area that is the result of the experimental measurements.
Figure 2. MatterSim delivers high accuracy in predicting material mechanical properties, vibrational properties, and phase diagrams that rivals quantum mechanics and experimental measurements. The figure shows a comparison between the predicted properties and experimental measurements.

Adaptation to complex design tasks

While MatterSim is trained on a wide range of synthetic datasets, it can also be adapted to specific design requirements by incorporating additional data. The model uses active learning and fine-tuning to customize predictions with high data efficiency. For example, simulating water properties is a seemingly simple but computationally intensive task that is greatly optimized by MatterSim's adaptive capabilities. The model requires only 3% of the data compared to traditional methods to match experimental accuracy. Otherwise, specialized models would require 30 times more resources, and ab initio methods would require exponentially more resources.

Figure 3: This diagram has two panels. The right panel shows the structure of Li2B12H12, a complex material system used in solid-state batteries. This system is used to benchmark the performance of MatterSim. The left panel shows a comparison of the number of data points required when training the model from scratch versus customizing it from MatterSim to achieve the same accuracy.  MatterSim requires 3% and 10% more data for the two tasks compared to training from scratch.
Figure 3. MatterSim delivers high data efficiency with 90% to 97% data savings for complex simulation tasks.

microsoft research podcast

AI Frontiers: Models and Systems by Ece Kamir

Ece Kamar explores short-term mitigation techniques to share long-term research questions that help make these models viable components of AI systems, give them purpose, and maximize their value.


Bridging the gap between atomic models and real-world measurements

Translating material properties from atomic structure is a complex task, often too complex for current methods based on statistics such as molecular dynamics. MatterSim addresses this problem by directly mapping these relationships through machine learning. It includes a custom adapter module that refines the model and predicts material properties from structural data, eliminating the need for complex simulations. Benchmark against MatBench (Opens in new tab)MatterSim, a well-known material property prediction benchmark set, shows significant accuracy improvements, outperforms all specialized property-specific models, and demonstrates a robust ability to predict material properties directly from domain-specific data.

Looking to the future

As MatterSim's research progresses, it will focus on experimental validation to strengthen its potential role in important areas such as designing catalysts for sustainability, breakthroughs in energy storage, and advances in nanotechnology. It is written. The planned integration of MatterSim with generative AI models and reinforcement learning heralds a new era in the systematic pursuit of novel materials. This synergy is expected to revolutionize the field and streamline the guided creation of materials for diverse applications ranging from semiconductor technology to biomedical engineering. Such advances promise to accelerate materials development, strengthen sustainable industrial practices, and thereby accelerate technological advances that benefit society.





Source link

Leave a Reply

Your email address will not be published. Required fields are marked *