Snyder, J. C., Rupp, M., Hansen, K., Müller, K.-R. & Burke, K. Finding Density Functionals with Machine Learning. Phys. Rev. Lett. 108, 253002 (2012).
Google Scholar
Yao, K. & Parkhill, J. Kinetic Energy of Hydrocarbons as a Function of Electron Density and Convolutional Neural Networks. J. Chem. Theory Comput. 12, 1139–1147 (2016).
Google Scholar
Kalita, B., Li, L., McCarty, R. J. & Burke, K. Learning to Approximate Density Functionals. Acc. Chem. Res. 54, 818–826 (2021).
Google Scholar
Welborn, M., Cheng, L. & Miller, T. F. Transferability in Machine Learning for Electronic Structure via the Molecular Orbital Basis. J. Chem. Theory Comput. 14, 4772–4779 (2018).
Google Scholar
Cheng, L., Sun, J., Deustua, J. E., Bhethanabotla, V. C. & Miller, T. F. Molecular-orbital-based machine learning for open-shell and multi-reference systems with kernel addition Gaussian process regression. The Journal of Chemical Physics 157, 154105 (2022).
Google Scholar
Chen, Y., Zhang, L., Wang, H. & E, W. Ground State Energy Functional with Hartree–Fock Efficiency and Chemical Accuracy. J. Phys. Chem. A 124, 7155–7165 (2020).
Google Scholar
Qiao, Z., Welborn, M., Anandkumar, A., Manby, F. R. & Miller, T. F. OrbNet: Deep learning for quantum chemistry using symmetry-adapted atomic-orbital features. The Journal of Chemical Physics 153, 124111 (2020).
Google Scholar
Ramakrishnan, R., Dral, P. O., Rupp, M. & Von Lilienfeld, O. A. Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach. J. Chem. Theory Comput. 11, 2087–2096 (2015).
Google Scholar
Rupp, M., Tkatchenko, A., Müller, K.-R. & Von Lilienfeld, O. A. Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning. Phys. Rev. Lett. 108, 058301 (2012).
Google Scholar
Christensen, A. S., Bratholm, L. A., Faber, F. A. & Anatole Von Lilienfeld, O. FCHL revisited: Faster and more accurate quantum machine learning. The Journal of Chemical Physics 152, 044107 (2020).
Google Scholar
Smith, J. S., Isayev, O. & Roitberg, A. E. ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost. Chem. Sci. 8, 3192–3203 (2017).
Google Scholar
Smith, J. S. et al. Approaching coupled cluster accuracy with a general-purpose neural network potential through transfer learning. Nat Commun 10, 2903 (2019).
Google Scholar
Khan, D., Heinen, S. & Von Lilienfeld, O. A. Kernel based quantum machine learning at record rate: Many-body distribution functionals as compact representations. The Journal of Chemical Physics 159, 034106 (2023).
Google Scholar
Lubbers, N., Smith, J. S. & Barros, K. Hierarchical modeling of molecular energies using a deep neural network. The Journal of Chemical Physics 148, 241715 (2018).
Google Scholar
Allen, A. E. A. et al. Learning together: Towards foundation models for machine learning interatomic potentials with meta-learning. npj Comput Mater 10, 154 (2024).
Google Scholar
Liu, Z. et al. Transferable Multilevel Attention Neural Network for Accurate Prediction of Quantum Chemistry Properties via Multitask Learning. J. Chem. Inf. Model. 61, 1066–1082 (2021).
Google Scholar
Montavon, G. et al. Machine learning of molecular electronic properties in chemical compound space. New J. Phys. 15, 095003 (2013).
Google Scholar
Grisafi, A. et al. Transferable Machine-Learning Model of the Electron Density. ACS Cent. Sci. 5, 57–64 (2019).
Google Scholar
Atz, K., Isert, C., Böcker, M. N. A., Jiménez-Luna, J. & Schneider, G. Δ-Quantum machine-learning for medicinal chemistry. Phys. Chem. Chem. Phys. 24, 10775–10783 (2022).
Google Scholar
Zhu, J., Vuong, V. Q., Sumpter, B. G. & Irle, S. Artificial neural network correction for density-functional tight-binding molecular dynamics simulations. MRS Communications 9, 867–873 (2019).
Google Scholar
Nandi, A., Qu, C., Houston, P. L., Conte, R. & Bowman, J. M. Δ -machine learning for potential energy surfaces: A PIP approach to bring a DFT-based PES to CCSD(T) level of theory. The Journal of Chemical Physics 154, 051102 (2021).
Google Scholar
Pattnaik, P. et al. Machine Learning for Accurate Force Calculations in Molecular Dynamics Simulations. J. Phys. Chem. A 124, 6954–6967 (2020).
Google Scholar
Bowman, J. M. et al. Δ-Machine Learned Potential Energy Surfaces and Force Fields. J. Chem. Theory Comput. 19, 1–17 (2023).
Google Scholar
Chen, X., Li, P., Hruska, E. & Liu, F. Δ-Machine learning for quantum chemistry prediction of solution-phase molecular properties at the ground and excited states. Phys. Chem. Chem. Phys. 25, 13417–13428 (2023).
Google Scholar
Unzueta, P. A., Greenwell, C. S. & Beran, G. J. O. Predicting Density Functional Theory-Quality Nuclear Magnetic Resonance Chemical Shifts via Δ-Machine Learning. J. Chem. Theory Comput. 17, 826–840 (2021).
Google Scholar
Kleine Büning, J. B. & Grimme, S. Computation of CCSD(T)-Quality NMR Chemical Shifts via Δ-Machine Learning from DFT. J. Chem. Theory Comput. 19, 3601–3615 (2023).
Google Scholar
Grumet, M., Von Scarpatetti, C., Bučko, T. & Egger, D. A. Delta Machine Learning for Predicting Dielectric Properties and Raman Spectra. J. Phys. Chem. C 128, 6464–6470 (2024).
Google Scholar
Zhao, Q., Anstine, D. M., Isayev, O. & Savoie, B. M. Δ2 machine learning for reaction property prediction. Chem. Sci. 14, 13392–13401 (2023).
Google Scholar
Farrar, E. H. E. & Grayson, M. N. Machine learning and semi-empirical calculations: a synergistic approach to rapid, accurate, and mechanism-based reaction barrier prediction. Chem. Sci. 13, 7594–7603 (2022).
Google Scholar
Zaspel, P., Huang, B., Harbrecht, H. & Von Lilienfeld, O. A. Boosting Quantum Machine Learning Models with a Multilevel Combination Technique: Pople Diagrams Revisited. J. Chem. Theory Comput. 15, 1546–1559 (2019).
Google Scholar
Vinod, V., Maity, S., Zaspel, P. & Kleinekathöfer, U. Multifidelity Machine Learning for Molecular Excitation Energies. J. Chem. Theory Comput. 19, 7658–7670 (2023).
Google Scholar
Fisher, K. E., Herbst, M. F. & Marzouk, Y. M. Multitask methods for predicting molecular properties from heterogeneous data. The Journal of Chemical Physics 161, 014114 (2024).
Google Scholar
Nakata, M. & Maeda, T. PubChemQC B3LYP/6-31G*//PM6 Data Set: The Electronic Structures of 86 Million Molecules Using B3LYP/6-31G* Calculations. J. Chem. Inf. Model. 63, 5734–5754 (2023).
Google Scholar
Isert, C., Atz, K., Jiménez-Luna, J. & Schneider, G. QMugs, quantum mechanical properties of drug-like molecules. Sci Data 9, 273 (2022).
Google Scholar
Balcells, D. & Skjelstad, B. B. tmQM Dataset—Quantum Geometries and Properties of 86k Transition Metal Complexes. J. Chem. Inf. Model. 60, 6135–6146 (2020).
Google Scholar
Nilsson, U. L. & Oestman, C. E. Chlorinated polycyclic aromatic hydrocarbons: method of analysis and their occurrence in urban air. Environ. Sci. Technol. 27, 1826–1831 (1993).
Google Scholar
Ma, J. et al. Chlorinated and Parent Polycyclic Aromatic Hydrocarbons in Environmental Samples from an Electronic Waste Recycling Facility and a Chemical Industrial Complex in China. Environ. Sci. Technol. 43, 643–649 (2009).
Google Scholar
Jin, R., Zheng, M., Lammel, G., Bandowe, B. A. M. & Liu, G. Chlorinated and brominated polycyclic aromatic hydrocarbons: Sources, formation mechanisms, and occurrence in the environment. Progress in Energy and Combustion Science 76, 100803 (2020).
Google Scholar
Ohura, T. Environmental Behavior, Sources, and Effects of Chlorinated Polycyclic Aromatic Hydrocarbons. The Scientific World JOURNAL 7, 372–380 (2007).
Google Scholar
Kamiya, Y., Iijima, A., Ikemori, F., Okuda, T. & Ohura, T. Source apportionment of chlorinated polycyclic aromatic hydrocarbons associated with ambient particles in a Japanese megacity. Sci Rep 6, 38358 (2016).
Google Scholar
Li, W., Wu, S. & Zhang, W. Insights into the Formation of Chlorinated Polycyclic Aromatic Hydrocarbons Related to Chlorine in Salt-Tolerant Rice: Profiles in Market Samples, Effects of Saline Cultivation, and Household Cooking. J. Agric. Food Chem. 72, 24833–24846 (2024).
Google Scholar
Liu, Q., Xu, X., Wang, L. & Wang, D. Transformation reactivity of parent polycyclic aromatic hydrocarbons and the formation trend of halogenated polycyclic aromatic hydrocarbons in the presence of bromide ion during chlorination. Chemical Engineering Journal 400, 125901 (2020).
Google Scholar
Wahab, A., Pfuderer, L., Paenurk, E. & Gershoni-Poranne, R. The COMPAS Project: A Computational Database of Polycyclic Aromatic Systems. Phase 1: cata -Condensed Polybenzenoid Hydrocarbons. J. Chem. Inf. Model. 62, 3704–3713 (2022).
Google Scholar
Mayo Yanes, E., Chakraborty, S. & Gershoni-Poranne, R. COMPAS-2: a dataset of cata-condensed hetero-polycyclic aromatic systems. Sci Data 11, 97 (2024).
Google Scholar
Wahab, A. & Gershoni-Poranne, R. COMPAS-3: a dataset of peri -condensed polybenzenoid hydrocarbons. Phys. Chem. Chem. Phys. 26, 15344–15357 (2024).
Google Scholar
Allison, T. C. & Burgess, D. R. First-Principles Prediction of Enthalpies of Formation for Polycyclic Aromatic Hydrocarbons and Derivatives. J. Phys. Chem. A 119, 11329–11365 (2015).
Google Scholar
Xu, S., Wang, Q.-D., Sun, M.-M., Yin, G. & Liang, J. Benchmark calculations for bond dissociation energies and enthalpy of formation of chlorinated and brominated polycyclic aromatic hydrocarbons. RSC Adv. 11, 29690–29701 (2021).
Google Scholar
Huang, C. et al. The aryl hydrocarbon receptor (AhR) activity and DNA-damaging effects of chlorinated polycyclic aromatic hydrocarbons (Cl-PAHs). Chemosphere 211, 640–647 (2018).
Google Scholar
Goedtke, L. et al. Polycyclic Aromatic Hydrocarbons Activate the Aryl Hydrocarbon Receptor and the Constitutive Androstane Receptor to Regulate Xenobiotic Metabolism in Human Liver Cells. IJMS 22, 372 (2020).
Google Scholar
Li, X. et al. Chlorinated Polycyclic Aromatic Hydrocarbons Induce Immunosuppression in THP-1 Macrophages Characterized by Disrupted Amino Acid Metabolism. Environ. Sci. Technol. 56, 16012–16023 (2022).
Google Scholar
Putz, M. V. & Putz, A.-M. DFT Chemical Reactivity Driven by Biological Activity: Applications for the Toxicological Fate of Chlorinated PAHs. in Applications of Density Functional Theory to Biological and Bioinorganic Chemistry (eds. Putz, M. V. & Mingos, D. M. P.) vol. 150 181–231 (Springer Berlin Heidelberg, Berlin, Heidelberg, 2013).
Mekenyan, O. G., Veith, G. D., Call, D. J. & Ankley, G. T. A QSAR evaluation of Ah receptor binding of halogenated aromatic xenobiotics. Environ Health Perspect 104, 1302–1310 (1996).
Google Scholar
Ohura, T., Morita, M., Makino, M., Amagai, T. & Shimoi, K. Aryl Hydrocarbon Receptor-Mediated Effects of Chlorinated Polycyclic Aromatic Hydrocarbons. Chem. Res. Toxicol. 20, 1237–1241 (2007).
Google Scholar
Kim, S. et al. PubChem 2023 update. Nucleic Acids Research 51, D1373–D1380 (2023).
Google Scholar
RDKit: Open-source cheminformatics; http://www.rdkit.org.
Halgren, T. A. Merck molecular force field. I. Basis, form, scope, parameterization, and performance of MMFF94. J. Comput. Chem. 17, 490–519 (1996).
Google Scholar
O’Boyle, N. M. et al. Open Babel: An open chemical toolbox. J Cheminform 3, 33 (2011).
Google Scholar
Bannwarth, C., Ehlert, S. & Grimme, S. GFN2-xTB—An Accurate and Broadly Parametrized Self-Consistent Tight-Binding Quantum Chemical Method with Multipole Electrostatics and Density-Dependent Dispersion Contributions. J. Chem. Theory Comput. 15, 1652–1671 (2019).
Google Scholar
Grimme, S., Hansen, A., Ehlert, S. & Mewes, J.-M. r2SCAN-3c: A “Swiss army knife” composite electronic-structure method. The Journal of Chemical Physics 154, 064103 (2021).
Google Scholar
Neese, F. Software update: The ORCA program system—Version 5.0. WIREs Comput Mol Sci 12, e1606 (2022).
Google Scholar
Paton, R. Shree Sowndarya Santhanalakkshmi Vejaykummar, Guilian Luchini & aligfellow. patonlab/pyQRC: 1.0.4. Zenodo https://doi.org/10.5281/ZENODO.12510821 (2024).
Najibi, A. & Goerigk, L. DFT‐D4 counterparts of leading meta-generalized‐gradient approximation and hybrid density functionals for energetics and geometries. J Comput Chem 41, 2562–2572 (2020).
Google Scholar
Weigend, F. & Ahlrichs, R. Balanced basis sets of split valence, triple zeta valence and quadruple zeta valence quality for H to Rn: Design and assessment of accuracy. Phys. Chem. Chem. Phys. 7, 3297 (2005).
Google Scholar
Neese, F., Wennmohs, F., Hansen, A. & Becker, U. Efficient, approximate and parallel Hartree–Fock and hybrid DFT calculations. A ‘chain-of-spheres’ algorithm for the Hartree–Fock exchange. Chemical Physics 356, 98–109 (2009).
Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine Learning in Python https://doi.org/10.48550/ARXIV.1201.0490 (2012).
linear-tree; https://github.com/cerlymarco/linear-tree.
Frolov, D. & Sedov, I. Science Data Bank. https://doi.org/10.57760/sciencedb.18703 (2024).
Google Scholar
Chen, G. et al. Alchemy: A Quantum Chemistry Dataset for Benchmarking AI Models. Preprint at https://doi.org/10.48550/arXiv.1906.09427 (2019).
Medrano Sandonas, L. et al. Dataset for quantum-mechanical exploration of conformers and solvent effects in large drug-like molecules. Sci Data 11, 742 (2024).
Google Scholar
Eastman, P. et al. SPICE, A Dataset of Drug-like Molecules and Peptides for Training Machine Learning Potentials. Sci Data 10, 11 (2023).
Google Scholar
Hoja, J. et al. QM7-X, a comprehensive dataset of quantum-mechanical properties spanning the chemical space of small organic molecules. Sci Data 8, 43 (2021).
Google Scholar
Ramakrishnan, R., Dral, P. O., Rupp, M. & Von Lilienfeld, O. A. Quantum chemistry structures and properties of 134 kilo molecules. Sci Data 1, 140022 (2014).
Google Scholar
Khan, D., Benali, A., Kim, S. Y. H., Rudorff, G. F. von & von Lilienfeld, O. A. Quantum mechanical dataset of 836k neutral closed shell molecules with upto 5 heavy atoms from CNOFSiPSClBr. Preprint at https://doi.org/10.48550/arXiv.2405.05961 (2024).
Gussoni, M., Rui, M. & Zerbi, G. Electronic and relaxation contribution to linear molecular polarizability. An analysis of the experimental values. Journal of Molecular Structure 447, 163–215 (1998).
Google Scholar
Firth, N. C., Brown, N. & Blagg, J. Plane of Best Fit: A Novel Method to Characterize the Three-Dimensionality of Molecules. J. Chem. Inf. Model. 52, 2516–2525 (2012).
Google Scholar
Taylor, C. R., Butler, P. W. V. & Day, G. M. Predictive crystallography at scale: mapping, validating, and learning from 1000 crystal energy landscapes. Faraday Discuss. https://doi.org/10.1039/D4FD00105B (2025).
Groom, C. R., Bruno, I. J., Lightfoot, M. P. & Ward, S. C. The Cambridge Structural Database. Acta Crystallogr B Struct Sci Cryst Eng Mater 72, 171–179 (2016).
Google Scholar
Zhang, H., Nettleton, D. & Zhu, Z. Regression-Enhanced Random Forests. https://doi.org/10.48550/ARXIV.1904.10416 (2019).
