Quantum chemical properties of chlorinated polycyclic aromatic hydrocarbons for delta machine learning

Machine Learning


  • Snyder, J. C., Rupp, M., Hansen, K., Müller, K.-R. & Burke, K. Finding Density Functionals with Machine Learning. Phys. Rev. Lett. 108, 253002 (2012).

    Article 
    ADS 
    PubMed 

    Google Scholar 

  • Yao, K. & Parkhill, J. Kinetic Energy of Hydrocarbons as a Function of Electron Density and Convolutional Neural Networks. J. Chem. Theory Comput. 12, 1139–1147 (2016).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Kalita, B., Li, L., McCarty, R. J. & Burke, K. Learning to Approximate Density Functionals. Acc. Chem. Res. 54, 818–826 (2021).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Welborn, M., Cheng, L. & Miller, T. F. Transferability in Machine Learning for Electronic Structure via the Molecular Orbital Basis. J. Chem. Theory Comput. 14, 4772–4779 (2018).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Cheng, L., Sun, J., Deustua, J. E., Bhethanabotla, V. C. & Miller, T. F. Molecular-orbital-based machine learning for open-shell and multi-reference systems with kernel addition Gaussian process regression. The Journal of Chemical Physics 157, 154105 (2022).

    Article 
    ADS 
    CAS 
    PubMed 

    Google Scholar 

  • Chen, Y., Zhang, L., Wang, H. & E, W. Ground State Energy Functional with Hartree–Fock Efficiency and Chemical Accuracy. J. Phys. Chem. A 124, 7155–7165 (2020).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Qiao, Z., Welborn, M., Anandkumar, A., Manby, F. R. & Miller, T. F. OrbNet: Deep learning for quantum chemistry using symmetry-adapted atomic-orbital features. The Journal of Chemical Physics 153, 124111 (2020).

    Article 
    ADS 
    CAS 
    PubMed 

    Google Scholar 

  • Ramakrishnan, R., Dral, P. O., Rupp, M. & Von Lilienfeld, O. A. Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach. J. Chem. Theory Comput. 11, 2087–2096 (2015).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Rupp, M., Tkatchenko, A., Müller, K.-R. & Von Lilienfeld, O. A. Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning. Phys. Rev. Lett. 108, 058301 (2012).

    Article 
    ADS 
    PubMed 

    Google Scholar 

  • Christensen, A. S., Bratholm, L. A., Faber, F. A. & Anatole Von Lilienfeld, O. FCHL revisited: Faster and more accurate quantum machine learning. The Journal of Chemical Physics 152, 044107 (2020).

    Article 
    ADS 
    CAS 
    PubMed 

    Google Scholar 

  • Smith, J. S., Isayev, O. & Roitberg, A. E. ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost. Chem. Sci. 8, 3192–3203 (2017).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Smith, J. S. et al. Approaching coupled cluster accuracy with a general-purpose neural network potential through transfer learning. Nat Commun 10, 2903 (2019).

    Article 
    ADS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Khan, D., Heinen, S. & Von Lilienfeld, O. A. Kernel based quantum machine learning at record rate: Many-body distribution functionals as compact representations. The Journal of Chemical Physics 159, 034106 (2023).

    Article 
    ADS 
    CAS 
    PubMed 

    Google Scholar 

  • Lubbers, N., Smith, J. S. & Barros, K. Hierarchical modeling of molecular energies using a deep neural network. The Journal of Chemical Physics 148, 241715 (2018).

    Article 
    ADS 
    PubMed 

    Google Scholar 

  • Allen, A. E. A. et al. Learning together: Towards foundation models for machine learning interatomic potentials with meta-learning. npj Comput Mater 10, 154 (2024).

    Article 
    CAS 

    Google Scholar 

  • Liu, Z. et al. Transferable Multilevel Attention Neural Network for Accurate Prediction of Quantum Chemistry Properties via Multitask Learning. J. Chem. Inf. Model. 61, 1066–1082 (2021).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Montavon, G. et al. Machine learning of molecular electronic properties in chemical compound space. New J. Phys. 15, 095003 (2013).

    Article 
    ADS 
    CAS 

    Google Scholar 

  • Grisafi, A. et al. Transferable Machine-Learning Model of the Electron Density. ACS Cent. Sci. 5, 57–64 (2019).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Atz, K., Isert, C., Böcker, M. N. A., Jiménez-Luna, J. & Schneider, G. Δ-Quantum machine-learning for medicinal chemistry. Phys. Chem. Chem. Phys. 24, 10775–10783 (2022).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Zhu, J., Vuong, V. Q., Sumpter, B. G. & Irle, S. Artificial neural network correction for density-functional tight-binding molecular dynamics simulations. MRS Communications 9, 867–873 (2019).

    Article 
    CAS 

    Google Scholar 

  • Nandi, A., Qu, C., Houston, P. L., Conte, R. & Bowman, J. M. Δ -machine learning for potential energy surfaces: A PIP approach to bring a DFT-based PES to CCSD(T) level of theory. The Journal of Chemical Physics 154, 051102 (2021).

    Article 
    ADS 
    CAS 
    PubMed 

    Google Scholar 

  • Pattnaik, P. et al. Machine Learning for Accurate Force Calculations in Molecular Dynamics Simulations. J. Phys. Chem. A 124, 6954–6967 (2020).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Bowman, J. M. et al. Δ-Machine Learned Potential Energy Surfaces and Force Fields. J. Chem. Theory Comput. 19, 1–17 (2023).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Chen, X., Li, P., Hruska, E. & Liu, F. Δ-Machine learning for quantum chemistry prediction of solution-phase molecular properties at the ground and excited states. Phys. Chem. Chem. Phys. 25, 13417–13428 (2023).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Unzueta, P. A., Greenwell, C. S. & Beran, G. J. O. Predicting Density Functional Theory-Quality Nuclear Magnetic Resonance Chemical Shifts via Δ-Machine Learning. J. Chem. Theory Comput. 17, 826–840 (2021).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Kleine Büning, J. B. & Grimme, S. Computation of CCSD(T)-Quality NMR Chemical Shifts via Δ-Machine Learning from DFT. J. Chem. Theory Comput. 19, 3601–3615 (2023).

    Article 
    PubMed 

    Google Scholar 

  • Grumet, M., Von Scarpatetti, C., Bučko, T. & Egger, D. A. Delta Machine Learning for Predicting Dielectric Properties and Raman Spectra. J. Phys. Chem. C 128, 6464–6470 (2024).

    Article 
    CAS 

    Google Scholar 

  • Zhao, Q., Anstine, D. M., Isayev, O. & Savoie, B. M. Δ2 machine learning for reaction property prediction. Chem. Sci. 14, 13392–13401 (2023).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Farrar, E. H. E. & Grayson, M. N. Machine learning and semi-empirical calculations: a synergistic approach to rapid, accurate, and mechanism-based reaction barrier prediction. Chem. Sci. 13, 7594–7603 (2022).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Zaspel, P., Huang, B., Harbrecht, H. & Von Lilienfeld, O. A. Boosting Quantum Machine Learning Models with a Multilevel Combination Technique: Pople Diagrams Revisited. J. Chem. Theory Comput. 15, 1546–1559 (2019).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Vinod, V., Maity, S., Zaspel, P. & Kleinekathöfer, U. Multifidelity Machine Learning for Molecular Excitation Energies. J. Chem. Theory Comput. 19, 7658–7670 (2023).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Fisher, K. E., Herbst, M. F. & Marzouk, Y. M. Multitask methods for predicting molecular properties from heterogeneous data. The Journal of Chemical Physics 161, 014114 (2024).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Nakata, M. & Maeda, T. PubChemQC B3LYP/6-31G*//PM6 Data Set: The Electronic Structures of 86 Million Molecules Using B3LYP/6-31G* Calculations. J. Chem. Inf. Model. 63, 5734–5754 (2023).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Isert, C., Atz, K., Jiménez-Luna, J. & Schneider, G. QMugs, quantum mechanical properties of drug-like molecules. Sci Data 9, 273 (2022).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Balcells, D. & Skjelstad, B. B. tmQM Dataset—Quantum Geometries and Properties of 86k Transition Metal Complexes. J. Chem. Inf. Model. 60, 6135–6146 (2020).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Nilsson, U. L. & Oestman, C. E. Chlorinated polycyclic aromatic hydrocarbons: method of analysis and their occurrence in urban air. Environ. Sci. Technol. 27, 1826–1831 (1993).

    Article 
    ADS 
    CAS 

    Google Scholar 

  • Ma, J. et al. Chlorinated and Parent Polycyclic Aromatic Hydrocarbons in Environmental Samples from an Electronic Waste Recycling Facility and a Chemical Industrial Complex in China. Environ. Sci. Technol. 43, 643–649 (2009).

    Article 
    ADS 
    CAS 
    PubMed 

    Google Scholar 

  • Jin, R., Zheng, M., Lammel, G., Bandowe, B. A. M. & Liu, G. Chlorinated and brominated polycyclic aromatic hydrocarbons: Sources, formation mechanisms, and occurrence in the environment. Progress in Energy and Combustion Science 76, 100803 (2020).

    Article 

    Google Scholar 

  • Ohura, T. Environmental Behavior, Sources, and Effects of Chlorinated Polycyclic Aromatic Hydrocarbons. The Scientific World JOURNAL 7, 372–380 (2007).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Kamiya, Y., Iijima, A., Ikemori, F., Okuda, T. & Ohura, T. Source apportionment of chlorinated polycyclic aromatic hydrocarbons associated with ambient particles in a Japanese megacity. Sci Rep 6, 38358 (2016).

    Article 
    ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Li, W., Wu, S. & Zhang, W. Insights into the Formation of Chlorinated Polycyclic Aromatic Hydrocarbons Related to Chlorine in Salt-Tolerant Rice: Profiles in Market Samples, Effects of Saline Cultivation, and Household Cooking. J. Agric. Food Chem. 72, 24833–24846 (2024).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Liu, Q., Xu, X., Wang, L. & Wang, D. Transformation reactivity of parent polycyclic aromatic hydrocarbons and the formation trend of halogenated polycyclic aromatic hydrocarbons in the presence of bromide ion during chlorination. Chemical Engineering Journal 400, 125901 (2020).

    Article 
    CAS 

    Google Scholar 

  • Wahab, A., Pfuderer, L., Paenurk, E. & Gershoni-Poranne, R. The COMPAS Project: A Computational Database of Polycyclic Aromatic Systems. Phase 1: cata -Condensed Polybenzenoid Hydrocarbons. J. Chem. Inf. Model. 62, 3704–3713 (2022).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Mayo Yanes, E., Chakraborty, S. & Gershoni-Poranne, R. COMPAS-2: a dataset of cata-condensed hetero-polycyclic aromatic systems. Sci Data 11, 97 (2024).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Wahab, A. & Gershoni-Poranne, R. COMPAS-3: a dataset of peri -condensed polybenzenoid hydrocarbons. Phys. Chem. Chem. Phys. 26, 15344–15357 (2024).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Allison, T. C. & Burgess, D. R. First-Principles Prediction of Enthalpies of Formation for Polycyclic Aromatic Hydrocarbons and Derivatives. J. Phys. Chem. A 119, 11329–11365 (2015).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Xu, S., Wang, Q.-D., Sun, M.-M., Yin, G. & Liang, J. Benchmark calculations for bond dissociation energies and enthalpy of formation of chlorinated and brominated polycyclic aromatic hydrocarbons. RSC Adv. 11, 29690–29701 (2021).

    Article 
    ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Huang, C. et al. The aryl hydrocarbon receptor (AhR) activity and DNA-damaging effects of chlorinated polycyclic aromatic hydrocarbons (Cl-PAHs). Chemosphere 211, 640–647 (2018).

    Article 
    ADS 
    CAS 
    PubMed 

    Google Scholar 

  • Goedtke, L. et al. Polycyclic Aromatic Hydrocarbons Activate the Aryl Hydrocarbon Receptor and the Constitutive Androstane Receptor to Regulate Xenobiotic Metabolism in Human Liver Cells. IJMS 22, 372 (2020).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Li, X. et al. Chlorinated Polycyclic Aromatic Hydrocarbons Induce Immunosuppression in THP-1 Macrophages Characterized by Disrupted Amino Acid Metabolism. Environ. Sci. Technol. 56, 16012–16023 (2022).

    Article 
    ADS 
    CAS 
    PubMed 

    Google Scholar 

  • Putz, M. V. & Putz, A.-M. DFT Chemical Reactivity Driven by Biological Activity: Applications for the Toxicological Fate of Chlorinated PAHs. in Applications of Density Functional Theory to Biological and Bioinorganic Chemistry (eds. Putz, M. V. & Mingos, D. M. P.) vol. 150 181–231 (Springer Berlin Heidelberg, Berlin, Heidelberg, 2013).

  • Mekenyan, O. G., Veith, G. D., Call, D. J. & Ankley, G. T. A QSAR evaluation of Ah receptor binding of halogenated aromatic xenobiotics. Environ Health Perspect 104, 1302–1310 (1996).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Ohura, T., Morita, M., Makino, M., Amagai, T. & Shimoi, K. Aryl Hydrocarbon Receptor-Mediated Effects of Chlorinated Polycyclic Aromatic Hydrocarbons. Chem. Res. Toxicol. 20, 1237–1241 (2007).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Kim, S. et al. PubChem 2023 update. Nucleic Acids Research 51, D1373–D1380 (2023).

    Article 
    PubMed 

    Google Scholar 

  • RDKit: Open-source cheminformatics; http://www.rdkit.org.

  • Halgren, T. A. Merck molecular force field. I. Basis, form, scope, parameterization, and performance of MMFF94. J. Comput. Chem. 17, 490–519 (1996).

    Article 
    CAS 

    Google Scholar 

  • O’Boyle, N. M. et al. Open Babel: An open chemical toolbox. J Cheminform 3, 33 (2011).

    Article 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Bannwarth, C., Ehlert, S. & Grimme, S. GFN2-xTB—An Accurate and Broadly Parametrized Self-Consistent Tight-Binding Quantum Chemical Method with Multipole Electrostatics and Density-Dependent Dispersion Contributions. J. Chem. Theory Comput. 15, 1652–1671 (2019).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Grimme, S., Hansen, A., Ehlert, S. & Mewes, J.-M. r2SCAN-3c: A “Swiss army knife” composite electronic-structure method. The Journal of Chemical Physics 154, 064103 (2021).

    Article 
    ADS 
    CAS 
    PubMed 

    Google Scholar 

  • Neese, F. Software update: The ORCA program system—Version 5.0. WIREs Comput Mol Sci 12, e1606 (2022).

    Article 

    Google Scholar 

  • Paton, R. Shree Sowndarya Santhanalakkshmi Vejaykummar, Guilian Luchini & aligfellow. patonlab/pyQRC: 1.0.4. Zenodo https://doi.org/10.5281/ZENODO.12510821 (2024).

  • Najibi, A. & Goerigk, L. DFT‐D4 counterparts of leading meta-generalized‐gradient approximation and hybrid density functionals for energetics and geometries. J Comput Chem 41, 2562–2572 (2020).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Weigend, F. & Ahlrichs, R. Balanced basis sets of split valence, triple zeta valence and quadruple zeta valence quality for H to Rn: Design and assessment of accuracy. Phys. Chem. Chem. Phys. 7, 3297 (2005).

    Article 
    CAS 
    PubMed 

    Google Scholar 

  • Neese, F., Wennmohs, F., Hansen, A. & Becker, U. Efficient, approximate and parallel Hartree–Fock and hybrid DFT calculations. A ‘chain-of-spheres’ algorithm for the Hartree–Fock exchange. Chemical Physics 356, 98–109 (2009).

    Article 
    ADS 
    CAS 

    Google Scholar 

  • Pedregosa, F. et al. Scikit-learn: Machine Learning in Python https://doi.org/10.48550/ARXIV.1201.0490 (2012).

  • linear-tree; https://github.com/cerlymarco/linear-tree.

  • Frolov, D. & Sedov, I. Science Data Bank. https://doi.org/10.57760/sciencedb.18703 (2024).

    Article 

    Google Scholar 

  • Chen, G. et al. Alchemy: A Quantum Chemistry Dataset for Benchmarking AI Models. Preprint at https://doi.org/10.48550/arXiv.1906.09427 (2019).

  • Medrano Sandonas, L. et al. Dataset for quantum-mechanical exploration of conformers and solvent effects in large drug-like molecules. Sci Data 11, 742 (2024).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Eastman, P. et al. SPICE, A Dataset of Drug-like Molecules and Peptides for Training Machine Learning Potentials. Sci Data 10, 11 (2023).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Hoja, J. et al. QM7-X, a comprehensive dataset of quantum-mechanical properties spanning the chemical space of small organic molecules. Sci Data 8, 43 (2021).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Ramakrishnan, R., Dral, P. O., Rupp, M. & Von Lilienfeld, O. A. Quantum chemistry structures and properties of 134 kilo molecules. Sci Data 1, 140022 (2014).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Khan, D., Benali, A., Kim, S. Y. H., Rudorff, G. F. von & von Lilienfeld, O. A. Quantum mechanical dataset of 836k neutral closed shell molecules with upto 5 heavy atoms from CNOFSiPSClBr. Preprint at https://doi.org/10.48550/arXiv.2405.05961 (2024).

  • Gussoni, M., Rui, M. & Zerbi, G. Electronic and relaxation contribution to linear molecular polarizability. An analysis of the experimental values. Journal of Molecular Structure 447, 163–215 (1998).

    Article 
    ADS 
    CAS 

    Google Scholar 

  • Firth, N. C., Brown, N. & Blagg, J. Plane of Best Fit: A Novel Method to Characterize the Three-Dimensionality of Molecules. J. Chem. Inf. Model. 52, 2516–2525 (2012).

    Article 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Taylor, C. R., Butler, P. W. V. & Day, G. M. Predictive crystallography at scale: mapping, validating, and learning from 1000 crystal energy landscapes. Faraday Discuss. https://doi.org/10.1039/D4FD00105B (2025).

  • Groom, C. R., Bruno, I. J., Lightfoot, M. P. & Ward, S. C. The Cambridge Structural Database. Acta Crystallogr B Struct Sci Cryst Eng Mater 72, 171–179 (2016).

    Article 
    ADS 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar 

  • Zhang, H., Nettleton, D. & Zhu, Z. Regression-Enhanced Random Forests. https://doi.org/10.48550/ARXIV.1904.10416 (2019).



  • Source link

    Leave a Reply

    Your email address will not be published. Required fields are marked *