Insights into the deviation from piecewise linearity in transition metal complexes from supervised machine learning models

Abstract

Virtual high-throughput screening (VHTS) and machine learning (ML) with density functional theory (DFT) suffer from inaccuracies from the underlying density functional approximation (DFA). Many of these inaccuracies can be traced to the lack of derivative discontinuity that leads to a curvature in the energy with electron addition or removal. Over a dataset of nearly one thousand transition metal complexes typical of VHTS applications, we computed and analyzed the average curvature (i.e., deviation from piecewise linearity) for 23 density functional approximations spanning multiple rungs of “Jacobs ladder”. While we observe the expected dependence of the curvatures on HF exchange, we note limited correlation of curvature values between different rungs of “Jacobs ladder”. We train ML models (i.e., artificial neural networks or ANNs) to predict the curvature and the associated frontier orbital energies for each of these 23 functionals and then interpret differences in curvature among the different DFAs through analysis of the ML models. Notably, we observe spin to play a much more important role in determining the curvature of range-separated and double hybrids in comparison to semi-local functionals, explaining why curvature values are weakly correlated between these and other families of functionals. Over a space of 187.2k hypothetical compounds, we use our ANNs to pinpoint DFAs for which representative transition metal complexes have near-zero curvature with low uncertainty, demonstrating an approach to accelerate screening of complexes with targeted optical gaps.

Publication
Phys. Chem. Chem. Phys., 25, 8103-8116 (2023)
Yael Cytter
Yael Cytter
Research Scientist
Chenru Duan
Chenru Duan
Chemistry PhD
Heather J. Kulik
Heather J. Kulik
Professor of Chemical Engineering and Chemistry