Bayesian regression for
multi-level machine-learned potentials
Subproject P03
The first-principles description of the properties of multi-component metal oxides is an exceedingly challenging problem. The reasons are that the configurational space grows exponentially with the number of species and standard Density Functional Theory (DFT) is often not accurate enough. The long-term objective of P03 is to accelerate first-principles calculations by developing machine-learning approaches for the description of the interatomic forces, Born effective charges, and other tensorial properties of multivalent oxides. The project will rely on kernel-based methods and Bayesian inference to implement fully automatic “on-the-fly” learning.
In the first project period, we will develop machine-learned force fields (MLFF) for DFT and DFT+U, whereby the number of components in the FF will be gradually increased. A concise framework for learning tensorial properties will be implemented. We will use this to simulate infrared spectra of oxide materials, which can be readily compared to the finite-temperature spectra measured by the experimental groups.
The difference between DFT and hybrid functionals will be machine-learned to go beyond semi-local functionals (Delta-learning). The long-term perspective is to extend this approach to highly accurate beyond-DFT methods, such as the random phase approximation and quantum chemistry (coupled cluster) methods. Although kernel-based methods are exceedingly accurate, they are often less efficient than NN. We will collaborate with other projects to recast the on-the-fly trained FF into NN potentials to address this issue.
In the second funding period, the project is structured around four principal components:
Δ-Learning Beyond Local Density Functional Theory:
Significant progress was made in the first funding period by integrating the random phase approximation (RPA) with machine learning, yielding improved agreement with experimental data for various materials, including surface science applications. To enhance robustness, the approach will be refined by adopting message-passing neural networks, which are less data-intensive than kernel-based methods. This methodology will be applied to multivalent oxides, specifically iron oxides and other complex oxide surfaces. Furthermore, the project intends to explore advanced methods beyond RPA, such as coupled cluster and auxiliary field quantum Monte Carlo methods.
Transition from Kernel-Based Methods to Message-Passing Networks:
Equivariant message-passing networks demonstrate substantial accuracy gains over traditional approaches, albeit at a higher computational cost, necessitating GPU-based computations. The utilization of graph-ACE (atomic cluster expansion) is expected to advance the learning of force fields for complex materials, the differentiation between RPA and DFT, and the learning of tensorial quantities. A transition to graph-ACE will occur whenever the current methodology exhibits unsatisfactory accuracy.
Learning Polarization, Polarizabilities, and Simulating IR Spectra:
During the first funding period, polipy4vasp was developed for predicting IR spectra. While prediction accuracy for polarization and electric field gradient (NMR data) is generally very good to excellent across many systems, challenges persist. Kernel-based methods encounter some difficulties in learning tensorial quantities. Plans include improving accuracy through the use of higher-order body descriptors and ACE. Additionally, the project aims to explore advanced message-passing approaches like graph-ACE and migrate to contemporary software platforms such as PyTorch. Finally, training will be conducted on the electrical enthalpy and its derivatives.
Bayesian Regression and Global Optimization:
Active learning strategies, including Bayesian regression, are routinely and reliably employed. However, the predicted errors occasionally display an unsatisfactory distribution, which is partly attributed to the use of local descriptors. The project will investigate global descriptors to mitigate prediction errors and generate more accurate error estimates that align with expected distributions. This refined methodology will extend and enhance the active learning strategy, increasing its robustness. We also plan to implement global optimization strategies based on Bayesian regression and global descriptor-based force fields.
Expertise
The main research efforts of the group are directed towards the development of quantum-mechanical tools for atomic-scale simulations of properties and processes in materials and the application of these methodologies to key areas of condensed matter physics and materials research. An important pillar of the research is the Vienna Ab initio Simulation Package (VASP), a general-purpose ab initio code for solving the many-electron Schrödinger equation. The code is among the world leaders in its field, with more than 3500 licensees worldwide. We have expertise with simulations for a vast number of properties using many different techniques:
- Density functional theory (DFT), including spin and non-collinear DFT
- Linear response theory to calculate phonons and dielectric properties
- Hartree-Fock techniques and many flavors of hybrid functionals
- Many-body perturbation theory, including GW and Bethe-Salpeter
- Wavefunction-based correlated methods (Møller-Plesset perturbation theory)
- Surface science, including growth and oxide formation
- Simulation of nanostructures
- Semiconductor physics: charge trapping, polarons
- Electronic excitations
- Defect energies in extended systems
For TACO, we will adapt our machine-learning techniques to tensorial properties and correlated wavefunction techniques. These techniques are directly integrated into VASP and allow to accelerate finite-temperature simulations by many orders of magnitudes.
Team
Former Members
Publications
2025

Schmiedmayer, Bernhard; Wolffs, Jop W.; de Wijs, Gilles A.; Kentgens, Arno P. M.; Lahnsteiner, Jonathan; Kresse, Georg
Journal ArticleOpen AccessIn: The Journal of Chemical Physics, vol. 163, pp. 214110, 2025.
Abstract | Links | BibTeX | Tags: P03
@article{Schmiedmayer_2025a,
title = {Equivariant machine learning of electric field gradients—Predicting the quadrupolar coupling constant in the MAPbI_{3} phase transition},
author = {Bernhard Schmiedmayer and Jop W. Wolffs and Gilles A. de Wijs and Arno P. M. Kentgens and Jonathan Lahnsteiner and Georg Kresse},
doi = {10.1063/5.0301056},
year = {2025},
date = {2025-12-02},
journal = {The Journal of Chemical Physics},
volume = {163},
pages = {214110},
abstract = {We present a strategy combining machine learning and first-principle calculations to achieve highly accurate nuclear quadrupolar coupling constant predictions. Our approach employs two distinct machine-learning frameworks: a machine-learned force field to generate molecular dynamics trajectories and a second model for electric field gradients that preserves rotational and translational symmetries. By incorporating thermostat-driven molecular dynamics sampling, we enable the prediction of quadrupolar coupling constants in highly disordered materials at finite temperatures. We validate our method by predicting the tetragonal-to-cubic phase transition temperature of the organic–inorganic halide perovskite MAPbI_{3}, obtaining results that closely match experimental data.},
keywords = {P03},
pubstate = {published},
tppubtype = {article}
}

Wadhwa, Payal; Schmid, Michael; Kresse, Georg
Machine learning study of surface reconstructions of the Cu2O(111) surface
Journal ArticleOpen AccessIn: Physical Review B, vol. 112, iss. 20, pp. 205420, 2025.
Abstract | Links | BibTeX | Tags: P02, P03
@article{Wadhwa_2025a,
title = {Machine learning study of surface reconstructions of the Cu_{2}O(111) surface},
author = {Payal Wadhwa and Michael Schmid and Georg Kresse},
doi = {10.1103/sfjm-1gyr},
year = {2025},
date = {2025-11-17},
journal = {Physical Review B},
volume = {112},
issue = {20},
pages = {205420},
abstract = {The atomic structure of the most stable reconstructed surface of cuprous oxide (Cu_{2}O)(111) surface has been a longstanding topic of debate. In this study, we develop on-the-fly machine-learned force fields (MLFFs) to systematically investigate the various reconstructions of the Cu_{2}O(111) surface under stoichiometric as well as O- and Cu-deficient or rich conditions, focusing on both (√3×√3)R30∘ and (2×2) supercells. By utilizing parallel tempering simulations supported by MLFFs, we confirm that the previously described nanopyramidal and Cu-deficient (1×1) structures are the lowest energy structures from moderately to strongly oxidizing conditions. In addition, we identify two promising nanopyramidal reconstructions at highly reducing conditions, a stoichiometric one and a Cu-rich one. Surface energy calculations performed using spin-polarized PBE, PBE+𝑈, r2SCAN, and HSE06 functionals show that the previously known Cu-deficient configuration and nanopyramidal configurations are at the convex hull (and, thus, equilibrium structures) for all functionals, whereas the stability of the other structures depends on the functional and is therefore uncertain. Our findings demonstrate that on-the-fly trained MLFFs provide a simple, efficient, and rapid approach to explore the complex surface reconstructions commonly encountered in experimental studies, and also enhance our understanding of the stability of Cu_{2}O(111) surfaces.},
keywords = {P02, P03},
pubstate = {published},
tppubtype = {article}
}

Cao, Yu; Wang, Jiantao; Liu, Mingfeng; Liu, Yan; Ma, Hui; Franchini, Cesare; Sun, Yan; Kresse, Georg; Chen, Xing-Qiu; Liu, Peitao
Quantum Delocalization Enables Water Dissociation on Ru(0001)
Journal ArticleOpen AccessIn: Physical Review Letters, vol. 134, iss. 17, pp. 178001, 2025.
Abstract | Links | BibTeX | Tags: P03, P07
@article{Cao_2025a,
title = {Quantum Delocalization Enables Water Dissociation on Ru(0001)},
author = {Yu Cao and Jiantao Wang and Mingfeng Liu and Yan Liu and Hui Ma and Cesare Franchini and Yan Sun and Georg Kresse and Xing-Qiu Chen and Peitao Liu},
doi = {10.1103/PhysRevLett.134.178001},
year = {2025},
date = {2025-04-30},
journal = {Physical Review Letters},
volume = {134},
issue = {17},
pages = {178001},
abstract = {We revisit the long-standing question of whether water molecules dissociate on the Ru(0001) surface through nanosecond-scale path-integral molecular dynamics simulations on a sizable supercell. This is made possible through the development of an efficient and reliable machine-learning potential with near first-principles accuracy, overcoming the limitations of previous ab initio studies. We show that the quantum delocalization associated with nuclear quantum effects enables rapid and frequent proton transfers between water molecules, thereby facilitating the water dissociation on Ru(0001). This work provides the direct theoretical evidence of water dissociation on Ru(0001), resolving the enduring issue in surface sciences and offering crucial atomistic insights into water-metal interfaces.},
keywords = {P03, P07},
pubstate = {published},
tppubtype = {article}
}

Romano, Salvatore; de Hijes, Pablo Montero; Meier, Matthias; Kresse, Georg; Franchini, Cesare; Dellago, Christoph
Journal ArticleOpen AccessIn: Journal of Chemical Theory and Computation, vol. 21, iss. 4, pp. 1951–1960, 2025.
Abstract | Links | BibTeX | Tags: P03, P07, P12
@article{Romano_2024a,
title = {Structure and Dynamics of the Magnetite(001)/Water Interface from Molecular Dynamics Simulations Based on a Neural Network Potential},
author = {Salvatore Romano and Pablo Montero de Hijes and Matthias Meier and Georg Kresse and Cesare Franchini and Christoph Dellago},
doi = {10.1021/acs.jctc.4c01507},
year = {2025},
date = {2025-02-13},
urldate = {2024-08-21},
journal = {Journal of Chemical Theory and Computation},
volume = {21},
issue = {4},
pages = {1951–1960},
abstract = {The magnetite/water interface is commonly found in nature and plays a crucial role in various technological applications. However, our understanding of its structural and dynamical properties at the molecular scale remains still limited. In this study, we developed an efficient Behler-Parrinello neural network potential (NNP) for the magnetite/water system, paying particular attention to the accurate generation of reference data with density functional theory. Using this NNP, we performed extensive molecular dynamics simulations of the magnetite (001) surface across a wide range of water coverages, from single molecules to bulk water. Our simulations revealed several new ground states of low coverage water on the Subsurface Cation Vacancy (SCV) model and yielded a density profile of water at the surface that exhibits marked layering. By calculating mean square displacements, we obtained quantitative information on the diffusion of water molecules on the SCV for different coverages, revealing significant anisotropy. Additionally, our simulations provided qualitative insights into the dissociation mechanisms of water molecules at the surface.},
keywords = {P03, P07, P12},
pubstate = {published},
tppubtype = {article}
}
2024

Faller, Carolin; Kaltak, Merzuk; Kresse, Georg
Density-Based Long-Range Electrostatic Descriptors for Machine Learning Force Fields
Journal ArticleOpen AccessIn: The Journal of Chemical Physics, vol. 161, iss. 21, pp. 214701, 2024, (submitted to the Journal of Chemical Physics).
Abstract | Links | BibTeX | Tags: P03
@article{Faller_2024a,
title = {Density-Based Long-Range Electrostatic Descriptors for Machine Learning Force Fields},
author = {Carolin Faller and Merzuk Kaltak and Georg Kresse},
doi = {10.1063/5.0245615},
year = {2024},
date = {2024-12-02},
urldate = {2024-06-25},
journal = {The Journal of Chemical Physics},
volume = {161},
issue = {21},
pages = {214701},
abstract = {This study presents a long-range descriptor for machine learning force fields that maintains translational and rotational symmetry, similar to short-range descriptors while being able to incorporate long-range electrostatic interactions. The proposed descriptor is based on an atomic density representation and is structurally similar to classical short-range atom-centered descriptors, making it straightforward to integrate into machine learning schemes. The effectiveness of our model is demonstrated through comparative analysis with the long-distance equivariant (LODE) [Grisafi and Ceriotti, J. Chem. Phys. 151, 204105 (2019)] descriptor. In a toy model with purely electrostatic interactions, our model achieves errors below 0.1%, worse than LODE but still very good. For real materials, we perform tests for liquid NaCl, rock salt NaCl, and solid zirconia. For NaCl, the present descriptors improve on short-range density descriptors, reducing errors by a factor of two to three and coming close to message-passing networks. However, for solid zirconia, no improvements are observed with the present approach, while message-passing networks reduce the error by almost a factor of two to three. Possible shortcomings of the present model are briefly discussed.},
note = {submitted to the Journal of Chemical Physics},
keywords = {P03},
pubstate = {published},
tppubtype = {article}
}

de Hijes, Pablo Montero; Dellago, Christoph; Jinnouchi, Ryosuke; Kresse, Georg
Density isobar of water and melting temperature of ice: Assessing common density functionals
Journal ArticleOpen AccessIn: The Journal of Chemical Physics, vol. 161, pp. 131102, 2024.
Abstract | Links | BibTeX | Tags: P03, P12
@article{Montero-de-Hijes_2024b,
title = {Density isobar of water and melting temperature of ice: Assessing common density functionals},
author = {Pablo Montero de Hijes and Christoph Dellago and Ryosuke Jinnouchi and Georg Kresse},
url = {https://doi.org/10.1063/5.0227514},
year = {2024},
date = {2024-10-03},
urldate = {2024-06-06},
journal = {The Journal of Chemical Physics},
volume = {161},
pages = {131102},
abstract = {We investigate the density isobar of water and the melting temperature of ice using six different density functionals. Machine-learning potentials are employed to ensure computational affordability. Our findings reveal significant discrepancies between various base functionals. Notably, even the choice of damping can result in substantial differences. Overall, the outcomes obtained through density functional theory are not entirely satisfactory across most utilized functionals. All functionals exhibit significant deviations either in the melting temperature or equilibrium volume, with most of them even predicting an incorrect volume difference between ice and water. Our heuristic analysis indicates that a hybrid functional with 25% exact exchange and van der Waals damping averaged between zero and Becke–Johnson dampings yields the closest agreement with experimental data. This study underscores the necessity for further enhancements in the treatment of van der Waals interactions and, more broadly, density functional theory to enable accurate quantitative predictions for molecular liquids.},
keywords = {P03, P12},
pubstate = {published},
tppubtype = {article}
}

Hütner, Johanna I.; Conti, Andrea; Kugler, David; Mittendorfer, Florian; Kresse, Georg; Schmid, Michael; Diebold, Ulrike; Balajka, Jan
Stoichiometric reconstruction of the Al2O3(0001) surface
Journal ArticleOpen AccessIn: Science, vol. 385, pp. 1241–1244, 2024, ISSN: 1095-9203.
Abstract | Links | BibTeX | Tags: P02, P03
@article{Huetner2024,
title = {Stoichiometric reconstruction of the Al_{2}O_{3}(0001) surface},
author = {Johanna I. Hütner and Andrea Conti and David Kugler and Florian Mittendorfer and Georg Kresse and Michael Schmid and Ulrike Diebold and Jan Balajka},
url = {https://www.science.org/doi/10.1126/science.adq4744
https://arxiv.org/abs/2405.19263},
issn = {1095-9203},
year = {2024},
date = {2024-09-12},
urldate = {2024-09-12},
journal = {Science},
volume = {385},
pages = {1241--1244},
publisher = {American Association for the Advancement of Science (AAAS)},
abstract = {Macroscopic properties of materials stem from fundamental atomic-scale details, yet for insulators, resolving surface structures remains a challenge. We imaged the basal (0001) plane of α–aluminum oxide (α-Al_{2}O_{3}) using noncontact atomic force microscopy with an atomically defined tip apex. The surface formed a complex (√31 × √31)R±9° reconstruction. The lateral positions of the individual oxygen and aluminum surface atoms come directly from experiment; we determined with computational modeling how these connect to the underlying crystal bulk. Before the restructuring, the surface Al atoms assume an unfavorable, threefold planar coordination; the reconstruction allows a rehybridization with subsurface O that leads to a substantial energy gain. The reconstructed surface remains stoichiometric, Al_{2}O_{3}.},
keywords = {P02, P03},
pubstate = {published},
tppubtype = {article}
}

Schmiedmayer, Bernhard; Kresse, Georg
Journal ArticleOpen AccessIn: The Journal of Chemical Physics, vol. 161, iss. 8, pp. 084703, 2024.
Abstract | Links | BibTeX | Tags: P03
@article{Schmiedmayer_2024a,
title = {Derivative learning of tensorial quantities—Predicting finite temperature infrared spectra from first principles},
author = {Bernhard Schmiedmayer and Georg Kresse},
doi = {https://doi.org/10.1063/5.0217243},
year = {2024},
date = {2024-08-28},
journal = {The Journal of Chemical Physics},
volume = {161},
issue = {8},
pages = {084703},
abstract = {We develop a strategy that integrates machine learning and first-principles calculations to achieve technically accurate predictions of infrared spectra. In particular, the methodology allows one to predict infrared spectra for complex systems at finite temperatures. The method’s effectiveness is demonstrated in challenging scenarios, such as the analysis of water and the organic–inorganic halide perovskite MAPbI_{3}, where our results consistently align with experimental data. A distinctive feature of the methodology is the incorporation of derivative learning, which proves indispensable for obtaining accurate polarization data in bulk materials and facilitates the training of a machine learning surrogate model of the polarization adapted to rotational and translational symmetries. We achieve polarization prediction accuracies of about 1% for the water dimer by training only on the predicted Born effective charges.},
keywords = {P03},
pubstate = {published},
tppubtype = {article}
}
de Hijes, Pablo Montero; Dellago, Christoph; Jinnouchi, Ryosuke; Schmiedmayer, Bernhard; Kresse, Georg
Journal ArticleOpen AccessIn: The Journal of Chemical Physics, vol. 160, iss. 11, no. 114107, 2024.
Abstract | Links | BibTeX | Tags: P03, P12
@article{10.1063/5.0197105,
title = {Comparing machine learning potentials for water: Kernel-based regression and Behler–Parrinello neural networks},
author = {Pablo Montero de Hijes and Christoph Dellago and Ryosuke Jinnouchi and Bernhard Schmiedmayer and Georg Kresse},
doi = {https://doi.org/10.1063/5.0197105},
year = {2024},
date = {2024-03-20},
urldate = {2024-03-20},
journal = {The Journal of Chemical Physics},
volume = {160},
number = {114107},
issue = {11},
abstract = {In this paper, we investigate the performance of different machine learning potentials (MLPs) in predicting key thermodynamic properties of water using RPBE + D3. Specifically, we scrutinize kernel-based regression and high-dimensional neural networks trained on a highly accurate dataset consisting of about 1500 structures, as well as a smaller dataset, about half the size, obtained using only on-the-fly learning. This study reveals that despite minor differences between the MLPs, their agreement on observables such as the diffusion constant and pair-correlation functions is excellent, especially for the large training dataset. Variations in the predicted density isobars, albeit somewhat larger, are also acceptable, particularly given the errors inherent to approximate density functional theory. Overall, this study emphasizes the relevance of the database over the fitting method. Finally, this study underscores the limitations of root mean square errors and the need for comprehensive testing, advocating the use of multiple MLPs for enhanced certainty, particularly when simulating complex thermodynamic properties that may not be fully captured by simpler tests.},
keywords = {P03, P12},
pubstate = {published},
tppubtype = {article}
}
2023

Verdi, Carla; Ranalli, Luigi; Franchini, Cesare; Kresse, Georg
Journal ArticleIn: Physical Review Materials, vol. 7, no. 3, pp. l030801, 2023.
Abstract | Links | BibTeX | Tags: P03, P07
@article{Verdi2023,
title = {Quantum paraelectricity and structural phase transitions in strontium titanate beyond density functional theory},
author = {Carla Verdi and Luigi Ranalli and Cesare Franchini and Georg Kresse},
doi = {10.1103/physrevmaterials.7.l030801},
year = {2023},
date = {2023-03-16},
journal = {Physical Review Materials},
volume = {7},
number = {3},
pages = {l030801},
publisher = {American Physical Society (APS)},
abstract = {We demonstrate an approach for calculating temperature-dependent quantum and anharmonic effects with beyond density-functional theory accuracy. By combining machine-learned potentials and the stochastic self-consistent harmonic approximation, we investigate the cubic to tetragonal transition in strontium titanate and show that the paraelectric phase is stabilized by anharmonic quantum fluctuations. We find that a quantitative understanding of the quantum paraelectric behavior requires a higher-level treatment of electronic correlation effects via the random phase approximation. This approach enables detailed studies of emergent properties in strongly anharmonic materials beyond density-functional theory.},
keywords = {P03, P07},
pubstate = {published},
tppubtype = {article}
}
