Orders of Coupling Representations as a Versatile Framework for Machine Learning from Sparse Data in High-Dimensional Spaces

dc.contributor.authorManzhos, Sergei
dc.contributor.authorCarrington, Tucker Jr.
dc.contributor.authorIhara, Manabu
dc.date.accessioned2024-01-12T15:21:16Z
dc.date.available2024-01-12T15:21:16Z
dc.date.issued2023-07-17
dc.description© 2023 The Author(s). Published by Elsevier B.V. and also available at Elsevier via [DOI] © 2023. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
dc.description.abstractMachine learning (ML) techniques are already widely and increasingly used in diverse applications in science and technology, including computational chemistry. Specifically in computational chemistry, neural networks (NN) and kernel methods such as Gaussian process regressions (GPR) have been increasingly used for the construction of potential functions and functionals for density functional theory. While ML techniques have a number of advantages vs intuition-based models, notably their generality and black-box nature, they are still challenged when faced with high dimensionality of the feature space or low and uneven data density – in part because of their general nature. We review recent works using methods such as NNs and GPR as building blocks of composite methods in the framework of an expansion over orders of coupling. We introduce models using NN or GPR-based components as part of HDMR (high-dimensional model representations)-based structures. HDMR is a formalization of orders-of-coupling representations that include the many-body and N-mode representations well known in computational chemistry and allows, in particular, building all terms from one dataset of arbitrarily distributed data. The resulting HDMR-NN and HDMR-GPR combinations and NN with HDMR-GPR derived neuron activation functions not requiring non-linear optimization enhance machine learning capabilities in high dimensional spaces and or with sparse data.
dc.identifier.citationSergei Manzhos, Tucker Carrington, Manabu Ihara. Orders of coupling representations as a versatile framework for machine learning from sparse data in high-dimensional spaces. Artificial Intelligence Chemistry, 2023 1(2), 100008, https://doi.org/10.1016/j.aichem.2023.100008.
dc.identifier.doihttps://doi.org/10.1016/j.aichem.2023.100008
dc.identifier.urihttps://hdl.handle.net/1974/32725
dc.language.isoen
dc.publisherElsevier
dc.rightsAttribution 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.titleOrders of Coupling Representations as a Versatile Framework for Machine Learning from Sparse Data in High-Dimensional Spaces
dc.typejournal article
oaire.awardNumberGrant Number JPMJMI22H1
project.funder.identifierhttp://dx.doi.org/501100020959
project.funder.identifierhttp://dx.doi.org/10.13039/501100000038
project.funder.nameJST-Mirai Program, Japan
project.funder.nameNatural Sciences and Engineering Research Council of Canada
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
jrnl version.pdf
Size:
653.6 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
3.56 KB
Format:
Item-specific license agreed upon to submission
Description: