Machine learning (ML) and artificial intelligence (AI) methods for
modeling useful materials properties are now important technologies for
rational design and optimization of bespoke functional materials.
Although these methods make good predictions of the properties of new
materials, current modeling methods use efficient but rather arcane
(difficult-to-interpret) mathematical features (descriptors) to
characterize materials. Data-driven ML models are considerably more
useful if more chemically interpretable descriptors are used to train
them, as long as these models also accurately recapitulate the
properties of materials in training and test sets used to generate and
validate the models. Herein, how a particular type of molecular fragment
descriptor, the signature descriptor, achieves these joint aims of
accuracy and interpretability is described. Seven different types of
materials properties are modeled, and the performance of models
generated from signature descriptors is compared with those generated by
widely used Dragon descriptors. The key descriptors in the model
represent functionalities that make chemical sense. Mapping these
fragments back on to exemplar materials provides a useful guide to
chemists wishing to modify promising lead materials to improve their
properties. This is one of the first applications of signature
descriptors to the modeling of complex materials properties.
History
Publication Date
2019-01-01
Journal
Advanced Intelligent Systems
Volume
1
Issue
8
Article Number
1900045
Pagination
16p.
Publisher
Wiley
ISSN
2640-4567
Rights Statement
The Authors reserves all moral rights over the deposited text and must be credited if any re-use occurs.