Science 2018-04-13

Predicting reaction performance in C–N cross-coupling using machine learning

Derek T. Ahneman, Jesús G. Estrada, Shishi Lin, Spencer D. Dreher, Abigail G. Doyle

Index: 10.1126/science.aar5169

Full Text: HTML

Abstract

Machine learning methods are becoming integral to scientific inquiry in numerous disciplines. We demonstrated that machine learning can be used to predict the performance of a synthetic reaction in multidimensional chemical space using data obtained via high-throughput experimentation. We created scripts to compute and extract atomic, molecular, and vibrational descriptors for the components of a palladium-catalyzed Buchwald-Hartwig cross-coupling of aryl halides with 4-methylaniline in the presence of various potentially inhibitory additives. Using these descriptors as inputs and reaction yield as output, we showed that a random forest algorithm provides significantly improved predictive performance over linear regression analysis. The random forest model was also successfully applied to sparse training sets and out-of-sample prediction, suggesting its value in facilitating adoption of synthetic methodology.

Latest Articles:

Single-cell profiling of the developing mouse brain and spinal cord with split-pool barcoding

2018-04-13

[10.1126/science.aam8999]

Structural basis for coupling protein transport and N-glycosylation at the mammalian endoplasmic reticulum

2018-04-13

[10.1126/science.aar7899]

Structure of the nuclear exosome captured on a maturing preribosome

2018-04-13

[10.1126/science.aar5428]

Photoperiodic control of seasonal growth is mediated by ABA acting on cell-cell communication

2018-04-13

[10.1126/science.aan8576]

Observation of topological superconductivity on the surface of an iron-based superconductor

2018-04-13

[10.1126/science.aan4596]

More Articles...