The Materials Simulation Toolkit for Machine Learning (MAST-ML): Automating Development and Evaluation of Machine Learning Models for Materials Property Prediction

By Ryan Jacobs

University of Wisconsin - Madison, Madison, WI

Published on

Abstract

Run the Tool: Materials Simulation Toolkit for Machine Learning This tutorial contains an introduction to the use of the Materials Simulation Toolkit for Machine Learning (MAST-ML), a python package designed to broaden and accelerate the use of machine learning and data science methods for materials property prediction. Through hands-on activities, we will use MAST-ML to:

  1. import materials datasets from online databases and clean and examine our input data,
  2. conduct feature engineering analysis, including generation, preprocessing, and selection of features,
  3. construct, evaluate and compare the performance of different model types and data splitting techniques, and
  4. conduct a preliminary assessment of model error analysis and uncertainty quantification (UQ).

MAST-ML resources:

MAST-ML on nanoHUB: https://nanohub.org/tools/mastmltutorial
MAST-ML code: https://github.com/uw-cmg/MAST-ML
Publication: https://doi.org/10.1016/j.commatsci.2020.109544
MAST-ML tutorials: https://github.com/uw-cmg/MAST-ML/tree/master/examples

Sponsored by

Cite this work

Researchers should cite this work as follows:

  • Ryan Jacobs (2021), "The Materials Simulation Toolkit for Machine Learning (MAST-ML): Automating Development and Evaluation of Machine Learning Models for Materials Property Prediction," https://nanohub.org/resources/35142.

    BibTex | EndNote

Time

Tags

The Materials Simulation Toolkit for Machine Learning (MAST-ML): Automating Development and Evaluation of Machine Learning Models for Materials Property Prediction
  • The MAterials Simulation Toolkit for Machine Learning (MAST-ML): Automating Development and Evaluation of Machine Learning Models for Materials Property Prediction 1. The MAterials Simulation Toolk… 0
    00:00/00:00
  • Machine learning in Materials Science is Exploding 2. Machine learning in Materials … 129.82982982982983
    00:00/00:00
  • A Basic Materials Design Workflow 3. A Basic Materials Design Workf… 188.62195528862196
    00:00/00:00
  • What is MAST-ML? 4. What is MAST-ML? 315.61561561561564
    00:00/00:00
  • MAST-ML automates the supervised learning workflow 5. MAST-ML automates the supervis… 382.51584918251586
    00:00/00:00
  • (NSF CSSI) Machine Learning Materials Innovation Infrastructure 6. (NSF CSSI) Machine Learning Ma… 561.56156156156158
    00:00/00:00
  • (NSF CSSI) Machine Learning Materials Innovation Infrastructure 7. (NSF CSSI) Machine Learning Ma… 603.47013680347015
    00:00/00:00
  • Test Problem: Impurity Diffusion Database 8. Test Problem: Impurity Diffusi… 657.691024357691
    00:00/00:00
  • Getting Started with the MAST-ML tutorial on NanoHub 9. Getting Started with the MAST-… 747.08041374708046
    00:00/00:00
  • Demo 10. Demo 769.93660326993665
    00:00/00:00
  • Q&A 11. Q&A 3310.5772439105772
    00:00/00:00