Introduction to a Basic Machine Learning Workflow for Predicting Materials Properties

By Benjamin Afflerbach

Materials Science and Engineering, University of Wisconsin-Madison, Madison, WI

Published on

Abstract

Run the Tool: Machine Learning Lab Module This tutorial will introduce core concepts of machine learning through the lens of a basic workflow to predict material bandgaps from material compositions. As we progress through this workflow we will highlight key steps, challenges that can come up with materials data, and potential solutions to these challenges. The core workflow we’ll introduce includes: Data Cleaning, Feature Generation, Feature Engineering, Establishing Model Assessment, Training a Default Model, Hyperparameter Optimization, and Making Predictions. By the end of the tutorial I hope that you’ll have a better understanding of these core concepts, and how they can all fit together.

Bio

Benjamin Afflerbach >Benjamin Afflerbach received a B.S in mechanical engineering from Texas A&M University in 2014. He then obtained a Ph. D. in Materials Science and Engineering from the University of Wisconsin – Madison in 2021.

Dr. Afflerbach is currently a Research Associate in the Materials Science and Engineering department at the University of Wisconsin – Madison. His research is focused on using materials informatics to supplement traditional materials science research, with the main focus of this research being on metallic glasses. In addition to research he is heavily involved in management of the undergraduate research group the Informatics Skunkworks through which he has mentored multiple undergraduate research groups, developed educational materials and curriculum for onboarding undergraduate researchers, and built community infrastructure to help grow the group.

Sponsored by

Cite this work

Researchers should cite this work as follows:

  • Benjamin Afflerbach (2022), "Introduction to a Basic Machine Learning Workflow for Predicting Materials Properties," https://nanohub.org/resources/36489.

    BibTex | EndNote

Time

Tags

An Introduction to Machine Learning for Materials Science: A Basic Workflow for Predicting Materials Properties
  • A Basic Workflow for Predicting Materials Properties 1. A Basic Workflow for Predictin… 0
    00:00/00:00
  • Summary 2. Summary 125.59225892559226
    00:00/00:00
  • An Application: Predict a Materials Property 3. An Application: Predict a Mate… 229.89656322989657
    00:00/00:00
  • A Basic Materials Design Workflow 4. A Basic Materials Design Workf… 306.63997330664
    00:00/00:00
  • Machine Learning for Pattern Matching 5. Machine Learning for Pattern M… 405.17183850517188
    00:00/00:00
  • Key Distinction in ML 6. Key Distinction in ML 483.11644978311648
    00:00/00:00
  • Key Distinction in ML 7. Key Distinction in ML 554.12078745412077
    00:00/00:00
  • Model Types 8. Model Types 645.77911244577911
    00:00/00:00
  • Decision Trees: Structure 9. Decision Trees: Structure 672.57257257257265
    00:00/00:00
  • Decision Trees: Inputs 10. Decision Trees: Inputs 727.52752752752758
    00:00/00:00
  • Decision Trees: Outputs 11. Decision Trees: Outputs 862.06206206206207
    00:00/00:00
  • Summary 12. Summary 922.12212212212216
    00:00/00:00
  • Machine Learning Lab Module Demo 13. Machine Learning Lab Module De… 1022.4557891224558
    00:00/00:00
  • 1. Data Cleaning and Inspection 14. 1. Data Cleaning and Inspectio… 1354.3543543543544
    00:00/00:00
  • 2. Feature Generation 15. 2. Feature Generation 1822.288955622289
    00:00/00:00
  • 3. Feature Engineering 16. 3. Feature Engineering 2136.5031698365033
    00:00/00:00
  • 4. Setup for Model Evaluation 17. 4. Setup for Model Evaluation 2423.4901568234905
    00:00/00:00
  • 5. Fitting and Evaluating a Default Model 18. 5. Fitting and Evaluating a De… 2616.7834501167836
    00:00/00:00
  • 6. Improving the Model by Optimizing Hyperparameters 19. 6. Improving the Model by Opti… 2903.4701368034703
    00:00/00:00
  • 7. Making Predictions 20. 7. Making Predictions 3096.2295628962297
    00:00/00:00
  • Questions 21. Questions 3196.4964964964965
    00:00/00:00