Post 2 companion ยท Adjust parameters to see how the combined DFT+ML pipeline saves time
Download candidate structures from databases: Materials Project, ICSD, AFLOW, or generate them computationally.
Run DFT (VASP, WIEN2k, QE) on a representative subset. Calculate target properties: band gap, formation energy, etc. These become the labels for ML.
Train a model (e.g. ALIGNN, random forest, neural network) on the DFT-labelled data. Validate on a held-out test set. Tune hyperparameters.
Apply the trained model to all remaining structures. Predict the target property for each in milliseconds. Rank by predicted value and filter.
Take the top 50โ100 candidates from ML screening. Run full DFT calculations to verify. These are your final, reliable predictions.