Skip to article frontmatterSkip to article content

Codes

Computational Codes for High-Throughput Simulations

Here we will discuss the computational codes used in high-throughput (HT) simulations, focusing on density functional theory (DFT), molecular dynamics, and Monte Carlo methods. These codes are essential for calculating the properties of materials in a high-throughput manner, enabling the exploration of large materials databases and the prediction of new materials with desired properties.

Density Functional Theory

In HT, DFT is mainly used for:

Popular DFT codes include:

CodeLicencingFeature
VASPShared-source commercialKnown for accuracy and efficiency, especially for periodic systems. Suitable for transition metals and heavy elements. Used in databases like the Materials Project.
Quantum ESPRESSOOpen-sourceFlexible and scalable suite for electronic structure calculations and materials modeling. Suitable for a wide range of systems and properties.
CASTEPShared-source commercial/academicPowerful for computing spectroscopy properties. Used in the Materials Studio software suite.
GPAWOpen-sourcePython-based DFT code. Works well with ASE and uses the PAW method for pseudopotentials.
GaussianCommercialWidely used for quantum chemistry calculations, including DFT. Known for its accuracy and extensive range of methods.

Molecular Dynamics

Recall that force fields are used for studying time-dependent phenomena, thermal properties, and materials behavior under realistic conditions.

Popular force field codes include:

CodeLicencingFeature
LAMMPSOpen-sourceHighly flexible and scalable molecular dynamics code. Suitable for a wide range of materials and simulation types.
GROMACSOpen-sourceSpecialized for biomolecular simulations but also used for materials modeling. Known for its performance and efficiency.

For Monte Carlo, the codes are usually developed in-house or adapted from existing libraries due to the specific nature of the simulations.

Performance and Reproducibility

These codes were written in Fortran or Python calling Fortran/C++ Libraries for high performance. They are usually run on high-performance computing (HPC) clusters due to their high computational cost. For such purpose, they are optimized for parallel computing using MPI (Message Passing Interface) or OpenMP (Open Multi-Processing) to distribute the workload across multiple processors.

Verification of the precision of DFT codes. Reproduced from Bosoni et al..

Verification of the precision of DFT codes. Reproduced from Bosoni et al..

The codes are extremely complicated, especially for DFT codes, which requires a deep understanding of DFT theory and computational methods to use effectively. The development of these codes usually needs collaborations between different research group. Most of the DFT codes can show consistent results (Δ value), which is important for HT calculations.

However, be careful when publishing results for comparison between different codes, as it might violate the license agreement (especially for commercial softwares).

Workflow Management Systems

Some workflows from Atomate 2 for high-throughput calculations. Reproduced from Ganose et al..

Some workflows from Atomate 2 for high-throughput calculations. Reproduced from Ganose et al..

Workflow management systems are essential for coordinating and automating the various steps in a high-throughput simulation workflow. These systems help manage the complexity of running large numbers of simulations. They can handle tasks such as setting up simulations, running calculations, and analyzing results, allowing researchers to focus on the scientific aspects of their work.

References
  1. Lejaeghere, K., Bihlmayer, G., Björkman, T., Blaha, P., Blügel, S., Blum, V., Caliste, D., Castelli, I. E., Clark, S. J., Dal Corso, A., de Gironcoli, S., Deutsch, T., Dewhurst, J. K., Di Marco, I., Draxl, C., Dułak, M., Eriksson, O., Flores-Livas, J. A., Garrity, K. F., … Cottenier, S. (2016). Reproducibility in density functional theory calculations of solids. Science, 351(6280). 10.1126/science.aad3000
  2. Bosoni, E., Beal, L., Bercx, M., Blaha, P., Blügel, S., Bröder, J., Callsen, M., Cottenier, S., Degomme, A., Dikan, V., Eimre, K., Flage-Larsen, E., Fornari, M., Garcia, A., Genovese, L., Giantomassi, M., Huber, S. P., Janssen, H., Kastlunger, G., … Pizzi, G. (2023). How to verify the precision of density-functional-theory implementations via reproducible and universal workflows. Nature Reviews Physics, 6(1), 45–58. 10.1038/s42254-023-00655-3
  3. Ganose, A., Sahasrabuddhe, H., Asta, M., Beck, K., Biswas, T., Bonkowski, A., Bustamante, J., Chen, X., Chiang, Y., Chrzan, D., Clary, J., Cohen, O., Ertural, C., Gallant, M., George, J., Gerits, S., Goodall, R., Guha, R., Hautier, G., … Jain, A. (2025). Atomate2: Modular workflows for materials science. 10.26434/chemrxiv-2025-tcr5h