金属学报, 2026, 62(5): 785-802 DOI: 10.11900/0412.1961.2025.00289

综述

矩张量机器学习势模型的发展与应用

陈星秋,1, 王建韬1,2, 刘培涛,1

1 中国科学院金属研究所 沈阳材料科学国家研究中心 沈阳 110016

2 中国科学技术大学 材料科学与工程学院 沈阳 110016

Moment Tensor Machine-Learning Potential: Development and Applications

CHEN Xing-Qiu,1, WANG Jiantao1,2, LIU Peitao,1

1 Shenyang National Laboratory for Materials Science, Institute of Metal Research, Chinese Academy of Sciences, Shenyang 110016, China

2 School of Materials Science and Engineering, University of Science and Technology of China, Shenyang 110016, China

通讯作者: 陈星秋,xingqiu.chen@imr.ac.cn,主要从事合金计算和智能设计研究; 刘培涛,ptliu@imr.ac.cn,主要从事材料电子结构计算和原子模拟研究

责任编辑: 梁烨

收稿日期: 2025-09-28   修回日期: 2025-11-18  

基金资助: 国家自然科学基金项目(52422112)
国家自然科学基金项目(52188101)
新材料重大专项项目(2025ZD0618901)
中国科学院战略性先导科技专项项目(XDA041040402)
辽宁省重大专项项目(2024JH1/11700032)

Corresponding authors: CHEN Xing-Qiu, professor, Tel: 15002491386, E-mail:xingqiu.chen@imr.ac.cn; LIU Peitao, professor, Tel:17696608803, E-mail:ptliu@imr.ac.cn

Received: 2025-09-28   Revised: 2025-11-18  

Fund supported: National Natural Science Foundation of China(52422112)
National Natural Science Foundation of China(52188101)
Advanced Materials-National Science and Technology Major Project(2025ZD0618901)
Strategic Priority Research Program of Chinese Academy of Sciences(XDA041040402)
Science and Technology Major Project of Liaoning Province(2024JH1/11700032)

作者简介 About authors

陈星秋,男,1975年生,研究员,博士

摘要

近年来,基于人工智能的材料计算模拟技术快速发展,机器学习的原子间相互作用势函数是其中最为重要的分支之一。机器学习势函数通过机器学习算法对第一性原理计算数据进行高精度拟合,将其转化为连续可微的数学表达,使其在保持量子力学计算精度的同时显著降低计算成本,有效解决了第一性原理方法难以处理大体系和长时间模拟的瓶颈问题。凭借这一独特优势,机器学习势函数已成为衔接微观原子模拟与宏观材料性能预测的关键桥梁,为材料计算领域带来了革命性机遇。本文聚焦于兼顾精度和效率的矩张量势(MTP)模型,从理论框架、算法优化和应用实践三个层面进行系统评述:深入解析了MTP的数学基础和架构设计原理;详细讨论了其精度提升策略和计算加速技术;通过典型材料体系的模拟案例验证了模型的优越性能,并对其发展趋势进行了展望。

关键词: 原子间相互作用势; 机器学习; 矩张量; 原子模拟

Abstract

In recent years, artificial intelligence-based computational materials modeling has advanced rapidly, with machine learning potentials (MLPs) emerging as a central research direction. By fitting ab initio reference data into continuous and differentiable functional forms, MLPs retain near-quantum-mechanical accuracy while substantially reducing computational cost. This capability alleviates the limitations of ab initio methods in simulations of large-scale systems and long timescales. Consequently, MLPs serve as a critical link between atomistic simulations and macroscopic material property predictions, enabling new possibilities in computational materials science. This review focuses on the moment tensor potential (MTP), which offers an excellent balance between accuracy and computational efficiency. This paper provides a systematic overview from three perspectives: theoretical framework, algorithmic optimization, and practical applications. First, the mathematical foundations and design principles of MTP are analyzed. Next, strategies for improving accuracy and accelerating computation are discussed. Finally, representative case studies on typical material systems are presented to demonstrate the performance of MTP, and future development directions are outlined.

Keywords: interatomic potential; machine learning; moment tensor; atomistic simulation

PDF (3459KB) 元数据 多维度评价 相关文章 导出 EndNote| Ris| Bibtex  收藏本文

本文引用格式

陈星秋, 王建韬, 刘培涛. 矩张量机器学习势模型的发展与应用[J]. 金属学报, 2026, 62(5): 785-802 DOI:10.11900/0412.1961.2025.00289

CHEN Xing-Qiu, WANG Jiantao, LIU Peitao. Moment Tensor Machine-Learning Potential: Development and Applications[J]. Acta Metallurgica Sinica, 2026, 62(5): 785-802 DOI:10.11900/0412.1961.2025.00289

从最初依赖经验、观察和实验研究,到热力学定律等以数学建模为特征的理论研究,到结合材料理论、物理、化学等交叉学科的计算模拟研究,再到以材料高通量实验、计算和表征以及数据库为基础的材料基因工程,每一次进展都拓展了我们对材料学领域的认知边界。如今,蓬勃发展的人工智能方法已然成为推动材料研究的新范式[1],旨在通过机器学习等人工智能手段,深度挖掘材料成分-工艺-组织-性能之间的复杂关联,显著加速传统材料的改良,以及新材料、新性能的发现和设计。

在材料的计算模拟研究中,原子尺度的计算模拟是连接微观结构与宏观性能的关键桥梁[2~4]。然而,该领域长期面临着核心挑战:计算精度与效率的“两难困境”。一方面,基于第一性原理的计算方法虽然精度高,但其高昂的计算成本将其应用限制在数百个原子和皮秒级的时间尺度内;另一方面,诸如嵌入原子势(embedded atom method,EAM)[5,6]、修正嵌入原子势(modified embedded atom method,MEAM)[7]、Tersoff[8]等经验力场,虽然能够实现百万甚至上亿原子体系的纳秒级模拟,但其往往采用基于物理近似的简化模型,导致精度和适用性存在固有局限。此外,这类力场的构建过程依赖对体系物理特性的先验认知,难以实现普适性应用。

为解决上述瓶颈问题,研究者将数据和智能驱动新范式思想与传统的计算模拟深度融合,催生了机器学习势(machine learning potential,MLP)函数,又称作机器学习力场(machine-learned force field,MLFF)。其核心思想不再是依赖物理直觉去构建固定的函数形式,而是借助灵活的机器学习模型(如神经网络),直接从海量的第一性原理计算数据中“学习”高维、高精度的势能面(potential energy surface,PES)。这种数据驱动的方法为原子间势函数的开发带来了革命性突破,有效解决了计算模拟中长期存在的精度与效率不可兼得的难题。

回顾MLFF的发展历程,其早期的探索可追溯到20世纪90年代,研究人员[9~14]尝试使用神经网络拟合小分子体系的势能面。然而,这些早期模型通常采用体系的整体坐标作为输入,导致模型仅限于特定的小体系,缺乏处理原子置换不变性的内在机制和向大体系迁移的能力。2007年,Behler和Parrinello[15]在其开创性的工作中提出了一种高维神经网络势(high dimensional neural network potential,HDNNP),标志着MLFF发展的范式转变。他们提出的两大核心思想——“局域性假设”(将体系总能量分解为各原子能量贡献之和)和原子中心对称函数(atom-centered symmetry function,ACSF)描述符,成功解决了早期方法的根本局限。ACSF将每个原子的局部化学环境编码为满足平移、旋转及同类原子置换不变性的固定维度特征向量,从而攻克了维度灾难和迁移性难题,使MLFF从理论探索正式迈入大规模实际应用阶段。MLFF的主要构建流程见图1,包括构建训练结构集合,对训练集进行第一性原理计算以得到其能量、力、应力张量等信息,选择合适的回归模型进行拟合,以及验证模型的准确性及泛化性能。

图1

图1   机器学习力场构建流程

Fig.1   Workflow of machine-learned force field (MLFF) construction (HEA—high-entropy alloy; F(x*)—atomic energy; x1-xn —reference local configurations; x*—local configuration to be predicted; ω1-ωN —model weights; K—kernel function; RC1 and RC2—coordinates; G1 and G2—local structure descriptors; γji and aijij—neural network model parameters)


在Behler-Parrinello框架的启发下,MLFF的发展分化为两条主流技术路线。第一条路线致力于改进和发展更优的局域环境描述符和回归模型。在此路线中,基于核函数的方法取得了巨大成功,特别是Bartók等[16,17]发展的Gaussian近似势(Gaussian approximation potential,GAP),其采用的平滑原子位置重叠(smooth overlap of atomic positions,SOAP)描述符[18]已成为局域结构分析的最重要工具之一。Jinnouchi等[19,20]对该模型进行重要改进,通过在第一性原理分子动力学模拟中引入基于Bayesian力误差的实时主动学习机制,显著提升了训练集构建的智能化程度。除基于核函数的方法外,Shapeev[21]提出的基于线性回归的矩张量势(MTP)模型,Thompson等[22]提出的基于线性回归的SNAP (spectral neighbor analysis potential)模型,Wood和Thompson[23]提出的二次多项式回归的qSNAP (quaderatic spectral neighbor analysis potential)模型,以及Drautz[24]提出的原子团簇展开(atomic cluster expansion,ACE)模型,均以兼备高精度和高效率而获得了巨大成功。此外,刘智攀团队基于ACSF类型描述符开发了指数型描述符(power type structural descriptors,PTSDs),开发了LASP软件[25,26],在催化[27,28]、结构搜索[29,30]等领域获得了广泛应用。Behler团队[31~33]也进一步发展了他们构建的HDNNP模型,先后提出了四代模型,进一步引入电荷转移,整合了长程的Coulomb相互作用及磁矩等物理量。

第二条技术路线则寻求摆脱人为设计描述符的束缚,发展“端到端”的深度学习模型,让网络直接从原子坐标和种类中自动学习特征表示。深度势能(deep potential,DP)模型[34,35]是这一路线的杰出代表,它采用一个可训练的嵌入网络生成描述符,并在凝固、相变等诸多体系中取得了巨大成功[36~38]。图神经网络(graph neural network,GNN)模型则是该路线的另一类重要范式。其核心思想是将分子或材料结构表示为图,其中原子作为节点、化学键或邻近关系作为边,再通过迭代的信息传递和聚合机制更新节点特征,从而捕捉复杂的局域和非局域相互作用。典型的消息传递神经网络(message passing neural network,MPNN)框架[39]为后续工作奠定了基础,它通过在节点间交换“消息”,实现原子间相互作用的建模,并能够自然满足平移、旋转和置换等物理对称性。此方面开创性的工作为2018年Schütt等[40]提出的SchNet,并进一步发展出了PhysNet[41]、DimeNet (directional message passing neural network)[42]等一系列改进版本,这些方法通过设计更精细的消息函数或结合角度与三体信息进一步提升了准确性和泛化性。

近年来,MLFF的发展前沿聚焦于将物理知识嵌入模型架构。等变神经网络(equivariant neural network)为这一趋势的典型代表,其通过在网络结构中内置旋转等变性,显著提升了模型的数据利用效率和预测精度。这方面的代表性工作包括PaiNN[43]、NEQUIP[44]、Allegro[45]、MACE[46]、EquiRANN[47]、BOTNet[48]等,均展现出优于传统方法的性能。另一方面,针对长程相互作用建模的挑战,CACE-LES (cartesian atomic cluster expansion with latent Ewald summation)等模型[49]通过引入隐式Ewald求和技术,在不依赖显式电荷模型的前提下成功实现了对Coulomb、Van der Waals等长程相互作用的描述,并进一步拓展至材料极化性质的建模,为复杂材料体系的精确原子模拟提供了新的技术路径。

上述的机器学习势函数模型针对特定体系构建,开发和验证过程需要大量的人力和计算资源。针对该问题,机器学习势函数发展的另一个重要方向在于构建能够涵盖周期表元素的通用大模型(universal/foundation models)。此类模型旨在通过统一架构,实现对多种材料体系的普适性建模。借助Materials Project (MP)[50]等公开数据库中丰富的第一性原理数据,研究人员成功开发出诸如MEGNet[51]、M3GNet[52]、CHGNet[53]、GNoME[54]等一系列通用势函数模型。在DP模型基础上发展的DPA-1[55]和DPA-2[56]等模型进一步引入了注意力机制和预训练策略,显著增强了模型的泛化能力。基于MACE框架构建的MACE-MP-0模型[57],具备直接用于定性或半定量模拟的能力,也可作为基础模型,通过在特定任务数据上进行微调以达到更高的精度要求。此外,Xie等[58]开发的GPTFF (graph-based pre-trained transformer force field)模型将Transformer架构引入了模型中,进一步提升了模型的表达能力。典型的机器学习势函数模型发展过程见图2

图2

图2   典型的机器学习势函数发展过程

Fig.2   Evolution of typical MLFF models (NN—neural network, MPNN—message passing neural network, KRR—kernel ridge regression, ETN—equivariant tensor network, HDNNP—high dimensional neural network potential, MSA—monomial symmetrization algorithm, GAP—Gaussian approximation potential, SNAP—spectral neighbor analysis method, MTP—moment tensor potential, Amp—atomistic machine-learning package, GDML—gradient domain machine learning, PROPhet—PROPerty prophet, DTNN—deep tensor neural network, qSNAP—quaderatic spectral neighbor analysis method, DP—deep potential, ACE—atomic cluster expansion, FCHL—Fabe-Christensen-Huang-Lilienfeld, LASP—large-scale atomistic simulation with neural network potential, EANN—embedded atom neural network, AIMNet—atoms-in-molecules Net, PIP—permutationally invariant polynomial, PaiNN—polarizable atom interaction neural network, RANN—rapid artificial neural network, NequIP—neural equivariant interatomic potential, NEP—Neuro evolution potential, GemNet—geometric message passing neural network, PFP—PreFerred potential, ml-ACE—multi-layer atomic cluster expansion, BIGDML—Bravais-inspired gradient domain machine learning, REANN—recursively embedded atom neural network, HermNet—heterogeneous relational message passing network, BOTNet—body-ordered-tensor-network, ETNs—equivalent tensor networks, PESPIP—potential energy surface PIP, GNoME—graph networks for materials exploration, EDDP—ephemeral data derived potential, CHGNet—crystal Hamiltonian graph neural network, ALIGNN-FF—atomistic line graph neural network-based force field, AIMNet2—2nd generation of atoms-in-molecules neural network potential, ViSNet—vector-scalar interactive graph neural network, SEGNN—spindistance edge graph neural network, MB-PIPNet—ManyBody PIPNet, HotPP—high-order tensor message passing interatomic potential, GPTFF—graph-based pre-trained transformer force field, EquiREANN—equivalent REANN, CAMP—Cartesian atomic moment machine learning interatomic potential, CACE—Cartesian atomic cluster expansion)


在第一类基于局域结构描述符的模型中,基于对称多项式构建的模型因其数学上的完备性和形式上的简洁性而备受关注。其中,MTP模型采用一组完备的对称多项式对原子环境进行描述,可视为传统多项式不变势(permutationally invariant polynomial,PIP)模型在局域描述框架下的重要发展。相较于其他常见的MLFF模型,MTP模型在精度与效率之间取得了优异的平衡。Han等[59]在C和S体系中,比较了包含2体、3体项的余弦特征(MTP、ACSF、Bispectrum、DP-Chebyshev、DP-Gaussian及ACE等描述符),发现在采用相同描述符长度和线性回归模型的条件下,MTP描述符所构建的线性模型拟合精度最优。进一步,Zuo等[60]针对Ni、Li、Si、Cu、Mo、Ge等多种材料体系,系统评估了MTP、NNP、GAP、SNAP、qSNAP等主流势函数模型的精度及计算效率,评估结果如图3[60]所示。可见,MTP在保持同等精度的前提下,具备更优的计算效率,其数据点紧贴Pareto边界,展现出模型精度与计算效率的优异平衡。值得强调的是,MTP模型与其他多项式形式的模型存在深刻的联系。Drautz[24]和Dusson等[61]证明了MTP模型与ACE模型之间存在等价性。除此之外,MTP模型也可以看作Hodapp和Shapeev[62]提出的等变张量网络(equivariant tensor network,ETN)模型的特殊形式。

图3

图3   各种势函数模型的计算精度和效率评估[60]

Fig.3   Assessment of computational accuracy and efficiency for various MLFFs[60] (JMax—maximum angular momentum channel in the SNAP model, MD—molecular dynamics)


本文聚焦Shapeev[21]提出的MTP势函数模型,回顾了MTP势函数模型的理论基础,介绍了基于MTP的模型发展,以及该模型在一些典型材料科学体系中的应用,最后对MTP模型未来的发展方向进行展望。

1 矩张量模型

矩张量模型由Shapeev团队[21,63,64]提出,该模型具有对称多项式形式,在同等精度下具有较高的计算效率。模型采用矩张量(moment tensor)描述符来描述原子周围的局部结构。对于编号为i的原子的局部环境,其矩张量描述符Mμ,νni可表示为[21]

Mμ,νni=jfμrij, zi, zjrijν timesrij

式中,为两个矢量的外积,ni表示i原子的局域环境,μ为径向基函数编号,ν为矩张量的阶,rij为中心原子i到原子j的距离矢量,zizj为原子类别标号,fμ为径向分布函数。由 式(1)可见,Mμ,νν阶对称张量。fμ具有如下形式:

fμrij, zi, zj=kcμ,zi,zjkQkrijfcrij

式中,Qk为径向基函数,通常可选择第一类Chebyshev多项式;fc为平滑截断函数,在截断半径平滑归零;cμ,zi,zjk为径向函数中的待拟合参数;k为Chebyshev多项式级数。若将f视作原子的某种性质,譬如“质量”,那么可以对Mμ,ν进行如下理解:Mμ,ν可视作原子性质分布关于中心原子的ν级矩。例如,当ν=0时,有:Mμ,0=jfμ|rij|, zi, zj,反映了所有原子属性的加和,即统计学中的“零级矩”。当ν=1时,Mμ,1=jfμ|rij|, zi, zjrij,反映了f分布的期望和。当ν=2时,Mμ,2=jfμ|rij|, zi, zjrijrij,反映了f分布相对于中心的标准差。同理,更高阶的矩张量可以看作是f更细致的统计分布情况。

为解决张量形式下的参考坐标系依赖性问题,Shapeev[21]提出通过张量缩并构造旋转不变的标量基组Bα。由于矩张量具有全对称性:

ms1, s2, , sn=mσs1, s2, , sn

式中,m为矩张量M的分量元素,si为指标,σ表示任意指标置换。故对两个矩张量缩并相同数量的任意指标得到的结果都是相同的,且剩余指标仍然满足指标交换的不变性,换言之,对一组矩张量,缩并操作的顺序不影响缩并的结果。现考虑n个矩张量构成的集合{Mμi,νi},缩并至标量的规则可以通过一个n×n的对称矩阵(α)表示:

α=0α12α1nα12α2nα1nα2n0

式中,αij为第i与第j个矩张量间缩并αij个指标。由于指标间具有交换不变性,张量间缩并任意指标结果相同,且由于对ν1ν2阶张量缩并p个指标后得到的张量阶数为ν1+ν2-2p,当α满足jαij=νi时,缩并结果必定为标量,故α标记了矩张量集合{Mμi,νi}缩并为标量的规则。这里举一个简单的例子,考虑三个矩张量{M1,3, M2,2, M3,1},以及α=021200100,那么在该指标下,标量基组为Bα=M1,3ijkM2,2ijM3,1k Einstein求和约定。

从上面讨论可以看到,Bα是关于坐标的高次对称多项式。Shapeev[21]已经证明,对于所有不等价的α{Bα}具有完备性,通常而言,采用线性回归的方式构建机器学习势函数即可获得良好的精度:

U=iαξαBαni

式中,U为体系势能,ξα为线性拟合参数。

选择一组合适的标量基组{Bα}对保证势函数精度及效率至关重要。在Shapeev团队[65]提出的方案中,定义Mμ,ν的层级(level)为levMμ,ν=wμμ+wνν+wo,其中wμwνwo为各项权重。对于由矩张量组{Mμ1,ν1, , Mμn,νn}构成的Bα,其层级取各张量层级之和levBα=inlevMμi,νi,并采用预设阈值d筛选满足levBαd的标量基函数。Shapeev团队[65]通常选择wμ = 4、wν = 1、wo = 2;但Wang等[66]研究表明,wμ = 2、wν = 1、wo = 1可能是更优选择。

2 矩张量模型的优化和改进

首先讨论矩张量模型的计算过程。如果直接计算张量缩并,由于两个张量间缩并l个指标时,需要计算的求和数为3l,呈现指数增长。Shapeev[21]针对该问题提出了一种创新性的解决方案。如图4[66]所示,该方案将缩并过程抽象为树形结构:从矩张量缩并定义的标量函数出发,递归地将张量分解为两个中间张量的缩并运算,直至中间张量分为两个矩张量缩并为止。具体而言,对于一个Bα,将其分解为两个中间张量BϕBη的张量缩并的过程可用下式表示:

α=γ'ϕϕTη'
γi,i'=γi,i-jϕi,j, ηi,i'=ηi,i-jϕi,jT
γi,j'=γi,j,  ηi,j'=ηi,j    ij

式中,γ'η'分别是指标矩阵α的对角子块,其维度分别为kγ×kγkη×kηϕα的非对角子块,维度为kγ×kη。对于非标量指标α,其非零对角元αii表明矩张量Mi中有αii个指标未参与缩并运算,而非对角元αij则代表MiMj之间需缩并αij个指标。特别地,矩阵ϕ中的非零元素ϕi,j表示BγMi继承的指标与BηMj继承的指标之间需缩并ϕi,j个指标。通过αγηϕ这些指标,可定义从BγBη生成Bα的缩并规则。这种分解方法不仅明确了缩并运算的顺序,还保留了所有中间结果。值得注意的是,不同标量函数的分解过程可能共享相同的中间张量。因此,通过预计算并复用这些共享的中间张量,可以大幅降低整体张量缩并运算的计算开销。

图4

图4   标量基组BαBβ分解路径示意[66]

Fig.4   Decomposition routines of scalar basis functions Bα and Bβ[66] (Bη1-Bη5—intermediate tensors, M1-M6—moment tensors)


需特别说明的是,Bα的分解方式并非唯一。对于一组选定的标量函数集合{Bα},各函数采用不同分解策略会导致不等价中间张量数(nb)和规则应用次数(nt)的差异,从而显著影响模型训练和实际应用的计算效率。由于可能的分解路径组合数量极其庞大,在实际计算中无法穷举所有情况。以Podryabinkin和Shapeev[63]定义的28阶矩张量为例,通过统计基组中各标量函数的独立分解方式总数可知,其可能的分解路径组合超过102195种。Shapeev[21]提出的优化准则指出:应最小化分解过程中的总维度累加值,同时最大化中间张量复用率。虽然该准则通常能得到较优解,但当存在多个分解路径具有相同维度累加值或中间张量时,可能导致选择歧义,因而无法严格保证全局最优性。

针对上述问题,Wang等[66]采用遗传算法为每个标量函数确定最优分解路径。具体实施步骤如下:首先,为每个Bα枚举所有可能的计算树结构,形成该函数对应的“计算树集合”{Tiα}。通过为每个Bα分配一个选择基因p (指定选用集合中的第p棵树),标量函数集的完整分解方案可由基因序列{p1, p2, , pn}表征,该序列构成遗传算法中的一个个体。优化目标是最小化不等价张量分量数(ϑ)和乘法规则调用次数(ς),为此构建成本函数(F(R)):

FR=ϑ+3ς

式中,R为特定分解方案。在 式(9)中对ς赋予3倍权重,源于基组规模较大时乘法规则的计算开销占主导地位。基于此,定义个体的适应度函数(f(R))为Sigmoid型表达式:

fR=1-11+exp-κFR-FRmin+c

式中,FRmin为当前FR的最小值,并在优化过程中动态更新;κc为调控常数。该函数通过动态调整选择压力,确保算法能高效收敛至近优解。具体优化流程如下。(1) 根据给定的约束条件,包括考虑的多体相互作用,矩张量的最大阶,径向函数的数量,生成对应的标量基组,并依据参与缩并运算的矩张量数目升序排列。(2) 为各标量函数建立全部计算树组成的搜索空间。(3) 初始化种群个体。采用加权评分机制筛选优质初始解:当某计算树满足维度总和最小化准则时,赋予基准分10分;中间张量复用每次计1分;张量对称分解额外奖励1分。优先选择总分最高的分解树作为初始候选方案。(4) 遗传操作生成新种群:变异操作随机替换单个标量函数的计算树;交叉操作交换两个个体间的部分计算树序列。(5) 通过适应度函数评估个体优劣,实施精英保留策略。(6) 迭代执行步骤(4)和(5),直到满足最大迭代限制或适应度变化达到收敛阈值。

Wang等[66]研究表明,该算法针对复杂基组可显著减少中间计算项,并提升近一倍的计算效率。在Ni-Al二元体系上的测试表明,在相似的基组大小下,wμ = 2、wν = 1、wo = 1策略相较于Shapeev团队[65]采用的wμ = 4、wν = 1、wo = 2策略,在相同精度前提下的性能提升可超过一个数量级[66]。Wang等[66]已将上述算法植入到了软件包IMR-MLP中。

3 基于主动学习的训练集生成

MLFF的精度和泛化能力不仅取决于模型本身的复杂度,完备的高质量训练集同样至关重要。基于对体系的先验认识,从而手动构建训练集的方法难以完备地覆盖相空间,如尚未了解的亚稳相、相变路径中的过渡态等结构,导致所拟合的势函数泛化能力较差。采用主动学习(active learning)方法对势函数及训练集进行迭代优化是目前解决此问题最为常用的策略[67~69]。对线性参数化的MTP模型而言,一种常见的采样策略为Shapeev团队[63,64]提出的广义D-最优化(D-optimality)准则。

下面简要介绍一下D-最优化准则。对于具有线性回归形式的势函数,将 式(5)改写为U=iφξibi,其中φ为基组大小,设计矩阵的元素bi=kNBinkN为体系大小。假设已有一个训练集XTS={x1, , xn},可构建矩阵B,其中每一行为b1xi, , bφxi。为降低计算量,首先在B中选出包含φ行样本的最大线性不相关的子矩阵AR φ×φ,同时可以得到对应训练集的子集XTS'。对于一个新样本x*,用其依次替换XTS'中的每一个样本,并计算γk=detAk / detA (其中,γ表示矩阵的行列式,Ak为替换第k个样本后的矩阵),选择其中最大值定义为外推度γx*=max1kφγk (其中,γx*为新样本x*的外推值)。对于γx*>1的情况,可视作当前模型对结构x*是外推的,即可标记该结构,进行第一性原理计算并更新训练集。同时,x*将替换掉对应的训练集结构xk以供进一步的主动学习选择。在实际计算中会设定一个阈值γthr,该值常用的取值范围为2~5,当γx*>γthr时,标记该新结构。图5为结合主动学习采样方法的MTP模型开发流程。其核心是一个迭代过程:通过在相空间中不断采样来扩充训练集,直至模型收敛,即没有新的结构被选中。随后,将基于此收敛后的训练集重新训练,以得到最终的MTP模型。在很多情况下,例如研究点缺陷、界面、离子迁移等情况时,选择外推程度高的局域环境更为重要。在此情形下,上述过程同样适用,只需将对应整体结构的bi展开为局域结构的Bink即可。为使采样过程能够充分遍历高能亚稳相及过渡态,还可进一步结合增强采样方法[70~74]辅助生成完备的训练集。

图5

图5   结合主动学习采样方法的矩张量势(MTP)模型开发流程

Fig.5   Workflow of developing a (MTP) model combined with active learning sampling method (γ(x*)—extrapolation grade for configuration x*, γthr—grade threshold)


为凸显主动学习的高效性,Gubaev等[64]基于QM9公开数据集[75],分别采用随机采样和主动学习策略构建了相应的MTP势函数模型。结果表明,基于主动学习训练集的MTP模型,其均方根误差和最大误差均显著低于随机采样模型(图6ab[64])。此外,从焓值的预测结果(图6cd[64])来看,主动学习模型未出现随机采样模型中的异常值,表明其泛化能力更强。

图6

图6   MTP模型在QM9数据集上的主动学习与随机选择策略的效率和性能比较[64]

Fig.6   Efficiency and property comparisons of active learning vs random sampling for training a MTP model on the QM9 dataset[64] (a, b) root-mean-square (RMS) error (a) and maximal error (b) as a function of training set size (c) parity plot of MTP-predicted enthalpy (HMT) vs reference enthalpy (Href) for a model trained with random selection (d) parity plot for a model trained with active learning


4 MTP模型在不同体系中的应用

自提出以来,MTP模型已在材料科学的诸多子领域得到成功验证。本节从结构搜索、金属和合金、固-固相变、表面和界面现象等几个典型方面,系统回顾MTP模型的代表性应用研究,以展示其强大的计算模拟能力和广泛的适用性。

4.1 结构搜索

基于第一性原理的结构搜索受限于计算成本,通常仅能处理至多数十原子规模的体系,而机器学习势函数的应用则有效突破了这一限制。目前,MTP模型已在USPEX[76],MAGUS[77]等结构搜索软件中应用。Podryabinkin等[78]将MTP模型与USPEX软件及主动学习策略相结合,提出了一种动态构建势函数的高效搜索框架。该框架在结构搜索过程中通过主动学习不断优化势函数,并将其应用于单质C、B及高压Na体系的结构探索。该方案能够高效筛选出基态附近的稳定构型,并成功发现了一种由54个原子组成的新型硼晶体结构。Gubaev等[79]将该方案应用于Cu-Pd、Co-Ni-V、Al-Ni-Ti等合金体系,高效地搜索凸包附近的结构,并在Al-Ni-Ti体系中成功预测了AlNi11、Al4Ni8、AlNi9Ti2三种新的热力学稳定结构。

4.2 金属和传统合金

MTP模型已经广泛应用于金属和合金体系的模拟中。Novoselov等[80]利用MTP模型计算了Al、Mo、Si中的空位扩散行为,其结果与密度泛函理论(DFT)结果高度吻合。Shapeev等[81]利用MTP模型研究了bcc-Ti的弹性不变效应。该工作计算了900~1700 K下bcc-Ti的弹性常数,发现该温度区间内bcc-Ti弹性模量的温度依赖性极弱,展现出单质金属中独特的Elinvar效应(弹性不变效应)。Mukhamedov等[82]β-Ti9 - x Nb x Zr6体系中使用MTP模型研究了弹性性能,预测了该体系在宽温域内的Elinvar效应。hcp-Zn具有异常的c / a比(其中,ca为晶格常数),这一点在经验势函数中难以体现。Mei等[83]采用MTP模型准确复现了该特性,并进一步研究了Zn中不同晶相的相对稳定性、弹性性能、缺陷特性及熔体结构。

对于以W为代表的难熔元素,其在高温下强烈的晶格非谐性对传统模拟方法构成挑战。Forslund和Ruban[84]采用基于Langevin动力学的两阶段升采样热力学积分(TU-TILD)方法,利用MTP作为中间模型,以DFT精度实现了金属W七个不同晶面在温度达到熔点前的表面自由能的高效计算。计算结果表明,对于更开放的表面,非谐效应在3000 K时显著增强,致使表面能各向异性随温度升高而减弱,而最密排(110)表面非谐性较弱。计算得到的Wulff形状和表面能数据在接近熔点时与实验结果高度吻合。Forslund等[85]采用基于上述方法改进的直接升采样方法进一步研究了V、Ta、Mo和W四种bcc结构难熔金属直至熔点的热容、热胀系数、体模量等热力学性质。结果表明,这四种元素电子贡献(电子热激发及电子-振动耦合)及非谐效应贡献均十分显著,并可结合电子结构计算阐释第V族和VI族难熔金属的热力学性质变化规律。基于熔点构建的同源温度标度分析表明,计算得到的同源温度依赖性与实验数据高度吻合,证明了采用MTP作为中间模型的直接升采样算法在第一性原理精度的热力学性质计算方面的能力。Srinivasan等[86]同样利用MTP作为中间模型,采用直接升采样方法计算了Nb、Mo、Ta和W四种难熔金属的显式非谐自由能和熵值。结果表明,四种元素的显式非谐性差异并非源于高温行为,而是源自其基态状态,即温度效应会使得Mo和W的bcc结构失稳,从而对于Nb和Ta则起到稳定作用。Zhu等[87]利用MTP模型结合TU-TILD方法研究了V、W及无序VW合金的熔化特性,发现热振动对电子态密度具有显著影响,进而显著改变电子自由能贡献。该效应在W中表现明显,在V中则被组态熵抵消,在VW合金中该效应较W稍弱。该结果表明,电子自由能贡献对于难熔合金热力学性质影响不可忽视。Jung等[88]采用相同的方法,以DFT精度计算了fcc结构的Nb、Ni、Al及hcp-Mg单质基态至熔点的平衡态热力学性质。Zhang等[89]提出了用于计算包含热激发效应的空位扩散过渡态Gibbs自由能的过渡态热力学积分(TSTI)方案,并采用MTP模型作为全非谐性的模型,研究了bcc-W中非谐性对温度依赖的空位形成和迁移Gibbs自由能的影响,计算结果与实验结果高度吻合,成功解释了W自扩散非Arrhenius行为的物理起源。上述研究均采用了升采样方法[90],其中MTP模型以其精度和效率兼备的特点,成为该类型方案中中间参考态计算的不二之选[91]

合金中的缺陷对其性能具有关键影响。因此,势函数能否精确模拟这些缺陷的性质,是评估其有效性和可靠性的核心指标。Zotov等[92]构建了bcc-Nb体系的MTP模型,该模型不仅精确复现了DFT计算的位错核心结构及位错迁移的Peierls能垒,还精确描述了位错易核能量及临界分切应力(CRSS)的孪生-反孪生不对称性,充分展示了MTP模型对位错缺陷的建模能力。在辐照损伤模拟方面,Wang等[93]针对bcc-Fe的辐照损伤问题设计了混合MTP模型。该模型通过光滑的切换函数,将适用于极短程碰撞的Ziegler-Biersack-Littmark (ZBL)核排斥势、描述短程相互作用的MTP-S势以及描述中长程相互作用的MTP-L势进行了无缝的跨尺度耦合,实现了从高能级联碰撞到缺陷演化的全过程精确模拟。Kwon等[94]采用MTP和路径积分分子动力学(PIMD)模拟方法,系统研究了三种bcc金属(Nb、Fe和W)中的H扩散行为。在合理温度范围内,其计算的扩散系数和观测到的同位素效应均与实验数据高度吻合,证实了MTP模型在准确表征bcc金属中的H扩散动力学方面的可靠性。Kumar等[95]利用MTP模型研究了H在C15立方及C14六方TiCr2H x (0<x6) Laves相中的吸/放氢机制。该研究采用盆地跳跃Monte Carlo (BHMC)模型确定了不同H浓度下的H构型、形成焓及H有序排列,揭示了间隙位点的H占位规律。该工作展现了MTP模型在描述复杂合金相与H相互作用方面的能力。

Wang等[66]针对Ni-Al二元体系,采用两阶段主动学习策略构建了高精度的MTP势函数模型,该模型使用优化后的MTP基组,在保持高精度的同时也具备极高的计算效率。该模型可以高精度地预测Ni及Ni3Al相的弹性常数、各类点缺陷及点缺陷团簇形成能/结合能、界面、广义层错能、声子谱等基本物理性质,且预测精度显著优于现有的EAM等经验势函数。基于此势函数,该工作系统研究了镍合金中不同大小团簇的稳定构型及空位团簇形成能/结合能随团簇大小的变化关系,并研究了堆垛层错四面体(SFT)类型空位团簇的扩散过程。分子动力学(MD)模拟结果表明,六空位SFT在小空位团簇中具有更高的稳定性和更高的迁移速率,并表征了SFT迁移原子尺度上的动力学过程(图7c[66])。Xu等[96]研究了Ni3Al中的反常屈服行为,利用MTP模型高效地计算了包含非简谐效应的复杂堆垛层错(CSF)滑移的自由能面,发现了Ni3Al中交滑移过程能垒随温度升高而增大的现象,并揭示了该现象主要来自非谐振动(图7a[96])和纵向的自旋涨落。Xu等[97]进一步考虑了Ni3Al体系的屈服应力反常现象,利用MTP势函数研究了Ni3Al中的位错行为,观测并分析了Kear-Wilsdorf锁(KWL)的形成与解锁过程(图7b[97]),发现其解锁应力呈现显著的温度依赖性。该发现与现有解析模型中普遍认为解锁过程非温度依赖的基本假设相矛盾,不仅为理解KWL的形成和解锁机制提供了新的原子尺度视角,也为揭示屈服应力反常现象的物理起源提供了更深入的理论依据,同时充分展现了MTP模型在描述复杂位错核心结构和热激活过程方面的强大能力。

图7

图7   MTP模型在Ni-Al合金体系中的应用[66,96,97]

Fig.7   Applications of MTP model on Ni-Al alloys

(a) distributions of projected first-nearest-neighboring pairs for both the complex stacking fault (CSF) and bulk Ni3Al at 1454 K[96] (d—displacement)

(b) temperature-dependent critical stresses to unlock a Kear-Wilsdorf lock (KWL)[97] (τ1, τ2—critical stresses for unlocking the KWL)

(c) migration of stacking fault tetrahedra (SFT) cluster in Ni[66]


Fe、Co、Ni等金属元素具有复杂的磁性特征,但常规的MTP模型中并没有包含磁矩。对此通常的处理方式为直接拟合自旋极化的第一性原理计算结果,但该方式无法应对晶格结构相同但磁组态不同的情况(如fcc-Fe的高自旋、低自旋、铁磁、反铁磁、顺磁)。针对此问题,Shapeev课题组[98~100]进一步将MTP模型扩展到共线磁性体系的研究中,将MTP模型拓展为包含磁性基组的mMTP模型。该模型将径向函数部分,即 式(2)进一步扩展为:

fμrij, zi, zj, si, sj=
kβγcμ, zi, zjk, β, γQkrijψβsiψγsjfcrij

式中,s为原子磁矩,ψ为磁基组,同样可选择Chebyshev多项式形式。模型其余部分与MTP仍保持一致。Kotykhov等[100]研究进一步表明,拟合mMTP模型时,在能量、力、应力张量的基础上进一步考虑“磁力”,即能量(E)对磁矩的负梯度𝒯=-E / s。相对不考虑𝒯的情况,虽然其没有显著降低能量、力和应力的均方根误差,但可以有效提高mMTP模型的稳定性,使得mMTP模型对结构弛豫后得到的平衡磁矩结果与DFT更为吻合,在不同磁组态下计算的形成能及晶格常数等物理量的预测也更为准确。在Fe单质体系中,该模型准确地复现了DFT计算的铁磁、反铁磁、顺磁态下bcc-Fe的声子色散关系,以及三种磁组态下bcc、fcc和hcp相各自的能量-体积曲线以及磁矩-体积关系[98]。Kotykhov等[99]进一步通过约束密度泛函理论(constrained DFT,cDFT)方法考虑非平衡磁矩构型。在Fe-Al二元合金体系中,该工作选择了23种结构原型,并对其施加结构、晶格及磁矩的微扰,构建了包含2012个数据的训练集,在此基础上拟合了mMTP模型。该模型成功复现了实验观察到的Fe-Al体系中反常的体积-成分依赖关系[101],充分展示了mMTP模型在复杂磁性合金体系中的准确描述能力和应用潜力。

在先前的案例中,电子自由能仍需通过DFT计算得到。进一步,Srinivasan等[102]将电子温度效应纳入了MTP框架中,将MTP模型扩展为eMTP模型。在eMTP框架下, 式(5)变为:

UeMTP , Tel=iEiMTPPiTel

式中,UeMTP为体系势能,为体系坐标,Tel为电子温度,PiTel为在[0, Telmax]上的Chebyshev多项式(其中,Telmax为最大电子温度),EMTP为经典的MTP模型。在构建用于该模型的训练集时,需给出不同电子温度下的第一性原理计算结果。相较于经典模型,利用eMTP可获得电子自由能,以及原子振动与电子激发的耦合能,且各贡献项可独立解析。Srinivasan等[102]将模型用于Nb和TaVCrW两种难熔体系,并准确复现了DFT的热力学性质计算结果。

4.3 高熵合金

高熵合金(HEA)因其近乎无限的成分空间而具有广阔的性质可调控性,但这种复杂性使得原子间势函数的开发极具挑战性。本节针对MTP模型在高熵合金体系的应用(图8[103~105])进行综述。Gubaev等[103]对比了Gaussian矩神经网络(GM-NN)与MTP模型在TaVCrW体系中的性能,在成分、构型及振动自由等多个方面展开分析(图8a[103]),发现两者具有相当的预测能力,但MTP模型在训练规模收敛速率方面表现更优。Jafary-zadeh等[106]采用MTP模型揭示了Co-Fe-Ni三元多主元合金中,静态/动态晶格畸变、热膨胀及化学短程有序(SRO)对弹性性能的耦合影响机制。Gubaev等[107]利用MTP模型研究了不同Ta浓度下,bcc-TiZrHfTa x 体系在有限温度下的结构-弹性相关关系,发现合金弹性性能与结构ω相稳定性之间存在强关联。基于该机理,Gubaev等[107]在宽温度域内系统搜索了合金的成分,成功找到了具有Elinvar效应的合金成分,为设计弹性温度不变的多组分合金开辟了新途径。Hodapp和Shapeev[108]开发了一种随机合金的采样方法,并利用MTP模型计算了MoNbTa中熵合金的1/4[111]非稳态层错能。Zhu等[109]利用MTP模型结合直接升采样技术准确计算了TaVCrW合金的熔点、熔化熵、熔化焓及熔点处体积变化等熔化特性,同时获得了固态和液态TaVCrW的热容数据。如图8b[104]所示,Shuang等[104]采用MTP模型研究了MoNbTaW体系的强化机制,发现在超过尺寸阈值时,强化主要来源于位错运动过程中的固溶强化效应,而在临界尺度下以位错形核和晶界变形为主。Grabowski等[91]首次将TU-TILD方法与MTP势函数结合,以VNbMoTaW高熵合金体系非谐自由能计算为例验证了该方案的优越性能。

图8

图8   MTP模型在高熵合金体系中的应用[103~105]

Fig.8   Applications of MTP model in HEAs

(a) RMS errors in predicted energies and forces for the MTP model evaluated on the test data for different in-distribution subsystems of Ta-V-Cr-W[103]

(b) scale and plasticity mechanism-dependent strengths of MoNbTaW and W in polycrystal and single-crystalline nanopillar[104] (dc, dc1-dc3—critical values; MPEA—multi-principal element alloy)

(c) kink nucleation mechanism and the observation of cross-slip locking[105] (d110—interplanar spacing of the (110) plane)


难熔高熵合金(RHEA)具有优异的高温强度,合金中刃型位错和螺型位错对其塑性变形具有重要影响。Yin等[105]利用MTP模型系统研究了bcc-MoNbTaW RHEA中刃型和螺型位错在宽温域内的迁移机制及SRO对这些机制的影响规律,发现螺位错的交叉滑移锁定机制是RHEA强化作用的关键(图8c[105]),SRO的存在会促进刃型位错的迁移,但抑制螺型位错运动中的双扭折形核速率,而螺型位错运动中的交滑移锁定机制与SRO无关。该工作展示了MTP模型对复杂合金位错缺陷良好的描述能力。Song等[110]采用MTP模型研究了钛基RHEAs在有限温度下的弹性性能,发现软化声子模式是导致钛基RHEAs弹性常数、模量异常以及各向异性随温度升高的主要原因。

4.4-固相变

MTP模型在固-固相变领域也获得了广泛应用,如图9[111~113]所示。Jung等[114]采用MTP模型结合直接升采样算法,实现了体积-温度相空间稳定区域的高效扫描,并在稳定区间内的密集网格上进行热力学积分。利用该方案,Jung等[114]计算了Ti、Zr、Hf体系中hcp-bcc相变特性及两相的热力学性质。在Ti3O5体系中,Liu等[111]利用结构因子作为集合变量(collective variable,CV),结合增强采样与主动学习技术构建了完备的MTP,并借此研究了β-Ti3O5λ-Ti3O5的超快可逆重构型相变过程。该工作采用巨分子动力学(metadynamics)方法计算了两相的温压相图,预测在0 GPa下相变温度为535 K,与实验值(470 K)高度吻合。Liu等[111]还观察到,β相及λ相沿c方向的任意的乐高式堆垛结构均具有动力学稳定性(图9a[111])。图9b[111]为对自由能面的计算。可见,相对于协同转变,逐层转变在动力学上更具有优势。在拉伸应变条件下的大尺度直接MD模拟中观察到了层内转变的面内形核长大过程,进一步的静态能垒计算表明,面内转变更倾向于形核机制,故而β-λ相变为面内形核、多步逐层转变的能垒降低的过程。该研究[111]不仅阐明了具体的相变路径,更展示了MTP结合增强采样技术在探索复杂重构型相变自由能面方面的巨大潜力。Liu等[112]利用MTP模型研究了Van der Waals层状材料In2Se3的多晶型相变过程,构建了In2Se3体系热力学相图(图9c[112]),并系统研究了该体系中αββ'β等相在温度、压力及应变等条件下的相变机制,在MD模拟中成功捕捉到成核过程、畴壁形成及堆垛缺陷演化等决定相变路径的关键微观机制(图9d[112])。Wang等[113]针对宽温域压卡材料体系KPF6[115],利用MTP模型研究了该体系中室温面心立方(C)相、中温单斜(M-II)相、低温单斜(M-I)相和高压菱方(R)相的微观相变机制。结果表明,该体系中F原子取向无序、非谐晶格动力学与压力驱动八面体协同旋转的耦合机制,使KPF6能够通过压力诱导相变在宽温域内实现持续的等温熵变(图9e[113])。

图9

图9   MTP模型在固-固相变中的应用[111~113]

Fig.9   Applications of MTP model on solid-solid phase transitions

(a) lego-like metastable phases in Ti3O5[111] (c and a—lattice constants)

(b) free energy as a function of collective variables during β-λ phase transition in Ti3O5[111] (The dashed line denotes the PES for a concerted β-λ phase transition, whereas the solid line represents the PES for the transition pathway via intermediate metastable phases)

(c) phase diagram of In2Se3[112]

(d) domain wall configurations in β'-2H In2Se3 phase[112] (t—time)

(e) vibrational, orientational, and total entropy of low-temperature monoclinic (M-I), intermediate-temperature monoclinic (M-II), fcc (C), and high-pressure rhombohedral (R) phases of KPF6 as a function of temperature[113] (Sori—orientational contribution, Svib—vibrational contribution, Stot—total contribution)


4.5 表界面体系

MTP模型在金属表面分子吸附及表面化学领域也展现出突出的应用潜力。Chen等[116]结合MD模拟与增强采样技术,构建了Cu表面溶剂模型的MTP函数,精确预测了CO*、OH*、COH*、HCO*及OCCHO*在Cu表面的吸附能,以及乙二醇在Cu和Pd表面发生C—H键断裂所需的自由能势垒,这些结果与DFT结果高度吻合,展现出了MTP模型在表面催化问题中的应用潜力。Klimanova等[117]利用MTP模型结合主动学习算法,系统研究了CO/Pd(111)、NO/Pd(111)、NH3/Cu(100)、C6H6/Ag(111)及CH2CO/Rh(211)表面吸附体系的稳定吸附位点,结果与DFT计算结果相吻合。

在表界面催化研究中,过渡金属表面CO吸附是一个经典谜题(the CO puzzle),即传统的DFT计算会错误预测过渡金属表面CO的稳定吸附位点,并系统性地低估表面能、高估吸附能[118,119]。尽管多体无规相近似(RPA)方法不仅能给出与定性实验一致的结果,还能给出准确的定量预测[120],但是该方法因十分昂贵而无法用于大体系、高通量计算。为此,Liu等[121]针对Rh(0001)表面CO吸附问题,利用MTP模型结合差分机器学习(Δ-learning)算法,在低成本下构建了具备RPA精度的机器学习势函数模型,成功预测了不同CO覆盖度下的吸附行为。如图10a[121]所示,基于RPA构建的MLFF能准确预测Rh(111)表面能、CO吸附位点偏好以及不同覆盖度下的吸附能,预测结果均与实验数据高度吻合。此外,Liu等[121]还预测了覆盖度依赖的基态吸附构型及吸附饱和覆盖度13/16 ML (monolayer),即表面吸附位点中13/16被占据。该工作利用MTP作为沟通高精度RPA计算与大规模计算模拟的桥梁,通过Δ-learning策略将RPA级别的精度“迁移”到高效的MTP势函数上,解决了低成本下高精度势函数开发“老大难”的问题。

图10

图10   MTP模型在表面及界面体系中的应用[121,122,125]

Fig.10   Applications of MTP model on surface and interface systems

(a) CO adsorption/binding energies and adsorption configurations on Rh(0001) surface[121] (ML—monolayer, PBE—Perdew-Burke-Ernzerhof functional theory, RPA—random phase approximation, vdW-DF—Van der Waals density functional theory, MTP-RPAΔ—MTP with RPA accuracy generated by the Δ-learning approach)

(b) Arrhenius plot of Li areal density at the interfacial regions Li-S cathode interface[125] (Ea—Li-Fe diffusion energy barrier, D—diffusivity)

(c) free energy profile for the protons along the intermolecular axes, and snapshots from the path integral molecular dynamics (PIMD) trajectory representing proton transfer process[122] (δ—reaction coordinate, ROaH and RObH—distances between H and Oa/Ob atom)


H2O分子在Ru(0001)表面是否存在解离是一个争论已久的问题,许多实验观测和理论计算都给出了不同的结论。Cao等[122]基于优化的MTP模型针对该体系开发了高效的MLFF,并采用该模型在大超胞体系上进行了纳秒级PIMD模拟,重新审视了H2O分子在Ru(0001)表面是否解离的问题。结果表明,核量子效应导致的质子量子离域化促使H2O分子间发生快速、频繁的质子转移,进而促进H2O分子在Ru(0001)表面的解离,PIMD观测到的质子转移能垒及水解离过程见图10c[122]。该工作为Ru(0001)表面H2O分子解离现象提供了直接理论证据,为H2O/金属界面的原子尺度机理研究提供了关键见解[122]

Li6PS5X (X = Cl、Br、I)是一种全固态锂离子电池中固态电解质的有力候选材料,Ou等[123]构建了Li6PS5Cl体系的MTP函数。计算结果表明,倾斜晶界Σ3(11¯2)[110]、Σ3(1¯11)[110]与扭转晶界Σ5(001)[001]的晶界的形成能均低于2000 meV/nm2,在多晶Li6PS5Cl中具有高稳定性。该工作进一步通过构建的MTP对阴离子有序/无序体相及三种晶界体系(包含超过16000个原子)进行了5 ns的分子动力学模拟,并获得了相应的扩散系数。Kim等[124]在Li6PS5X体系中采用MTP模型进行大规模分子动力学研究,发现在扭转晶界Σ5(001)[001]处,锂离子聚集会抑制离子电导率,且这种影响会延伸至距晶界(GB)界面约20 × 10-1 nm的内部区域。Holekevi Chandrappa等[125]采用MTP模型研究了Li-S电池(LSB)中S8/β-Li3PS4界面。MD模拟结果表明,最稳定的Li3PS4(100)面倾向于与S8形成具有二维通道的界面结构,这种结构能显著降低锂离子扩散的活化势垒(图10b[125]),这些发现为下一代全固态LSB的阴极-电解质界面设计提供了关键理论依据,同时也展现出MTP模型在复杂界面问题上的出色描述能力。

5 总结与展望

本文系统梳理了矩张量机器学习势模型的数学理论基础和模型发展历程,详细阐述了基于MTP的主动学习方法,并全面综述了矩张量机器学习势在单质、合金、相变材料、表面/界面等多元材料体系模拟中的应用进展。尽管MTP在精度与效率之间展现出优良的平衡性,其进一步发展仍面临若干关键挑战,具体体现在以下方面。

(1) 模型表达能力与计算成本的权衡。MTP模型作为对称多项式模型,其模型复杂度存在上限,相较于DP、SchNet、MACE等基于神经网络/图神经网络的复杂模型,MTP模型的全相空间描述能力存在不足。尽管引入更高阶的矩张量、更多的径向基组、更高的多项式幂次可以有效拓展MTP模型的描述能力,但是计算成本急剧增加,使得MTP模型的优势不复存在。针对此问题,未来可通过优化获得描述性更强的标量基组,并耦合等变神经网络[48]等具有高数据效率的非线性回归模型,在保留MTP物理解释性的同时突破线性模型表达瓶颈。这一技术路线有望高效处理非晶态材料、高熵合金及复杂有机/无机界面等化学环境高度异质的体系。另一条技术路线为结合MTP的高效特性与通用大模型(如MACE-MP-0模型[57])的泛化能力,进而构建“大模型迁移到小模型”的研究范式:通过大模型获得体系结构的能量、力、应力等高质量训练数据,进而训练专用MTP函数进行大规模、长时程模拟。该方法有望形成“泛化筛选-精准模拟”的新研究框架,为特定复杂材料体系的深度研究提供高效解决方案。

(2) 长程相互作用建模能力有限。尽管MTP在局部化学环境的描述上表现出色[59,64],但其本质上仍属于局域描述符模型,对于静电相互作用、Van der Waals力等长程效应的直接建模能力不足。在未来的发展中,可参考MPNN的思想,通过引入多层次聚合机制,扩展矩张量描述符的感知范围,从而有效捕捉长程相互作用。另一可行路径是引入隐式Ewald求和技术框架[49],对MTP模型所采用的标量基组额外构建一个线性回归模型作为描述电荷分布的隐变量,进而在MTP中实现对长程相互作用的直接建模。

(3) 并行计算架构适配不足。随着计算模拟领域对算力需求的不断提升,图形处理器(GPU)、张量处理器(TPU)、深算单元(DCU)等大规模并行计算设备正日益成为主流硬件平台。尽管MTP模型在中央处理器(CPU)架构上已展现出卓越的计算性能,但其在GPU等加速设备上的移植和优化仍显不足。特别是在张量分量的计算过程中,尚未充分利用GPU的并行计算特性,存在显著的性能提升空间。因此,开发面向现代并行计算架构的高效实现方案,已成为推动MTP模型持续发展的重要方向。

参考文献

Agrawal A, Choudhary A.

Perspective: Materials informatics and big data: Realization of the “fourth paradigm” of science in materials science

[J]. APL Mater., 2016, 4: 053208

[本文引用: 1]

Meyer M, Pontikis V. Computer Simulation in Materials Science [M]. Dordrecht: Springer, 1991, 205: 3

[本文引用: 1]

Hafner J.

Atomic-scale computational materials science

[J]. Acta Mater., 2000, 48: 71

DOI      URL    

Steinhauser M O, Hiermaier S.

A review of computational methods in materials science: Examples from shock-wave and polymer physics

[J]. Int. J. Mol. Sci., 2009, 10: 5135

DOI      PMID      [本文引用: 1]

This review discusses several computational methods used on different length and time scales for the simulation of material behavior. First, the importance of physical modeling and its relation to computer simulation on multiscales is discussed. Then, computational methods used on different scales are shortly reviewed, before we focus on the molecular dynamics (MD) method. Here we survey in a tutorial-like fashion some key issues including several MD optimization techniques. Thereafter, computational examples for the capabilities of numerical simulations in materials research are discussed. We focus on recent results of shock wave simulations of a solid which are based on two different modeling approaches and we discuss their respective assets and drawbacks with a view to their application on multiscales. Then, the prospects of computer simulations on the molecular length scale using coarse-grained MD methods are covered by means of examples pertaining to complex topological polymer structures including star-polymers, biomacromolecules such as polyelectrolytes and polymers with intrinsic stiffness. This review ends by highlighting new emerging interdisciplinary applications of computational methods in the field of medical engineering where the application of concepts of polymer physics and of shock waves to biological systems holds a lot of promise for improving medical applications such as extracorporeal shock wave lithotripsy or tumor treatment.

Daw M S, Baskes M I.

Embedded-atom method: Derivation and application to impurities, surfaces, and other defects in metals

[J]. Phys. Rev., 1984, 29B: 6443

[本文引用: 1]

Finnis M W, Sinclair J E.

A simple empirical N-body potential for transition metals

[J]. Philos. Mag., 1984, 50A: 45

[本文引用: 1]

Baskes M I.

Modified embedded-atom potentials for cubic materials and impurities

[J]. Phys. Rev., 1992, 46B: 2727

[本文引用: 1]

Tersoff J.

New empirical approach for the structure and energy of covalent systems

[J]. Phys. Rev., 1988, 37B: 6991

[本文引用: 1]

Blank T B, Brown S D, Calhoun A W, et al.

Neural network models of potential energy surfaces

[J]. J. Chem. Phys., 1995, 103: 4129

[本文引用: 1]

Brown D F R, Gibbs M N, Clary D C.

Combining ab initio computations, neural networks, and diffusion Monte Carlo: An efficient method to treat weakly bound molecules

[J]. J. Chem. Phys., 1996, 105: 7597

DOI      URL    

We describe a new method to calculate the vibrational ground state properties of weakly bound molecular systems and apply it to (HF)2 and HF–HCl. A Bayesian Inference neural network is used to fit an analytic function to a set of ab initio data points, which may then be employed by the quantum diffusion Monte Carlo method to produce ground state vibrational wave functions and properties. The method is general and relatively simple to implement and will be attractive for calculations on systems for which no analytic potential energy surface exists.

Gassner H, Probst M, Lauenstein A, et al.

Representation of intermolecular potential functions by neural networks

[J]. J. Phys. Chem., 1998, 102A: 4596

Prudente F V, Acioli P H, Neto J J S.

The fitting of potential energy surfaces using neural networks: Application to the study of vibrational levels of H 3 +

[J]. J. Chem. Phys., 1998, 109: 8801

DOI      URL    

Huang X C, Braams B J, Bowman J M.

Ab initio potential energy and dipole moment surfaces for H 5 O 2 +

[J]. J. Chem. Phys., 2005, 122: 044308

Manzhos S, Carrington T.

Using neural networks to represent potential surfaces as sums of products

[J]. J. Chem. Phys., 2006, 125: 194105

DOI      URL     [本文引用: 1]

By using exponential activation functions with a neural network (NN) method we show that it is possible to fit potentials to a sum-of-products form. The sum-of-products form is desirable because it reduces the cost of doing the quadratures required for quantum dynamics calculations. It also greatly facilitates the use of the multiconfiguration time dependent Hartree method. Unlike potfit product representation algorithm, the new NN approach does not require using a grid of points. It also produces sum-of-products potentials with fewer terms. As the number of dimensions is increased, we expect the advantages of the exponential NN idea to become more significant.

Behler J, Parrinello M.

Generalized neural-network representation of high-dimensional potential-energy surfaces

[J]. Phys. Rev. Lett., 2007, 98: 146401

DOI      URL     [本文引用: 1]

Bartók A P, Payne M C, Kondor R, et al.

Gaussian approximation potentials: The accuracy of quantum mechanics, without the electrons

[J]. Phys. Rev. Lett., 2010, 104: 136403

DOI      URL     [本文引用: 1]

Bartόk A P.

The gaussian approximation potential: An interatomic potential derived from first principles quantum mechanics

[D]. Berlin, Heidelberg: Springer, 2010

[本文引用: 1]

Bartók A P, Kondor R, Csányi G.

On representing chemical environments

[J]. Phys. Rev., 2013, 87B: 184115

[本文引用: 1]

Jinnouchi R, Karsai F, Kresse G.

On-the-fly machine learning force field generation: Application to melting points

[J]. Phys. Rev., 2019, 100B: 014105

[本文引用: 1]

Jinnouchi R, Lahnsteiner J, Karsai F, et al.

Phase transitions of hybrid perovskites simulated by machine-learning force fields trained on the fly with Bayesian inference

[J]. Phys. Rev. Lett., 2019, 122: 225701

DOI      URL     [本文引用: 1]

Shapeev A V.

Moment tensor potentials: A class of systematically improvable interatomic potentials

[J]. Multiscale Model. Simul., 2016, 14: 1153

DOI      URL     [本文引用: 8]

Thompson A P, Swiler L P, Trott C R, et al.

Spectral neighbor analysis method for automated generation of quantum-accurate interatomic potentials

[J]. J. Comput. Phys., 2015, 285: 316

[本文引用: 1]

Wood M A, Thompson A P.

Extending the accuracy of the SNAP interatomic potential form

[J]. J. Chem. Phys., 2018, 148: 241721

DOI      URL     [本文引用: 1]

The Spectral Neighbor Analysis Potential (SNAP) is a classical interatomic potential that expresses the energy of each atom as a linear function of selected bispectrum components of the neighbor atoms. An extension of the SNAP form is proposed that includes quadratic terms in the bispectrum components. The extension is shown to provide a large increase in accuracy relative to the linear form, while incurring only a modest increase in computational cost. The mathematical structure of the quadratic SNAP form is similar to the embedded atom method (EAM), with the SNAP bispectrum components serving as counterparts to the two-body density functions in EAM. The effectiveness of the new form is demonstrated using an extensive set of training data for tantalum structures. Similar to artificial neural network potentials, the quadratic SNAP form requires substantially more training data in order to prevent overfitting. The quality of this new potential form is measured through a robust cross-validation analysis.

Drautz R.

Atomic cluster expansion for accurate and transferable interatomic potentials

[J]. Phys. Rev., 2019, 99B: 014104

[本文引用: 2]

Huang S D, Shang C, Kang P L, et al.

LASP: Fast global potential energy surface exploration

[J]. WIREs Comput. Mol. Sci., 2019, 9: e1415

DOI      URL     [本文引用: 1]

Here we introduce the LASP code, which is designed for large‐scale atomistic simulation of complex materials with neural network (NN) potential. The software architecture and functionalities of LASP will be overviewed. LASP features with the global neural network (G‐NN) potential that is generated by learning the first principles dataset of global PES from stochastic surface walking (SSW) global optimization. The combination of the SSW method with global NN potential facilitates greatly the PES exploration for a wide range of complex materials. Not limited to SSW‐NN global optimization, the software implements standard interfaces to dock with other energy/force evaluation packages and can also perform common tasks for computing PES properties, such as single‐ended and double‐ended transition state search, the molecular dynamics simulation with and without restraints. A few examples are given to illustrate the efficiency and capabilities of LASP code. Our ongoing efforts for code developing and G‐NN potential library building are also presented.

Kang P L, Shang C, Liu Z P.

Recent implementations in LASP 3.0: Global neural network potential with multiple elements and better long-range description

[J]. Chin. J. Chem. Phys., 2021, 34: 583

DOI      URL     [本文引用: 1]

LASP (large-scale atomistic simulation with neural network potential) software developed by our group since 2018 is a powerful platform (www.lasphub.com) for performing atomic simulation of complex materials. The software integrates the neural network (NN) potential technique with the global potential energy surface exploration method, and thus can be utilized widely for structure prediction and reaction mechanism exploration. Here we introduce our recent update on the LASP program version 3.0, focusing on the new functionalities including the advanced neural network training based on the multi-network framework, the newly-introduced S7 and S8 power type structure descriptor (PTSD). These new functionalities are designed to further improve the accuracy of potentials and accelerate the neural network training for multiple-element systems. Taking Cu-C-H-O neural network potential and a heterogeneous catalytic model as the example, we show that these new functionalities can accelerate the training of multi-element neural network potential by using the existing single-network potential as the input. The obtained double-network potential CuCHO is robust in simulation and the introduction of S7 and S8 PTSDs can reduce the root-mean-square errors of energy by a factor of two.

Kang P L, Shang C, Liu Z P.

Glucose to 5-hydroxymethylfurfural: Origin of site-selectivity resolved by machine learning based reaction sampling

[J]. J. Am. Chem. Soc., 2019, 141: 20525

DOI      URL     [本文引用: 1]

Ma S C, Huang S D, Liu Z P.

Dynamic coordination of cations and catalytic selectivity on zinc-chromium oxide alloys during syngas conversion

[J]. Nat. Catal., 2019, 2: 671

DOI      [本文引用: 1]

Huang S D, Shang C, Kang P L, et al.

Atomic structure of boron resolved using machine learning and global sampling

[J]. Chem. Sci., 2018, 9: 8644

DOI      URL     [本文引用: 1]

Huang S D, Shang C, Zhang X J, et al.

Material discovery by combining stochastic surface walking global optimization with a neural network

[J]. Chem. Sci., 2017, 8: 6327

DOI      URL     [本文引用: 1]

Eckhoff M, Behler J.

High-dimensional neural network potentials for magnetic systems using spin-dependent atom-centered symmetry functions

[J]. npj Comput. Mater., 2021, 7: 170

DOI      [本文引用: 1]

Machine learning potentials have emerged as a powerful tool to extend the time and length scales of first-principles quality simulations. Still, most machine learning potentials cannot distinguish different electronic spin arrangements and thus are not applicable to materials in different magnetic states. Here we propose spin-dependent atom-centered symmetry functions as a type of descriptor taking the atomic spin degrees of freedom into account. When used as an input for a high-dimensional neural network potential (HDNNP), accurate potential energy surfaces of multicomponent systems can be constructed, describing multiple collinear magnetic states. We demonstrate the performance of these magnetic HDNNPs for the case of manganese oxide, MnO. The method predicts the magnetically distorted rhombohedral structure in excellent agreement with density functional theory and experiment. Its efficiency allows to determine the Néel temperature considering structural fluctuations, entropic effects, and defects. The method is general and is expected to be useful also for other types of systems such as oligonuclear transition metal complexes.

Behler J.

Four generations of high-dimensional neural network potentials

[J]. Chem. Rev., 2021, 121: 10037

DOI      PMID     

Since their introduction about 25 years ago, machine learning (ML) potentials have become an important tool in the field of atomistic simulations. After the initial decade, in which neural networks were successfully used to construct potentials for rather small molecular systems, the development of high-dimensional neural network potentials (HDNNPs) in 2007 opened the way for the application of ML potentials in simulations of large systems containing thousands of atoms. To date, many other types of ML potentials have been proposed continuously increasing the range of problems that can be studied. In this review, the methodology of the family of HDNNPs including new recent developments will be discussed using a classification scheme into four generations of potentials, which is also applicable to many other types of ML potentials. The first generation is formed by early neural network potentials designed for low-dimensional systems. High-dimensional neural network potentials established the second generation and are based on three key steps: first, the expression of the total energy as a sum of environment-dependent atomic energy contributions; second, the description of the atomic environments by atom-centered symmetry functions as descriptors fulfilling the requirements of rotational, translational, and permutation invariance; and third, the iterative construction of the reference electronic structure data sets by active learning. In third-generation HDNNPs, in addition, long-range interactions are included employing environment-dependent partial charges expressed by atomic neural networks. In fourth-generation HDNNPs, which are just emerging, in addition, nonlocal phenomena such as long-range charge transfer can be included. The applicability and remaining limitations of HDNNPs are discussed along with an outlook at possible future developments.

Ko T W, Finkler J A, Goedecker S, et al.

A fourth-generation high-dimensional neural network potential with accurate electrostatics including non-local charge transfer

[J]. Nat. Commun., 2021, 12: 398

DOI      PMID      [本文引用: 1]

Machine learning potentials have become an important tool for atomistic simulations in many fields, from chemistry via molecular biology to materials science. Most of the established methods, however, rely on local properties and are thus unable to take global changes in the electronic structure into account, which result from long-range charge transfer or different charge states. In this work we overcome this limitation by introducing a fourth-generation high-dimensional neural network potential that combines a charge equilibration scheme employing environment-dependent atomic electronegativities with accurate atomic energies. The method, which is able to correctly describe global charge distributions in arbitrary systems, yields much improved energies and substantially extends the applicability of modern machine learning potentials. This is demonstrated for a series of systems representing typical scenarios in chemistry and materials science that are incorrectly described by current methods, while the fourth-generation neural network potential is in excellent agreement with electronic structure calculations.

Han J Q, Zhang L F, Car R, et al.

Deep potential: A general representation of a many-body potential energy surface

[J]. Commun. Comput. Phys., 2018, 23: 629

DOI      URL     [本文引用: 1]

Wang H, Zhang L F, Han J Q, et al.

DeePMD-kit: A deep learning package for many-body potential energy representation and molecular dynamics

[J]. Comput. Phys. Commun., 2018, 228: 178

DOI      URL     [本文引用: 1]

Chen M Y, Hu J W, Yu Y C, et al.

Advances in machine learning molecular dynamics to assist materials nucleation and solidification research

[J]. Acta Metall. Sin., 2024, 60: 1329

DOI      [本文引用: 1]

Solidification nucleation is an everlasting research topic in the fields of materials science and condensed matter physics. Molecular dynamics (MD) and enhanced sampling methods provide a powerful means to observe the microscopic mechanisms of solidification processes in situ at the atomic level and to analyze the thermodynamic and kinetic properties of phase transitions. Recent advancements in MD simulations, particularly those incorporating machine learning (ML) techniques, have remarkably advanced our understanding of nucleation across different systems. This paper first reviews the basic theory of solidification nucleation and introduces common methods used in solidification nucleation simulation studies. It then delves into the application of ML techniques in three key areas: force fields, enhanced sampling, and order parameters. The paper further highlights several representative systems to demonstrate the practical applications of these methods. Finally, a summary and outlook on the future of ML-assisted MD simulations for studying material solidification were provided.

陈名毅, 胡俊伟, 余耀辰 .

机器学习分子动力学辅助材料凝固形核研究进展

[J]. 金属学报, 2024, 60: 1329

DOI      [本文引用: 1]

凝固形核是材料科学、凝聚态物理等领域长盛不衰的研究热点。分子动力学以及增强采样方法为从原子尺度原位观察凝固过程微观机理、解析相变热力学与动力学性质提供了有力手段。近年来,该领域的研究者们开发了一些融合机器学习技术的先进分子动力学模拟新方法,在多种体系的形核研究中取得了一定成果。本文首先回顾了凝固形核的基本理论,并从势函数、增强采样、形核序参量3个方面介绍凝固形核模拟研究中的常用方法以及机器学习技术在其中的应用。然后,选取了几个具有代表性的体系并介绍相关方法的实际应用。最后,对机器学习分子动力学辅助材料凝固形核模拟研究领域进行了总结与展望。

Liu S, Huang J W, Wu J.

Application of machine learning force fields for modeling ferroelectric materials

[J]. Acta Metall. Sin., 2024, 60: 1312

DOI     

Ferroelectric materials, which are characterized by tunable spontaneous polarization, show remarkable application potential for nonvolatile information storage; however they present various challenges. The performance of these materials is strongly influenced by their dynamic polarization behavior under multiple external fields. Due to the limited precision of experimental observations, precise atomic-level material simulations are crucial. Although molecular dynamics (MD) offers an ideal method for investigating material dynamics over a wide spatiotemporal range, its application to new materials is often limited by challenges such as low accuracy, complex development, and limited portability of conventional classical force fields. Advances in machine learning have provided new possibilities for developing force fields. Among different machine learning potentials, deep potential (DP) based on deep neural networks stands out. DP offers accuracy comparable to that of density functional theory while providing computational efficiency similar to that of conventional classical force fields. This review primarily focused on the development and application of DP in ferroelectric materials, specifically examining the phase transition mechanisms and polarization reversal processes at the atomic scale. Considerable efforts have been made to develop and evaluate DP for crucial ferroelectric materials such as hafnium dioxide (HfO2) and classic perovskite ferroelectric materials. Furthermore, this review explores the high oxygen-ion migration kinetics in HfO2 using DP and investigates the flexoelectricity induced by polar domain boundaries and the bulk photovoltaic effects in strontium titanate. By highlighting the use of DP molecular dynamics approaches in ferroelectric materials, this review emphasizes the role of machine learning approaches in optimizing and accelerating material simulations to facilitate further breakthroughs and discoveries.

刘 仕, 黄佳玮, 武 静.

机器学习势在铁电材料研究中的应用

[J]. 金属学报, 2024, 60: 1312

Wen T Q, Zhang L F, Wang H, et al.

Deep potentials for materials science

[J]. Mater. Futures, 2022, 1: 022601

[本文引用: 1]

Gilmer J, Schoenholz S S, Riley P F, et al.

Neural message passing for quantum chemistry

[A]. Proceedings of the 34th International Conference on Machine Learning [C]. Sydney: PMLR, 2017

[本文引用: 1]

Schütt K T, Sauceda H E, Kindermans P J, et al.

SchNet—A deep learning architecture for molecules and materials

[J]. J. Chem. Phys., 2018, 148: 241722

DOI      URL     [本文引用: 1]

Deep learning has led to a paradigm shift in artificial intelligence, including web, text, and image search, speech recognition, as well as bioinformatics, with growing impact in chemical physics. Machine learning, in general, and deep learning, in particular, are ideally suitable for representing quantum-mechanical interactions, enabling us to model nonlinear potential-energy surfaces or enhancing the exploration of chemical compound space. Here we present the deep learning architecture SchNet that is specifically designed to model atomistic systems by making use of continuous-filter convolutional layers. We demonstrate the capabilities of SchNet by accurately predicting a range of properties across chemical space for molecules and materials, where our model learns chemically plausible embeddings of atom types across the periodic table. Finally, we employ SchNet to predict potential-energy surfaces and energy-conserving force fields for molecular dynamics simulations of small molecules and perform an exemplary study on the quantum-mechanical properties of C20-fullerene that would have been infeasible with regular ab initio molecular dynamics.

Unke O T, Meuwly M.

PhysNet: A neural network for predicting energies, forces, dipole moments, and partial charges

[J]. J. Chem. Theory Comput., 2019, 15: 3678

DOI      PMID      [本文引用: 1]

In recent years, machine learning (ML) methods have become increasingly popular in computational chemistry. After being trained on appropriate ab initio reference data, these methods allow for accurately predicting the properties of chemical systems, circumventing the need for explicitly solving the electronic Schrödinger equation. Because of their computational efficiency and scalability to large data sets, deep neural networks (DNNs) are a particularly promising ML algorithm for chemical applications. This work introduces PhysNet, a DNN architecture designed for predicting energies, forces, and dipole moments of chemical systems. PhysNet achieves state-of-the-art performance on the QM9, MD17, and ISO17 benchmarks. Further, two new data sets are generated in order to probe the performance of ML models for describing chemical reactions, long-range interactions, and condensed phase systems. It is shown that explicitly including electrostatics in energy predictions is crucial for a qualitatively correct description of the asymptotic regions of a potential energy surface (PES). PhysNet models trained on a systematically constructed set of small peptide fragments (at most eight heavy atoms) are able to generalize to considerably larger proteins like deca-alanine (Ala): The optimized geometry of helical Ala predicted by PhysNet is virtually identical to ab initio results (RMSD = 0.21 Å). By running unbiased molecular dynamics (MD) simulations of Ala on the PhysNet-PES in gas phase, it is found that instead of a helical structure, Ala folds into a "wreath-shaped" configuration, which is more stable than the helical form by 0.46 kcal mol according to the reference ab initio calculations.

Klicpera J, Groß J, Günnemann S.

Directional message passing for molecular graphs

[A]. Proceedings of the 8th International Conference on Learning Representations [C]. Addis Ababa: OpenReview.net, 2020

[本文引用: 1]

Schütt K T, Unke O T, Gastegger M.

Equivariant message passing for the prediction of tensorial properties and molecular spectra

[J]. arXiv: 2102. 03150, 2021

[本文引用: 1]

Batzner S, Musaelian A, Sun L X, et al.

E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials

[J]. Nat. Commun., 2022, 13: 2453

DOI      PMID      [本文引用: 1]

This work presents Neural Equivariant Interatomic Potentials (NequIP), an E(3)-equivariant neural network approach for learning interatomic potentials from ab-initio calculations for molecular dynamics simulations. While most contemporary symmetry-aware models use invariant convolutions and only act on scalars, NequIP employs E(3)-equivariant convolutions for interactions of geometric tensors, resulting in a more information-rich and faithful representation of atomic environments. The method achieves state-of-the-art accuracy on a challenging and diverse set of molecules and materials while exhibiting remarkable data efficiency. NequIP outperforms existing models with up to three orders of magnitude fewer training data, challenging the widely held belief that deep neural networks require massive training sets. The high data efficiency of the method allows for the construction of accurate potentials using high-order quantum chemical level of theory as reference and enables high-fidelity molecular dynamics simulations over long time scales.© 2022. The Author(s).

Musaelian A, Batzner S, Johansson A, et al.

Learning local equivariant representations for large-scale atomistic dynamics

[J]. Nat. Commun., 2023, 14: 579

DOI      PMID      [本文引用: 1]

A simultaneously accurate and computationally efficient parametrization of the potential energy surface of molecules and materials is a long-standing goal in the natural sciences. While atom-centered message passing neural networks (MPNNs) have shown remarkable accuracy, their information propagation has limited the accessible length-scales. Local methods, conversely, scale to large simulations but have suffered from inferior accuracy. This work introduces Allegro, a strictly local equivariant deep neural network interatomic potential architecture that simultaneously exhibits excellent accuracy and scalability. Allegro represents a many-body potential using iterated tensor products of learned equivariant representations without atom-centered message passing. Allegro obtains improvements over state-of-the-art methods on QM9 and revMD17. A single tensor product layer outperforms existing deep MPNNs and transformers on QM9. Furthermore, Allegro displays remarkable generalization to out-of-distribution data. Molecular simulations using Allegro recover structural and kinetic properties of an amorphous electrolyte in excellent agreement with ab-initio simulations. Finally, we demonstrate parallelization with a simulation of 100 million atoms.© 2023. The Author(s).

Batatia I, Kovács D P, Simm G N C, et al.

MACE: Higher order equivariant message passing neural networks for fast and accurate force fields

[J]. arXiv: 2206. 07697, 2023

[本文引用: 1]

Wu Y B, Xia J F, Zhang Y L, et al.

A simple and efficient equivariant message-passing neural network model for non-local potential energy surface

[J]. J. Phys. Chem., 2024, 128A: 11061

[本文引用: 1]

Batatia I, Batzner S, Kovács D P, et al.

The design space of E(3)-equivariant atom-centered interatomic potentials

[J]. arXiv: 2205. 06643, 2022

[本文引用: 2]

Cheng B Q.

Latent Ewald summation for machine learning of long-range interactions

[J]. npj Comput. Mater., 2025, 11: 80

DOI      [本文引用: 2]

Jain A, Ong S P, Hautier G, et al.

Commentary: The materials project: A materials genome approach to accelerating materials innovation

[J]. APL Mater., 2013, 1: 011002

[本文引用: 1]

Chen C, Ye W K, Zuo Y X, et al.

Graph networks as a universal machine learning framework for molecules and crystals

[J]. Chem. Mater., 2019, 31: 3564

DOI      [本文引用: 1]

Graph networks are a new machine learning (ML) paradigm that supports both relational reasoning and combinatorial generalization. Here, we develop universal MatErials Graph Network (MEGNet) models for accurate property prediction in both molecules and crystals. We demonstrate that the MEGNet models outperform prior ML models such as the SchNet in 11 out of 13 properties of the QM9 molecule data set. Similarly, we show that MEGNet models trained on similar to 60 000 crystals in the Materials Project substantially outperform prior ML models in the prediction of the formation energies, band gaps, and elastic moduli of crystals, achieving better than density functional theory accuracy over a much larger data set. We present two new strategies to address data limitations common in materials science and chemistry. First, we demonstrate a physically intuitive approach to unify four separate molecular MEGNet models for the internal energy at 0 K and room temperature, enthalpy, and Gibbs free energy into a single free energy MEGNet model by incorporating the temperature, pressure, and entropy as global state inputs. Second, we show that the learned element embeddings in MEGNet models encode periodic chemical trends and can be transfer-learned from a property model trained on a larger data set (formation energies) to improve property models with smaller amounts of data (band gaps and elastic moduli).

Chen C, Ong S P.

A universal graph deep learning interatomic potential for the periodic table

[J]. Nat. Comput. Sci., 2022, 2: 718

DOI      PMID      [本文引用: 1]

Interatomic potentials (IAPs), which describe the potential energy surface of atoms, are a fundamental input for atomistic simulations. However, existing IAPs are either fitted to narrow chemistries or too inaccurate for general applications. Here we report a universal IAP for materials based on graph neural networks with three-body interactions (M3GNet). The M3GNet IAP was trained on the massive database of structural relaxations performed by the Materials Project over the past ten years and has broad applications in structural relaxation, dynamic simulations and property prediction of materials across diverse chemical spaces. About 1.8 million materials from a screening of 31 million hypothetical crystal structures were identified to be potentially stable against existing Materials Project crystals based on M3GNet energies. Of the top 2,000 materials with the lowest energies above the convex hull, 1,578 were verified to be stable using density functional theory calculations. These results demonstrate a machine learning-accelerated pathway to the discovery of synthesizable materials with exceptional properties.© 2022. The Author(s), under exclusive licence to Springer Nature America, Inc.

Deng B W, Zhong P C, Jun K, et al.

CHGNet as a pretrained universal neural network potential for charge-informed atomistic modelling

[J]. Nat. Mach. Intell., 2023, 5: 1031

DOI      [本文引用: 1]

Merchant A, Batzner S, Schoenholz S S, et al.

Scaling deep learning for materials discovery

[J]. Nature, 2023, 624: 80

DOI      [本文引用: 1]

Zhang D, Bi H R, Dai F Z, et al.

Pretraining of attention-based deep learning potential model for molecular simulation

[J]. npj Comput. Mater., 2024, 10: 94

DOI      [本文引用: 1]

Zhang D, Liu X Z J, Zhang X Y, et al.

DPA-2: A large atomic model as a multi-task learner

[J]. npj Comput. Mater., 2024, 10: 293

DOI      [本文引用: 1]

Batatia I, Benner P, Chiang Y, et al.

A foundation model for atomistic materials chemistry

[J]. J. Chem. Phys., 2025, 163: 184110

DOI      URL     [本文引用: 2]

Xie F K, Lu T L, Meng S, et al.

GPTFF: A high-accuracy out-of-the-box universal AI force field for arbitrary inorganic materials

[J]. Sci. Bull., 2024, 69: 3525

DOI      PMID      [本文引用: 1]

This study introduces a novel artificial intelligence (AI) force field, namely a graph-based pre-trained transformer force field (GPTFF), which can simulate arbitrary inorganic systems with good precision and generalizability. Harnessing a large trove of the data and the attention mechanism of transformer algorithms, the model can accurately predict energy, atomic force, and stress with mean absolute error (MAE) values of 32 meV/atom, 71 meV/Å, and 0.365 GPa, respectively. The dataset used to train the model includes 37.8 million single-point energies, 11.7 billion force pairs, and 340.2 million stresses. We also demonstrated that the GPTFF can be universally used to simulate various physical systems, such as crystal structure optimization, phase transition simulations, and mass transport. The model is publicly released with this paper, enabling anyone to use it immediately without needing to train it.Copyright © 2024 Science China Press. Published by Elsevier B.V. All rights reserved.

Han T, Li J, Liu L P, et al.

Accuracy evaluation of different machine learning force field features

[J]. New J. Phys., 2023, 25: 093007

[本文引用: 2]

Zuo Y X, Chen C, Li X G, et al.

A performance and cost assessment of machine learning interatomic potentials

[J]. J. Phys. Chem., 2020, 124A: 731

[本文引用: 4]

Dusson G, Bachmayr M, Csányi G, et al.

Atomic cluster expansion: Completeness, efficiency and stability

[J]. J. Comput. Phys., 2022, 454: 110946

DOI      URL     [本文引用: 1]

Hodapp M, Shapeev A.

Equivariant tensor network potentials

[J]. Mach. Learn.: Sci. Technol., 2024, 5: 035075

[本文引用: 1]

Podryabinkin E V, Shapeev A V.

Active learning of linearly parametrized interatomic potentials

[J]. Comput. Mater. Sci., 2017, 140: 171

DOI      URL     [本文引用: 3]

Gubaev K, Podryabinkin E V, Shapeev A V.

Machine learning of molecular properties: Locality and active learning

[J]. J. Chem. Phys., 2018, 148: 241727

DOI      URL     [本文引用: 8]

Novikov I S, Gubaev K, Podryabinkin E V, et al.

The MLIP package: Moment tensor potentials with MPI and active learning

[J]. Mach. Learn.: Sci. Technol., 2021, 2: 025002

[本文引用: 3]

Wang J T, Liu P T, Zhu H T, et al.

Efficient moment tensor machine-learning interatomic potential for accurate description of defects in Ni-Al Alloys

[J]. Phys. Rev. Mater., 2025, 9: 053805

[本文引用: 12]

Cohn D A, Ghahramani Z, Jordan M I.

Active learning with statistical models

[J]. J. Artif. Intell. Res., 1996, 4: 129

DOI      URL     [本文引用: 1]

For many types of machine learning algorithms, one can compute the statistically `optimal' way to select training data. In this paper, we review how optimal data selection techniques have been used with feedforward neural networks. We then show how the same principles may be used to select data for two alternative, statistically-based learning architectures: mixtures of Gaussians and locally weighted regression. While the techniques for neural networks are computationally expensive and approximate, the techniques for mixtures of Gaussians and locally weighted regression are both efficient and accurate. Empirically, we observe that the optimality criterion sharply decreases the number of training examples the learner needs in order to achieve good performance.

Jinnouchi R, Miwa K, Karsai F, et al.

On-the-fly active learning of interatomic potentials for large-scale atomistic simulations

[J]. J. Phys. Chem. Lett., 2020, 11: 6946

DOI      PMID     

The on-the-fly generation of machine-learning force fields by active-learning schemes attracts a great deal of attention in the community of atomistic simulations. The algorithms allow the machine to self-learn an interatomic potential and construct machine-learned models on the fly during simulations. State-of-the-art query strategies allow the machine to judge whether new structures are out of the training data set or not. Only when the machine judges the necessity of updating the data set with the new structures are first-principles calculations carried out. Otherwise, the yet available machine-learned model is used to update the atomic positions. In this manner, most of the first-principles calculations are bypassed during training, and overall, simulations are accelerated by several orders of magnitude while retaining almost first-principles accuracy. In this Perspective, after describing essential components of the active-learning algorithms, we demonstrate the power of the schemes by presenting recent applications.

Friederich P, Häse F, Proppe J, et al.

Machine-learned potentials for next-generation matter simulations

[J]. Nat. Mater., 2021, 20: 750

DOI      PMID      [本文引用: 1]

The choice of simulation methods in computational materials science is driven by a fundamental trade-off: bridging large time- and length-scales with highly accurate simulations at an affordable computational cost. Venturing the investigation of complex phenomena on large scales requires fast yet accurate computational methods. We review the emerging field of machine-learned potentials, which promises to reach the accuracy of quantum mechanical computations at a substantially reduced computational cost. This Review will summarize the basic principles of the underlying machine learning methods, the data acquisition process and active learning procedures. We highlight multiple recent applications of machine-learned potentials in various fields, ranging from organic chemistry and biomolecules to inorganic crystal structure predictions and surface science. We furthermore discuss the developments required to promote a broader use of ML potentials, and the possibility of using them to help solve open questions in materials science and facilitate fully computational materials design.

Laio A, Parrinello M.

Escaping free-energy minima

[J]. Proc. Natl. Acad. Sci. USA, 2002, 99: 12562

PMID      [本文引用: 1]

We introduce a powerful method for exploring the properties of the multidimensional free energy surfaces (FESs) of complex many-body systems by means of coarse-grained non-Markovian dynamics in the space defined by a few collective coordinates. A characteristic feature of these dynamics is the presence of a history-dependent potential term that, in time, fills the minima in the FES, allowing the efficient exploration and accurate determination of the FES as a function of the collective coordinates. We demonstrate the usefulness of this approach in the case of the dissociation of a NaCl molecule in water and in the study of the conformational changes of a dialanine in solution.

Allen R J, Frenkel D, ten Wolde P R.

Simulating rare events in equilibrium or nonequilibrium stochastic systems

[J]. J. Chem. Phys., 2006, 124: 024102

Barducci A, Bussi G, Parrinello M.

Well-tempered metadynamics: A smoothly converging and tunable free-energy method

[J]. Phys. Rev. Lett., 2008, 100: 020603

Han X, Lei Y K, Li M D, et al.

A brief review of integrated tempering sampling molecular simulation

[J]. Chem. Phys. Rev., 2024, 5: 011308

Castellano A, Béjaud R, Richard P, et al.

Machine learning assisted canonical sampling (MLACS)

[J]. Comput. Phys. Commun., 2025, 316: 109730

DOI      URL     [本文引用: 1]

Ramakrishnan R, Dral P O, Rupp M, et al.

Quantum chemistry structures and properties of 134 kilo molecules

[J]. Sci. Data, 2014, 1: 140022

DOI      [本文引用: 1]

Glass C W, Oganov A R, Hansen N.

USPEX—Evolutionary crystal structure prediction

[J]. Comput. Phys. Commun., 2006, 175: 713

DOI      URL     [本文引用: 1]

Wang J J, Gao H, Han Y, et al.

MAGUS: Machine learning and graph theory assisted universal structure searcher

[J]. Natl. Sci. Rev., 2023, 10: nwad128

DOI      URL     [本文引用: 1]

Crystal structure predictions based on first-principles calculations have gained great success in materials science and solid state physics. However, the remaining challenges still limit their applications in systems with a large number of atoms, especially the complexity of conformational space and the cost of local optimizations for big systems. Here, we introduce a crystal structure prediction method, MAGUS, based on the evolutionary algorithm, which addresses the above challenges with machine learning and graph theory. Techniques used in the program are summarized in detail and benchmark tests are provided. With intensive tests, we demonstrate that on-the-fly machine-learning potentials can be used to significantly reduce the number of expensive first-principles calculations, and the crystal decomposition based on graph theory can efficiently decrease the required configurations in order to find the target structures. We also summarized the representative applications of this method on several research topics, including unexpected compounds in the interior of planets and their exotic states at high pressure and high temperature (superionic, plastic, partially diffusive state, etc.); new functional materials (superhard, high-energy-density, superconducting, photoelectric materials), etc. These successful applications demonstrated that MAGUS code can help to accelerate the discovery of interesting materials and phenomena, as well as the significant value of crystal structure predictions in general.

Podryabinkin E V, Tikhonov E V, Shapeev A V, et al.

Accelerating crystal structure prediction by machine-learning interatomic potentials with active learning

[J]. Phys. Rev., 2019, 99B: 064114

[本文引用: 1]

Gubaev K, Podryabinkin E V, Hart G L W, et al.

Accelerating high-throughput searches for new alloys with active learning of interatomic potentials

[J]. Comput. Mater. Sci., 2019, 156: 148

DOI      URL     [本文引用: 1]

Novoselov I I, Yanilkin A V, Shapeev A V, et al.

Moment tensor potentials as a promising tool to study diffusion processes

[J]. Comput. Mater. Sci., 2019, 164: 46

DOI      URL     [本文引用: 1]

Shapeev A V, Podryabinkin E V, Gubaev K, et al.

Elinvar effect in β-Ti simulated by on-the-fly trained moment tensor potential

[J]. New J. Phys., 2020, 22: 113005

DOI      [本文引用: 1]

A combination of quantum mechanics calculations with machine learning techniques can lead to a paradigm shift in our ability to predict materials properties from first principles. Here we show that on-the-fly training of an interatomic potential described through moment tensors provides the same accuracy as state-of-the-art ab initio molecular dynamics in predicting high-temperature elastic properties of materials with two orders of magnitude less computational effort. Using the technique, we investigate high-temperature bcc phase of titanium and predict very weak, Elinvar, temperature dependence of its elastic moduli, similar to the behavior of the so-called GUM Ti-based alloys (Sato et al 2003 Science \n 300 464). Given the fact that GUM alloys have complex chemical compositions and operate at room temperature, Elinvar properties of elemental bcc-Ti observed in the wide temperature interval 1100–1700 K is unique.

Mukhamedov B, Tasnádi F, Abrikosov I A.

Machine learning interatomic potential for the low-modulus Ti-Nb-Zr alloys in the vicinity of dynamical instability

[J]. Mater. Des., 2025, 253: 113865

DOI      URL     [本文引用: 1]

Mei H J, Cheng L Y, Chen L, et al.

Development of machine learning interatomic potential for zinc

[J]. Comput. Mater. Sci., 2024, 233: 112723

DOI      URL     [本文引用: 1]

Forslund A, Ruban A.

Ab initio surface free energies of tungsten with full account of thermal excitations

[J]. Phys. Rev., 2022, 105B: 045403

[本文引用: 1]

Forslund A, Jung J H, Srinivasan P, et al.

Thermodynamic properties on the homologous temperature scale from direct upsampling: Understanding electron-vibration coupling and thermal vacancies in bcc refractory metals

[J]. Phys. Rev., 2023, 107B: 174309

[本文引用: 1]

Srinivasan P, Shapeev A, Neugebauer J, et al.

Anharmonicity in bcc refractory elements: A detailed ab initio analysis

[J]. Phys. Rev., 2023, 107B: 014301

[本文引用: 1]

Zhu L F, Srinivasan P, Gong Y L, et al.

Melting properties of the refractory metals V and W and the binary VW alloy fully from first principles

[J]. Phys. Rev., 2024, 109B: 094110

[本文引用: 1]

Jung J H, Srinivasan P, Forslund A, et al.

High-accuracy thermodynamic properties to the melting point from ab initio calculations aided by machine-learning potentials

[J]. npj Comput. Mater., 2023, 9: 3

DOI      [本文引用: 1]

Zhang X, Divinski S V, Grabowski B.

Ab initio machine-learning unveils strong anharmonicity in non-Arrhenius self-diffusion of tungsten

[J]. Nat. Commun., 2025, 16: 394

DOI      [本文引用: 1]

The knowledge of diffusion mechanisms in materials is crucial for predicting their high-temperature performance and stability, yet accurately capturing the underlying physics like thermal effects remains challenging. In particular, the origin of the experimentally observed non-Arrhenius diffusion behavior has remained elusive, largely due to the lack of effective computational tools. Here we propose an efficient ab initio framework to compute the Gibbs energy of the transition state in vacancy-mediated diffusion including the relevant thermal excitations at the density-functional-theory level. With the aid of a bespoke machine-learning interatomic potential, the temperature-dependent vacancy formation and migration Gibbs energies of the prototype system body-centered cubic (BCC) tungsten are shown to be strongly affected by anharmonicity. This finding explains the physical origin of the experimentally observed non-Arrhenius behavior of tungsten self-diffusion. A remarkable agreement between the calculated and experimental temperature-dependent self-diffusivity and, in particular, its curvature is revealed. The proposed computational framework is robust and broadly applicable, as evidenced by first tests for a hexagonal close-packed (HCP) multicomponent high-entropy alloy. The successful applications underscore the attainability of an accurate ab initio diffusion database.

Duff A I, Davey T, Korbmacher D, et al.

Improved method of calculating ab initio high-temperature thermodynamic properties with application to ZrC

[J]. Phys. Rev., 2015, 91B: 214311

[本文引用: 1]

Grabowski B, Ikeda Y, Srinivasan P, et al.

Ab initio vibrational free energies including anharmonicity for multicomponent alloys

[J]. npj Comput. Mater., 2019, 5: 80

DOI      [本文引用: 2]

The unique and unanticipated properties of multiple principal component alloys have reinvigorated the field of alloy design and drawn strong interest across scientific disciplines. The vast compositional parameter space makes these alloys a unique area of exploration by means of computational design. However, as of now a method to compute efficiently, yet with high accuracy the thermodynamic properties of such alloys has been missing. One of the underlying reasons is the lack of accurate and efficient approaches to compute vibrational free energies—including anharmonicity—for these chemically complex multicomponent alloys. In this work, a density-functional-theory based approach to overcome this issue is developed based on a combination of thermodynamic integration and a machine-learning potential. We demonstrate the performance of the approach by computing the anharmonic free energy of the prototypical five-component VNbMoTaW refractory high entropy alloy.

Zotov N, Gubaev K, Wörner J, et al.

Moment tensor potential for static and dynamic investigations of screw dislocations in bcc Nb

[J]. Modell. Simul. Mater. Sci. Eng., 2024, 32: 035032

[本文引用: 1]

Wang Y, Liu J B, Li J H, et al.

Machine-learning interatomic potential for radiation damage effects in bcc-iron

[J]. Comput. Mater. Sci., 2022, 202: 110960

DOI      URL     [本文引用: 1]

Kwon H, Shiga M, Kimizuka H, et al.

Accurate description of hydrogen diffusivity in bcc metals using machine-learning moment tensor potentials and path-integral methods

[J]. Acta Mater., 2023, 247: 118739

DOI      URL     [本文引用: 1]

Kumar P, Körmann F, Grabowski B, et al.

Machine learning potentials for hydrogen absorption in TiCr2 Laves phases

[J]. Acta Mater., 2025, 297: 121319

DOI      URL     [本文引用: 1]

Xu X, Zhang X, Ruban A, et al.

Accurate complex-stacking-fault Gibbs energy in Ni3Al at high temperatures

[J]. Scr. Mater., 2024, 242: 115934

DOI      URL     [本文引用: 4]

Xu X, Zhang X, Bitzek E, et al.

Origin of the yield stress anomaly in L12 intermetallics unveiled with physically informed machine-learning potentials

[J]. Acta Mater., 2024, 281: 120423

DOI      URL     [本文引用: 4]

Novikov B, Grabowski B, Körmann F, et al.

Magnetic Moment Tensor Potentials for collinear spin-polarized materials reproduce different magnetic states of bcc Fe

[J]. npj Comput. Mater., 2022, 8: 13

DOI      [本文引用: 2]

We present the magnetic Moment Tensor Potentials (mMTPs), a class of machine-learning interatomic potentials, accurately reproducing both vibrational and magnetic degrees of freedom as provided, e.g., from first-principles calculations. The accuracy is achieved by a two-step minimization scheme that coarse-grains the atomic and the spin space. The performance of the mMTPs is demonstrated for the prototype magnetic system bcc iron, with applications to phonon calculations for different magnetic states, and molecular-dynamics simulations with fluctuating magnetic moments.

Kotykhov A S, Gubaev K, Hodapp M, et al.

Constrained DFT-based magnetic machine-learning potentials for magnetic alloys: a case study of Fe-Al

[J]. Sci. Rep., 2023, 13: 19728

DOI      PMID      [本文引用: 1]

We propose a machine-learning interatomic potential for multi-component magnetic materials. In this potential we consider magnetic moments as degrees of freedom (features) along with atomic positions, atomic types, and lattice vectors. We create a training set with constrained DFT (cDFT) that allows us to calculate energies of configurations with non-equilibrium (excited) magnetic moments and, thus, it is possible to construct the training set in a wide configuration space with great variety of non-equilibrium atomic positions, magnetic moments, and lattice vectors. Such a training set makes possible to fit reliable potentials that will allow us to predict properties of configurations in the excited states (including the ones with non-equilibrium magnetic moments). We verify the trained potentials on the system of bcc Fe-Al with different concentrations of Al and Fe and different ways Al and Fe atoms occupy the supercell sites. Here, we show that the formation energies, the equilibrium lattice parameters, and the total magnetic moments of the unit cell for different Fe-Al structures calculated with machine-learning potentials are in good correspondence with the ones obtained with DFT. We also demonstrate that the theoretical calculations conducted in this study qualitatively reproduce the experimentally-observed anomalous volume-composition dependence in the Fe-Al system.© 2023. The Author(s).

Kotykhov A S, Gubaev K, Sotskov V, et al.

Fitting to magnetic forces improves the reliability of magnetic moment tensor potentials

[J]. Comput. Mater. Sci., 2024, 245: 113331

DOI      URL     [本文引用: 2]

Taylor A, Jones R M.

Constitution and magnetic properties of iron-rich iron-aluminum alloys

[J]. J. Phys. Chem. Solids, 1958, 6: 16

DOI      URL     [本文引用: 1]

Srinivasan P, Demuriya D, Grabowski B, et al.

Electronic moment tensor potentials include both electronic and vibrational degrees of freedom

[J]. npj Comput. Mater., 2024, 10: 41

DOI      [本文引用: 2]

We present the electronic moment tensor potentials (eMTPs), a class of machine-learning interatomic models and a generalization of the classical MTPs, reproducing both the electronic and vibrational degrees of freedom, up to the accuracy of ab initio calculations. Following the original polynomial interpolation idea of the MTPs, the eMTPs are defined as polynomials of vibrational and electronic degrees of freedom, corrected to have a finite interatomic cutoff. Practically, an eMTP is constructed from the classical MTPs fitted to a training set, whose energies and forces are calculated with electronic temperatures corresponding to the Chebyshev nodes on a given temperature interval. The eMTP energy is hence a Chebyshev interpolation of the classical MTPs. Using the eMTP, one can obtain the temperature-dependent vibrational free energy including anharmonicity coming from phonon interactions, the electronic free energy coming from electron interactions, and the coupling of atomic vibrations and electronic excitations. Each of the contributions can be accessed individually using the proposed formalism. The performance of eMTPs is demonstrated for two refractory systems which have a significant electronic, vibrational and coupling contribution up to the melting point—unary Nb, and a disordered TaVCrW high-entropy alloy. Highly accurate thermodynamic and kinetic quantities can now be obtained just by using eMTPs, without any further ab initio calculations. The proposed construction to include the electronic degree of freedom can also be applied to other machine-learning models.

Gubaev K, Zaverkin V, Srinivasan P, et al.

Performance of two complementary machine-learned potentials in modelling chemically complex systems

[J]. npj Comput. Mater., 2023, 9: 129

DOI      [本文引用: 5]

\n Chemically complex multicomponent alloys possess exceptional properties derived from an inexhaustible compositional space. The complexity however makes interatomic potential development challenging. We explore two complementary machine-learned potentials—the moment tensor potential (MTP) and the Gaussian moment neural network (GM-NN)—in simultaneously describing configurational\n and\n vibrational degrees of freedom in the Ta-V-Cr-W alloy family. Both models are equally accurate with excellent performance evaluated against density-functional-theory. They achieve root-mean-square-errors (RMSEs) in energies of less than a few meV/atom across 0 K ordered and high-temperature disordered configurations included in the training. Even for compositions not in training, relative energy RMSEs at high temperatures are within a few meV/atom. High-temperature molecular dynamics forces have similarly small RMSEs of about 0.15 eV/Å for the disordered quaternary included in, and ternaries not part of training. MTPs achieve faster convergence with training size; GM-NNs are faster in execution. Active learning is partially beneficial and should be complemented with conventional human-based training set generation.\n

Shuang F, Ji Y C, Laurenti L, et al.

Size-dependent strength superiority in multi-principal element alloys versus constituent metals: Insights from machine-learning atomistic simulations

[J]. Int. J. Plast., 2025, 188: 104308

DOI      URL     [本文引用: 3]

Yin S, Zuo Y X, Abu-Odeh A, et al.

Atomistic simulations of dislocation mobility in refractory high-entropy alloys and the effect of chemical short-range order

[J]. Nat. Commun., 2021, 12: 4873

DOI      PMID      [本文引用: 5]

Refractory high-entropy alloys (RHEAs) are designed for high elevated-temperature strength, with both edge and screw dislocations playing an important role for plastic deformation. However, they can also display a significant energetic driving force for chemical short-range ordering (SRO). Here, we investigate mechanisms underlying the mobilities of screw and edge dislocations in the body-centered cubic MoNbTaW RHEA over a wide temperature range using extensive molecular dynamics simulations based on a highly-accurate machine-learning interatomic potential. Further, we specifically evaluate how these mechanisms are affected by the presence of SRO. The mobility of edge dislocations is found to be enhanced by the presence of SRO, whereas the rate of double-kink nucleation in the motion of screw dislocations is reduced, although this influence of SRO appears to be attenuated at increasing temperature. Independent of the presence of SRO, a cross-slip locking mechanism is observed for the motion of screws, which provides for extra strengthening for refractory high-entropy alloy system.© 2021. The Author(s).

Jafary-Zadeh M, Khoo K H, Laskowski R, et al.

Applying a machine learning interatomic potential to unravel the effects of local lattice distortion on the elastic properties of multi-principal element alloys

[J]. J. Alloys Compd., 2019, 803: 1054

DOI      URL     [本文引用: 1]

Gubaev K, Ikeda Y, Tasnádi F, et al.

Finite-temperature interplay of structural stability, chemical complexity, and elastic properties of bcc multicomponent alloys from ab initio trained machine-learning potentials

[J]. Phys. Rev. Mater., 2021, 5: 073801

[本文引用: 2]

Hodapp M, Shapeev A.

Machine-learning potentials enable predictive and tractable high-throughput screening of random alloys

[J]. Phys. Rev. Mater., 2021, 5: 113802

[本文引用: 1]

Zhu L F, Körmann F, Chen Q, et al.

Accelerating ab initio melting property calculations with machine learning: Application to the high entropy alloy TaVCrW

[J]. npj Comput. Mater., 2024, 10: 274

DOI      [本文引用: 1]

Melting properties are critical for designing novel materials, especially for discovering high-performance, high-melting refractory materials. Experimental measurements of these properties are extremely challenging due to their high melting temperatures. Complementary theoretical predictions are, therefore, indispensable. One of the most accurate approaches for this purpose is the ab initio free-energy approach based on density functional theory (DFT). However, it generally involves expensive thermodynamic integration using ab initio molecular dynamic simulations. The high computational cost makes high-throughput calculations infeasible. Here, we propose a highly efficient DFT-based method aided by a specially designed machine learning potential. As the machine learning potential can closely reproduce the ab initio phase-space distribution, even for multi-component alloys, the costly thermodynamic integration can be fully substituted with more efficient free energy perturbation calculations. The method achieves overall savings of computational resources by 80% compared to current alternatives. We apply the method to the high-entropy alloy TaVCrW and calculate its melting properties, including the melting temperature, entropy and enthalpy of fusion, and volume change at the melting point. Additionally, the heat capacities of solid and liquid TaVCrW are calculated. The results agree reasonably with the CALPHAD extrapolated values.

Song Y M, Xian J W, Xu Y J, et al.

Machine learning accelerated study on temperature dependent elastic properties of Ti-based refractory high entropy alloys

[J]. Mater. Today Commun., 2025, 42: 111559

[本文引用: 1]

Liu M F, Wang J T, Hu J W, et al.

Layer-by-layer phase transformation in Ti3O5 revealed by machine-learning molecular dynamics simulations

[J]. Nat. Commun., 2024, 15: 3079

DOI      [本文引用: 9]

Reconstructive phase transitions involving breaking and reconstruction of primary chemical bonds are ubiquitous and important for many technological applications. In contrast to displacive phase transitions, the dynamics of reconstructive phase transitions are usually slow due to the large energy barrier. Nevertheless, the reconstructive phase transformation from β- to λ-Ti3O5 exhibits an ultrafast and reversible behavior. Despite extensive studies, the underlying microscopic mechanism remains unclear. Here, we discover a kinetically favorable in-plane nucleated layer-by-layer transformation mechanism through metadynamics and large-scale molecular dynamics simulations. This is enabled by developing an efficient machine learning potential with near first-principles accuracy through an on-the-fly active learning method and an advanced sampling technique. Our results reveal that the β−λ phase transformation initiates with the formation of two-dimensional nuclei in the a\n b-plane and then proceeds layer-by-layer through a multistep barrier-lowering kinetic process via intermediate metastable phases. Our work not only provides important insight into the ultrafast and reversible nature of the β−λ transition, but also presents useful strategies and methods for tackling other complex structural phase transitions.

Liu M F, Wang J T, Liu P T, et al.

Diverse polymorphs and phase transitions in van der Waals In2Se3

[J]. 2025, arXiv: 2506. 21248v1, 2025

[本文引用: 5]

Wang J T, Zhang Y C, Liu Y, et al.

Atomistic mechanisms of phase transitions in all-temperature barocaloric material KPF6

[J]. J. Mater. Sci. Technol., 2026, 263: 211

DOI      URL     [本文引用: 5]

Jung J H, Forslund A, Srinivasan P, et al.

Dynamically stabilized phases with full ab initio accuracy: Thermodynamics of Ti, Zr, Hf with a focus on the hcp-bcc transition

[J]. Phys. Rev., 2023, 108B: 184107

[本文引用: 2]

Zhao X T, Zhang Z, Hattori T, et al.

All-temperature barocaloric effects at pressure-induced phase transitions

[J]. Nat. Commun., 2025, 16: 7713

DOI      [本文引用: 1]

Chen B W J, Zhang X L, Zhang J.

Accelerating explicit solvent models of heterogeneous catalysts with machine learning interatomic potentials

[J]. Chem. Sci., 2023, 14: 8338

DOI      PMID      [本文引用: 1]

Realistically modelling how solvents affect catalytic reactions is a longstanding challenge due to its prohibitive computational cost. Typically, an explicit atomistic treatment of the solvent molecules is needed together with molecular dynamics (MD) simulations and enhanced sampling methods. Here, we demonstrate the utility of machine learning interatomic potentials (MLIPs), coupled with active learning, to enable fast and accurate explicit solvent modelling of adsorption and reactions on heterogeneous catalysts. MLIPs trained on-the-fly were able to accelerate MD simulations by up to 4 orders of magnitude while reproducing with high fidelity the geometrical features of water in the bulk and at metal-water interfaces. Using these ML-accelerated simulations, we accurately predicted key catalytic quantities such as the adsorption energies of CO*, OH*, COH*, HCO*, and OCCHO* on Cu surfaces and the free energy barriers of C-H scission of ethylene glycol over Cu and Pd surfaces, as validated with calculations. We envision that such simulations will pave the way towards detailed and realistic studies of solvated catalysts at large time- and length-scales.This journal is © The Royal Society of Chemistry.

Klimanova O, Rybin N, Shapeev A.

Accelerating the global search of adsorbate molecule positions using machine-learning interatomic potentials with active learning

[J]. Phys. Chem. Chem. Phys., 2025, 27: 9201

DOI      URL     [本文引用: 1]

We accelerate the global search of adsorbate molecule positions using machine-learning interatomic potentials with active learning.

Feibelman P J, Hammer B, Nørskov J K, et al.

The CO/Pt(111) puzzle

[J]. J. Phys. Chem., 2001, 105B: 4018

[本文引用: 1]

Grinberg I, Yourdshahyan Y, Rappe A M.

CO on Pt(111) puzzle: A possible solution

[J]. J. Chem. Phys., 2002, 117: 2264

DOI      URL     [本文引用: 1]

CO adsorption on the Pt(111) surface is studied using first-principles methods. As found in a recent study [Feibelman et al., J. Phys. Chem. B 105, 4018 (2001)], we find the preferred adsorption site within density functional theory to be the hollow site, whereas experimentally it is found that the top site is preferred. The influence of pseudopotential and exchange-correlation functional error on the CO binding energy and site preference is carefully investigated. We also compare the site preference energy of CO on Pt(111) with the reaction energy of formaldehyde formation from H2 and CO. We show that the discrepancies between the experimental and theoretical results are due to the generalized gradient approximation (GGA) treating different bond orders with varying accuracy. We can therefore expect that GGA results will contain significant error whenever bonds of different bond order are broken and formed.

Ren X G, Rinke P, Joas C, et al.

Random-phase approximation and its applications in computational chemistry and materials science

[J]. J. Mater. Sci., 2012, 47: 7447

DOI      URL     [本文引用: 1]

Liu P T, Wang J T, Avargues N, et al.

Combining machine learning and many-body calculations: Coverage-dependent adsorption of CO on Rh(111)

[J]. Phys. Rev. Lett., 2023, 130: 078001

[本文引用: 5]

Cao Y, Wang J T, Liu M F, et al.

Quantum delocalization enables water dissociation on Ru(0001)

[J]. Phys. Rev. Lett., 2025, 134: 178001

DOI      URL     [本文引用: 5]

Ou Y L, Ikeda Y, Scholz L, et al.

Atomistic modeling of bulk and grain boundary diffusion in solid electrolyte Li6PS5Cl using machine-learning interatomic potentials

[J]. Phys. Rev. Mater., 2024, 8: 115407

[本文引用: 1]

Kim J H, Jun B, Jang Y J, et al.

Highly reliable and large-scale simulations of promising argyrodite solid-state electrolytes using a machine-learned moment tensor potential

[J]. Nano Energy, 2024, 124: 109436

DOI      URL     [本文引用: 1]

Holekevi Chandrappa M L, Qi J, Chen C, et al.

Thermodynamics and kinetics of the cathode-electrolyte interface in all-solid-state Li-S batteries

[J]. J. Am. Chem. Soc., 2022, 144: 18009

DOI      URL     [本文引用: 4]

/