ALEXANDRIA, Va., Jan. 13 -- United States Patent no. 12,524,319, issued on Jan. 13, was assigned to Hewlett Packard Enterprise Development LP (Spring, Texas).
"Resilient optimizer states for fully sharded data parallel" was invented by Lianjie Cao (Milpitas, Calif.), Saeed Rashidi (Milpitas, Calif.), Garrett Goon (Spring, Texas), Paolo Faraboschi (Barcelona, Spain) and Puneet Sharma (Milpitas, Calif.).
According to the abstract* released by the U.S. Patent & Trademark Office: "Systems and methods are provided for failure resiliency in distributed training of machine learning (ML) models. Examples include a plurality of compute nodes storing optimizer shards of a plurality of optimizer shards and a first compute node storing a first optimi...