ALEXANDRIA, Va., Nov. 6 -- United States Patent no. 12,462,592, issued on Nov. 4, was assigned to Salesforce Inc. (San Francisco).

"Systems and methods for a vision-language pretraining framework" was invented by Junnan Li (Singapore) and Chu Hong Hoi (Singapore).

According to the abstract* released by the U.S. Patent & Trademark Office: "Embodiments described herein provide a multimodal vision-language model. The multimodal vision-language model contains a Generalist Multimodal Transformer capable of complete multiple tasks using the same set of parameters learning from pre-training. The Generalist Multimodal Transformer allows alignment between frozen, unimodal encoders, such as image encoders and large language models. The Generalist M...