ALEXANDRIA, Va., Nov. 18 -- United States Patent no. 12,475,692, issued on Nov. 18, was assigned to International Business Machines Corp. (Armonk, N.Y.).

"Training multi-modal models on documents using multiple instance learning" was invented by Amit Alfassy (Haifa, Israel), Assaf Arbelle (Lehvot Haviva, Israel) and Leonid Karlinsky (Acton, Mass.).

According to the abstract* released by the U.S. Patent & Trademark Office: "An example system includes a processor to automatically extract text and images from a document. The processor can automatically generate text bags including a number of nearest texts for each of the extracted images. The processor can then train a multi-modal model based on the automatically generated text bags using a...