ALEXANDRIA, Va., Sept. 23 -- United States Patent no. 12,424,201, issued on Sept. 23, was assigned to GOOGLE LLC (Mountain View, Calif.).
"Pre-training a model using unlabeled videos" was invented by Hongsuck Seo (Meylan, France), Arsha Nagrani (Cambridge, Mass.), Anurag Arnab (Grenoble, France) and Cordelia Luise Schmid (Saint-Ismier, France).
According to the abstract* released by the U.S. Patent & Trademark Office: "Systems and methods method for performing captioning for image or video data are described herein. The method can include receiving unlabeled multimedia data, and outputting, from a machine learning model, one or more captions for the multimedia data. Training the machine learning model to create these outputs can include i...