ALEXANDRIA, Va., June 18 -- United States Patent no. 12,327,395, issued on June 10, was assigned to GOOGLE LLC (Mountain View, Calif.).
"Aggregating nested vision transformers" was invented by Zizhao Zhang (San Jose, Calif.), Han Zhang (Sunnyvale, Calif.), Long Zhao (Mountain View, Calif.) and Tomas Pfister (Foster City, Calif.).
According to the abstract* released by the U.S. Patent & Trademark Office: "A method includes receiving image data including a series of image patches of an image. The method includes generating, using a first set of transformers of a vision transformer (V-T) model, a first set of higher order feature representations based on the series of image patches and aggregating the first set of higher order feature repres...