MUMBAI, India, May 9 -- Intellectual Property India has published a patent application (202414034246 A) filed by Google Llc, Mountain View, U.S.A., on April 30, 2024, for 'instance level scene recognition with a vision language model.'

Inventor(s) include Kharbanda, Harshit; Bluntschli, Boris; Mahajan, Vibhuti; and Wang, Louis.

The application for the patent was published on May 9, under issue no. 19/2025.

According to the abstract released by the Intellectual Property India: "Systems and methods for image understanding can include one or more object recognition systems and one or more vision language models to generate an augmented language output that can be both scene-aware and object-aware. The systems and methods can process an inpu...