US Patent Issued to Google on Aug. 19 for "Cascaded audiovisual automatic speech recognition models" (New York Inventor)

Posted On: 2025-08-20

ALEXANDRIA, Va., Aug. 20 -- United States Patent no. 12,394,417, issued on Aug. 19, was assigned to Google LLC (Mountain View, Calif.).

"Cascaded audiovisual automatic speech recognition models" was invented by Oscar Chang (New York).

According to the abstract* released by the U.S. Patent & Trademark Office: "A method includes receiving a sequence of acoustic frames and generating, by an audio encoder, at each of a plurality of output steps, an acoustic higher-order feature representation for a corresponding acoustic frame in the sequence of acoustic frames. For each acoustic frame in the sequence of acoustic frames paired with a corresponding video frame, the method includes generating, by an audiovisual encoder, an audiovisual higher-or...

Click here to read full article from source

To read the full article or to get the complete feed from this publication, please Contact Us.

Exclusive

Category

Source

Publication

Location

US Patent Issued to Google on Aug. 19 for "Cascaded audiovisual automatic speech recognition models" (New York Inventor)