Google’s teaching AI to ‘see’ and ‘hear’ at the same time — here’s why that matters

November 30, 2021

By Tristan Greene

A team of scientists from Google Research, the Alan Turing Institute, and Cambridge University recently unveiled a new state of the art (SOTA) multimodal transformer for AI. In other words, they’re teaching AI how to ‘hear’ and ‘see’ at the same time. Up front: You’ve probably heard about transformer AI systems such as GPT-3. At their core, they process and categorize data from a specific kind of media stream. Under the current SOTA paradigm, if you wanted to parse the data from a video you’d need several AI models running concurrently. You’d need a model that’s been trained on videos…

This story continues at The Next Web

Or just read more coverage about: Google

Source:: The Next Web


No comments

You must be logged in to post a comment.
To ensure attendees get the full benefit of an intimate technology expo,
we are only offering a limited number of passes.
Get My Pass Now!