Don't Show Again Yes, I would!

Gemini on Android becomes more capable and works with Gmail, Messages, YouTube and more


Google’s Gemini on Android, its AI replacement for Google Assistant, will soon be taking advantage of its ability to deeply integrate with Android’s mobile operating system and Google’s apps. At the Google I/O 2024 developer conference on Tuesday, the company announced that users will be able to pull up the Gemini overlay on top of the app they’re using in more ways. It’s also updating Android’s built-in AI model, Gemini Nano. 

Soon, Android users will be able to drag and drop AI-generated images directly into their Gmail, Google Messages and other apps. Meanwhile, YouTube users will be able to tap “Ask this video” to find specific information from within that YouTube video, Google says. 

Image Credits: TechCrunch

Those who pay for the upgraded Gemini Advanced will also have the ability to use an “Ask this PDF” option that lets you get answers from the document without having to read through all the pages. Gemini Advanced subscribers pay $19.99 per month for access to AI and receive 2TB of storage along with other Google One benefits.

Already, Gemini on Android could do other things like generate captions on photos, ask questions about articles you’re reading, and perform other generative AI tasks, similar to other AI chatbots. However, OpenAI upstaged Google’s event to announce a GenAI model, GPT-4o (with the o standing for “omni”), that works with text, speech, and video, including what the phone’s camera is seeing. So despite Gemini’s built-in advantages, it will have some competition on mobile devices.

Google says the latest Gemini on Android features will roll out to hundreds of millions of supported devices over the next few months. Over time, Gemini will evolve to offer other suggestions related to what’s on your screen as well. 

See also  Google Messages is getting a new in-app camera

Meanwhile, the on-device foundation model on Android, Gemini Nano, will be upgraded to include multimodality. That means it will be able to process text input as well as other means of processing information, including sights, sounds, and spoken language. 

Image Credits: TechCrunch

We’re launching an AI newsletter! Sign up here to start receiving it in your inboxes on June 5.

Read more about Google I/O 2024 on TechCrunch



Source Link Website

Share:

Miranda Cosgrove

My Miranda cosgrove is an accomplished article writer with a flair for crafting engaging and informative content. With a deep curiosity for various subjects and a dedication to thorough research, Miranda cosgrove brings a unique blend of creativity and accuracy to every piece.

Leave a Reply

Your email address will not be published. Required fields are marked *