Home Internet, Software & Apps News Launch

Google I/O 2024 announcements: Ask Photos, Gmail Summarize, Gemini Live, Veo and more explained

By

-

Published On: May 15, 2024

Google said AI (artificial intelligence) 121 times at its I/O 2024 keynote yesterday. So as expected, we got a lot of Gemini updates and feature announcements. The company has better integrated Gemini into Chrome, Search, Photos, Gmail, and other Workspace apps. It also unveiled Veo, Imagen 3, Project Astra and several new AI-driven products. Let’s have a look!

Ask Photos

Ask Photos Search is a new Google Photos feature that uses Gemini AI to help you find specific photos and videos in your library. You just have to ask it questions about the said photo or video.
You can ask naturally like “find photos from my anniversary party last month” or “show me videos of the hills I saw on my trip last November.”
It will be available in the coming months.

Gmail Summarize, Q&A, Contextual Smart Replies

Gmail is getting upgraded with Gemini 1.5 Pro-powered features like Summarize, Q&A, and Smart Replies. You can access them via a sidebar on the Gmail desktop or by tapping the Gemini button on Gmail mobile.
Tap the “summarize this email” to get a brief of the email or email thread you’ve received. Google is rolling this out to Workspace Labs users this month, and to all Gemini for Workspace customers and Google One AI Premium subscribers starting next month.
Tap the “Enter a prompt here” from the Gemini sidebar or Gemini button on Gmail and ask it for anything related to emails in your inbox. It will give you a relevant answer. Gmail Q&A will be coming to Workspace Labs users in July.
Based on the context from the email thread, Gemini will give you context-aware responses. You just have to use them as it is. It will be released to Workspace Labs users in July.

Veo video generator

Veo is an alternative to OpenAI’s Sora video generator.
It can create over a minute-long 1080p video based on your text, image or video prompts.
Videos can be in different styles like timelapses or aerial shots. You can tweak them with further prompts too.
It’s public availability hasn’t been disclosed yet. Following OpenAI’s steps, Google will be pitching Veo to Hollywood creators.

Introducing Veo: our most capable generative video model. ?

It can create high-quality, 1080p clips that can go beyond 60 seconds.

From photorealism to surrealism and animation, it can tackle a range of cinematic styles. ? #GoogleIO pic.twitter.com/6zEuYRAHpH
— Google DeepMind (@GoogleDeepMind) May 14, 2024

Gemini Live

Gemini Live lets you have a natural chat with Gemini. Just tap the mic button and have a 2-way conversation with it.
It will be available for Gemini Advanced subscribers in the coming months.
Later this year, you will be able to use the camera feed to show Gemini what you are seeing and have contextual conversations related to what you are seeing.

And you’ll also be able to ask questions with video, right in Search. Coming soon. #GoogleIO pic.twitter.com/zFVu8yOWI1
— Google (@Google) May 14, 2024

Gems custom chatbot

Gems is like OpenAI’s GPTs for ChatGPT. In other words, Gems are custom chatbots that you can create using Gemini.
You can inject personality and get it to do a particular task repeatedly. For example, it can act like a gym buddy, shrink, or motivational coach.
It will soon be available to Gemini Advanced subscribers.

Whether you need a yoga bestie or calculus tutor, in the coming months you’ll be able to customize Gemini, saving time when you have specific ways you interact with Gemini again and again. We’re calling these Gems. #GoogleIO pic.twitter.com/YQOHsUbMWE
— Google (@Google) May 14, 2024

Project Astra

This isn’t technically ready for the public, but it is where Google is leading many of its AI efforts.
Project Astra is like a real-time, multimodal universal AI assistant that can see the world and is contextually aware of your reality. So, it should be able to answer all your queries and help you do things.
Here’s a demo, which Google claims it hasn’t doctored in any way:

Other Google AI announcements at I/O 2024

Search Generative Experience is now named AI Overview on Google Search. It can understand your Google Search query and provide a specially designed results page with summarised answers from the web. AI Overview will be released to US users this week.
Circle to Search on Android can now break down math problems into steps so that you can understand and get to the answer.
Gemini on Android will gain more multimodal capabilities soon. It will be available to Gemini Advanced users in the coming months.
Google Chrome will get Gemini Nano-based AI assistant starting Chrome 126.
Later this year, Google will start rolling out Gemini Nano-based AI scam detection on Android.
Gemini Advanced users can get Gemini to scan a PDF and give responses to their queries based on the information in that PDF.
Gemini 1.5 Pro sidebar is now part of apps like Gmail, Drive, Docs, Sheets, and Slides for users of Workspace Labs and Gemini for Workspace Alpha. Starting next month, it will roll out on desktops for businesses and consumers through Gemini for Workspace add-ons and the Google One AI Premium plan.
Gemini 1.5 Pro’s context window has been increased from 1 million to 2 million tokens. This refers to how much context/knowledge related to something it can carry in its mind if you will.
Gemini 1.5 Flash is the new multimodal AI model that is as capable as Gemini 1.5 Pro but designed to work for “narrow, high-frequency, low-latency tasks.” Differently put, it can respond faster.

Also Read

Google Pixel 8a review: AI goodness in a compact form factor

Google Pixel 8a review: AI goodness in a compact form factor

How to use Gemini (ex-Google Bard): step-by-step guide

How to use Gemini (ex-Google Bard): step-by-step guide

Google Pixel 9 series leak shows live photos, reveals 4 models including new XL variants

Google Pixel 9 series leak shows live photos, reveals 4 models including new XL variants

Google claims Imagen 3 can now create photo-realistic images with less distortion and clearer text. You can sign up to use it today.
Google has also announced Gemini can act as an AI coworker. You can name it whatever you want. In the demo, Google called it Chip. This Gemini AI coworker can be assigned a specific task. It can lurk in your workspace and nudge you if it finds some red flags. You can ask it queries related to the project you both are part of.
SynthID is Google’s AI watermark tool that will be integrated into Veo and other AI-generated content using Google tools. SynthID can also identify other AI-generated content.
The Audio Overview feature lets you feed your study documents into Google’s NotebookLM tool, which then generates an audio conversation between AI characters about the content.
Gemma 2 has been announced with 27 billion parameters support.

Related Articles

No posts to display

MORE STORIES