James Phang

View Original

Google I/O 2024: Start of Gemini Era

Google I/O was full of AI announcements with Google heavily focusing on Google’s Gemini AI model and its integration with the Google ecosystem. Here are some of the announcements at this year’s Google I/O.

Ask Photos

Google Photos will launch an experimental feature called Ask Photos which is powered by Google’s Gemini AI model. The new feature will allow users to search across their Google Photo collection using natural language queries that take advantage of AI to understand the user’s photo content. Ask Photos will make finding the right content more intuitive and less of a manual search process.

Google Lens: Search By Recording a Video

Google Lens currently allow users to search for something based on an image. Google have taken a step further with the ability to search with a video. Users can take a video of something and ask a question during the video and Google’s AI will pull up relevant answers from the web.

Detecting Scams During Calls

Android will be able to help avoid scam calls by looking out for red flags such as common scammer conversation patterns, and then popping up real-time warnings to indicate a potential scam call.

Veo

To compete against OpenAI’s Sora, Google have unveiled a new generative AI model that can output 1080p video based on text, image, and video-based prompts. Videos can be produced in a variety of styles and can be tweaked with more prompts.

Gemini

Google’s Bard has been transformed in Gemini and here are a few of the announcements made for Google’s latest AI literation.

Gemini 1.5 Pro

Gemini has been upgraded and can now analyse longer documents, codebases, videos and audio recordings than before. Gemini 1.5 Pro can take in up to 2 million tokens which is double the previous maximum amount which means Gemini 1.5 Pro supports the largest input of any commercially available model.

Gemini Nano

Google is also building Gemini Nano, the smallest of its AI models directly into the Chrome desktop client. This will enable developers to use the on-device model to power their AI features.

Gemini in Gmail

Gmail users will be able to search, summarize, and draft their emails using Gemini. It will be able to handle more complex tasks such as helping users process an e-commerce return by searching their inbox, finding the receipt and filling out an online form.  

Gemini on Android

Google Assistant will soon be replaced by Gemini and integrate with Android’s mobile operating system and Google Apps.  

Video: Google I/O ‘24 in Under 10 Minutes by Google