AI and neural networks

Google improves Gemini Live: AI assistant gets even smarter

Google improves Gemini Live: AI assistant gets even smarter

Communicating with artificial intelligence once seemed like something out of science fiction, but today it’s an everyday reality thanks to tools like Gemini Live. These AI assistants are becoming more and more comfortable and natural at communicating, and Google continues to improve their capabilities.

Gemini Live update: what’s new?

.

Google has sent out an email to users announcing a major update to Gemini Live. The new AI model makes the assistant even smarter, improving its ability to understand different languages, accents and dialects. There are also significant improvements to the translation features.

Google sent out an email to users announcing a major update to Gemini Live.

Another major new feature is support for screen sharing and live video streaming. For these features to work properly, Google will start saving audio, video, and screen streaming data in the Gemini Apps activity log (if enabled). Right now, only text transcripts of conversations are saved.

Google Gemini Live

Gemini 2.0: a new era of AI

With the release of Gemini 2.0 late last year, Google introduced the Multimodal Live API, which allows developers to process text, audio, and video input and produce text or voice responses. In all likelihood, it’s this API that is at the heart of how the updated Gemini Live works.

So it’s likely that this API is the basis for the updated Gemini Live.

Google calls Gemini 2 the beginning of the «agent era» (Agent Era). This AI is on the level of OpenAI o1, but with additional capabilities: it can natively generate images, speech, text and other elements. The first model in this line is Gemini 2.0 Flash, still in the status of «experimental». According to Google, it is twice as fast as its predecessor, Gemini Pro 1.5, and outperforms it on key performance metrics.

When Gemini 1.0 was released, AI assistants were primarily used for content creation and socializing – it was the «chatbot stage». Then, with the arrival of OpenAI o1, the «reasoning era» began as AI became better at analyzing information and understanding logic. Now we are entering the «agent era» where AI is not just answering queries, but performing complex tasks on its own.

And now we are entering the «agent era» where AI is not just answering queries, but performing complex tasks on its own.

Google clearly intends to make Gemini Live a more interactive and useful tool in users’ daily lives.

The story Google improves Gemini Live: AI assistant gets even smarter was first published on ITZine.ru.

The story ITZine.ru.

Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments

You may also like