Gemini's artificial intelligence is equipped to recognize and hear sound

Google's new update for artificial intelligence Gemini 1.5 Pro includes new features such as hearing sound. Thanks to the new Google update, Gemini 1.5 Pro AI can listen to uploaded audio files and extract information from things like incoming calls or video audio without the need for text commands.


At the Cloud Next event, Google announced that it is making the large Gemini 1.5 Pro language model available to the public, including developers, for the first time through the Vertex AI tool.


Currently, the new version of Gemini Pro has made significant progress in terms of performance, relying on one of the largest and most powerful Jumna models called Ultra 1.0. Google claims that Gemini1.5 Pro is capable of understanding more complex instructions.


But unlike this, Gemini 1.5 Pro cannot be used by people who do not have access to Vertex AI facilities. Also, while Jumna Ultra significantly improves the power of Jumna Chatbot, this big and powerful language model can understand long text commands, but it is not as fast as Gemini 1.5 Pro.


But in addition to Gemini 1.5 Pro, the only major Google language model to be updated is Imagen 2. Imagen 2 is a tool that helps empower Gemini's image generation capabilities, and in its new update, it allows users to add or remove subjects from the image. Also, Google has made available its SynthID digital watermark feature on all images created through Imagen models.


Google is currently testing the AI answer feature on the main search results page publicly, so that users can see the AI answers without having to be a member of Search Labs.”