Google has quietly updated its AI Studio platform, introducing a new feature that leverages real-time image and video processing capabilities. The update is tied to Gemini’s multimodal processing and may lead to an upgrade from Flash 2.0 to 2.5, which could significantly enhance the model’s ability to analyze and generate media.
The hidden feature in AI Studio’s Stream Realtime section redirects users to a new interface that suggests Google is ramping up its capabilities for real-time media analysis and generation. This update may be part of a broader push towards end-to-end input-output pipelines, enabling visual inputs to produce spoken responses.
In addition, a backend tweak spotted by early users enables simultaneous web search and code execution, allowing developers to handle tasks with greater autonomy and context switching. Google is also exploring the integration of an engineering agent into AI Studio, similar to OpenAI’s Codex agent in ChatGPT, which could lead to more automated software development tasks.
While there is no confirmation on the extent of these updates, Google’s efforts to unify model capabilities with deployability via Cloud Run suggest a seamless coding and deployment flow. The significance of these changes will be revealed during the upcoming Google I/O conference, where broader model advancements are expected.
Source: https://www.testingcatalog.com/google-readies-upgrade-to-stream-realtime-feature-in-ai-studio