- With Gemini Pro, developers are now able to achieve their desired results much faster through more straightforward methods.
- Gemini Pro will let developers build new and differentiated agents that can process information across text, code, images, and video.
- Google has also introduced Imagen 2, its most advanced text-to-image technology.
A week after Google unveiled Gemini, it did not get the reception it was hoping for. While Gemini was indeed revolutionary and seemingly capable of giving ChatGPT a run for its money, a demo video it showcased during a media presentation and later uploaded on YouTube seemed to be gathering more attention.
The excitement was not just because of the actual capabilities of Gemini but because many felt that Google had faked Gemini’s capabilities, as the effects in the video were not generated in real time. While Google was quick to acknowledge this, it remained confident that Gemini was still capable of what it showcased in the video.
Belief of course means absolutely nothing to the business community without backup.
So, following the incident (and no apparent consequences for what could arguably be described as fraudulent representation), Google organized another media session, which aimed to showcase more capabilities of Gemini Pro, the version that is intended for enterprises. This time, there were no video demos but a live demo instead with Thomas Kurian, the CEO of Google Cloud himself, sharing the latest innovations.
While the demos themselves have been tested many times before by the developers, the capabilities of Google Gemini Pro seemed to be taking developers to the next level of coding. Put simply, with Gemini Pro, developers are now able to achieve their desired results at a much faster pace, with more straightforward methods.
The Gemini Pro API is now available to customers in AI Studio & Vertex AI.
Among the highlights of the media session was the introduction of Google AI Studio. Google AI Studio is a free web-based developer tool that lets developers quickly develop prompts and then get an API key to use in their app development.
According to Kurian, all developers need to do is sign in to the Google AI Studio with their Google account and take advantage of the free quota. Once ready, developers just need to click “Get Code” to transfer their work to their IDE of choice. They can even use the quickstart templates from Android Studio, Colab, or Project IDX.
“With Google AI Studio, a developer can get an API toolkit and get going with Gemini Pro. Everything happens in the browser. They just need to log in with a Google account. Developers can even take the API key from AI Studio to Vertex AI,” said Kurian.
Google AI Studio is a free, web-based developer tool that enables you to quickly develop prompts and then get an API key to use in your app development. (Source – Google).
Gemini Pro on Vertex AI and Duet AI
Vertex AI offers everything developers need to build and use generative AI, including building AI solutions, from Search and Conversation to 130 over foundation models, to a unified AI platform. As an open and integrated AI platform, data scientists can move faster with the Vertex AI platform’s tools for training, tuning, and deploying ML models.
Now, Gemini Pro will also be available in preview on Vertex AI. Gemini Pro will empower developers to build new and differentiated agents that can process information across text, code, images, and video. It will help developers deploy and manage agents to production, automatically evaluate the quality and trustworthiness of agent responses, as well as monitor and manage them.
Apart from Gemini Pro on Vertex AI, it will also be incorporated across Duet AI. This includes Duet AI for Developers and Duet AI in Security Operations.
“Duet AI for Developers helps users code faster with AI code completion, code generation, and chat in multiple integrated development environments (IDEs). It streamlines repetitive developer tasks and processes with shortcuts for common tasks, including unit test generation and code explanation, speeds troubleshooting and issue remediation, and helps reduce context-switching. Duet AI also expedites skills-based learning by giving users the ability to ask questions using natural language chat,” said Kurian.
Meanwhile, Duet AI in Security Operations makes Google among the first major cloud providers to make generative AI generally available in a unified SecOps platform. Security teams will be able to elevate their skills and boost productivity by accelerating threat detection, investigation, and response using the power of generative AI.
With Duet AI in Security Operations, Google is offering AI assistance first in Chronicle, where users can:
- Search vast amounts of data in seconds with custom queries generated from natural language.
- Reduce time-consuming manual reviews and quickly surface critical context by leveraging automatic summaries of case data and alerts.
- Improve response time using recommendations for next steps to support incident remediation.
“Security operations can now be supercharged with Duet AI across multiple dimensions – from streamlining threat detection and response to allowing a broader set of technology professionals, such as developers, to more easily comprehend and respond to threats, giving the security experts time to focus on higher order issue resolution. With Duet AI in Security Operations, defenders can search event data and query across many log types without needing to know the specialized syntax, saving valuable time,” said Brad Calder, vice president and GM for Google Cloud Platform.
Kurian also mentioned that Gemini Pro will be part of Google Workspace in the early part of 2024. In Google Workspace and Duet AI for Workspace, Gemini will work to enable collaborations with humans in real-time. For example, a user can be writing and email and request Google AI to help them with it (Think predictive text, with contextual understanding). Another example would be if a user is late for a meeting, Google AI will transcribe and summarize the meeting so far, to get them up to speed. This will make Duet AI function as a meeting assistant for users in the future.
Some of the capabilities of Imagen 2. (Source – Google).
Imagen 2
Given the developments and enhancements in text-to-image technology by other tech companies, Google is also ensuring it remains part of the competition. While users are still in awe of the new capabilities of DALL.E-3, Google has introduced Imagen 2, its most advanced text-to-image technology.
Generally available to Vertex AI customers on an allowlist, Imagen 2 on Vertex AI lets users customize and deploy Imagen 2 with intuitive tooling, fully managed infrastructure, and built-in privacy and safety features. Developed using Google DeepMind technology, Imagen 2 delivers significantly improved image quality and a host of features that enable developers to create images for their specific use case, including:
- Generating high-quality, photorealistic, high-resolution, aesthetically pleasing images from natural language prompts
- Text rendering in multiple languages to create images with accurate text overlays
- Logo generation to create company or product logos and overlay them in images
- Visual questions and answering for generating captions from images, and for getting informative text responses to questions about image details
“Importantly, Vertex AI’s indemnification commitment now covers Imagen on Vertex AI, which includes Imagen 2 and future generally available upgrades of the model powering the service. We employ an industry-first, two-pronged copyright indemnification approach that can give customers peace of mind when using our generative AI products,” commented Vishy Tirumalashetty, head of product, generative media at cloud AI at Google.
Aaron Raj
Aaron enjoys writing about enterprise technology in the region. He has attended and covered many local and international tech expos, events and forums, speaking to some of the biggest tech personalities in the industry. With over a decade of experience in the media, Aaron previously worked on politics, business, sports and entertainment news.