Opera Brings Google Gemini and Image Understanding and Generation to Aria AI

Opera Aria image understanding

Opera announced it is partnering with Google Cloud to bring the Gemini AI models and image understanding and generative capabilities to the Aria AI in its flagship web browser.

“Aria is unique because it doesn’t just utilize one provider or LLM,” Opera’s Patrick Curtin writes. “Powered by our very own Composer AI engine, Aria can plug into over 150 local LLM variants from around 50 families of models. That way, you can decide for yourself what you like and what best suits your needs.”

With the addition of Google Gemini—”a modern, powerful, and user-friendly LLM,” Opera says—Aria can provide the most current information, and with exceptionally high performance. This support is now available in the latest Opera AI Feature Drop along with another set of Google Cloud-based capabilities, image understanding and improved image generation.

These features work together. You can show Aria an image and have the AI generate a different version. This works similarly to the Image Creator-based Cocreator demo during last week’s Microsoft Copilot+ PC launch event, where the presenter drew a simplistic image in Paint and the feature created a more impressive image.

“Aria uses Image Understanding to interpret what you want it to create and then uses Image Generation to bring it to life,” Opera’s Santiago Benavides García explains. “The nice part about this feature is that it allows you to create images without necessarily having a long text prompt describing what you want. Instead, you can use a rough sketch and a short text prompt to get the image you want.”

To check out these new features, you need to install Opera Developer, which is where Opera tests new AI and other experiment features via its AI Feature Drops.

Tagged with

Share post

Thurrott