Opera Brings Google Gemini and Image Understanding and Generation to Aria AI

Paul Thurrott
May 28, 2024
1

Opera Aria image understanding

Opera announced it is partnering with Google Cloud to bring the Gemini AI models and image understanding and generative capabilities to the Aria AI in its flagship web browser.

“Aria is unique because it doesn’t just utilize one provider or LLM,” Opera’s Patrick Curtin writes. “Powered by our very own Composer AI engine, Aria can plug into over 150 local LLM variants from around 50 families of models. That way, you can decide for yourself what you like and what best suits your needs.”

With the addition of Google Gemini—”a modern, powerful, and user-friendly LLM,” Opera says—Aria can provide the most current information, and with exceptionally high performance. This support is now available in the latest Opera AI Feature Drop along with another set of Google Cloud-based capabilities, image understanding and improved image generation.

These features work together. You can show Aria an image and have the AI generate a different version. This works similarly to the Image Creator-based Cocreator demo during last week’s Microsoft Copilot+ PC launch event, where the presenter drew a simplistic image in Paint and the feature created a more impressive image.

“Aria uses Image Understanding to interpret what you want it to create and then uses Image Generation to bring it to life,” Opera’s Santiago Benavides García explains. “The nice part about this feature is that it allows you to create images without necessarily having a long text prompt describing what you want. Instead, you can use a rough sketch and a short text prompt to get the image you want.”

To check out these new features, you need to install Opera Developer, which is where Opera tests new AI and other experiment features via its AI Feature Drops.

Tagged with

Aria
Opera

About author

Paul Thurrott

Paul Thurrott is an award-winning technology journalist and blogger with 30 years of industry experience and the author of 30 books. He is the owner of Thurrott.com and the host of three tech podcasts: Windows Weekly with Leo Laporte and Richard Campbell, Hands-On Windows, and First Ring Daily with Brad Sams. He was formerly the senior technology analyst at Windows IT Pro and the creator of the SuperSite for Windows from 1999 to 2014 and the Major Domo of Thurrott.com while at BWW Media Group from 2015 to 2023. You can reach Paul via email, Twitter or Mastodon.

View Articles

Currently on Forums
Visit the forums
- [CLOSED] Ask Paul for Friday, June 26
  Posted by Paul Thurrott
  
  7
  comments
- Interview with Cory Doctorow regarding AI and the AI Bubble
  Posted by anoldamigauser
  
  12
  comments
- [CLOSED] Ask Paul for Friday, June 19
  Posted by Paul Thurrott
  
  5
  comments
- Microsoft Office 365 Desktop Apps – Upgrade your plan banner
  Posted by Lee Thacker
  
  6
  comments
Podcasts
Podcast Hub
- First Ring Daily 1985: Another Day, Another Doom
  
  Aired on June 30, 2026 by Brad Sams with 0 Comments
- First Ring Daily 1984: End of the Year
  
  Aired on June 29, 2026 by Brad Sams with 0 Comments
- Windows Weekly 989: Deer Hate MSDN
  
  Aired on June 25, 2026 by Paul Thurrott with 0 Comments
- First Ring Daily 1983: Digging a Ditch
  
  Aired on June 25, 2026 by Brad Sams with 0 Comments
Join the crowd where the love of tech is real - become a Thurrott Premium Member today!

Explore Premium Benefits

Tagged with

Share post