
This week, Opera brought several AI features it had previously tested in Opera Developer to its Opera GX gaming web browser too.
“New features are here to significantly buff browser capabilities, and give you more ways to interact with it,” Opera’s Santiago Benavides García writes in the announcement post. “We’re talking about Image Generation capabilities, Voice Output, and even Image Understanding. We’re also bringing improvements to the chat itself with an option to summarize entire conversations. And if you’re the kind of person who likes to do deep dives, Aria now provides links to the sources of information regarding the conversation’s topic.”
New Opera GX features include:
Image generation improvements. Like other generative AI tools, Aria now supports text prompt-based image generation capabilities in Opera GX. It uses the Google Imagen2 model, and it supports conversation-style improvements over time, so you can keep creating different images without starting over from scratch. You can also click a “Regenerate” button to create different image variations from the same prompt.
Image understanding. Aria can now understand images and answer questions about them. Just click “Upload image” to learn more about any image. This feature can also be used to understand math problems and even answer some basic programming questions.
Aria voice output. Aria can now read answers out loud using Google’s WaveNet model-based text-to-speech capabilities. Just click the little speaker icon to switch from text-only to voice output.
Chat summary and source linking. Aria can now summarize an entire chat, and it provides links to sources while you’re chatting.
These features are available in the stable version of Opera GX for Windows (and, I assume, Mac and Linux).