Google simply launched Gemini 2.5 Flash Picture, which is nicknamed “Nano Banana,” and a few are calling it probably the most superior AI picture editor accessible at the moment.
Constructed instantly into the Gemini app and accessible by way of the Gemini API and Google AI Studio, Nano Banana isn’t simply one other picture device. It provides a seamless mixture of picture era, enhancing, and understanding, all accessible by pure language prompts.
I unpacked what the device can do with SmarterX and Advertising and marketing AI Institute founder and CEO Paul Roetzer on Episode 165 of The Synthetic Intelligence Present.
An Picture Editor That Understands the Actual World
Nano Banana allows customers to:
Preserve character consistency throughout a number of photos
Carry out complicated native edits (like eradicating a stain or altering a pose)
Fuse a number of photos right into a photorealistic scene
Recolor or restyle images with a sentence
Perceive diagrams and floor edits in world information
Whereas different instruments is perhaps nice at aesthetics, Nano Banana goes additional. It understands context. Which means your canine received’t all of a sudden change breeds mid-edit, and your face stays your face, even if you happen to swap the background from a kitchen to the floor of Mars.
That’s drawing numerous consideration on-line, says Roetzer, as customers are discovering the device to be glorious at performing detailed edits utilizing simply pure language prompts.
Immediate-Based mostly Modifying Meets Multimodal Intelligence
What makes Nano Banana so disruptive is how pure it feels to make use of. You’ll be able to say issues like, “Put me on a mountaintop at sundown” or “Take away the particular person within the background” and even “Flip this drawing right into a labeled diagram,” and it simply works.
That is all made potential by the mannequin’s native world information and multimodal coaching. They open the door to a spread of latest use circumstances, from model asset creation to interactive academic instruments.
And all photos include Google’s SynthID invisible watermark, so AI edits stay traceable.
So, Is This the Finish of Imagen?
One of many first questions we had concerning the functionality: Is that this simply Google’s Imagen 4 picture era mannequin underneath a brand new title?
The reply: Not fairly, no less than in keeping with the solutions we bought by asking Google Gemini.
Based on Gemini, Imagen 4 remains to be round, but it surely performs a distinct position. It is a specialised diffusion mannequin designed purely for photorealistic picture era from textual content prompts. Nano Banana, alternatively, is a local multimodal mannequin that understands each photos and textual content. When it must generate a picture from scratch, it may name upon Imagen 4 as an underlying engine.
Consider Nano Banana because the director. Imagen 4 is the cinematographer referred to as in when wanted.
Need to Attempt It? Simply Search for the Banana
Google even embraced their playful aspect with this launch. Within the Gemini app, picture enhancing is now symbolized by a banana emoji, a nod to the Nano Banana nickname.
It is a small contact, but it surely indicators that Google is now not afraid to have enjoyable with its AI releases.
Need to discover what it may do? Ask Gemini: “Give me a immediate to check the complete capabilities of two.5 Flash Picture.” You’ll get wealthy, detailed prompts to kickstart your experimentation. Or add a picture and ask for options.