OpenAI made its picture technology choices extra exact and constant in its newest replace to ChatGPT Pictures, as extra enterprises and types use AI picture technology to assist with design visualization.
The updates will roll out to all ChatGPT customers and the API as GPT Picture 1.5. The corporate stated it's powered by GPT 5.2, which many early customers discovered to be a robust replace for enterprise use instances.
“Many people’s first experience with ChatGPT involves turning a text prompt into a picture,” stated Fidji Simo, OpenAI CEO of Functions, in a Substack put up. “It’s a magical way to see what this technology can do, but the chat interface wasn't originally designed for this. Creating and editing images is a different kind of task and deserves a space built for visuals.”
Enterprise-friendly updates in exact modifying and instruction following
One of many largest updates to ChatGPT Pictures is extra focused modifying, even when the picture is generated on the chat platform somewhat than by way of the API. Picture technology fashions equivalent to ChatGPT Pictures, Google’s Nano Banana, and Secure Diffusion tout prompt-based tweaks to AI-made photos, the place the consumer can pinpoint particular components of the picture to alter. However these options can generally be hit-and-miss.
With the replace, OpenAI stated the mannequin higher adheres to what the consumer desires “while keeping elements like lighting, composition, and people’s appearances consistent across inputs, outputs and subsequent edits.”
Customers can instruct the mannequin to do most forms of picture modifying, equivalent to including or subtracting a component, combining, mixing, and transposing.
OpenAI stated that this mannequin “follows instructions more reliably” than earlier variations. It’s additionally in a position to render textual content higher and generate precise, readable letters, even when these are denser or smaller. OpenAI up to date the mannequin to create higher, smaller faces in images that includes a big group of individuals.
“These transformations work for both simple and more intricate concepts, and are easy to try using preset styles and ideas in the new ChatGPT Images feature — no written prompt required,” in response to OpenAI.
Battle of the picture mills
OpenAI’s picture mannequin replace comes after Google’s much-lauded Nano Banana Professional picture mannequin, which drew reward from the developer group.
The corporate should compete with different ever-growing, frequently enhancing image-generation fashions that intention to draw extra enterprise customers. And it isn’t simply Google that OpenAI has to take care of. In August, Alibaba introduced that Qwen-Picture can render readable textual content in each Chinese language and English. Black Forest Labs launched Flux.2, which additionally presents a strong, open-source picture mannequin.

