OpenAI has launched in-painting features in GPT-4o, enabling users to create visuals within the AI model itself. This tool, launched on Tuesday, can be employed to develop diverse images, including infographics, memes, menus, comic strips, signboards, and others.

Users can fine-tune and edit produced images through follow-up prompts, giving more control.The feature is already rolled out to Free, Plus, Pro, and Team plan subscribers, while Enterprise and Edu plans will follow soon. Also, API integration will be rolled out during the following weeks.

No Longer DALL-E Limited

In contrast to earlier models, GPT-4o can create images on its own from its internal knowledge base and not through diffusion models like OpenAI’s DALL-E.Nevertheless, users can still access DALL-E for generating images when needed. OpenAI elaborated, “Creating and tailoring images is as easy as conversing with GPT‑4o – just tell us what you want, including any details like aspect ratio, precise colors in terms of hex codes, or a transparent background.”

Social Media Responses

GPT-4o’s new capabilities were tested by users straight away, with responses posted online. Shopify CEO Tobias Lütke was stunned when the model accurately described the anatomy of a foreign creature on his son’s t-shirt, answering with, “How is this even real?”

Indian users also experimented with the potential of the model by creating images, converting existing pictures into various styles, and even creating user interfaces from text inputs.

Take a look:

OpenAI vs. Google in AI Image Generation

OpenAI’s announcement follows Google’s introduction of native image generation in its Gemini 2.0 Flash AI model. Initially tested by select users in December, the feature is now available in all supported regions via Google AI Studio.

Google stated, “Developers can now test this new capability using an experimental version of Gemini 2.0 Flash (gemini-2.0-flash-exp) in Google AI Studio and via the Gemini API.”