GPT Image is an advanced multimodal model that transforms text and image inputs into high-quality, customizable visuals for creative and professional use.

A powerful multimodal image generation model that accepts textual instructions (and optionally image inputs) to generate, edit, and customise high-quality visuals — ideal for creators, developers and teams looking to bring ideas to life.
Enter a descriptive prompt and GPT Image will generate a brand-new image from scratch, with high fidelity to your instructions.
Provide an existing image and ask the model to edit, restyle or create variations of it — enabling iterative creative workflows.
Specify background transparency, aspect ratio, style, colour scheme or composition to tailor output precisely to your needs.
A step-by-step workflow to get started and integrate the model into your toolchain.
Use the prompt guide above to describe the desired image in detail.
Select image size (e.g., 1024×1024, 1536×1024, 1024×1536), background setting (opaque/transparent/auto) and quality level.
If you are editing or creating variations, upload your base image and optionally supply a mask indicating the area to modify.
Submit your prompt and settings, review the output, and select variants or refine further.
Download the final image and iterate on your prompt if needed to achieve your desired result.
When editing existing images, some constraints may remain. For best results, clearly describe the desired changes in your prompt.
Explore how creators and teams are leveraging this model across creative and business workflows.
Generate hero images, product mock-ups, social media visuals and campaign assets with consistent style and customisation.
Develop concept art, storyboards, character designs or editorial illustrations quickly from descriptive prompts.
Produce visuals for blog posts, educational materials, infographics, slide decks or interactive experiences.
Restyle, refine or create multiple variations of an existing image to test ideas, A/B concepts or brand variants.
Answers to common questions about GPT Image.
The model accepts textual prompts and optionally image inputs (for editing or variation workflows).
Yes — you can choose from standard sizes (such as 1024×1024, 1536×1024 or 1024×1536) and specify formats like PNG, JPEG or WebP.
Yes — the model accepts a background setting ("opaque", "transparent" or "auto") though for best results it's recommended to also mention "transparent background" in your prompt.
Although very capable, the model may struggle with perfect pixel-level structural fidelity (e.g., precise object alignment or small text rendering) and may interpret ambiguous prompts unpredictably.
Generate stunning visuals, edit existing images and iterate on creative ideas in minutes.
No prior design skills required — just describe your vision and let the model bring it to life.
Explora más modelos de IA del mismo proveedor
Sora 2 transforma tu imaginación en realidad creando videos impresionantes y fotorrealistas con audio sincronizado a partir de simples descripciones de texto. Experimenta el futuro de la creación de video con el modelo de IA más avanzado de OpenAI, que presenta una simulación de física innovadora, capacidades multi-toma e incluso la capacidad de protagonizar tus propios videos generados por IA con Cameo.
Personaliza, controla e implementa modelos GPT con una flexibilidad sin igual.