Gen4 Image solves a specific frustration: standard text-to-image generation gives you something close to what you want, but rarely exact. If you need a particular face, product, outfit, or object to appear in the output, uploading up to three reference photos changes the equation. The model reads those references alongside your text prompt and builds an image that reflects both. The reference tagging system is the core mechanic. You assign a short alphanumeric tag to each uploaded photo, then call those tags by name directly in your prompt using @tag notation. This lets you mix references naturally, like placing a person from one photo into a scene described in another. Outputs arrive at 1080p by default across six aspect ratios, from portrait 9:16 to cinema-wide 21:9. Product teams use it to shoot virtual catalog images without a studio. Brand designers use it to produce consistent visuals across a whole campaign. Concept artists use it to quickly block out scenes with real reference material. Whatever the project, the workflow is the same: upload your references, write a clear prompt, and get a clean 1080p image ready to use.
Gen4 Image is a text-to-image model that accepts up to 3 reference photos alongside your text prompt, giving you precise control over what ends up in the final frame. Most AI image tools let you describe what you want and hope for the best. Gen4 Image lets you show it. On Picasso IA, you upload your references, tag each one, and call them by name directly inside your prompt, so the output matches your actual intent rather than an interpretation of it. Whether you are recreating a product from multiple angles or building a scene around a specific face or object, the guesswork disappears.
Do I need programming skills or technical knowledge to use this? No, just open Gen4 Image on Picasso IA, adjust the settings you want, and hit generate.
Is it free to try? Yes, you can run the model directly in your browser without installing anything or writing a single line of code.
How long does it take to get results? Most generations finish in a few seconds. Resolution and scene complexity can affect the time slightly, but you will not be waiting long.
Can I use reference images of real people or products? Yes. The reference system is built for exactly that: anchor the output to a specific face, product, outfit, or object by uploading a photo and tagging it in your prompt.
What aspect ratios can I generate in? The model supports 16:9, 9:16, 4:3, 3:4, 1:1, and 21:9, covering everything from social media posts to cinematic widescreen frames.
What if only some of my references are showing up correctly? Check that each tag appears explicitly in the prompt text and that the reference image is clearly focused on one subject. Cropping tightly around the object or person before uploading usually improves accuracy.
Where can I use the images I generate? The output is a standard image file you can download immediately and drop into any project, presentation, or publishing workflow without restriction.
Everything this model can do for you
Use up to 3 photos to anchor the output to a specific person, object, or visual style.
Label each reference with a short tag and call it by name directly inside your text prompt.
Images generate at full 1080p resolution, sharp enough for print, web, or client delivery.
Choose from 16:9, 9:16, 4:3, 3:4, 1:1, or 21:9 to fit any platform or layout.
Set a seed value to recreate the same output exactly across multiple runs.
Enter a prompt, upload references, and hit generate directly in the browser.