LTX Video is a text-to-video model that generates 24 FPS clips at 768x512 resolution, producing footage faster than real-time playback speed. If you need video content for a project and you don't have footage or time to shoot it, this model turns a descriptive text prompt into a finished, watchable clip in seconds. You can describe a scene in detail and get back a clip with consistent motion, accurate framing, and a clear visual style. Drop in a reference image to anchor the first frame, and the model animates forward from that exact starting point. The model supports 13 aspect ratios from portrait 9:16 to ultrawide 21:9, so your output already matches your target format without any extra cropping. For content teams working on social video, marketers building quick campaign previews, or anyone who needs b-roll in a hurry, LTX Video fits directly into a production workflow. Write a detailed prompt, set your clip length and aspect ratio, and the video is ready before you expected. The longer and more specific your prompt, the closer the output gets to what you had in mind.
LTX Video is a text-to-video model that generates 24 FPS clips at 768x512 resolution faster than real-time playback. Describe a scene, pick your aspect ratio, and a finished video is ready in seconds on Picasso IA. It accepts both plain text prompts and an optional input image as the starting frame, giving you a concrete visual anchor for the generated motion. The model responds best to long, detailed descriptions, so the more specific your prompt, the sharper and more intentional the result.
Do I need programming skills or technical knowledge to use this? No, just open LTX Video on Picasso IA, adjust the settings you want, and hit generate.
Is it free to try? Yes, you can run the model without a paid account to start. Check the model page on Picasso IA for current free generation limits and available credits.
How long does it take to get results? LTX Video generates faster than real-time, so a 97-frame clip is usually ready in a few seconds. Longer clips and higher target resolutions add a bit more processing time.
What aspect ratios does it support? The model supports 13 aspect ratios, including 9:16 for vertical mobile content, 16:9 for standard video, 4:3 for traditional formats, and 21:9 for ultrawide cinematic output.
Can I use my own image as the starting frame? Yes. Upload any image through the image input field and the model uses it as the first frame, then generates the video motion from there. Use the image noise scale setting to control how closely the video stays tied to your original image.
What happens if the result doesn't look right? Rewrite your prompt to be longer and more specific, then try again. If the motion is too loose, increase the guidance scale. Changing the seed generates an entirely different output without touching any other setting.
Everything this model can do for you
Produces 24 FPS video clips faster than they can be played back.
Upload a reference image to anchor the first frame and animate forward from there.
Output in any format from 9:16 portrait to 21:9 ultrawide without extra cropping.
Choose between 97 and 257 frames to control the duration of your output.
Write detailed descriptions to shape camera movement, lighting, and scene content.
Specify what to exclude from the video to keep outputs clean and on target.
Set a seed to regenerate the exact same video whenever you need it.
High-resolution outputs up to 1024px
A man in a blue plaid shirt turns his head to look at a woman walking past in a red top. His expression is curious as he glances away from his companion in a light blue sleeveless top. The scene takes place on a busy city street with buildings and pedestrians visible in the background. The natural lighting suggests it's daytime. The camera captures the scene from street level, with the woman in red slightly out of focus in the foreground.
A chef prepares food in a professional kitchen. He has olive skin and dark, close-cropped hair, wearing traditional white chef's attire. Steam rises from the pan he's working with. The kitchen's stainless steel surfaces reflect the bright overhead lighting. The camera follows his hands as he cooks. The scene appears to be real-life footage.
A woman with long brown hair and light skin smiles at another woman with long blonde hair. The woman with brown hair wears a black jacket and has a small, barely noticeable mole on her right cheek. The camera angle is a close-up, focused on the woman with brown hair's face. The lighting is warm and natural, likely from the setting sun, casting a soft glow on the scene. The scene appears to be real-life footage.