• Picasso AI Logo
    Logo Picasso IA
  • Home
  • AI Image
    Nano Banana 2
  • AI Video
    Veo 3.1 Lite
  • AI Chat
    Gemini 3 Pro
  • Edit Images
  • Upscale Image
  • Remove Background
  • Text to Speech
  • Effects
    NEW
  • Generations
  • Billing
  • Support
  • Account
  1. Collection
  2. Text to Video
  3. Ltx 2 Distilled

Generate Videos from Text with LTX 2 Distilled

LTX 2 Distilled is a text-to-video AI model that takes a written prompt or an input image and produces a short video clip without any setup. Creators and marketers who need quick video drafts no longer have to spend hours in editing software or wait for large production runs. You type what you want to see, adjust the aspect ratio, and get a generated video ready for review. The model supports six aspect ratios including 16:9 for widescreen content and 9:16 for social media reels, giving you direct control over how the final clip is framed. Built-in prompt expansion takes a rough idea like "a rainy street at night" and automatically broadens it into a richer scene description before generation begins. You can also supply a reference image and control how closely the video follows it, making it straightforward to animate product photos or reference shots. LTX 2 Distilled fits naturally into content pipelines where speed matters. Iteration is fast: adjust a word in the prompt, tweak the image strength, and run again to refine the clip. It is available directly on Picasso IA with no account required to start testing.

Official

Lightricks

14.2k runs

Ltx 2 Distilled

2026-01-07

Commercial Use

Table of contents

  • Overview
  • How It Works
  • Frequently Asked Questions
  • Credit Cost
  • Features
  • Use Cases
  • Examples
Get Nano Banana Pro

Overview

LTX 2 Distilled is an open-source text-to-video model that converts a written prompt, or an uploaded image, into a playable video clip within seconds. Available on Picasso IA, it is built for creators, marketers, and content teams who need video drafts without editing software or a full production setup. A typical use case looks like this: you type a scene description, pick 16:9 for a widescreen format, and receive a generated clip in under a minute. The model also handles image-to-video, so you can start from a still photograph and describe the motion you want to see in the output.

How It Works

  • Write a text prompt that describes the scene, action, or mood you want in the video. Short prompts work, but more detail gives the model more to work with.
  • Optionally upload a reference image if you want the video to follow a specific visual starting point, such as a product photo or a location shot.
  • Choose an aspect ratio from six options (16:9, 9:16, 4:3, 3:4, 1:1, 21:9) and pick the number of frames to control how long the clip runs.
  • Toggle prompt expansion on if you want the model to automatically broaden your short idea into a fuller scene description before generation begins.
  • Submit the request and download the generated video file when processing finishes. The output is a standard video file ready for editing or direct use.

Frequently Asked Questions

Do I need programming skills or technical knowledge to use this? No, just open LTX 2 Distilled on Picasso IA, adjust the settings you want, and hit generate. The interface is point-and-click with no code required.

Is it free to try? Yes, you can run the model on Picasso IA without paying upfront. Some usage tiers may apply depending on how many clips you generate in a session.

How long does it take to get results? Most clips are ready in under a minute. Generation time scales with the number of frames you request and current server load, so shorter clips are faster.

What output formats are supported? The model outputs a standard video file you can download and open in any common video editor or upload directly to social media platforms.

Can I customize the output quality or style? Yes. You adjust the prompt wording, aspect ratio, frame count, and image strength slider to shape each result. If you supply a reference image, the image strength setting controls how much the video deviates from that source.

What happens if I'm not happy with the result? Revise your prompt or change one setting and run the model again. Toggling prompt expansion or setting a different seed typically produces noticeably different output in the next run.

Credit Cost

Each generation consumes 15 credits

15 credits

or 75 credits for 5 generations

Features

Everything this model can do for you

Text-to-video

Type a prompt and receive a generated video clip without uploading any source media.

Image-to-video

Upload a reference image and animate it into a short clip with adjustable image influence.

Six aspect ratios

Choose from 16:9, 9:16, 4:3, 3:4, 1:1, or 21:9 to match your publishing format.

Prompt expansion

Toggle on automatic scene expansion to turn a short idea into a richer description before generation.

Reproducible results

Set a seed value to get the same output across runs for consistent testing and iteration.

Frame control

Pick the number of frames from the available options to set clip length and pacing.

No watermarks

Download generated video clips as clean files ready for direct use or further editing.

Use Cases

Write a short description of a scene and receive a generated video clip in under a minute, ready for social media drafts

Upload a product photo and a text prompt to animate it into a short video showing the item in motion

Type a brief narrative for a brand teaser and export a 16:9 video clip to use as a background loop

Describe a concept scene for a storyboard and get a rough animated version to review with a client before full production

Generate a vertical 9:16 video from a text prompt for short-form social content without any video editing software

Enable auto-prompt expansion to fill out a one-sentence idea into a detailed scene and compare the result against a manual prompt

Set a fixed seed to reproduce the same video result across multiple runs for consistent batch output

Examples

16:9
35.5s
Num Frames: 241
Enhance Prompt: No
Image Strength: 1

The shot opens on a news reporter standing in front of a row of cordoned-off cars, yellow caution tape fluttering behind him. The light is warm, early sun reflecting off the camera lens. The faint hum of chatter and distant drilling fills the air. The reporter, composed but visibly excited, looks directly into the camera, microphone in hand with the letters "R8". Reporter (live): “Thank you, Sylvia. And yes this is a sentence I never thought I’d say on live television but as of today, you can now run LTX 2 distilled on Replicate”

16:9
19.8s
Num Frames: 121
Enhance Prompt: No
Image Strength: 1

A cinematic close-up of Wednesday Addams frozen mid-dance on a dark, blue-lit ballroom floor as students move indistinctly behind her, their footsteps and muffled music reduced to a distant, underwater thrum; the audio foregrounds her steady breathing and the faint rustle of fabric as she slowly raises one arm, never breaking eye contact with the camera, then after a deliberately long silence she speaks in a flat, dry, perfectly controlled voice, “LTX 2 distilled is now on Replicate,” each word crisp and unemotional, followed by an abrupt cutoff of her voice as the background sound swells slightly, reinforcing the deadpan humor, with precise lip sync, minimal facial movement, stark gothic lighting, and cinematic realism.

Switch Category

Effects

Text To Image

Text To Image

Text To Video

Large Language Models

Large Language Models

Text To Speech

Text To Speech

Super Resolution

Super Resolution

Lipsync

AI Music Generation

AI Music Generation

Video Editing

Speech To Text

Speech To Text

AI Enhance Videos

Remove Backgrounds

Remove Backgrounds