• Picasso AI Logo
    Logo Picasso IA
  • Home
  • AI Image
    Nano Banana 2
  • AI Video
    Veo 3.1 Lite
  • AI Chat
    Gemini 3 Pro
  • Edit Images
  • Upscale Image
  • Remove Background
  • Text to Speech
  • Effects
    NEW
  • Generations
  • Billing
  • Support
  • Account
  1. Collection
  2. Text to Video
  3. Kling V2.0

Turn Text into 720p Video with Kling v2.0

Kling v2.0 takes a written description and turns it into a short video clip, either 5 or 10 seconds long, at 720p resolution. If you need a product teaser, a social media clip, or a motion asset for a presentation without the budget or schedule for traditional production, this is the gap it fills. You can set the aspect ratio to 16:9 for landscape content, 9:16 for vertical social formats, or 1:1 for square feeds and ads. A cfg_scale slider controls how closely the output follows your prompt: dial it higher for literal results, lower for more visual range. You can also supply a start image to fix the first frame, which helps when the video needs to begin with a specific scene or character. The whole process runs in a browser without installation or a credit card on first use. Write your prompt, pick a duration and aspect ratio, and download the finished clip. Most runs return a ready-to-use video file in under a minute.

Official

Kwaivgi

73.9k runs

Kling V2.0

2025-04-25

Commercial Use

Turn Text into 720p Video with Kling v2.0

Table of contents

  • Overview
  • How It Works
  • Frequently Asked Questions
  • Credit Cost
  • Features
  • Use Cases
  • Examples
Get Nano Banana Pro

Overview

Kling v2.0 is a text-to-video model that turns written prompts into short video clips at 720p resolution. It solves a common production bottleneck: getting a motion asset quickly without a film crew, stock footage license, or editing timeline. On Picasso IA, you describe the scene you want, pick a duration, and get a downloadable clip. It fits into any workflow where original motion content is needed fast and built from scratch.

How It Works

  • Write a text prompt describing the scene, subject, mood, and any visual elements you want in the video.
  • Choose a duration of 5 or 10 seconds depending on how much motion your content needs.
  • Select an aspect ratio: 16:9 for widescreen, 9:16 for vertical formats, or 1:1 for square outputs.
  • Optionally upload a start image to fix the exact first frame, or add a negative prompt to block unwanted visual elements.
  • Hit generate and download the 720p video file when the run finishes.

Frequently Asked Questions

Do I need programming skills or technical knowledge to use this? No, just open Kling v2.0 on Picasso IA, adjust the settings you want, and hit generate.

Is it free to try? Yes. You can run Kling v2.0 without a paid subscription. Check the current pricing page for credit details.

How long does it take to get results? Most video generations finish in under a minute. A 10-second clip may take slightly longer than a 5-second one due to the additional frames being rendered.

What output formats are supported? The model returns a standard video file you can download directly and import into any video editor or upload to social platforms.

Can I customize the output quality or style? Yes. The cfg_scale slider controls how literally the model interprets your prompt. A higher value sticks closely to the text; a lower value gives more visual flexibility. A negative prompt lets you describe what to exclude.

Where can I use the outputs? The video clips you generate are yours to use in social media posts, presentations, websites, ads, or any other project.

Credit Cost

Each generation consumes 6 credits

6 credits

or 30 credits for 5 generations

Features

Everything this model can do for you

720p video output

Produces crisp 720p resolution clips ready for web publishing without post-processing.

Flexible duration

Choose between 5-second quick cuts or 10-second clips to match your content format.

Three aspect ratios

Output in 16:9, 9:16, or 1:1 to fit YouTube, vertical social, or square formats.

Prompt fidelity control

Adjust cfg_scale to balance strict prompt adherence against creative visual range.

Start image anchor

Supply a reference image to lock in the first frame and control where the video begins.

Negative prompt support

Describe what to exclude from the scene to keep the output focused and clean.

Browser-based access

Run entirely online with no software to install and no local hardware required.

Ideal for marketing, social, and creative projects

Use Cases

Generate a 5-second product teaser by describing the item, its color, and the background you want in the video

Create a vertical 9:16 clip for short-form social by typing your scene description and selecting portrait format

Produce a slow cinematic scene for a website header by prompting a landscape setting at 16:9

Animate a concept scene for a pitch deck by describing the setting and mood, then downloading the clip

Start a video from a specific photo by uploading a start image and letting the model extend the scene from that frame

Generate a 10-second B-roll clip for a YouTube video by describing the location and on-screen action in plain text

Refine unwanted visual elements by adding a negative prompt that excludes specific objects, colors, or styles

Examples

16:9
10s
15m 57s
Cfg Scale: 0.5

A woman is talking directly to the camera, twitch live stream, pink and gold uplighting behind

16:9
10s
13m 27s
Cfg Scale: 0.5

she lets down her hair and then puts on a baseball cap

16:9
5s
5m 18s
Cfg Scale: 0.5

Aerial view of the city with flying vehicles swiftly navigating through the neon-lit skyscrapers. the camera follows the subject moving

16:9
5s
5m 9s
Cfg Scale: 0.5

A cute hamster lies leisurely on a lifebuoy, wearing fashionable sunglasses, and drifts with the gentle waves on the shimmering sea surface. The hamster reclines comfortably, enjoying a peaceful and pleasant time. Cartoon style, the camera follows the subject moving, with a heartwarming and high picture quality.

Switch Category

Effects

Text To Image

Text To Image

Text To Video

Large Language Models

Large Language Models

Text To Speech

Text To Speech

Super Resolution

Super Resolution

Lipsync

AI Music Generation

AI Music Generation

Video Editing

Speech To Text

Speech To Text

AI Enhance Videos

Remove Backgrounds

Remove Backgrounds