• Picasso AI Logo
    Logo Picasso IA
  • Home
  • AI Image
    Nano Banana 2
  • AI Video
    Veo 3.1 Lite
  • AI Chat
    Gemini 3 Pro
  • Edit Images
  • Upscale Image
  • Remove Background
  • Text to Speech
  • Effects
    NEW
  • Generations
  • Billing
  • Support
  • Account
  1. Collection
  2. Text to Video
  3. Avatar Iv

Create Talking Avatar Videos with Avatar IV

Avatar IV is a text-to-video model that generates realistic talking avatar videos directly from a written script. If you need presenter-led video content but have no camera, no actor, and no production budget, it fills that gap. You write the words, pick a digital avatar and voice, and the model produces the video. The model gives you direct control over how the avatar looks and sounds. You can adjust voice speed from half to one-and-a-half times normal, pick from voice emotion options including Soothing, Friendly, Excited, and Broadcaster, and choose between a full-frame, close-up, or circle overlay display style. Add auto-generated captions in one toggle, and the video is ready to use without any post-production. Avatar IV fits naturally into any workflow where video content needs to be produced repeatedly without filming. Marketing teams use it for product updates, trainers use it for onboarding clips, and content creators use it for scripted social posts. Type a script, generate a clip, and iterate from there.

Official

Heygen

134 runs

Avatar Iv

2026-03-03

Commercial Use

Table of contents

  • Overview
  • How It Works
  • Frequently Asked Questions
  • Credit Cost
  • Features
  • Use Cases
  • Examples
Get Nano Banana Pro

Overview

Avatar IV is a text-to-video model that turns a written script into a presenter-led video featuring a photorealistic digital human. No camera, no actor, and no recording studio required. You type the narration, choose a digital avatar and a voice, and the model produces the video. On Picasso IA, the whole process runs in your browser with nothing to install. It is a practical answer for anyone who needs consistent, presenter-led video output without the overhead of traditional production.

How It Works

  • Write your script in the text field. The model accepts up to 5000 characters per video, so most explainer scripts, product walkthroughs, and short presentations fit in a single run.
  • Choose an avatar ID to select the digital presenter you want on screen.
  • Pick a voice ID and set the emotion (Excited, Friendly, Serious, Soothing, or Broadcaster) to match the tone of your content.
  • Set the voice speed between 0.5x and 1.5x, and select the display style: full frame, close-up, or circle overlay.
  • Toggle captions on if you want subtitles baked into the video, then hit generate and download the finished file.

Frequently Asked Questions

Do I need programming skills or technical knowledge to use this? No, just open Avatar IV on Picasso IA, adjust the settings you want, and hit generate.

Is it free to try? Avatar IV is available to try on Picasso IA. Check the pricing page to see which plan covers how many video generations.

How long does it take to get results? Most videos are ready within a few minutes. Shorter scripts tend to process faster, and the model runs in the cloud so your own hardware is not a factor.

What output formats are supported? The model outputs a video file at 1920x1080 resolution by default. You can change the width and height before generating to match a different aspect ratio.

Can I customize the output quality or style? Yes. You control the avatar style, voice emotion, voice speed, caption overlay, and video dimensions. Adjusting these before each run lets you shape the output to fit the exact brief you have.

Where can I use the outputs? The generated videos carry no watermarks and can be used in presentations, websites, internal communications, social media posts, or any other channel that accepts video.

What if the avatar or voice doesn't match what I need? Try a different avatar ID or voice ID, adjust the emotion setting, or rewrite sections of the script and regenerate. Each run is fast enough that several iterations in one session are practical.

Credit Cost

Each generation consumes 50 credits

50 credits

or 250 credits for 5 generations

Features

Everything this model can do for you

Photorealistic avatars

Choose from lifelike digital presenters that move and speak naturally on screen.

Emotion-aware voices

Select from five voice emotion settings, including Excited, Soothing, and Broadcaster, to match your message tone.

Adjustable voice speed

Set the speaking rate anywhere from 0.5x to 1.5x to control delivery pace.

Multiple avatar styles

Display the avatar in full frame, close-up, or circle overlay to fit your video format.

Auto-generated captions

Enable subtitles with a single toggle so your video works without sound.

HD video output

Export at 1920x1080 resolution, ready to publish without additional rendering.

Long-form text support

Feed up to 5000 characters of script in a single generation run.

Use Cases

Record a product walkthrough video without appearing on camera by typing your narration and selecting a photorealistic digital avatar.

Produce onboarding clips for new team members by scripting each step and generating a presenter-led video in minutes.

Build a branded spokesperson video by selecting a professional avatar, entering your sales copy, and downloading the finished file.

Generate social media video content with a talking presenter from a single block of text, no recording equipment needed.

Create multilingual video content by switching the voice ID and text for each language without re-recording anything.

Turn a written FAQ into a series of short presenter-led clips for a help center or product site.

Produce event intro segments with a virtual host reading a prepared script, with no studio booking required.

Examples

1920x1080
4m 12s
Caption: No
Voice Id: KQkyHvsV1wdsvvqTolHK
Avatar Id: Annie_Bar_Standing_Front_2_public
Input Text: We just launched Avatar IV and video agent on Replicate! Cre…
Voice Speed: 1
Avatar Style: normal
Voice Emotion: none
1920x1080
4m 40s
Caption: No
Voice Id: KQkyHvsV1wdsvvqTolHK
Avatar Id: Abigail_standing_office_front
Input Text: We just launched Avatar IV on Replicate! Create custom talki…
Voice Speed: 1
Avatar Style: normal
Voice Emotion: none

Switch Category

Effects

Text To Image

Text To Image

Text To Video

Large Language Models

Large Language Models

Text To Speech

Text To Speech

Super Resolution

Super Resolution

Lipsync

AI Music Generation

AI Music Generation

Video Editing

Speech To Text

Speech To Text

AI Enhance Videos

Remove Backgrounds

Remove Backgrounds