• Picasso AI Logo
    Logo Picasso IA
  • Home
  • AI Image
    Nano Banana 2
  • AI Video
    Veo 3.1 Fast
  • AI Chat
    Gemini 3 Pro
  • Edit Images
  • Upscale Image
  • Remove Background
  • Text to Speech
  • Effects
    NEW
  • Generations
  • Billing
  • Support
  • Account
  1. Collection
  2. Text to Speech
  3. Tts 1.5 Mini

Explore voices to match your need

ASMR

ASMR

Japanese
Whisper
Whispering Woman

Whispering Woman

Whisper
Relaxation
Lucky Robot

Lucky Robot

Robotic
Creative
Angry Pirate

Angry Pirate

Character
Creative

Audio Tools

Original Audio
Cloned
Result

Clone Your Voice

Experience instant voice magic with just 10 seconds of audio input!

Pirate Captain
Pirate Captain
Greedy Goblin
Greedy Goblin
Southern Belle
Southern Belle

Voice Design

Create Any Voice You Can Imagine - From Simple Text Description

TTS 1.5 Mini: Instant AI Voice in 15 Languages

TTS 1.5 Mini converts written text into natural-sounding speech in roughly 120 milliseconds, making it one of the fastest options available for anyone who needs audio at scale. If you record voiceovers manually, you know how much time it takes to re-record every time the script changes. This model lets you iterate in seconds instead of hours. The model supports 15 languages out of the box, so you can produce audio for international audiences without managing separate workflows. You can control speaking speed, emotional tone, and expressiveness through simple settings, and switch between multiple preset voices to match your content style. Output formats include MP3, WAV, OGG, and FLAC, and sample rates go up to 48kHz for broadcast-quality audio. Whether you are adding narration to a course, automating a podcast intro, or generating audio for a product demo, TTS 1.5 Mini fits directly into your production process without requiring any coding knowledge. Open the model, paste your script, choose a voice, and download the audio file in seconds. It is a practical option for anyone who needs reliable, fast text-to-speech on a regular basis.

Official

Inworld

15.4k runs

Tts 1.5 Mini

2026-03-10

Commercial Use

TTS 1.5 Mini: Instant AI Voice in 15 Languages

Table of contents

  • Overview
  • How It Works
  • Frequently Asked Questions
  • Credit Cost
  • Features
  • Use Cases
Get Nano Banana Pro

Overview

TTS 1.5 Mini converts written text into natural-sounding speech in roughly 120 milliseconds, making it one of the fastest synthesis options available. Whether you need a voiceover draft, a product demo narration, or a spoken notification for an app, you paste the text, choose a voice, and get back a clean audio file in seconds. Available on Picasso IA, it covers 15 languages so multilingual projects no longer require separate recording sessions or different tools for each locale. The result is a workflow where you can iterate through multiple takes in the time it once took to prepare a single recording.

How It Works

  • Paste up to 2,000 characters of text into the input field. You can include break tags for timed pauses, emotion markers like [happy] or [sad], and non-verbal sounds like [laugh] or [sigh] to shape the delivery.
  • Select a voice from the preset list (Ashley, Dennis, Alex, and others) or enter a custom voice ID if you have a cloned voice saved.
  • Choose your audio format: MP3, WAV, OGG Opus, or FLAC. Pick a sample rate from 8,000 Hz up to 48,000 Hz to match the technical spec of your project.
  • Adjust the speaking rate if you need faster or slower delivery, and set the temperature to control how expressive or neutral the voice sounds.
  • Turn text normalization on, off, or leave it on auto so numbers, dates, and abbreviations are read out naturally.
  • Click generate. TTS 1.5 Mini processes the input and returns your audio file in around 120 milliseconds.

Frequently Asked Questions

Do I need programming skills or technical knowledge to use this? No, just open TTS 1.5 Mini on Picasso IA, adjust the settings you want, and hit generate.

Is it free to try? Yes, you can run TTS 1.5 Mini without any account setup or payment required to get started. Submit your text, pick a voice, and download the file.

How long does it take to get results? The model targets around 120 milliseconds of latency from request to audio output. For most inputs, the file is ready almost as soon as you click generate.

What output formats are supported? TTS 1.5 Mini exports audio in MP3, WAV, OGG Opus, and FLAC. You can also select from seven sample rate options, from 8,000 Hz to 48,000 Hz, to match the technical requirements of your platform.

Can I customize the voice or speaking style? Yes. Pick from preset voice names or supply a custom cloned voice ID. The temperature parameter controls expressiveness: lower values give a consistent, neutral tone; higher values add more variation. The speaking rate slider lets you slow down or speed up the narration.

What languages does TTS 1.5 Mini support? It supports 15 languages, so you can produce multilingual audio content from a single tool without switching between services.

Where can I use the audio files I download? The output files are clean with no added watermarks, so you can drop them directly into video edits, podcasts, mobile apps, e-learning modules, or any project that needs spoken audio.

Credit Cost

Each generation consumes 1 credit

1 credit

or 5 credits for 5 generations

Features

Everything this model can do for you

120ms latency

Receive generated audio in under 120 milliseconds, fast enough for real-time production use.

15-language support

Synthesize speech in 15 different languages from a single text input without extra configuration.

Preset voices

Choose from a set of named voices to match different content styles and audience expectations.

Emotion markup

Use inline tags to add happiness, sadness, laughter, and other emotional cues directly in the script.

Multiple output formats

Download audio as MP3, WAV, OGG Opus, or FLAC for any platform or publishing workflow.

Flexible sample rates

Select from 8kHz to 48kHz to balance file size against audio quality for your specific use case.

Speaking rate control

Adjust playback speed without distorting pitch or losing clarity in the synthesized voice.

SSML break support

Insert timed pauses anywhere in the script using standard markup tags for precise pacing control.

Use Cases

Generate narration for an online course by pasting the lesson script and selecting a preset voice that fits your brand tone

Add a voiceover to a product demo video by converting your script to MP3 and syncing it in your video editor

Produce podcast intros or outros in multiple languages without re-recording each version manually

Create audio versions of blog posts so readers can listen instead of read, using auto text normalization to handle numbers and dates correctly

Build a voice response for a chatbot or app by feeding the reply text into the model and downloading the audio clip in seconds

Test different voice styles for an audiobook by switching between preset voices and adjusting the speaking rate until the pacing feels right

Generate multilingual customer support audio in 15 languages from a single script without hiring separate voice talent for each market

Add emotion-marked speech to explainer videos using built-in emotion markup tags for more expressive, natural-sounding delivery

Switch Category

Effects

Text To Image

Text To Image

Text To Video

Large Language Models

Large Language Models

Text To Speech

Text To Speech

Super Resolution

Super Resolution

Lipsync

AI Music Generation

AI Music Generation

Video Editing

Speech To Text

Speech To Text

AI Enhance Videos

Remove Backgrounds

Remove Backgrounds