Voxtral Mini TTS is Mistral's text-to-speech model featuring zero-shot voice cloning and multilingual support. It converts text input into natural-sounding audio output.
Modalities
Price
$16per 1M characters
Context
4K
Weekly Tokens
234K
Released
Apr 19, 2026
Sample code and API for Voxtral Mini TTS
OpenRouter normalizes requests and responses across providers for you.
1
Get your API key
Create an API key from your OpenRouter dashboard and set it as an environment variable:
2
Make your first request
Use mistralai/voxtral-mini-tts-2603 with the OpenRouter API:
OpenRouter provides a text-to-speech API that converts text into natural-sounding audio. Send text and a voice selection, and receive raw audio bytes in your chosen format.
The response is a raw audio stream (not JSON). The generation ID is returned in the X-Generation-Id response header for tracking.
Using third-party SDKs
For information about using third-party SDKs and frameworks with OpenRouter, please see our frameworks documentation.
Endpoint
POSThttps://openrouter.ai/api/v1/audio/speech
AuthorizationBearer $OPENROUTER_API_KEY
Content-Typeapplication/json
HTTP-Refereroptional — your site URL, for rankings
X-Titleoptional — your site name, for rankings
Modelmistralai/voxtral-mini-tts-2603
Parameters
Name
Type
Default
Description
max_tokens
integer
—
This sets the upper limit for the number of tokens the model can generate in response.
temperature
float
1
This setting influences the variety in the model's responses.
top_p
float
1
This setting limits the model's choices to a percentage of likely tokens: only the top tokens whose probabilities add up to P.
stop
array
—
Stop generation immediately if the model encounter any token specified in the stop array.
frequency_penalty
float
0
This setting aims to control the repetition of tokens based on how often they appear in the input.
presence_penalty
float
0
Adjusts how often the model repeats specific tokens already used in the input.
seed
integer
—
If specified, the inferencing will sample deterministically, such that repeated requests with the same seed and parameters should return the same result.
response_format
map
—
Forces the model to produce specific output format.
structured_outputs
boolean
—
If the model can return structured outputs using response_format json_schema.