Generates audio using the selected model.
Music generation: elevenlabs-music — creates music from a text description.
Key parameter: music_length_ms (milliseconds, e.g. 30000 = 30 s). Billed per output minute.
Workflow
GET /hub/v1/models?capability=audio — browse available modelsGET /hub/v1/models/:model — copy the example as your inputPOST /hub/v1/audio ← you are hereGET /hub/v1/tasks/:task_id — poll until ready=trueTip: Click Try it out → select a model from the dropdown below → the parameter schema auto-populates with an example.
elevenlabs-music — ElevenLabs Music (ElevenLabs) · music-generateElevenLabs Music: AI music generation from text description.
| Parameter | Type | Req | Default | Values / Range | Description |
|---|---|---|---|---|---|
prompt | string | – | – | The text prompt describing the music to generate. Use this for simple text-to-music generation. Mutually exclusive with composition_plan. | |
output_format | string | mp3_44100_128 | mp3_22050_32 mp3_44100_32 mp3_44100_64 mp3_44100_96 mp3_44100_128 mp3_44100_192 pcm_8000 pcm_16000 pcm_22050 pcm_24000 pcm_44100 pcm_48000 ulaw_8000 alaw_8000 opus_48000_32 opus_48000_64 opus_48000_96 opus_48000_128 opus_48000_192 | Output audio format. Encoded as codec_sampleRate_bitrate (e.g. mp3_44100_128 = MP3 at 44.1kHz / 128kbps). Note: mp3_44100_192 requires Creator tier; pcm_44100 requires Pro tier. | |
music_length_ms | integer | ✓ | – | 3000–600000 | Duration of the generated music in milliseconds. Required for billing. Range: 3000ms (3s) to 600000ms (10min). Use with prompt only; when using composition_plan, total duration is determined by the sum of section duration_ms values. |
composition_plan | object | – | – | Advanced: structured composition plan with sections, styles and lyrics. Each section requires section_name, positive_local_styles[], negative_local_styles[], duration_ms (3000-120000ms), and lines[]. Also requires positive_global_styles[] and negative_global_styles[] at the top level. Mutually exclusive with prompt. | |
force_instrumental | boolean | – | – | If true, guarantees the generated song is instrumental (no vocals). Can only be used with prompt. | |
respect_sections_durations | boolean | true | – | Controls how strictly section durations in the composition_plan are enforced. Only effective with composition_plan. When true, each section’s duration_ms is precisely respected; when false, the model may adjust durations for better quality while preserving total song length. |
{
"model": "elevenlabs-music",
"input": {
"prompt": "Mysterious original soundtrack, themes of jungle, rainforest, nature, woodwinds, busy rhythmic tribal percussion.",
"output_format": "mp3_44100_128",
"music_length_ms": 60000
}
}
Documentation Index
Fetch the complete documentation index at: https://docs.mountsea.ai/llms.txt
Use this file to discover all available pages before exploring further.
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Unique task ID — use this to poll GET /hub/v1/tasks/:task_id
"hub-xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx"
Task status at creation time (usually pending)
"pending"
Capability: image | video | audio | transcribe
"video"
Model ID
"veo-3.1-fast"
Model vendor
"Google"
Generation mode (e.g. text-to-video, image-to-image)
"text-to-video"
ISO 8601 creation timestamp
"2026-05-18T09:00:00.000Z"