11 — AI Audio & Music Generation

11 — AI Audio & Music Generation

AI Audio Generation lets you create original background music, instrumentals, and sound effects using AI models. Generated audio can be downloaded or added directly to your timeline as an audio clip.

Access this feature under AI Generate → Audio.

A Replicate API key is required for all models.


Available Models

Stable Audio 2.5

Best for: instrumentals, sound effects, loops

Setting Description
Prompt Describe the sound or music style
Duration 1–190 seconds
Steps Generation quality (4–8)
CFG Scale Prompt adherence (1–25)
Seed Optional, for reproducibility

Example prompt: "Upbeat corporate background music, piano and strings, 120 BPM"


Google Lyria 2

Best for: high-quality 48kHz stereo music

Setting Description
Prompt Music description
Negative Prompt What to exclude (e.g., “vocals, drums”)
Seed Optional

MiniMax Music 1.5

Best for: full vocal songs with lyrics

Setting Description
Prompt Genre, mood, and style description
Lyrics Full song lyrics with structural tags (see below)

Lyrics tag format:

[verse]
Your verse lyrics here

[chorus]
Your chorus lyrics here

[bridge]
Optional bridge lyrics

ElevenLabs Music

Best for: studio-quality music production

Setting Description
Prompt Music description
Duration 5,000–300,000 milliseconds (5 seconds to 5 minutes)
Force Instrumental Toggle to prevent any vocal generation

ACE-Step

Best for: open-source full songs with custom lyrics

Setting Description
Tags Style descriptors (comma-separated genre/mood tags instead of a prompt sentence)
Lyrics Full lyrics text
Duration Clip length in seconds
Guidance Scale Prompt adherence
Number of Steps Generation quality
Seed Optional

Example tags: "pop, upbeat, female vocals, guitar, 120bpm"


Meta MusicGen

Best for: melody-conditioned generation — provide a reference melody

Setting Description
Prompt Description of the desired music
Duration Clip length
Temperature Creativity (higher = more variation)
Top K Sampling diversity
Input Audio Optional reference audio to condition the melody
Guidance Scale Prompt adherence
Seed Optional

Generating Audio

  1. Select your model.
  2. Fill in the prompt and any model-specific fields.
  3. Click Generate.
  4. The job runs in the background via the Job Queue.
  5. When complete, the result appears in the Results tab with an audio player.

Using Generated Audio

From the Results tab:
– Click Play to preview the audio in the browser
– Click Download to save the MP3 file
– Click Add to Timeline to insert the clip into the audio track at the current playhead position


Previous: Talking Head / Lip-Sync | Next: Long-Form Video Pipeline →