Text-to-music
describe a song, get audio
// official site: github.com ↗
MusicGen is Meta AI's text-to-music model (part of AudioCraft) — generates 30+ seconds of stereo music from text prompts or melody conditioning. Three model sizes (300M / 1.5B / 3.3B params) for VRAM/quality tradeoff. Genre, instrument, mood, BPM all controllable via prompts.
MusicGen is Meta AI's text-to-music model (part of AudioCraft) — generates 30+ seconds of stereo music from text prompts or melody conditioning. Three model sizes (300M / 1.5B / 3.3B params) for VRAM/quality tradeoff. Genre, instrument, mood, BPM all controllable via prompts.
The open answer to Suno / Udio for music generation, with full control over the entire pipeline.
Concrete scenarios where teams pick MusicGen over the SaaS alternative.
describe a song, get audio
hum a tune, get a full instrumental
for video, podcast, game, app
orchestral, electronic, jazz, folk, classical, hip-hop
via prompts
(with appropriate license use)
If your team profile matches one of these, MusicGen is a strong fit out of the box.
generating royalty-free soundtracks
producing intro/outro themes
prototyping music for levels
producing campaign-specific audio
generating ambient/UX audio
selling music gen tier
When evaluating self-hosted options for this category, here are the dimensions on which MusicGen consistently lands above the alternatives.
The stack you'll plug MusicGen into — services, protocols, and adjacent apps in the BluixApps catalog.
facebookresearch/audiocraft repo/root/bluixapps/musicgen.txtbluixapps_ensure_nvidia_runtimeOperational guidance from running this in production — what to lock down, what surprises people.