This page shows inference results for real‑world musical audio samples.
The model was trained on additional data beyond SLakh, MUSDB, and MoisesDB.
Limitations:
We set the diffusion steps to 250 and the classifier‑free guidance scale to 10.0 (if not specified) for source extraction