Tag: Stable Audio Open

  • How to use Stable Audio Open 1.0

    How to use Stable Audio Open 1.0

    Stable Audio Open 1.0 is a powerful model that generates variable-length stereo audio (up to 47 seconds) at 44.1kHz from text prompts. It consists of three key components: an autoencoder that compresses waveforms into manageable sequence lengths, a T5-based text embedding for text conditioning, and a transformer-based diffusion (DiT) model that operates in the autoencoder’s…

  • Stable Audio Open Paper

    Stable Audio Open Paper

    StabilityAI excited to announce the release of the research paper for Stable Audio Open! This open-weight text-to-audio model generates high-quality stereo audio at 44.1kHz from text prompts. Perfect for synthesizing realistic sounds and field recordings, it runs on consumer-grade GPUs, making it accessible for academic and artistic use. Details The research paper on Stable Audio…