Tag: audio
Mamba #1
2026-04-27 Mamba AIAYN audio model-intro text State-space models represent a thread of statistical modelling other than attention, often used for continuous domains like audio. This paper introduces Mamba, a model architecture where attention is replaced by state-space layers in a model aimed at language. Access: $ Basic
Softmax and sine shaping
2026-04-13 audio basics math Softmax is a basic operation used in machine learning models to find the maximum of a set of numbers in a way that works well with training algorithms. It also happens to be almost exactly the operating principle of the Gilbert sine shaper, used in synthesizer electronics. Access: Free account (logged in)
Making music with Moûsai
2025-10-20 applications model-intro audio diffusion The latent diffusion concept applied to music generation: a transformer-type text model generates embeddings from a prompt, which guide a diffusion model to create encoded spectrograms in a latent space, which are translated by another diffusion model into audio waveforms. Access: $$$ Pro