In the late 1980s, a set of military ears listening for submarines instead picked up something far stranger: a solitary, ...
Abstract: Masked Autoencoders (MAEs) trained on audio spectrogram patches have emerged as a prominent approach for learning self-supervised audio representations. While several recent papers have ...