whisper-at

0.5
15.23k

Joint speech recognition and audio tagging model.