Advanced
Katsuya@kn
8/6/2023

Anyone know open source models or papers for splitting audio (that contains human speech and other sounds) into two separate audio files, one containing human speech only and one containing other sounds?

In reply to @kn
Kiren Srinivasan@srinitude.eth
8/6/2023

I’ve been using Spleeter in my workflow for my own music! Not sure about other audio source separation models https://github.com/deezer/spleeter

In reply to @kn
Joshua Fisher@joshuafisher.eth
8/6/2023

Have you tried Audio Shake? https://www.audioshake.ai

In reply to @kn
Giuliano Giacaglia@giu
8/6/2023

I’ve tried to find a good tool for this to help with speech synthesis to get the human speech track but didn’t find anything that worked well

In reply to @kn
Cylo@cylo
8/7/2023

Demucs v4 from facebook research was the open source state of the art at this (last I checked).