Any bilingual or multilingual people here who want to dub YouTube videos and give us feedback on the dubbing quality? Follow me and I'll DM the link 🙏
I feel like I'm spending most of my time resolving version conflicts and env dependencies when working in python. Just me?
Despite the belief that AI models will get quickly commoditized, Midjourney quality feels levels ahead than other image models.
Did you know you can get a Mystery Seafood Box from Amazon in Japan? https://twitter.com/smallworld_en/status/1693654123365744664
Did you know you can get a Mystery Seafood Box from Amazon in Japan? https://twitter.com/smallworld_en/status/1693654123365744664?s=20
Looks interesting! Seems great for side projects. But I’m looking for something more modern, vercel of sorts.
What are modern severless solutions to run long python functions asynchronously? (need to be able to install system packages like ffmpeg) I used to use AWS Lambda.
Has anyone tried Vercel Postgres or Blob product? Should I use it for new project, e.g. vs. supabase?
Is this the best open source speaker diarization? https://huggingface.co/pyannote/speaker-diarization
Yeah I don’t have good intuition for the size of dataset required but my guess is a lot less than TTS, maybe similar to ASR given one-to-many problem, so was thinking there is enough public speech dataset (>10k hrs) plus non-speech dataset which can be mixed together to synthesize for training.