- cross-posted to:
- opensource@lemmy.ml
Very cool tool. I tried out the medium-size model on a Russian video, and the English subtitles that it generated were much more accurate than YouTube's autotranslated captions.
Very cool tool. I tried out the medium-size model on a Russian video, and the English subtitles that it generated were much more accurate than YouTube's autotranslated captions.
Gotcha gotcha, I haven't used Windows since the mid 00s so I won't be the most helpful, but it looks like you'll need to do the following:
If you aren't using a package manager, install Chocolatey (or maybe Scoop? I'm not familiar with that one - maybe some Windows comrades can chime in on which would be better for you)
Install Python 3 and Pip if you don't have them installed
Run the commands in the Setup part of that doc:
pip install git+https://github.com/openai/whisper.git choco install ffmpeg # assuming you are using Chocolatey and not Scoop
Assuming everything installs properly, you can use the examples from the Command-line usage section as a starting point. I'm running
whisper my-audio-file.mp3 --language Korean --task translate
to translate an audio file from Korean to English.Thank you, I'll try this later!
No problem, good luck! :stalin-heart: