pull down to refresh
Thanks for the tutorial.
Didn't come out too well... humor can be very subtle, human translators still have some edge.
Pretty cool though.
reply
We can only learn what to improve through finding what doesn't work :-)
reply
Note: the last line should read
ffmpeg -i 6ZWf4Jfd1sM.mp4 -i out_en.wav -c:v copy -c:a aac -map 0:v -map 1:a output.mp4 not sure how i messed that up, but i did.
reply
# clone the repo git clone https://github.com/kyutai-labs/hibiki.git # use the rust version cd hibiki/hibiki-rs # do questionable things to fetch the video, for science - don't try this at home yt-dlp -t mp4 "https://www.youtube.com/watch?v=6ZWf4Jfd1sM" -o 6ZWf4Jfd1sM.mp4 # demux the audio (as mp3 encoded and mp3 container) ffmpeg -i 6ZWf4Jfd1sM.mp4 -c:v none -c:a libmp3lame 6ZWf4Jfd1sM.mp3 # do the magic translation # note: i used this on a mac, use --features cuda to run on an nvidia gpu instead cargo run --features metal -r -- gen 6ZWf4Jfd1sM.mp3 out_en.wav # remux the english audio in (as aac encoded) ffmpeg -i 6ZWf4Jfd1sM.mp4 -i out_en.wav -c:v copy aac -map 0:v -map 1:a output.mp4