FFmpeg 8.0 merges OpenAI "Whisper Filter" for automatic speech recognition, Vulkan AV1 encoding, & VP9 decoding (www.phoronix.com)

submitted 2 months ago by cm0002@piefed.world to c/linux@programming.dev

13 comments fedilink hide all child comments

https://www.phoronix.com/news/FFmpeg-Vulkan-AV1-Encoding

https://www.phoronix.com/news/FFmpeg-Lands-Whisper

you are viewing a single comment's thread
view the rest of the comments

[-] cupcakezealot@piefed.blahaj.zone 5 points 2 months ago

ugh so what's the alternative package to ffmpeg?

[-] data1701d@startrek.website 18 points 2 months ago

No need to panic in this case. While I hate OpenAI, there's two things to note here:

Whisper is an open source library for speech recognition rather than generative AI, run entirely locally. It's just using ML to do something we could already do with computers (speech recognition), but better.
They aren't even directly using the OpenAI version - they're using whisper.cpp, a port of the model.

[-] LunaChocken@programming.dev 16 points 2 months ago

Good luck with that.. ffmpeg is the de facto standard.

[-] TonyOstrich@lemmy.world 8 points 2 months ago

This is one of the actually decent uses of this model. I have used Whisper to transcribe to phone calls, and just the other week I had to export the audio from a video I was working on to run whisper to get subtitles for the video. It's still not a set it and forget it solution, but correcting it's small mistakes here and there is so much faster than manually transcribing the audio.

Given how modular ffmpeg is with the way the switches work a user never has to interact with that portion of the application. I can technically use ffmpeg to trsnscode an mp3 without ever using the video components.

this post was submitted on 13 Aug 2025

62 points (100.0% liked)

Linux

9810 readers

676 users here now

A community for everything relating to the GNU/Linux operating system (except the memes!)

Also, check out:

Original icon base courtesy of lewing@isc.tamu.edu and The GIMP

founded 2 years ago

MODERATORS

Ategon@programming.dev

anzo@programming.dev

dwraf_of_ignorance@programming.dev