Could we add a possibility of generating subtitles for the recordings or even for the live programs on the fly?
An eighth gen quad core i5 can achieve 5x speed when using tiny.en model in whisper:
Just like with hardware transcoding, having a GPU would speed things up quite a bit and allow for the larger and more accurate models to be used.
If somebody is interested in playing with it, just install it via PIP and follow the Command-line usage
section
EDIT: There is also a standalone executable