Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WhisperX missing pieces of transcript compared to Whisper API #931

Open
tomhayw opened this issue Nov 25, 2024 · 2 comments
Open

WhisperX missing pieces of transcript compared to Whisper API #931

tomhayw opened this issue Nov 25, 2024 · 2 comments

Comments

@tomhayw
Copy link

tomhayw commented Nov 25, 2024

I've been using WhisperX but I keep coming across issues whereby parts of the transcript are just missing entirely (i.e. half of sentences). I have ran the same audio file through OpenAI's Whisper API and it works perfectly fine.

Has anyone else had this issue and if so, how did you remediate it?

Thanks.

@sulutian
Copy link

You can try lowering --vad_onset 0.1 --vad_offset 0.1

@klausackermann
Copy link

I also obtained this and it is much better switching back to model large_v2 instead of model large_v3. The large_v3 also shows halo in the middle of transcribed, which large_v2 does not.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants