Whisper's output for Hindi audio leads to a transcript in Arabic or Persian script #1662
Replies: 3 comments 4 replies
-
Don't bother using Whisper the accuracy rate is pretty low compared to letting GPT3.5 do the translation. |
Beta Was this translation helpful? Give feedback.
1 reply
-
@abhilashgupta Did you come across any solution/new insights for this? |
Beta Was this translation helpful? Give feedback.
2 replies
-
I am facing the same issue. When I call model.transcribe('filename', language='hi') the returned text is in urdu/persian. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I tried transcribing a Hindi (non standard dialect) audio file.
The API call went through fine and returned the transcription in the Devnagiri script.
However, when I run whisper transcribe on it locally, it recognises that the audio is in Hindi, but the transcription is in either Arabic or Persian script (or maybe Urdu, not sure).
Same when I force it to transcribe in Hindi, Nepali or Marathi language (almost same script as Hindi, one extra character).
Writing here because I couldn't see an option to raise an issue.
Steps to reproduce:
I cannot attach a
.wav
file here, so here is the link to the video -> https://www.youtube.com/watch?v=x2CnZZFf8JgTo use API, call the API and get back response in correct script.
To run locally, just run
whisper downloaded_audio.wav --language hi --model base
orwhisper downloaded_audio.wav --model base
There's a conversation about how Whisper's auto detection detects
Hindi
audio to beUrdu
(which makes sense given their similarity) at #118 but the API gives the response in the correct script and the toolkit outputs it in the wrong script even if I explicitly mention the languageBeta Was this translation helpful? Give feedback.
All reactions