+ 1
How to convert .wav audio to text using python with words having timestamps
I want the speech recognition to insert a time stamp to each word how can I achieve that. https://code.sololearn.com/cG0j0Ik4qxoz/?ref=app
2 Answers
+ 1
The best approach seems to be to recognize reasonably short snippets of the audio so you find roughly when things are said. For example, you can pass every 10 second interval to the speech recognition function and you'll know that what was recognized was said somewhere in that 10 second interval.
The following article shows that you can specify duration=10 when recording from a microphone:
https://stackabuse.com/introduction-to-speech-recognition-with-JUMP_LINK__&&__python__&&__JUMP_LINK/
You could chop up a wav audio into short segments in a similar way.
0
Thanks a lot Josh Greig đ