Android Question Video voice to text

jkhazraji

Active Member
Licensed User
Longtime User
Hi everyone,
I am trying to develop an app to extract text from voice in a video stream online or as a file, for scientific purposes. What is the best way to do that. I know that speech can be converted to text but how to get the voice audio from the video is my problem? Any suggestions will be highly appreciated.
Have a good day
 

jkhazraji

Active Member
Licensed User
Longtime User
Is this thread where I begin??
 
Upvote 0

JohnC

Expert
Licensed User
Longtime User
I don't think there is currently a "free" way for us developers to use Speech-to-text on an audio stream - only google apps can do that.

As far as I know, we need to use a paid web service to do that (but maybe these web services will convert a few examples for free).
 
Last edited:
Upvote 0

moster67

Expert
Licensed User
Longtime User
as to your first question about how to extract audio from a video file, then you can use FFMpeg as long it is not in real-time.
You could use my FFmpeg-encoder library.
 
Upvote 0

PenguinHero

Member
Licensed User
Longtime User
It probably doesn't help much but Microsoft Teams has a real-time transcription feature for video calls.
So its not just Google apps that have the feature.

Of course how they do it I have no idea, but in the past I have used Python libraries to transcribe video files from online workshops that I ran.
 
Upvote 0
Cookies are required to use this site. You must accept them to continue using the site. Learn more…