Hello,
I currently have an app that works using two phones. Person one presses a button, speaks into phone one, the speech is recognized and displayed as text, person one then presses a Send button and the text is sent to the second phone.
I use this as an aid when speaking to the Deaf.
The ultimate solution would be to dispense with the Send button and allow continuous, real-time speech to be recognized and transmitted.
I think this should be possible. Has anyone done this or can anyone point me in the right direction?
I envision the following software algorithm scenario.
It is working well. I'll have to investigate if it can be used in this continuous, real-time mode.
Thanks,
Barry.
I currently have an app that works using two phones. Person one presses a button, speaks into phone one, the speech is recognized and displayed as text, person one then presses a Send button and the text is sent to the second phone.
I use this as an aid when speaking to the Deaf.
The ultimate solution would be to dispense with the Send button and allow continuous, real-time speech to be recognized and transmitted.
I think this should be possible. Has anyone done this or can anyone point me in the right direction?
I envision the following software algorithm scenario.
- The person using phone 1 presses a Start analog capture button to begin capturing buffers of analog speech data.
- I think Android returns interim buffers, say a few seconds worth of analog data along with the current amplitude as one of its callback events, while continuing the analog capture.
- Each interim analog data buffer would be sent off to Google for speech recognition.
- The second analog data buffer would probably be available before the first results are returned, but it is also sent off to Google.
- As the results are received from Google they are displayed as text and sent to the second phone.
- This continues until person one presses the Stop analog capture button.
It is working well. I'll have to investigate if it can be used in this continuous, real-time mode.
Thanks,
Barry.
Last edited: