How can I browse the VoiceRecognition object?

Beja

Expert
Licensed User
Longtime User
Hello friends!

in Erel's TTS example, the voice recognition object was mentioned.. but I can't see it in the lib list.. is it included in the core?
However, I am interested in the object's events , specifically in the WordRecognized event if any.. or PhraseRecognized.

Thanks in advance
 

Beja

Expert
Licensed User
Longtime User
Thanks Klaus, thanks Erel,

I was thinking of designing an MS Agent style character and animate his mouth in sync with the words spoken, but the TTS doesn't have end of word events as in the Windows-based TTS SDK, so I will forget this..(there is no event raised except at the end of the whole spoken text). Also I didn't see speech formatting tags except for the pitch and another method. e.g falling word-end, rising word-end (for questions etc.) and the rest of the MS SAPI tags. If such tags were available in b4a then one can make the device sing the text, instead of monotone reading..
On the other hand, the Voice Reco is for dictation only, no command and control.
 
Upvote 0

mjtaryan

Active Member
Licensed User
Longtime User
On the other hand, the Voice Reco is for dictation only, no command and control.

I'm not certain, but I don't think that is necessarily true. I've downloaded and tried two or three VR control apps from the Play Store that sort of worked (they were early versions of the apps). Given that the mic imput is converted to text, it should be possible to associate words/phrases with commands to the system or an app, provided one can a) build and use a custom verbal/text dictionary and b) can capture or read the textual commands in the system or an app.

Ex. 1. Say "Home Screen" --> Presses the Home button on the device. On Home screen say "Open folder 'Productivity'." --> Opens a folder named "Productivity, " if it exists. In "Productivity" folder, say "Open Calendar" --> starts the calendar app.

Ex. 2. In a word processor (such as OfficeSuite), say "Menu" --> Presses the Menu button. Menu opens with various text options. Say "Format" -- Format menu opens. Say "Font" -- Font dialog box opens. Etc.

I, naturally, have made some assumptions in the above examples -- one of which is that possible commands that can be executed by tapping the screen can be translated or captured as text.

I haven't played around with this yet, but theoretically device, system and app control should be quite possible. Take for example the ability to dial the phone by voice. Mine allows me to say "Call Sue" and the phone will dial Sue's number. It also allow me to say "Dial (or Call)" and then say the number. And the phone will call the given number.

My two cents worth -- I make no guarantee that my comments are woth even that much :)
 
Upvote 0
Top