Android Question synchronize text and audio together


I use this example to convert :

audio there may be some pauses by the speaker and then he continues speaking, how can the pause be compensated by putting a (((space))) (on the number of time gone) between speeches.

i think >_______________<we all have passions>_______________<and you don't get to choose
in your conversion to text
i think we all have passions and you don't get to choose

Is it possible to take a sample of the sound wave when it is . In sleep mode (the speaker stops talking). And put space or "_" or "*" or "+" for the purpose of sync