Here is a prototype of the short time average crossing rate. Note that the zero crossing rate near the beginning of the phrase is high where the average magnitude is low. The combination of the zero crossing rate and average magnitude can be used in an algorithm to classify components of speech.
Average zero crossing [...]
The short time average zero crossing rate of a speech signal can be used in conjunction with the short time average energy (or magnitude) to discriminate between voiced speech, unvoiced speech and silence. The short time average crossing rate of a digitally sample speech signal is defined in Digital Processing of Speech Signals (Rabiner & [...]
My Beagle Board arrived today from Texas Instruments. I just ordered a power supply and assorted cables for it from Digi-Key. So it will stilll be a few days before I try to bring up Linux on it.
Here is a quick prototype of the short time energy function in GNU Octave for a the speech sample “Mister Meryk”. The plot below shows the average magnitude of the phrase using a window size of 320 samples, calculated every 80 samples.
Average magnitude function.
Here is the code that I used to generate the [...]
The short time energy measurement of a speech signal can be used to determine voiced vs. unvoiced speech. Short time energy can also be used to detect the transition from unvoiced to voiced speech and vice versa. The energy of voiced speech is much greater than the energy of unvoiced speech.
Equation 1 Short time [...]
My project, Speed Reader, has been approved for the BeagleBoard Sponsored Projects Program. Now I’ll receive a BeagleBoard to prototype a Speech Timescale Modification application for playing audio books. I’m one step closer to building a real application!