Unsupervised speaker indexing sequentially detects points where a speaker identity changes in a multi-speaker audio stream, and categorizes each speaker segment, without any prior knowledge about the speakers.[Read More]
Speech recognition is an essential component of any human computer interaction (HCI) scheme, which aspires to be natural.
Speech recognition is an essential component of any Human Computer Interaction (HCI) scheme, which aspires to be natural. Thus, high accuracy speech recognition is of critical importance in making natural man-machine interfaces.[Read More]
Emotions (anger, happiness, sadness, etc.) are inseparable components of the natural human speech. Because of that, the level of human speech can only be achieved with the ability to synthesize emotions.
Spoken language adds naturalness and efficiency to human-machine interactions with both children and adults.
One of the goals of this project is to develop methods for compressing speech signals for a distributed speech recognition task.[Read More]
This research aims at investigating several feature sets such as acoustic, lexical, and discourse features, and classification algorithms for classifying spoken utterances based on the emotional state of the speaker.[Read More]