Speech signal processing
Encyclopedia
Speech signal processing refers to the acquisition, manipulation, storage, transfer and output of vocal utterances by a computer. The main applications are the recognition, synthesis and compression of human speech:
  • Speech recognition
    Speech recognition
    Speech recognition converts spoken words to text. The term "voice recognition" is sometimes used to refer to recognition systems that must be trained to a particular speaker—as is the case for most desktop recognition software...

     (also called voice recognition) focuses on capturing the human voice as a digital sound wave and converting it into a computer-readable format.

  • Speech synthesis
    Speech synthesis
    Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware...

     is the reverse process of speech recognition. Advances in this area improve the computer's usability for the visually impaired.

  • Speech compression
    Speech compression
    Speech compression may mean different things:*Speech encoding refers to compression for transmission or storage, possibly to an unintelligible state, with decompression used prior to playback....

     is important in the telecommunications area for increasing the amount of information which can be transferred, stored, or heard, for a given set of time and space constraints.

  • Speaker diarization is the process of determining who spoke when in a signal.


.
The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK