List of speech recognition software
Encyclopedia
The following list presents notable speech recognition
software with a brief synopsis of characteristics.
by Microsoft
is the speech recognition system that comes built into Windows Vista
and Windows 7. Windows Vista
and Windows 7 include version 8.0 of the Microsoft speech recognition engine.
Speech Recognition is available only in English, French, Spanish, German, Japanese, Simplified Chinese, and Traditional Chinese.
Windows Speech Recognition is available only when the language of the operating system matches the language of Windows Speech Recognition. In Windows Vista Ultimate, you can change the language of the operating system by installing a language pack from Windows Update. If you install the language pack of a supported Windows Speech Recognition language, you can then use Windows Speech Recognition for that language if that is also the language of the operating system.
Check the article “The Windows Speech Recognition language must be the same as the operating system language in Windows Vista” http://support.microsoft.com/kb/934377
You may also refer to the link “How do I get additional language files?’ http://windows.microsoft.com/en-us/windows-vista/How-do-I-get-additional-language-files
Note: Multilingual User Interface Pack(MUIs) require a license to be used and are only available with Windows Vista Ultimate and Windows Vista Enterprise. If you are using Windows Vista Ultimate, you can download MUIs by using Windows Update. If you are using Windows Vista Enterprise, contact your system administrator for information about installing additional languages. Also you cannot switch between languages for Windows Speech recognition.
Source: http://answers.microsoft.com/en-us/windows/forum/windows_vista-windows_programs/set-up-windows-speech-recognition-in-french/6b8f29c0-301b-488e-8fa5-e4ed560b75a5
Speech recognition
Speech recognition converts spoken words to text. The term "voice recognition" is sometimes used to refer to recognition systems that must be trained to a particular speaker—as is the case for most desktop recognition software...
software with a brief synopsis of characteristics.
Open Source
- CMU SphinxCMU SphinxCMU Sphinx, also called Sphinx in short, is the general term to describe a group of speech recognition systems developed at Carnegie Mellon University...
— open source under a BSD license - JuliusJulius (software)Julius is an open source speech recognition engine.Julius is a high-performance, two-pass large vocabulary continuous speech recognition decoder software for speech-related researchers and developers. Based on word 3-gram and context-dependent HMM, it can perform almost real-time decoding on most...
— Japanese language only programs with BSD-style license. - simon — GPL; Uses JuliusJulius (software)Julius is an open source speech recognition engine.Julius is a high-performance, two-pass large vocabulary continuous speech recognition decoder software for speech-related researchers and developers. Based on word 3-gram and context-dependent HMM, it can perform almost real-time decoding on most...
and the HTKHTK (software)HTK is software toolkit for handling HMMs. It is mainly intended for speech recognition, but has been used in many other pattern recognition applications that employ HMMs.-External links:** using the TIMIT speech corpus... - iATROS — Released under a GPL license.
- RWTH ASRRWTH ASRRWTH ASR is an open source speech recognition toolkit.The toolkit includes state of the art speech recognition technology for the development of automatic speech recognition systems...
— QPL-style license.
Macintosh
- Dragon Dictate for Mac – From Nuance Communications, released as a new version of MacSpeech DictateMacSpeech DictateMacSpeech Dictate was a speech recognition program developed for Mac OS X by MacSpeech. The first version of MacSpeech Dictate was released in March 2008 after being showcased at the Macworld Conference & Expo in 2008 and winning the Macworld 2008 Best Of Show award...
in 2010. - iListenIListeniListen, developed by MacSpeech, is a speech recognition program for the Apple Macintosh. As of 2006, iListen is currently the only third-party software that allows inputting text using one's voice that works on newer Macintosh models...
– Product from MacSpeech, developed and supported for PowerPC-based Macintosh until ca. 2009. - MacSpeech DictateMacSpeech DictateMacSpeech Dictate was a speech recognition program developed for Mac OS X by MacSpeech. The first version of MacSpeech Dictate was released in March 2008 after being showcased at the Macworld Conference & Expo in 2008 and winning the Macworld 2008 Best Of Show award...
– By Nuance Communications. Dictation product for Intel-based Macintosh. Renamed and upgraded as "Dragon Dictate for Mac" in 2010. - MacSpeech Dictate Medical – Dictation product for Intel-based Macintosh with included vocabularies for 54 medical and dental specialties. Developed by MacSpeech; acquired by Nuance Communications in 2010.
- MacSpeech Dictate Legal – Dictation product for Intel-based Macintosh with a vocabulary of legal terms. Developed by MacSpeech; acquired by Nuance Communications in 2010.
- MacSpeech ScribeMacSpeech ScribeMacSpeech Scribe is speech recognition software for Mac OS X designed specifically for transcription of recorded voice dictation. It runs on Mac OS X 10.6 Snow Leopard. The software transcribes dictation recorded by an individual speaker. Typically the speaker will record their dictation using a...
– By Nuance Communications. Transcription product for automatically transcribing recorded dictation into text. - Speakable itemsSpeakable itemsSpeakable items is part of the speech recognition feature in the Mac OS and Mac OS X operating systems. It allows the user to control their computer with natural speech, without having to train the computer beforehand...
– Included with Mac OS X or higher. Apple's speech synthesis and recognition technology is collectively called PlainTalkPlainTalkPlainTalk is the collective name for several speech synthesis and speech recognition technologies developed by Apple Inc.In 1990, Apple invested a lot of work and money in speech recognition technology, hiring many respected researchers in the field. The result was "PlainTalk", released with the...
. - ViaVoiceViaVoiceIBM ViaVoice is a range of language-specific continuous speech recognition software products offered by IBM. The current version is designed primarily for use in embedded devices.-Editions:...
– Product from IBM, developed and supported until ca. 2007. - Voice NavigatorVoice NavigatorThe Voice Navigator was the first voice recognition device for command and control of a graphical user interface . The system was originally designed for the Apple Macintosh Plus and released in 1989. Subsequent versions were created for Microsoft Windows.The original system included both hardware...
- First voice control system for a graphical user interface by Articulate Systems in 1989.
Mobile Devices / Smartphones
Many cell phone handsets have basic dial-by-voice features built in. Smartphones such as iPhone or Blackberry also support this. A number of 3rd party Apps have implemented natural language speech recognition support, including:- Dragon DictationDragon DictationDragon Dictation is a speech recognition App for Apple's iOS platforms including iPhone, iPod touch and iPad. The App provides automatic speech-to-text capabilities. It was developed by Nuance Communications, and released in December 2009 as a free App....
- Dragon Search
- Google Voice Search
- Bing voice search
- Siri Personal AssistantSiri (software)Siri is an intelligent software assistant and knowledge navigator functioning as a personal assistant application for iOS. The application uses a natural language user interface to answer questions, make recommendations, and perform actions by delegating requests to a set of web services...
- Shoutout
- DriveSafe.ly Speech Recognition
- Vlingo
- Jeannie (Voice Actions) by Pannous for Android
Windows 7 built-in speech recognition
The Windows Speech RecognitionWindows Speech Recognition
Windows Speech Recognition is a speech recognition application included in Windows Vista and more recently, Windows 7.-Features:Windows Speech Recognition allows the user to control the computer by giving specific voice commands...
by Microsoft
Microsoft
Microsoft Corporation is an American public multinational corporation headquartered in Redmond, Washington, USA that develops, manufactures, licenses, and supports a wide range of products and services predominantly related to computing through its various product divisions...
is the speech recognition system that comes built into Windows Vista
Windows Vista
Windows Vista is an operating system released in several variations developed by Microsoft for use on personal computers, including home and business desktops, laptops, tablet PCs, and media center PCs...
and Windows 7. Windows Vista
Windows Vista
Windows Vista is an operating system released in several variations developed by Microsoft for use on personal computers, including home and business desktops, laptops, tablet PCs, and media center PCs...
and Windows 7 include version 8.0 of the Microsoft speech recognition engine.
Speech Recognition is available only in English, French, Spanish, German, Japanese, Simplified Chinese, and Traditional Chinese.
Add-ons for Windows 7 speech recognition
- VoiceAttack - is used primarily by the gaming community to allow hands-free keyboard and mouse input in Windows 7, Windows VistaWindows VistaWindows Vista is an operating system released in several variations developed by Microsoft for use on personal computers, including home and business desktops, laptops, tablet PCs, and media center PCs...
and Windows XPWindows XPWindows XP is an operating system produced by Microsoft for use on personal computers, including home and business desktops, laptops and media centers. First released to computer manufacturers on August 24, 2001, it is the second most popular version of Windows, based on installed user base...
. Its popularity lies mainly in its ease of use and extended feature set, which includes the ability to create multi-threaded macros. - Voice FingerVoice FingerVoice Finger is a software tool for Windows Vista and Windows 7 that enables users to control the mouse cursor and keyboard through speech recognition...
– software for Windows VistaWindows VistaWindows Vista is an operating system released in several variations developed by Microsoft for use on personal computers, including home and business desktops, laptops, tablet PCs, and media center PCs...
and Windows 7 that improves the Windows speech recognitionWindows Speech RecognitionWindows Speech Recognition is a speech recognition application included in Windows Vista and more recently, Windows 7.-Features:Windows Speech Recognition allows the user to control the computer by giving specific voice commands...
system by adding several extensions to accelerate and improve the mouse and keyboard control. - WSRToolkit – adds dictionaries, macros and other features similar to Dragon
- Trigamtech – adds features for medical users similar to Dragon
- Vocola – a macro language
Windows Speech Recognition is available only when the language of the operating system matches the language of Windows Speech Recognition. In Windows Vista Ultimate, you can change the language of the operating system by installing a language pack from Windows Update. If you install the language pack of a supported Windows Speech Recognition language, you can then use Windows Speech Recognition for that language if that is also the language of the operating system.
Check the article “The Windows Speech Recognition language must be the same as the operating system language in Windows Vista” http://support.microsoft.com/kb/934377
You may also refer to the link “How do I get additional language files?’ http://windows.microsoft.com/en-us/windows-vista/How-do-I-get-additional-language-files
Note: Multilingual User Interface Pack(MUIs) require a license to be used and are only available with Windows Vista Ultimate and Windows Vista Enterprise. If you are using Windows Vista Ultimate, you can download MUIs by using Windows Update. If you are using Windows Vista Enterprise, contact your system administrator for information about installing additional languages. Also you cannot switch between languages for Windows Speech recognition.
Source: http://answers.microsoft.com/en-us/windows/forum/windows_vista-windows_programs/set-up-windows-speech-recognition-in-french/6b8f29c0-301b-488e-8fa5-e4ed560b75a5
Windows 7 third-party speech recognition
- Dragon NaturallySpeakingDragon NaturallySpeakingDragon NaturallySpeaking is a speech recognition software package developed and sold by Nuance Communications for Windows personal computers. The most recent package is version 11.5, which supports 32-bit and 64-bit editions of Windows XP, Vista and 7. The Mac OS version is called Dragon...
from Nuance CommunicationsNuance CommunicationsNuance Communications is a multinational computer software technology corporation, headquartered in Burlington, Massachusetts, USA, that provides speech and imaging applications...
– Successor to the older DragonDictateDragonDictateDragonDictate and Dragon Dictate are proprietary speech recognition software. The older program, DragonDictate, was originally developed by Dragon Systems for Microsoft Windows. It has now been replaced by Dragon NaturallySpeaking for Windows, developed by Nuance Communications...
product. Focus on dictationDictationDictation can refer to:*Dictation , when one person speaks while another person transcribes what is spoken.*A dictation machine, a device used to record speech for transcription....
. 64-bit Windows support since version 10.1. - SpeechMagicSpeechMagicSpeechMagic is an industrial grade platform for capturing information in a digital format. It has been developed by Philips Speech Recognition Systems of Vienna, Austria. SpeechMagic features large-vocabulary speech recognition as well as a number of services aimed at supporting “accurate,...
– Nuance CommunicationsNuance CommunicationsNuance Communications is a multinational computer software technology corporation, headquartered in Burlington, Massachusetts, USA, that provides speech and imaging applications...
acquired PhilipsPhilipsKoninklijke Philips Electronics N.V. , more commonly known as Philips, is a multinational Dutch electronics company....
owned. Medical industry focus according to Frost & SullivanFrost & SullivanFrost & Sullivan, Inc. is an American firm which provides market research & analysis, growth strategy consulting and corporate training services. Its headquarters are located in San Antonio, Texas, with offices in over 20 countries across the world....
. Standalone or embedded. - VoxCommando – VoxCommando allows control of many media programs on Vista and Windows 7 including XBMC, iTunes, MediaMonkey, Windows Media Center, Skype and many more through eventghost. Ability to customize all spoken commands, create macros, launch applications, perform web searches etc. VoxCommando scans your media library for all your music (and in the case of XBMC, TV and Movie titles). Request media by name, and partial name matches. Currently free to try, but $25. Demo mode is fully functional, but you must close and restart after a fixed number of commands are issued.
- TaztiTaztiTazti is a speech recognition software package developed and sold by Voice Tech Group, Inc. for Windows personal computers. The most recent package is version 2.0.2, which supports 32-bit and 64-bit editions of Windows XP, Windows Vista and Windows 7...
– A multi function software with versions for Windows 7, Vista and Windows XP. Facilitates controlling almost any desktop software applications via tazti speech recognition API functionality; creating custom speech commands; playing PC games by talking to a PC; voice search; voice bookmark management; internet navigation; and voice control of iTunes music player. Includes a lite dictation capability.
Windows XP or 2000 only
- e-Speaking – a software for Windows XPWindows XPWindows XP is an operating system produced by Microsoft for use on personal computers, including home and business desktops, laptops and media centers. First released to computer manufacturers on August 24, 2001, it is the second most popular version of Windows, based on installed user base...
that facilitates use of the MicrosoftMicrosoftMicrosoft Corporation is an American public multinational corporation headquartered in Redmond, Washington, USA that develops, manufactures, licenses, and supports a wide range of products and services predominantly related to computing through its various product divisions...
Speech API by adding ability to create commands to perform custom actions. - MicrosoftMicrosoftMicrosoft Corporation is an American public multinational corporation headquartered in Redmond, Washington, USA that develops, manufactures, licenses, and supports a wide range of products and services predominantly related to computing through its various product divisions...
Speech APISpeech Application Programming InterfaceThe Speech Application Programming Interface or SAPI is an API developed by Microsoft to allow the use of speech recognition and speech synthesis within Windows applications. To date, a number of versions of the API have been released, which have shipped either as part of a Speech SDK, or as part...
– Speech recognition functionality included as part of Microsoft Office and on Tablet PCTablet computerA tablet computer, or simply tablet, is a complete mobile computer, larger than a mobile phone or personal digital assistant, integrated into a flat touch screen and primarily operated by touching the screen...
s running Microsoft Windows XP Tablet PC Edition. It can also be downloaded as part of the Speech SDK 5.1 for Windows applications, but since that is aimed at developers building speech applications, the pure SDK form lacks any user interface, and thus is unsuitable for end users.
Programs for controlling a computer's screens and desktop applications with claps or words
- Clap Commander – Novel Human-Computer Interface, based on hand clap recognition idea. For remotely controlling your computer from another part of the room by clapping your hands. Windows XPWindows XPWindows XP is an operating system produced by Microsoft for use on personal computers, including home and business desktops, laptops and media centers. First released to computer manufacturers on August 24, 2001, it is the second most popular version of Windows, based on installed user base...
, Windows VistaWindows VistaWindows Vista is an operating system released in several variations developed by Microsoft for use on personal computers, including home and business desktops, laptops, tablet PCs, and media center PCs...
and Windows 7. - TaztiTaztiTazti is a speech recognition software package developed and sold by Voice Tech Group, Inc. for Windows personal computers. The most recent package is version 2.0.2, which supports 32-bit and 64-bit editions of Windows XP, Windows Vista and Windows 7...
– A multi function software with versions for Windows 7, Vista and Windows XP. Facilitates controlling almost any desktop software applications via tazti speech recognition API functionality; creating custom speech commands; playing PC games by talking to a PC; voice search; voice bookmark management; internet navigation; and voice control of iTunes music player. Includes a lite dictation capability.
Interactive voice response
The following are IVR/Interactive Voice response systems:- AT&T Watson
- CSLU ToolkitCSLU ToolkitThe CSLU Toolkit is a software library comprising a comprehensive suite of tools that enable exploration, learning, and research into speech and human-computer interaction.The tools include:* Audio* Display* Speech recognition* Speech generation...
- HTKHTK (software)HTK is software toolkit for handling HMMs. It is mainly intended for speech recognition, but has been used in many other pattern recognition applications that employ HMMs.-External links:** using the TIMIT speech corpus...
— copyrighted by Microsoft, but altering the software for the Licensee's internal use is allowed. - iSpeech ASR API
- Loquendo ASR
- Nuance Recognizer ASR
- Proteus Conversational InterfaceProteus Conversational InterfaceProteus Conversational Engine is a conversational interface system developed by Artificial Ingenuity, a research and development company in Arizona, USA...
- Simmortel VoiceSimmortel VoiceSimmortel Voice Technologies is a computer software technology startup, started from IIT Kanpur, India, that provides hosted telephony, voice and automatic speech recognition solutions...
- Tellme NetworksTellme NetworksTellme. Networks, Inc. is a company founded in 1999 by Mike McCue and Angus Davis, based out of Mountain View, California, in the United States, that specializes in telephone-based applications....
(acquired by MicrosoftMicrosoftMicrosoft Corporation is an American public multinational corporation headquartered in Redmond, Washington, USA that develops, manufactures, licenses, and supports a wide range of products and services predominantly related to computing through its various product divisions...
) - Parlance nameConnector
Unix-like x86 and x86_64 Speech Transcription Software
- Vocapia Research's VoxSigma software suite
Discontinued software
- SpeechWorksSpeechWorksSpeechWorks was a company founded in the late 1990s in Boston that developed and supported speech-related computer software. The company was purchased in mid-2003 by Peabody, Massachusetts-based Nuance Communications, which was then known as ScanSoft....
from Nuance CommunicationsNuance CommunicationsNuance Communications is a multinational computer software technology corporation, headquartered in Burlington, Massachusetts, USA, that provides speech and imaging applications...
. - Quack.comQuack.comAOLByPhone was an AOL interactive voice service that began in 2000. It was offered to millions of consumers. AOLByPhone started with the America Online acquisition of Quack.com, evolving through the subsequent relaunching of Quack.com's Voice Portal as AOLByPhone. AOLbyPhone expanded as AOL...
(acquired by AOLAOLAOL Inc. is an American global Internet services and media company. AOL is headquartered at 770 Broadway in New York. Founded in 1983 as Control Video Corporation, it has franchised its services to companies in several nations around the world or set up international versions of its services...
) The name has now been reused for an iPad search app. - IBM ViaVoice – Embedded version still maintained by IBMIBMInternational Business Machines Corporation or IBM is an American multinational technology and consulting corporation headquartered in Armonk, New York, United States. IBM manufactures and sells computer hardware and software, and it offers infrastructure, hosting and consulting services in areas...
. No longer supported for versions above Windows Vista. Untested above Mac OS X 10.4 or on Macintoshes with an Intel chipset. - Game Commander 2 by Mindmaker. Gaming oriented voice recognition. Voice commands can be assigned to issue keystrokes and key combinations. Computer Gaming WorldComputer Gaming WorldComputer Gaming World was a computer game magazine founded in 1981 by Russell Sipe as a bimonthly publication. Early issues were typically 40-50 pages in length, written in a newsletter style, including submissions by game designers such as Joel Billings , Dan Bunten , and Chris Crawford...
reviewed it in their March 2001 issue, giving it a 5/5 score.
External links
- VoiceAttack
- SpeechGear
- Auditory Sciences
- Sonic Extractor
- e-speaking
- iSpeech Speech Recognition and Text-to-Speech
- AT&T Watson
- Loquendo ASR
- Tatzi -Free Speech Recognition Software by Voice Tech Group, Inc.
- Clap Commander
- Voice Finger
- Simon GPL, to help disabilited persons
- VoxSigma speech-to-text software
- iATROS speech recognition software
- RWTH ASR
- Speech To Text Software Information
- Speech Recognition Software Tools Directory