A Brief Introduction to Speech Technology

Speech technology has been a hot research topic for more than five decades; it tries to duplicate and respond to a human’s voice. It is a valuable tool for both human-to-human and human-to-machine communication. Speech technology is becoming more popular lately, and almost everyone benefits from the advances in this technology. In this series of articles, various aspects of the technology – its past, current state and future – will be discussed.

Speech technology can be divided into these categories:

• Speech synthesis which is the artificial production of human speech

• Speech recognition which converts the spoken words into text

• Speaker recognition, as the name suggests, is for recognizing the speaker

• Speaker verification is for verifying the identity of the speaker

• Multimodal interaction which offers alternative ways of interfacing with devices

In digital assistant systems like Apple Siri, Google Now, Microsoft Cortana and Amazon Echo a combination of all of the above technologies are involved and used. Before 2010, this technology was not very popular and was not known to many people. It was quite limited and was mostly used for voice dictation software such as Nuance’s Dragon NaturallySpeaking. Apple Siri was a game changer in 2011 and re-introduced the world to the high potentials of speech technology. Since then, other tech giants like Google, Microsoft and Amazon have introduced their speech tech-based products. Some of these products are no longer just an app on your cell phone (like Apple Siri and Google Now). Microsoft has announced that Cortana (which was originally developed as an app for Windows Phone) will be a part of Windows 10. Also, Amazon introduced Amazon Echo, which is a voice command hardware device. Other interesting uses of speech technology can be seen in Microsoft Skype’s translation system, which is a voice-to-voice real-time translation system.

Speech technology is not just limited to voice dictation software or personal digital assistant applications. In fact, it has many other potential uses such as in the health sector. In upcoming articles, other applications of this technology will be revealed.

References:

https://en.wikipedia.org/wiki/Speech_technology
http://www.techhive.com/article/243060/speech_recognition_through_the_decades_how_we_ended_up_with_siri.html

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously

I’m a PhD candidate or early career researcher

I have substantial applied expertise in a domain

I’m interested in employment with PreScouter

I’m a PhD candidate or early career researcher

I have substantial applied expertise in a domain

I’m interested in employment with PreScouter

Never miss an insight

Landscaping stakeholders within the high-resolution physiological data journey

Are alternative feedstocks the pathway to decarbonizing the chemical industry?

Landscaping of treatments and patient journey for abdominal aortic aneurysms

Assessing the most environmentally friendly and cost effective MSW processing methods

Most Popular

Smart shoes: Innovations revolutionizing the future of footwear

Can EV batteries keep up with the cold? The latest breakthroughs and advances

What are the disadvantages of cryptocurrencies?

The Spiral Pump: Pumping Water Without Electricity

The future of food: What will we be eating in 20 years?

Never miss an insight