Speech Recognition
Speech recognition is the ability of a machine to identify words and phrases in spoken language and convert them to a machine-readable format.
How Speech Recognition Works
Speech recognition is very complex, and many mathematical equations are involved. Let’s break it down into simple steps:
- First, the machine records the audio file.
- Then, it breaks down the audio to extract consonants and vowels (the building blocks of a text). After this process, we get a list of consonants and vowels.
- Using the word database of the language, the machine tries to identify words from the list and then make sentences, thus converting the speech into text.
Accessing Speech Recognition in Block Coding
Following is the process to add Speech Recognition capability to the PictoBlox Project.
- Open PictoBlox and create a new file.
- Select the coding environment as Block Coding.
- Next, click on the Add Extension button and add the Speech Recognition extension.
- You can find the Speech Recognition blocks available in the project.
Accessing Speech Recognition in Python Coding
Following is the process to add Speech Recognition capability to the PictoBlox Project.
- Open PictoBlox and create a new file.
- Select the coding environment as Python Coding.
- Next, click on the Add Modules/Libraries button and add the Speech Recognition extension.
- To access the library functions, you have to add the object declaration.
sr = SpeechRecognition()