SpeechSuper English Speech to Text API Supports Inverse Text Normalization

SpeechSuper released a new speech-to-text (speech recognition) API feature: inverse text normalization.


What is inverse text normalization? It converts the words to numerical or scientific expressions for better readability and understanding. For example, the recognized words 'seventeen dollars' can be converted to '$17' via inverse text normalization.


We now support inverse text normalization in the following domains.

1. Cardinal number and currency

 


SpeechSuper's English speech-to-text API can support number and currency conversion. For example, if the recognized words are 'It costs three hundred and one dollars.', it will be converted to 'It costs $301.'

2. Date


SpeechSuper's English speech-to-text API can support date conversion. For example, if the recognized words are 'I was born on November first nineteen ninety-seven.', it will be converted to 'I was born on November 1, 1997'.

3. Decimal number


SpeechSuper's English speech-to-text API can support decimal conversion. For example, if the recognized words are 'The investment is five point two million dollars.", it will be converted to 'The investment is $5.2 million."

4. Phone number


SpeechSuper's English speech-to-text API can support spelled-out numbers. For example, if the recognized words are 'My telephone number is six two one four zero two five zero.", it will be converted to 'My number is 62140250."

5. Scientific units


SpeechSuper's English speech-to-text API can support scientific units. For example, if the recognized words are 'I weigh fifty-seven kilograms", it will be converted to "I weigh 57 kg."

SpeechSuper's English speech-to-text API can support time expressions. For example, if the recognized words are "It's half past nine p.m.", it will be converted to "It's 09:30 p.m."

SpeechSuper English speech-to-text API has been evolving. You can try the demo here. If you’re interested, please contact us on the website.

----------------------------------------------------------

ABOUT US

Qiusi is a product manager in China’s EdTech industry focusing on language learning and AI. She enjoys writing stories. You can reach her at qiusi.dong@speechsuper.com

SpeechSuper provides cutting-edge AI speech assessment (a.k.a pronunciation assessment or pronunciation score) APIs for language learning products. Comprehensive feedback covers pronunciation score, fluency, completeness, rhythm, stress, liaison, etc. Languages supported include English, Mandarin Chinese, French, German, Korean, Japanese, Russian, Spanish, and more.

*Prior written consent is needed for any form of republication, modification, repost, or distribution of the contents.


Comments

Popular posts from this blog

How Accurate is Pronunciation Assessment?

DON’T Use Speech Recognition in Language Learning Apps