SpeechSuper English Speech to Text API Supports Inverse Text Normalization

September 27, 2022

SpeechSuper released a new speech-to-text (speech recognition) API feature: inverse text normalization.

What is inverse text normalization? It converts the words to numerical or scientific expressions for better readability and understanding. For example, the recognized words 'seventeen dollars' can be converted to '$17' via inverse text normalization.

We now support inverse text normalization in the following domains.

1. Cardinal number and currency

SpeechSuper's English speech-to-text API can support number and currency conversion. For example, if the recognized words are 'It costs three hundred and one dollars.', it will be converted to 'It costs $301.'

2. Date

SpeechSuper's English speech-to-text API can support date conversion. For example, if the recognized words are 'I was born on November first nineteen ninety-seven.', it will be converted to 'I was born on November 1, 1997'.

3. Decimal number

SpeechSuper's English speech-to-text API can support decimal conversion. For example, if the recognized words are 'The investment is five point two million dollars.", it will be converted to 'The investment is $5.2 million."

4. Phone number

SpeechSuper's English speech-to-text API can support spelled-out numbers. For example, if the recognized words are 'My telephone number is six two one four zero two five zero.", it will be converted to 'My number is 62140250."

5. Scientific units

SpeechSuper's English speech-to-text API can support scientific units. For example, if the recognized words are 'I weigh fifty-seven kilograms", it will be converted to "I weigh 57 kg."

SpeechSuper's English speech-to-text API can support time expressions. For example, if the recognized words are "It's half past nine p.m.", it will be converted to "It's 09:30 p.m."

SpeechSuper English speech-to-text API has been evolving. You can try the demo here. If you’re interested, please contact us on the website.

----------------------------------------------------------

ABOUT US

Qiusi is a product manager in China’s EdTech industry focusing on language learning and AI. She enjoys writing stories. You can reach her at qiusi.dong@speechsuper.com

SpeechSuper provides cutting-edge AI speech assessment (a.k.a pronunciation assessment or pronunciation score) APIs for language learning products. Comprehensive feedback covers pronunciation score, fluency, completeness, rhythm, stress, liaison, etc. Languages supported include English, Mandarin Chinese, French, German, Korean, Japanese, Russian, Spanish, and more.

*Prior written consent is needed for any form of republication, modification, repost, or distribution of the contents.

Comments

andurieMarch 21, 2025 at 2:00 AM
This comment has been removed by the author.
ReplyDelete
Replies
andurieMarch 21, 2025 at 2:01 AM
This text-to-speech software sounds incredibly versatile and well-designed! AI Voice Generator
ReplyDelete
Replies

Add comment

SpeechSuper Blog