AWS updates Amazon Transcribe platform with 100 languages and new features

2023-11-28

AWS has added new languages to its Amazon Transcribe product, providing transcription based on generative AI for 100 languages, as well as a range of new AI capabilities for customers. Announced at the AWS re:Invent event, Amazon Transcribe can now recognize more spoken languages and has launched a call transcription feature. AWS customers use Transcribe to add speech-to-text functionality to their applications on the AWS cloud. According to the company in a blog post, Transcribe has been trained on "millions of hours of unlabeled audio data from over 100 languages" and uses self-supervised algorithms to learn human speech patterns in different languages and accents. AWS ensures that some languages are not overrepresented in the training data to ensure accuracy for less frequently used languages, similar to more commonly used languages. By the end of 2022, Amazon Transcribe will support 79 languages. According to AWS, the accuracy of Amazon Transcribe varies between 20% and 50% for many languages. It also offers automatic punctuation, custom vocabulary, automatic language identification, and custom vocabulary filters. It can recognize speech in audio and video formats, as well as in noisy environments. With improved language recognition, AWS states that the progress of Amazon Transcribe also extends to better accuracy in its call analytics platform, which is frequently used by its contact center customers. Amazon Transcribe Call Analytics, now also powered by generative AI models, can summarize interactions between agents and customers. AWS says this reduces post-call work of creating reports, and managers can quickly review information without having to go through the entire transcript. Of course, AWS is not the only company offering AI transcription services. Otter has been providing AI transcription for consumers and businesses and released a summarization tool in June. While not exactly the same, Meta announced that it is developing a generative AI-based translation model that can recognize nearly 100 spoken languages. AWS has also added additional capabilities to its Amazon Personalization product, which allows customers to provide personalized product recommendations or displays to customers, similar to how streaming services suggest new shows based on previous activity. AWS has added content generation, which will write headlines or email subject lines to connect with recommended lists.