This is the documentation for a previous version of our product. Click here to see the latest version.

Real-time Container

High Level Summary

This release includes internal security fixes which affects the Real-time Container. Customers should upgrade to this version wherever possible.

Important Notices

It is now necessary to use processors that support Advanced Vector Extensions 2 (AVX2) when running the container in order to take advantage of latest performance optimisations.

It is also recommended when using the enhanced model to use hardware that supports the AVX512_VNNI flag for optimal processing performance. For more information please see the quick start guide.

What's New

1.4.1

Bug fixes to report the correct usage data required for processing in Usage Container
Internal security fixes

1.4.0

New improved language packs for all 31 languages. By default a language pack will contain a standard and enhanced model for all 31 languages. The standard model is available to use, with no user change required. For using the enhanced model refer to the API guide for details
General improvements in pop culture terms recognition for the English language pack
Removal of foreign characters from English and German language packs
Profanity tagging in Italian and Spanish
Chinese Mandarin language pack now supports Traditional as well as Simplified Mandarin. Please see API guide for guidelines of how to do so

Known Limitations

The following are known issues in this release:

Issue ID	Summary	Detailed Description and Possible Workarounds
REQ-10634	Putting "-" as an item in `additional vocab` configuration will cause the container to fail	Do not enter just a "-" on its own in Custom Dictionary either as an additional vocab item or in the `sounds_like property`. Hyphens are still supported when entered as part of phrases or words
REQ-13240	Chinese (cmn) container crashes occasionally when using certain additional vocabulary	Do not use whitespace characters in additional vocabulary `sounds_like`
REQ-16256	Audio Swapping between 8kHz and 16kHz causes memory leak	Repeatedly audio swapping between 8kHz and 16kHz files can cause an increase in memory over very long periods that causes the container to crash. If memory usage in this scenario becomes excessive it is recommended to restart the container
REQ-17771	Wide-space Unicode characters in Custom Dictionary cause a jobs to fail	This is now fixed and wide-spaced characters should be accepted
REQ-20261	The Japanese language pack may output fewer punctuation marks in certain scenarios	In some cases, users may see a decreased output in punctuation marks when transcribing in Japanese. Adjusting punctuation sensitivity sessions may improve output

Resolved Issues

The following is a list of any resolved issues within this release:

Issue ID	Summary	Resolution Description
REQ-20470	Silences included in total speech duration for Real-time transcription	Includes only spoken audio duration for Real-time transcription when sending data to the Usage Container

Supported Languages

These are the General Availability (GA) release notes for the Real-time ASR container images. Following languages are supported:

English (en)
German (de)
Spanish (es)
French (fr)
Portuguese (pt)
Japanese (ja)
Korean (ko)
Dutch (nl)
Italian (it)
Swedish (sv)
Danish (da)
Polish (pl)
Catalan (ca)
Hindi (hi)
Russian (ru)
Mandarin (cmn)
Norwegian (no)
Arabic (ar)
Bulgarian (bg)
Czech (cs)
Greek (el)
Finnish (fi)
Hungarian (hu)
Croatian (hr)
Lithuanian (lt)
Latvian (lv)
Romanian (ro)
Slovak (sk)
Slovenian (sl)
Turkish (tr)
Malay (ms)

Container images are labelled using the following scheme, where language codes adhere the ISO-639 standard:

rt-asr-transcriber-<language>:<version>

For example,

rt-asr-transcriber-en:1.4.1

Docker (17.06.0 or above).
Login credentials (URL, username and password) for the Speechmatics Docker registry.

Getting started

Real-time Container

Quick Start Guide

Real-time Container

High Level Summary

Important Notices

What's New

1.4.1

1.4.0

Known Limitations

Resolved Issues

Supported Languages

Supported Platforms

Installation

Prerequisites