This is the documentation for a previous version of our product. Click here to see the latest version.

Batch Container

High-Level Summary

This release provides internal bug and security fixes. It is recommended that customers on previous releases upgrade to this version.

The legacy V1 API and related output formats is no longer supported. V1 examples have been removed from all batch container documentation and will no longer work. We recommend use of the V2 API and the config.json for all supported features object as documented in the Speech API guide for 8.1.2.

It is now necessary to use processors that support Advanced Vector Extensions 2 (AVX2) when running the container in order to take advantage of latest performance optimisations.

What's New

8.1.2

Internal bug and security fixes

Issues Fixed

The following issues are addressed since the previous release:

Issue ID	Summary	Resolution Description
REQ-18031	Speaker Diarization Improvements	Speaker Diarization has been improved where there are two speakers in a file

Known Limitations

Issue ID	Summary	Detailed Description and Possible Workarounds
REQ-1409	Proteus HCL with `<unk>` causes out of memory error	A custom dictionary list that contains the word '' causes the worker to crash.
REQ-10160	Advanced punctuation for Spanish (es) does not contain inverted marks.	Inverted marks [ ¿ ¡ ] are not currently available for Spanish advanced punctuation.
REQ-10627	Double full stops when acronym is at the end of the sentence	If there is an acronym at the end of the sentence, then a double full stop will be output, for example: "team G.B.."
REQ-10634	Putting "-" as an item in `additional vocab` configuration will cause the container to fail	Do not enter just a "-" on its own in Custom Dictionary either as an additional vocab item or in the `sounds_like property`. Hyphens are still supported when entered as part of phrases or words

English (en)
German (de)
Spanish (es)
French (fr)
Portuguese (pt)
Japanese (ja)
Korean (ko)
Dutch (nl)
Italian (it)
Swedish (sv)
Danish (da)
Polish (pl)
Catalan (ca)
Hindi (hi)
Russian (ru)
Mandarin (cmn)
Norwegian (no)
Arabic (ar)
Bulgarian (bg)
Czech (cs)
Greek (el)
Finnish (fi)
Hungarian (hu)
Croatian (hr)
Lithuanian (lt)
Latvian (lv)
Romanian (ro)
Slovak (sk)
Slovenian (sl)
Turkish (tr)
Malay (ms)

Container images are labelled using the following scheme, where language codes adhere the ISO-639 standard:

batch-asr-transcriber-<language>:<version>

For example,

batch-asr-transcriber-en:8.1.2

Getting started

Batch Container

Quick Start Guide

Batch Container

High-Level Summary

Important Notices

What's New

8.1.2

Issues Fixed

Known Limitations

Supported Platforms

Installation

Pre-requisites

Related Documentation

Supported Languages