This release adds HTTPS support for the appliance REST APIs, with the ability to upload your own security certificates. TLS 1.2 and 1.3 are supported; certificates and keys that are uploaded should be in the PKCS #8 format.
The following languages now support advanced punctuation: English (en), German (de), Spanish (es), French (fr), Dutch (nl), Danish (da), Turkish (tr) and Malay (ms).
It is recommended that customers on previous releases upgrade to this version.
The legacy V1 API that the Real-time Virtual Appliance currently supports will be discontinued in a future release. We recommend that customers familiarise themselves with the configuration object as described in the Speech API Guide. Future notices will be provided to announce the end of life of the V1 API, and provide detailed instructions on migrations to the V2 API.
The following issues are addressed since the previous release:
Issue ID | Summary | Resolution Description |
---|---|---|
REQ-12489 | The en-GB output locale sometimes outputs en-US spellings | The latest rebuild of English (en) addresses this issue. |
REQ-13430 | Virtual Appliance conflict of internal (docker) and customer subnets | A smaller subnet is now used to reduce the likelihood of conflicts. |
REQ-12405 | English Transcription Includes Russian Words | The latest rebuild of English (en) addresses this issue. |
REQ-12551 | Invalid Character in RTA Partials | Invalid characters were introduced into the English (en) language pack for the previous release; these have now been removed. |
REQ-10688 | Memory leak on client disconnect with pending buffered audio | Clients sending data to a worker faster than it can be processed could cause a memory leak; this has now been addressed. |
REQ-11812 | Transcript and word elements from RT can contain ˻ as a word | A rebuild of the English (en) language model has fixed this issue. |
REQ-11829 | No worker available in RTVA in unclean disconnects | This was caused by a race condition that has now been fixed. |
REQ-12541 | Combo words RTA 3.2.2 which are not present in RTA 3.2.1 | The previous release introduced a problem with the English (en) model where seperate words were sometimes joined together. This has now been fixed. |
REQ-13201 | Transcriptions replace the word "Sigma" with the symbol "σ" | This has now been fixed in the latest English (en) model |
REQ-13690 | Sessions that use large custom dictionary word lists may experience disconnects | Ensure that you use a WebSocket ping/pong timeout of 60 seconds on your client |
The following are known issues in this release:
Issue ID | Summary | Detailed Description and Possible Workarounds |
---|---|---|
REQ-1409 | Proteus HCL with <unk> causes out of memory error | A custom dictionary list that contains the word ' |
REQ-7549 | Memory leak affecting gRPC | There is a small memory leak in the gRPC Python server (https://github.com/grpc/grpc/issues/5913). |
REQ-10160 | Advanced punctuation for Spanish (es) does not contain inverted marks. | Inverted marks [ ¿ ¡ ] are not currently available for Spanish advanced punctuation. |
REQ-10627 | Double full stops when acronym is at the end of the sentence | If there is an acronym at the end of the sentence, then a double full stop will be output, for example: "team G.B.." |
REQ-11087 | Additional white space appearing before the very last end of sentence punctuation character. | Its been observed that additional white space can occur before a full-stop '.' in some transcripts. This can be parsed and removed by a post-processing step if necessary. |
REQ-11135 | 3.2.0 introduced unwanted hesitations in transcripts. | Due to changes in the way that training data is now ingested to improve the accuracy of spontaneous speech for English (en) there is a greater likelihood that hesitations will be included in the output transcrtips. We plan to support a hesitation filtering capability in a future release for customers that do not want to see hesitations on transcripts. |
REQ-11792 | Speaker change token positioning is incorrect | We are aware of a consistent mis-placing of the speaker change |
REQ-12202 | High memory usage when using custom dictionary | It has been observed that when using custom dictionary an additional 800-1700MB of memory is required (depending on the size of the wordlist used). |
Virtual Appliance image (OVA) for installation on:
See the Installation and Admin Guide for details on the minimum specifications for the VM. The maximum number of concurrent jobs (maxworkers) that you can run on a single appliance is 32.
There are four variants of the Real-time Virtual Appliance.
Variant | Image Size | Max. Disk Space | Languages |
---|---|---|---|
nano | 11GB | 40GB | en |
mini | 16GB | 40GB | en, de, es |
midi | 27GB | 60GB | en, de, es, fr, ko, ja, nl, pt |
maxi | 45GB | 80GB | en, de, es, fr, ko, ja, nl, pt, it, da, pl, ca, hi, ru, sv |
plus | 75GB | 80GB | en, cmn, no, ar, bg, cs, el, fi, hu, hr, lt, lv, ro, sk, sl, tr, ms |
Remove the license from your old appliance (see the Admin Guide), then re-import the new OVA and configure networking as per the Installation and Admin guide. You will need to re-apply the license code you have once the OVA has imported.
Upload the OVA to VMWare ESX, VMWare Workstation Player, or VirtualBox. See the Installation and Admin Guide for more information.