Pages

Feb 7, 2020

Plug-in SDKs available for memoQ

This morning I was privileged to see a demonstration of some new search and data mining tools by a local developer, and I thought what a shame it was that the tools needed to implement some of that functionality were not available for memoQ. I "knew" this, because I asked a number of times over the years to have a look at an SDK for terminology plug-ins, for example, only to be told that such a thing was not easily available. That changed at some point, but the software artists formerly known as Kilgray forgot to mention that until I asked again today.

memoQ now has a number of software development kits available for creating plug-ins for a variety of functions. These can be seen and downloaded here: https://www.memoq.com/integrations/sdks.

Plugins must be developed with C# for the .NET Framework version 4.6.1. Each of the SDKs includes an example resource to use as a template as well as development and approval guidelines for distribution or internal use.

Feb 6, 2020

Speech-to-text in language services and learning: an update


On February 26 at 4:00 pm Central European Time (10 am Eastern Standard Time), I'll be presenting an overview of some popular and/or possible platforms for generating text from spoken words for professional work and language learning. As those who have followed this blog for years know, I have written quite a bit about this in the past and done a number of videos for demonstration and instruction using various platforms, but this is a field subject to frequent change and many new developments, so it is difficult sometimes to understand the value of one tool versus another for different applications.

The webinar is available free to anyone interested, and there will be time for questions afterward. We will compare and contrast Dragon NaturallySpeaking, iOS-based applications (including Hey memoQ), Google Chrome and Windows 10 for speech recognition work in translation and transcription, discussing some of the advantages and trade-offs with each platform working in translation environments and text-editing software, and the range of languages covered by each. Join us, and see if there are good fits for your speech recognition needs!

You can register here for the discussion.

Jan 28, 2020

Another look at Windows 10 speech recognition


A few years ago while on "holiday", I returned from dinner to find that my laptop had bluescreened. Panic time! It was Saturday night, and I still had quite a lot of text to translate and deliver on Monday morning. And up on the highest mountain in Portugal, I wasn't sure where I could find a replacement to finish the project, which was, at least, not utterly lost, because I had put it on a memoQ Cloud server for testing. The next day I got lucky: about 50 km away there was a Worten, where I picked up a gamer laptop with lots of RAM and an SSD. Well, not so lucky, as it was a Hewlett Packard Omen, with a fan prone to failure, but that's another story....

This new laptop was my first encounter with Windows 10. I had heard that this operating system offered improved speech recognition capabilities, and since I prefer to dictate my translations and downloading the 3 GB installation file for Dragon NaturallySpeaking (DNS) from my server at the office was going to take forever, I thought I would give Windows 10 speech recognition a try. I hadn't installed my CAT tool of choice yet, so I fired up Microsoft Word and began dictating. "Not bad," I thought. Then I tried it in my translation environment, and the results were a complete disaster. So I put that mess out of my mind.

Since then there have been some notable advances in speech-to-text capabilities on a number of platforms. But the best solution for my languages (German and English) with DNS became increasingly cranky thanks to neglect of the product by Nuance. Every week I read new reports of trouble with DNS in a variety of environments in which it used to perform very well. Apple's iOS 13 was a great leap forward of sorts for speech recognition and voice-controlled editing, but the new features are only available in English, and having Voice Control activated totally screws up my otherwise rather good dictation in German and Portuguese (or any other language). And don't get me started on the crappy vocabulary addition feature, which uses text entry alone with no link to actual pronunciation. Good luck with that garbage. It's not a bad solution in Hey memoQ with the additional command features added, but iOS dictation is not completely up to reasonable professional standards yet.

I probably would have given no further thought to Windows 10's speech-to-text features if it weren't for Anthony Rudd. We've corresponded a bit since I bought his excellent book on regular expressions for translators (and there's another practical guide for us coming soon from him!), and in a recent discussion he alluded to the use of Unicode with regex as a simple way of dealing with some things another colleague was struggling with. I was intrigued by this, and so for about half a day, I ran down a rabbit hole, testing Unicode subscripts and superscripts for a variety of purposes like fixing bad OCR of footnote markers and empirical formulae, autocorrecting common expressions for subscripted variables and chemical terms, including subscripts and superscripts in term bases and much more. Fascinating and useful stuff on the whole, even if some fonts don't support it well.

And of course I looked at using these special Unicode characters in speech-to-text applications. DNS had some funky quirks (not allowing numbers in the "spoken" version of terms, for example), but it worked rather well, so I can now say "calcium nitrate formula" and get Ca(NO₃)₂ without much ado. And for some reason it occurred to me to give Windows 10 speech recognition a try, just because I was curious whether vocabulary could in fact be trained. Indeed it can, and that feature is better than iOS 13 or DNS by far.

But first I had to remember how to activate speech recognition for Windows on my laptop again. When in doubt, type what you're looking for in the search box....

Notice I've pinned Windows Speech Recognition to my taskbar on the right, which is good for quick tasks.

Gesucht, gefunden
. Unlike other speech recognition solutions, the one in Windows 10 works only for the language set for the operating system. And options there are limited to English (United States, United Kingdom, Canada, India, and Australia), French, German, Japanese, Mandarin (Chinese Simplified and Chinese Traditional) and Spanish.

I put on my trusty Plantronics earset (the best microphone I've used for dictation tasks or audio in my occasional webinars in the past year) and began to dictate, first in Microsoft Word, which had shown acceptable results in my tests long ago. I found that adding vocabulary in the Speech Dictionary (accessed via the context menu in the dictation control element shown as a graphic at the top of this post) was dead simple.

The option to record pronunciation enabled me to record non-English names and words in several languages. And sure enough, the Unicode subscripts and superscripts worked, so I can now say CO₂ (I just dictated that) to my heart's content.

I was expecting a mess when I tried to use Windows 10 speech-to-text in a CAT tool, but it was not to be. It was brilliant, actually. I tried it in my copy of SDL Trados Studio, and with the scratchpad disabled so I could dictate directly into the target it worked well. No voice-controlled editing like I'm used to with DNS in memoQ, but that DNS feature does not work in SDL Trados Studio anyway, so this is no worse. But with the scratchpad box enabled (see the screenshot below), I could use voice commands to select and correct text or perform other operations. Brilliant!

After clicking or speaking "Insert", the text will be written to the target field with the proper formatting
So users of SDL Trados Studio who translate to a target language supported by Windows 10 speech recognition are probably better off not giving their money to Nuance, which I'm told can't even be bothered to make a 64-bit version of DNS now (which probably accounts for a lot of the trouble people have with that program.

I tested Wordfast Pro 5, which seems to confuse the speech recognition tool horribly, with source text displayed in the floating bar for some odd reason. But my earlier tests of Wordfast with DNS were equally unhappy, so somehow I'm not surprised. And I didn't test the Memsource desktop editor, which took the price a few years ago for the worst-ever DNS dictation results with a CAT tool. I'll leave that to someone with a much wider masochistic streak.

But what about memoQ, my personal environment of choice for most translation work? Equally brilliant, works just the same as SDL Trados Studio. No voice control for editing without the dictation scratchpad enabled (there, DNS has an advantage in memoQ), but with the scratchpad you can use the voice commands to edit before inserting in the target text field.


Wanna see this in action? Have a look at this short demo video:


I hope that the future will bring us more language support for Windows 10 dictation (Portuguese, Russian and Arabic, please!) and that other providers (like Google, if you're listening, and Apple, which never listens to anyone anymore except to spy on them with Siri) will expand the speech-to-text features offered, particularly to include sound-linked vocabulary training and better adaptation to individual users' speech. Five years ago when I began to investigate alternatives for non-DNS languages, I expected we would have more by now, and we do, but professional needs require all providers to raise their game.

Addendum: Someone asked me if Windows Speech Recognition is a cloud resource or a locally installed one which will work without an Internet connection. It's definitely the latter. So if you have lousy bandwidth or find yourself disconnected from the Internet, you can still use speech-to-text features.

And more: I use a lot of spoken commands for keyboard shortcuts when I work, so I did a little research and testing. It seems that Windows 10 speech recognition gives full access to an application's keyboard shortcuts via voice. So in memoQ, for example, I can dictate the insertion of tags, items from the Translation Results pane and a lot more. Watch out, Nuance. Windows 10 is going to kick your Dragon's scaly butt!

Save the date: IAPTI2020 in Timișoara, Romania

The International Association of Translators and Interpreters (IAPTI) will hold its sixth international conference in Romania this year, on October 3rd and 4th. Previous events in London, Athens, Bordeaux, Buenos Aires, and Valencia all drew an outstanding mix of professionals from around the world, and the four conferences I attended personally were among the best I've attended in two decades. Romania is a fascinating country with vigorous and talented professionals who have always been a pleasure to meet, and the political courage of its people in standing up to corruption in the government has been a source of inspiration in times where other countries continue to disappoint. So I'm looking forward to this event even more than the previous ones. And maybe I'll get a chance to see why SDL chose Romania to maintain its edge over the Clujless competition ;-)

Proposals for presentations are now being accepted until March 31st. Speakers will have 45 minutes followed by 10 minutes of questions and answers. Submissions should be made to the organizing committee at romaniaconference@iapti.org and include an abstract of no more than 200 words, a title and description, along with a short bio (up to 100 words) and a profile pic. Priority is given to topics not presented before at other professional events.

Jan 20, 2020

memoQfest 2020: call for papers

The twelfth memoQfest will be held in Budapest on June 10th to 12th, 2020. Presentation proposals are being accepted until January 30, 2020. A description of the theme for this year and suggested topics can be found here.

This event has always been a good opportunity to meet the development and support teams and exchange ideas with fellow users from around the world. I've always looked forward to this conference and the excellent people I meet there, and every time my usually high expectations have been exceeded.