Jan 22, 2019

The Ultimate Comparative Screwjob Calculator for translation rates

Some years ago I put out a number of little spreadsheet tools to help independent translators and some friends with small agencies to sort out the new concepts of "discount" created by the poisonous and unethical marketing tactics of Trados GmbH in the 1990s and adopted by many others since then. One of these was the Target Price Defense Tool (which I also released in German).

The basic idea behind that spreadsheet was the rate to charge on what looked to be a one-off job with a new client who came out of nowhere proposing some silly scale of rate reductions based on (often bogus and unusable) matches. So, for example, if your usual rate was USD 0.28 per word and that's what you wanted to make after all the "discounts" were applied, you could plug in the figures from the match analysis and determine that the rate to quote should be USD 0.35, for example.

Click on the graphic to view and download the Excel spreadsheet
Fast forward 11 years. Most of the sensible small agencies run by translators who understand the qualities needed for good text translation are gone, their owners retired, dead or hiding somewhere after their businesses were bought up and/or destroyed by unscrupulous and largely incompetent bulk market bog "leaders" with their Walmart-like tactics. Good at sales to C-level folk, with perhaps a few entertaining "inducements" on the side, but good at delivering the promised value? Not so much in cases I hear. And many of the good translators who haven't simply walked away from the bullshit have agreed to some sort of rate scale based on matching (despite the fact that there is no standard whatsoever on how different tools calculate these "matches" and now with various kinds of new and nonsensical "stealth" matches being sneaked in with little or no discussion).

So now, it's not so much whether a translator will deal with a given rate scale for a one-off job, but more often what the response should be to a new and usually more abusive rate scale proposed by some cost- and throat-cutting bogster who really cares enough to shave every cent that an independent translator can be intimidated to yield, thus destroying whatever remaining incentive there might be to go the extra mile in solving the inevitable unexpected problems one might find in many a text to translate.

And this, in fact, was the question I woke up to this morning. I told the friend who asked to go look for my ancient Target Price Defense Tool, but I was told that it wasn't helpful for the case at hand. (It actually was, but because of the different perspective that wasn't immediately obvious.)

Click on the graphic to view and download the Excel spreadsheet
So I built a new calculation tool quickly before breakfast which did the same calculations but in a little different layout with a somewhat different perspective: the Comparative Screwjob Calculator (screenshot above), because really, the point of these match scales is to screw somebody.

Shortly after that, I was asked to include the calculations of "internal matches" from SDL Trados (which are referred to as "homogeneity" in the memoQ world, stuff that is not in the translation memory but where portions of text in the document or collection of documents have some similarity based on their character strings - NOT their linguistic sense). And of course there are other creatively imagined matches in some calculation grids - for subsegments in larger sentences (expect to get screwed if an author writes "for example" a lot) or based on some sort of loser's machine pseudo-translation algorithms that some monolingual algorithm developer has decided without evidence might save the translator a little effort - cut that rate to the bone!). So I expanded the spreadsheet to allow for additional nonsense match rate types ("internal/other") and to compare a third grid which can be used, for example, to develop a counterproposal if you are currently billing based on an agreed rate scale and a new one is proposed (all the time keeping in view how much you are losing versus the full rate which might very well be getting charged to the end customer anyway).

Click on the graphic to view and download the Excel spreadsheet
The result was the Ultimate Comparative Screwjob Calculator (screenshot above). Now that's probably too optimistic a name for it, because surely those who think only of translators as providers of bulk material to be ground up for linguistic sausage have other ways to take their kilos of flesh for the delivery mix.

If this all sounds a bit ludicrous, that's because it is. I am a big fan of well-managed processes myself; I began my career as a research chemist with a knowledge of multivariate statistical optimization of industrial processes and used this knowledge to save - and make - countless millions for my employers or client companies and save hundreds of jobs for ordinary people. I get it that cost can be a variable in the equation, because starting some 34 years ago I began plugging it in to my equations along with resin mix components and whatnot.

But the objective I never lost sight of was to deliver real value. And that included minimizing defects (applying the Taguchi method or some other modeling technique or just bloody common sense). And ensuring that expectations are met, with all stakeholders (don't you hate that word? it reminds me of a Dracula movie in my dreams where I hold the bit of holly wood in my hand as we open the coffin of thebigword's CEO) protected. That is something too few slick salesfolk in the bulk market bog understand. They talk a lot of nonsense about quality (Vashinatto: "doesn't matter"; Bog Diddley: "no complaints from my clients who don't understand the target language", etc.). But they are unwilling to admit the unsustainable nature of their business models and the abusive toll it takes on so many linguistic service providers.

So use these spreadsheets I made - one and all - if you like. But think about the processes with which you are involved and the rates you need to provide the kind of service you can put your name to. The kind where you won't have to say desperately and mendaciously "It wasn't me!" because economic and time pressures meant that you were unable to deliver your best work. That goes as much for respectable translation companies (there are some left) as well as for independent service professionals who want to commit to helping all their clientele receive what they need and deserve for the long run.


Jan 14, 2019

Specialist terminology taxonomies from Cologne Technical University

Click and thou shalt go there!

Early in the last decade when I lived near Düsseldorf and began translating full time, the nearby technical university in Cologne had an excellent terminology studies program run by Prof. Klaus-Dirk Schmitz, who also had a long history in Saarbrücken back in my exchange student days there. I had the pleasure of meeting this gentleman at various professional events for Passolo (before it was swallowed by SDL) or other occasions, and I remain impressed by the professional qualities of some of the colleagues he helped to educate. At some point he or one of his students pointed me to an interesting online collection of specialist terminologies created by students at the university as part of their degree work. While student work must be viewed carefully, on the whole I found these collections to be of better quality than quite a few put together by "professionals", and their structured taxonomies were also interesting to people like me who enjoy such things. And occasionally the terminologies were rather helpful for certain technical topics I translate.

But over the years I simply forgot about them for the most part, and when they did come to mind I assumed that the old MultiTerm engine used to handle the data on the site would no longer work. That latter assumption may be partly correct; I found the collection again, noted that the most recent addition to the term library was a bit over a decade ago and that the search functions don't seem to work with Chrome, though I am able to browse the structured taxonomies without difficulty.


Looking through the list of term collections, I saw one that would be particularly useful for a current personal effort: beekeeping. One of my projects for the year ahead is to add some hives to the garden to see if I can improve some of the vegetable, fruit and nut yields. A local Portuguese beekeeper and I have been trading poultry, and he kindly provided me with a copy of his thesis on apiculture and offered assistance to get me started. So I am reading up on the subject in several languages, thinking to put together a good terminology to make cross-referencing the concepts between English, German and Portuguese a little easier.

One thing I never tried to do before was to extract data from the FH Köln (Cologne Technical University) site into any sort of terminology management tool. I don't think they were ever intended to be used that way, and at the time most of the collections were put together, translation environment tools were much less widely used by professionals and university study programs than they are today. But after a little thought and experimentation, mining the pages proved to be quite simple.

Here's how I did it:
  1. Opened a collection of interest and expanded the folder tree for a particular language completely, then selected and copied all the text in that frame:

  2. Pasted the copied content as plain text (no formatting) into Microsoft Word. The numerical codes were followed directly by the text entries.
  3. Removed parentheses by searching and replacing with nothing.
  4. Inserted a tab between the number codes using search and replace with wildcards (regex of a sort):

  5. Switched to the other language in the term collection and repeated steps 1 through 4.
  6. Transferred the contents to Excel (various ways to do this).
  7. Imported the Excel file with the specialist terms into a term base in my translation environment tool of choice.


Jan 13, 2019

A second look at Wordfast Pro


The generally good impression made by Wordfast Anywhere in my recent tests inspired me to take a new look at the premium environment for freelance translators: Wordfast Pro 5. A lot has changed with Wordfast Pro since its early days, and much of what I found troublesome with early versions has been corrected. A new look has also been on my agenda for a while since I realized that two new formats were introduced (TXLF, an XLIFF format, and GLP, a zipped project package format for Wordfast), which can be handled by my usual translation environment but (currently) with a few extra steps required compared to the old TXML format.

The installation took about a minute and started off with a good impression from the warning about cloud drive synchronization:


I've seen a number of people come to grief with other tools when their projects, translation memories or other resources are stored in Dropbox or similar configurations so they can be shared by installations on different computers, and I appreciate Wordfast's attempt to warn people off from this dodgy practice. If you want to share resources, play it safe and stick them in Wordfast Anywhere.

At first, the program is in demo mode, which limits translation memories to 500 translation units (TUs) and does not allow access to remote resources such as Wordfast Anywhere. Fortunately, there is a fully functional 30-day trial available, and it took all of about two minutes to fill out the simple request form, receive the mail with the trial license key and activate it in Wordfast Pro 5.

I was really enthusiastic about the clean, uncluttered feel of the interface. There's a lot more functionality in SDL Trados Studio or memoQ, but all the myriad features of those environments can be intimidating to some, and even for experienced users navigation can be confusing at times to locate some obscure setting or feature. Not in Wordfast Pro 5: the features mostly aren't there, and what is there can be found without much ado. Given the limited scope of mastery and inclination to learn on the part of many hamsters running on the freelance translation wheel, this can be a definite advantage.

On the Help ribbon I saw a Feedback icon. I don't know why, but this inspired a weird enthusiasm in me, so I clicked it, and when the dialog appeared, I wrote a quick note to the development team to say what a great impression the new user interface was making before I had even started to do anything useful with it. I noticed that the feedback dialog also had options to include files and projects in case of a problem, which I also thought was really cool. Something like that in other tools would be very helpful to their users and probably encourage more suggestions and interaction.

It was really easy to navigate through the ribbon menus and explore the configuration options. I was pleased to see that different sets of keyboard shortcuts were available to make the ergonomics easier for users of some other tools.


But SDLX? Huh? That's kind of Jurassic. No memoQ shortcuts, but no problem. I can customize, right? Yes... but I soon discovered that I apparently had no way to save my customized shortcuts as "memoQ style" or whatever else I might want to call them. And then I noticed that I probably can't save the configuration to move it onto a second computer where the terms of the license agreement allow private individuals to install another copy. And, hmmmm, no option to print a cheat sheet I can refer to as I learn the keyboard shortcuts. memoQ users are kind of spoiled on both counts, I guess.

One thing I was very eager to try was the connection to my translation memories and glossaries in my Wordfast Anywhere account. That proved to be quite straightforward: it worked exactly as the clear instructions of the Wordfast Pro Help described the process.

So I was ready to try out some translation, maybe a little dictation with Dragon NaturallySpeaking. I imported a little text file to get started:


WTF??? Now I know what the problem is here, but importing the same file to Wordfast Anywhere gives this result:


And in memoQ:


The import with the simple text filter of Wordfast Pro 5 (version 5.7) does not map the characters correctly. I had to change the source text file from ANSI to UTF-8: not a big deal for me, but a lot of translators I know will be over their heads right there.

The choice of import filters available is fairly good as one might expect from most professional translation environments these days, but two important things were missing for me. There seems to be no option to cascade filters, useful for example if you have a Microsoft Excel file containing HTML text to translate, and there is also no facility for configuring custom regex-based text filters or tagging text content which needs protection (such as placeholder text). This won't be an issue for a lot of translators, but for those who deal with challenging, often unexpected formatting issues in customers' files it could be a real pain in the neck.

On to dictation... Dragon NaturallySpeaking (DNS) seemed to perform well. I had to turn off the DNS dictation box by unmarking thew checkbox in its dialog. Text was then transcribed well into the target field, and my spoken keyboard shortcut to confirm a segment and go on to the next one worked perfectly. Then I misspoke and used a spoken editing command to correct my error. Nothing happened. I tried several different spoken selection and editing commands that I use every day in memoQ. Nothing worked. Shit. What we have here is a failure of compatibility. The full potential of Dragon NaturallySpeaking cannot be used in Wordfast Pro 5.

I explored the settings further... quality assurance. That looked pretty good; the options were easy to understand and I could set them as I wanted to check my work. But the QA settings I need vary in many projects, and sometimes I want to do a QA check on just one aspect like tags or maybe terminology. Wordfast Pro 5 offered no facility to save a QA configuration or profile and load it as one might do in SDL Trados Studio or memoQ. This too would be a deal-breaker for me, alas. I depend on a full hand of memoQ quality assurance profiles for selective checking of important quality parameters in my jobs. Toggling settings back and forth in Wordfast would drive me nuts. Still, this wouldn't disturb many CAT tool users who can barely be bothered to run a spelling check on their work, much less run a check or missing or mismatched tags.

In contrast to my conclusions years ago, I can now say that Wordfast Pro is "ready for prime time". It has a nice, clean, easy to navigate interface, and the Help descriptions are clear, if somewhat idiosyncratic in their spelling at times. The options are limited compared to other professional tools I use which have comparable costs of use, but that may be perceived as an advantage by many... until they need what's not there, which is probably inevitable if they work at translation in a full-time freelance capacity. Over the years I have heard many good things about Wordfast support, so I expect that users will at least find help and advice when they need it.

The integration with the online Wordfast Anywhere resources is also simple and good. That's a major point in favor of this tool and should be very helpful for collaboration.

Overall, I think that users who invest in a Wordfast Pro license will get their money's worth. A three-year license costs €400, with three-year renewals costing half the list price after that. If you aren't willing to pay after the three years, your license will stop working (unlike SDL Trados or memoQ, where the current license models allow you to keep working with the software long after your claim to support and upgrades has lapsed - basically "forever" if nothing strange happens with newer operating systems).

The possibilities for collaboration between Wordfast users and those who work with other environments are much better than they used to be, and in just a short time I was able to see how I can prepare projects for a colleague using Wordfast Pro 5. (SDL Trados packages can apparently be handled, though that's not the case for memoQ project packages prepared with the PM Edition - I would have to make MQXLIFF files and export TM and term base resources.) And I hope that this situation will only get better, with more environments offering various kinds of Wordfast resource integration and Wordfast acquiring new capacities to work with other formats and resources.