Translation Tribulations: In HAMPsTr We Trust?

Dec 12, 2013

In HAMPsTr We Trust?

So many times when I hear the bright and happy predictions of commercial interests spouting nonsense about "translation as a utility" and hoping to feast on the roadkill of communication, who claim the highest of motives and show the basest motivations in their real acts, I hear a saxophone in my mind and a strained voice declaring that some day "they may understand our rage".

Machine pseudo-translation (MpT) and human-assisted machine pseudo-translation (HAMPsTr) are big business for the profiteers offering pseudo-solutions which typically start in the low six figures of investment. "Get on the MT boat or drown!" declared one such profiteer, Asia Online CEO Dion Wiggins at his unfortunate keynote presentation at memoQfest 2012 in Budapest.

It seems that each week a new story line to justify the linguistic lemmings' rush over the cliff appears. Recently I heard for the first time how translators suffer from the "blank page syndrome" (note: as of 25 December 2013 the entire blog with that "blank page" link has disappeared) and need machine generated babble for inspiration. I thought perhaps I was just an odd one, usually struggling with many ways to render a text from German into my native language and trying to choose the best, but experienced colleagues I asked about their fear of blank pages all asked me if I was joking.

This morning another colleague sent me a real screamer:

"Smaller language service providers (LSPs) process fewer words than larger ones... [this] puts them at a disadvantage when it comes to leveraging linguistic assets due to the smaller size of their terminology databases and translation memories (TMs). These less comprehensive language resources limit reuse on subsequent projects or for training statistical machine translation (SMT) software."

The author of that particular bucket of bilge is Don DePalma, head of the Common (Non)sense Advisory, an organization rightly seen as incompetent to interpret even third-grade level mathematics in their discredited report of dramatic rate decreases for translations, which turned out to be an artifact of calculations involving mismatched survey populations. In any case, the idea that small translation agencies or individual translators, who are generally more aware of and concerned with their clients' business are at any disadvantage by not being buried under mountains of monkeyfied mumbo-jumbo from bulk trashlation nearly ruined my keyboard as I spit my coffee laughing. Don deserves an extra Christmas bonus for that transcreation of the truth.

But the best was yet to come:

This inspiring graphic accompanied an article on how to motivate those involved in post-editing MpT in the HAMPsTr process promoted by Asia Online and others. There has been some vigorous and interesting speculation on where that arrow is pointing :-) The colleague who sent the link to me commented:

An interesting read from a humanitarian perspective. If they need to go to these lengths to "motivate" people, even those who are otherwise happy to swim in the muddy, toxic pond that these LSPs (your definition of the term) have created, one would have thought that they will understand that there is something wrong with their concept and goals. But why let the facts get in the way, I guess.

Indeed, those swimming in the pond do seem to have some real issues, even in cases frequently quoted as a HAMPsTr success. I long ago lost count of how many MpT advocates have told me of the wonderful words at Microsoft and Symantec, nicely extruded from controlled language sources and lovingly shaped into their final sausage form by happy hamsters. But this TAUS presentation by a Symantec insider tells another story:

And further indications that we are all getting mooned by the MpT Emperor can be heard in the excerpts of this recent GALA presentation in Berlin:

Unlike some of my colleagues, I have no fear of being replaced by Mr. Gurgle or any of his online Asian cousins however well-trained. What provokes some rage in me and more than a little concern is the callous dishonesty of the MpT profiteers and their transparent contempt for truth, the true interests of modern business and the health of those involved in language processes.

I have no little sympathy for the many businesses and individuals struggling to cope with the challenging changes in international business communication in the past 20 years. Nor do I feel that MpT has no role to play in communication processes; colleagues such as Steve Vitek have presented clear cases of value for screening of bulk information in legal discovery to identify documents which may need timely human translation and other applications. Kirti Vashee of Asia Online has commented honestly on numerous occasions on his blog and elsewhere about the functional train wreck of most "automated translation" processes one encounters, but still cannot take proper distance from the distortion and scaremongering practiced by the head of his team and others.

I am particularly concerned by the continued avoidance of the very real psychological dangers of post-editing MpT, which were discussed by Bevan and others in the decades before the lust for quick profits silenced discussions and research into appropriate occupational health measures. If Asia Online and others are truly concerned with developing sustainable HAMPsTr processes, then let them fund graduate research in psychology to understand how to protect the language skills and mental function of those routinely exposed to toxic machine language.

All this disregard for true value and truth reminds me so much of my days as an insider in the Y2K programmers' profit orgy: we all knew it was bullshit, but all the old COBOL programmers wanted to take their last chance to score big before they were swept into the dustbin of history. Some 60 years or so after it began, is machine translation ready to assume its place in that bin? The True Believers and profiteers will loudly say no, but at some point the dust will settle, the damage will be assessed, and we will find that the place of MpT is not at all what many imagine it to be today.

15 comments:

Aurora HumaránDecember 12, 2013 6:16 AM
Great post, Kevin!

The messages the illustration sends are many. The illustration is self-explanatory, I would say.

Why our colleagues continue buying the lie of MpT as an opportunity if it has not one but many negative sides? I fail to understand.

Thank you so very much for voicing so well the concern of many professional translators.

ReplyDelete
Replies
Extra SpeechDecember 13, 2013 3:09 PM
Great article, Kevin.
Very well written.
I can feel your rage.
Olivier den Hartigh
English to French Translator @ www.extraspeech.com
ReplyDelete
Replies
Dan NewlandDecember 14, 2013 9:35 PM
Kevin, just reposted it on my FB wall with the following header:A brilliant assessment--by the prince of translation gab, Kevin Lossner--of the rampant "McCarthyism" that slimy, bottom-feeder, MT hawkers in the translation industry are practicing. The message: Don't believe everything they're telling you. In fact, don't believe ANYTHING they're telling you.
ReplyDelete
Replies
Dan NewlandDecember 15, 2013 8:58 PM
Just to clarify, Kevin, the "don't believe everything" they're telling you I saw as your message. Mine, to the readers of my FB wall, is "don't believe ANYTHING they're telling you," because the wholesaler/commoditizers habitually lie through their teeth.
ReplyDelete
Replies
Kirti VasheeDecember 26, 2013 6:34 PM
Kevin,

While there are some overzealous MT proponents and some who even appear to be deliberately providing misinformation, there are also others who are trying to determine how this technology can be deployed in a productive way. I think it is more useful to be able to tell who is who rather than lump them all together.

For the record, the graphic above which you imply is from Asia Online is ACTUALLY from KantanMT (an instant Do-it-yourself MT option that I often warn users about in my blog) and the generic message that they propagate about "motivating post-editors" is quite different from the practices that Asia Online recommend as best practices for post-editing practices.

The Asia Online views on recommended post-editing practices are more accurately characterized in this post http://kv-emptypages.blogspot.com/2013/04/pemt-case-study-advanced-language.html or this http://www.asiaonline.net/EN/Resources/CaseStudies/AdvancedLanguageTranslation1.aspx . Dion’s statement was also very clearly directed at LSPs and not translators and I understand that you disagree with him as you have made this point repeatedly.

The dialogue (if one is even possible) would be more constructive and your point more credible if the facts you presented were more accurate. It is unfortunate that the state of affairs in much of the translation business is mutual disrespect between translators, LSPs and buyers. While it is important to bring about change and raise awareness about things that do not work in the business, I doubt very much if “rage” or continued disrespect are means to bring about a change that any of us would find desirable.
ReplyDelete
Replies
John MoranDecember 28, 2013 4:43 PM
Great post Kevin. I take your point that prolonged exposure to MT may have an impact on language skills but I fear it may be years before this is proven. Large- scale longitudinal studies are expensive and style is hard to quantify objectively so impact must be hard to measure.

In the meantime we find some translators are faster when they post-edit MT compared to translation from scratch so the perceived utility for these people will probably outweigh the perceived or even proven risk. It is vaguely depressing but I try to remain optimistic. Look at the history of electricity. It is (also?) carcinogenic but useful. Eventually, richer countries learned not to build houses under high-voltage pylons. I am not aware of any translator that only post-edits but I do know that post-editing is a growing niche so agencies may have to be careful not to turn budding enthusiastic translators into dumbed-down "demotivated" post-editors.

Unfortunately, your suggestion of per hour payment has been tried but it doesn't work. In practice no LSP will allow a translator to translate or post-edit one word per hour. A common model is X€'s per hour @ Y words per hour but this is, of course, a poorly disguised per word pricing model. In short, the dog continues to chase his tail.

So let's collaborate and find a solution so the mutt can finally get some rest!

What we see in our data is that except in exceptional circumstances where the MT is super-dooper high quality some translators are faster when they post-edit relative to translation from scratch and some are not. This is always my mental starting point when I think about the pricing problem. It makes sense. Humans come in all shapes and sizes and just like some people are good at sprinting or endurance running I think some people look at a garbled text and see a potential translation while others just see junk, or worse toxic junk.

Training and practice may help those who want to give MT post-editing the old college try for a few months but I suspect in many cases this is a waste of time. Luckily, MT is such a small niche that these translators will always have work so far from dying out like the dinosaurs they are likely to continue to thrive (no matter what the CSA write).

Just as you see these translators who are willing to post-edit as hamsters they may see you, rightly or wrongly, as a lumbering sauropod. As a trainee researcher, I prefer to watch from the sidelines and bean count.

The solution to this problem that is proposed by Asia Online and similar providers is to provide better MT so that even you Kevin might be won over by high quality MT output. It is a seductive argument and mildly effective for some people or for some accounts where there is a large budget but in your case I am going to speculate that better MT is unlikely to win you over, not least because texts that are that easy to translate lend themselves to Dragon Dictate.

Unfortunately, for the moment it is quite cheap to produce MT of quality X but expensive to produce MT of quality X+Y as it normally involves sourcing large quantities of in-domain training data or writing lots of rules. This is how MT companies on the consulting side of the spectrum make a living. Kantan and Microsoft Translator Hub inhabit the opposite end and both niches are valid. Warning people about competitors is a time honored tradition in business so I suspect Bill Gates and Tony O'Dowd will take Kirti's concerns in their stride.

Soooo...getting back to pricing models and conveniently ignoring light-posting (a valid but hard to define niche within a niche) if we assume some kind of QA process after post-editing how do we stop the dog from chasing his tail in the "pay by the word or pay by the hour" debate?

Simples....
ReplyDelete
Replies
John MoranDecember 28, 2013 5:59 PM
Drum roll!

Let translators decide on a job-by-job basis if they prefer to post-edit or translate from scratch. In practical terms this takes the form of a checkbox or radio button on a web portal. These are often bespoke portals developed by larger agencies or off the shelf solutions like XTRF, Plunet or XTM. Small LSP’s can just use a spreadsheet.

If a translator choses to post-edit on a job a discount is applied to the new words in the job. This discount has been negotiated in advance based on his or her perception of their working speed improvement using MT after a period of post-editing at full price.

Badda bing!

This is not a panacea that will protect translators from evil MT as LSP’s will naturally gravitate towards translators that provide the discount. They are cheaper. However, it does at least provide a pushback mechanism should MT quality degrade for any reason, e.g. a new technical writer on the buyer side and it is certainly fairer than current models that are predominantly unilateral.

Also, it is not a one-size-fits-all solution. A low cost agency called Ttranslated.net tried it but it does not suit their long-tail client base (where marketing is done mainly using Google Ad words). It does however work on large regular accounts as is indicated by the success of the MT post-editing program within the agency that pioneered it, Celer Solutions in Madrid.

Beware of vested interests. When I mentioned this model on LinkedIn Dion Wiggens remarked that he didn’t think it would work, as MT clients prefer to have more control of the supply chain. Rubbish! It works for Celer, 30% of their turnover is post-editing in a market where 5% is closer to the average. I suspect, we will never hear this model from companies like TAUS or Asia Online as it does not involve shoving the technology they sell down translators’ throats.

Remember, most MT is produced on the buyer side. This is where the bilingual data needed to train Statistical MT systems is collated and it costs engineering hours. This cost must be recouped so a discount request is inevitable. The problem is that in any industry where you see fair discounts you also see unfair ones. Worse still, what is fair for Mary is not fair for Bob.

In the end, within reason, I feel it should always be up to a translator to choose how to get a job done.

Disclaimer: my implication that MT may or may not be carcinogenic does not represent the view of my employer, www.cngl.ie. In fact, in my own trainee pScientific opinion it is pretty bloody unlikely.

p.s. Katrin Drescher from Symantec (one of our industrial partners) sat directly to my right in the TAUS breakout group she is summarizing so I was privy to the original discussion. I think she was pretty forthright about the fact that pricing models are still evolving and most people I know accept that we are in a period of transition in that regard. In fairness, her summary of the discussion is pretty unbiased and she is a trained translator herself. At least Symantec are trying to examine the issue. Some big buyers that use MT don’t care about anything other than price so quality has to suffer.
ReplyDelete
Replies
Rick WoydeDecember 29, 2013 4:08 PM
While I can certainly understand taking offense to the HAMPsTr designation the fact is MT is here to stay. Today Google Translate is the largest translation service provider in the world with over 200 million monthly users. Google claims they "translate more data in one day than all the translators in the world translate in a year". It's also true that many translators use Google Translate to create their first draft when translating. The lines between computer translation and human translation are blurring. Your post reminds of me of the angry comments I heard countless times many years ago when translation memory was first introduced. Translators have never been early adopters of new technologies, they seem to see them as threats to their livelihood. I understand that, disruption and change are never easy. After speaking with countless end clients who are the ones that determine our future I can tell you that translators biggest threat is not MT but their inability to meet these end clients business needs. Translation is viewed as a commodity because too often clients perceive little difference in translation quality from one supplier to the next. And that's one of the reasons why MT has flourished. While there's no doubt MT produces repetitive incomprehensible translation segments and that some MT suppliers oversell how to use MT, humans introduce variable translation errors of their own and often times oversell their own capabilities.

Recently Google has begun partnering with more LSP's and have incorporated human translation services into their offerings like Youtube, etc... The near future looks pretty clear to me and it includes both MT and human translation. Having said that there will always be room for the handmade craftsman who delivers value that clients can actually appreciate.
ReplyDelete
Replies