Search me!

May 20, 2023

Taking the trash out of trashlation in translation

Let’s be real, people. We need to start using the word trashlation to describe what too often passes for professional activity in the language services sector these days.

I’m looking at all of you DeepL fans among others. 🧐  And especially the LSPs who try to sell that garbage or other post-edited spew as translation. Such nonsense is seldom fit for purpose, and when I am not bent over the Porcelain God in “prayer”, I am always reminded of Mark Twain’s comment that the difference between something “close” to the right word and the right word itself is akin to the difference between a lightning bug and lightning. I would suggest that in some cases the consequential difference could be compared to being strapped to the electric chair for execution as opposed to watching the Natural magnificence of a lightning storm.
Recently, a German publisher sent me a very interesting file containing a table with the original German text in the first column, a post-edited DeepL result by a German who has lived abroad for some decades in the second column, in a third the post-edited result from an American living in Germany, and in the fourth column the results of some beta program called DeepL Write, yet another iteration of all of this AI trash the fanboys and -girls all declare will replace human writing efforts. Although I prefer to be left out of any discussions whatsoever involving the linguistic quality of MT output, the submitter seemed pleased by what he saw and insisted on having my opinion. So I gave it to him. Unfiltered. 😃  I suggested leaving me alone with that nonsense, and said I considered all of it to be hopeless trash. Actually, I might have been harsher than that, as I don't consider that sort of “writing“ acceptable or in any way fit for purpose, any purpose with which I'll be associated. It is physically painful for me to look at verbal garbage like that. I would rather stick my nose in a garbage can full of rotting meat and inhale deeply. The original German text was not bad, and it deserved real translation like someone I recommended could provide for that subject matter.

In a long consulting gig in the second half of 2022, which ended only in April of this year, I had a very close look at how badly the commercial models of translation service offered by most agencies are broken, badly, badly broken. Years of social engineering propaganda by unscrupulous promotors of machine translation and artificial "intelligence" have skewed expectations badly so that if the buyer is lucky the "best" service might be "good enough", though barely.

Project managers at agencies tell me of their soul-crushing duty to force ever lower rates on those external providers of translation services who typically do the real work sold as the agency's deceptive product. All the while, charts and graphs and other "quality metrics" tell the fairy story of superior delivery.

It's time for individuals providing independent services to take a different approach. Let the linguistic sausage providers (aka LSPs) eat their own product. Take the trash out of trashlation.

Of course there has been a lot of talk for years about the need to find more direct clients. Most conferences for translators have at least one presentation on this topic. But many of the recommended practices are dated or have become less effective as LSPs providing trashlation have increasingly gamed the search algorithms to make their pages appear to be those of independent individual providers, and then once there the buyer is treated to lies and distortions suggesting that they may be better off with the superior "full service" of the agency. Very few of the claims on such pages reflect actual practice, something I know very well as an insider providing technical assistance to companies for a very long time.

I have also read suggestions recently that search engine optimization (SEO) strategies may soon be a dead letter. Why? Companies developing search engines are rushing to implement large language model (LLM) functionality, such as that found in ChatGPT, any many expect that to blow the algorithm gaming strategy to Hell, negating much of the investments companies and individuals have made to increase the visibility of their web sites and the services offered there.

I don't know, really, what disintermediation strategies might be most effective these days, but at the very least individual traders should examine alternative representation strategies. I saw an interesting one recently on Fiverr, a platform I remembered only for the idiotic idea that every service should cost $5. Well, that has apparently changed.

Although their presentations were often far from a perfect, translators offering services on those platforms are able to structure the "gigs" in such a way that buyers can easily specify relevant conditions, such as project scope, urgency, etc. And service providers can avoid overbooking by applying various kinds of throttles based on order volume. Extra services, such as multiple revisions, project glossaries and many other extras I have marketing over the years are a snap to set up.

In fact, I was so impressed by what I saw that I am considering to create special services for some of the routine technical services I provide for translation workflow training videos, custom import filters, regex tools to translation and text QA, etc. Frequently, more of my time is spent gathering information on a client's requirements than I actually spend providing the implementable result. The flexible FAQ functions, intake questionnaire, portfolio and communication tools look like they can be a huge time-saver.

Compared to what I saw on Fiverr and a few similar, huge volume platforms, the structure of platforms like ProZ-dot-com and Translator's Café definitely look "last century". The times they are a-changin' and maybe we should be too in the platforms we use to promote services that our potential clients need.

Mar 2, 2023

memoQ Regex Assistant workshops re-run

The series of three workshops on the use of regex resources in memoQ, with a particular emphasis on the integrated Regex Assistant library, has been updated and will be offered again on March 9, 16 and 23 from 3:00 pm to 4:30 Lisbon time (4:00 pm to 5:30 pm CET, 10:00 am-11:30 am EST).

You can register here to attend any or all of the three sessions:
https://us02web.zoom.us/meeting/register/tZEpde-sqTkvGtCdMsrBl825tFrpDQ98FkAI

This is an evolving course, with the content continuously adapted in response to new questions, workflow challenges and process research as well as interoperability studies with other tools. Participants in the last series asked quite a number of interesting things during and after the talks, and their questions provided excellent material for new examples and approaches, and I hope for the same experience in this round.

The memoQ Regex Assistant is a unique library tool introduced in its current form in memoQ version 9.9. The little bit of public discussion there has been about this tool is quite misleading. Contrary to the "pitch" from memoQ employees and nerdy fans in the user base, this isn't really a tool for learning regular expressions. There are much better means for doing that. And I have strong personal objections to the idiotic statements I hear so often that "everyone should learn some regex". What utter nonsense.

What everyone should do is take advantage of the power regular expressions offer to simplify time-consuming tasks of translation, review, quality assurance and more to ensure accuracy and consistency in language resources and translations. The Regex Assistant helps with this by providing a platform where useful "expressions" can be collected and organized with readable names, labels and descriptions in any language. These libraries can be sorted, exchange with other users and applied for filtering, find and replace operations, QA checks, segmentation improvements, structured translation of dates, currency expressions, bibliographic information, legal citations and more or exported and converted to formats for easy use in other tools such as Trados Studio, Phrase/Memsource, Transtools+ and more. All without the need to learn any regular expression syntax!

HTML created from a memoQ Regex Assistant library export
An exported Regex Assistant library converted to a readable format by XSLT

My objective is not to teach regex syntax. It is to empower users to take more control of their work environment and save time and frustration for their teams and enjoy more life beyond the wordface. To help with that, I provide some usable examples in a follow-up mail after each sessions: resources that you can use in your own work and share freely with colleagues. 

And in this next round of workshops, available for purchase, there will be some additional high value resources to help achieve better outcomes for work in particular language pairs and particular specialties, such as financial translations. These complex resources were developed over a period of years, sometimes at great cost. In the last session I'll be getting "down and dirty and a little nerdy" to show you my way of maintaining complex resources like these auto-translation rules and others in a very effective, sustainable way that enables you to adapt quickly to changing requirements and style guides.

Sign up free to join the fun here.

Jan 12, 2023

memoQ&A: The Regex Assistant in Practice

Note: there will be another series of workshops for this subject matter in March. Details are HERE. Register once to attend any or all of the sessions.

Users of memoQ version 9.9 or later have a powerful library tool available with which they can organize solutions or solution elements that use regular expressions and apply these without the complications of learning regex syntax. In each session, we'll look at different ways in which these portable libraries can be used, with a particular emphasis on solving common problems faced by translators and reviewers. Materials will also be made available to participants for later study and practice.

There will be three sessions of 90 minutes each on three consecutive Thursdays: January 19, January 26 and February 2 at 11:00 a.m. Lisbon time (i.e. noon CET). The first session will introduce the Regex Assistant library and its basic functions for organizing and exchanging information and then move on to specific examples of using the library to deal with common problems encountered in translation and review work. Particular emphasis in the first session will be on filtering and Find/Replace operations. 

The two later sessions will continue to explore filtering and options for making changes to texts and tags, and we will also take a tour of possibilities for using regular expression resources (from the library!) in other parts of memoQ such as the Regex Tagger, QA checks or auto-translation rules. As time permits, examples or requests from participants can also be explored. 

Those interested in joining the free sessions can register here.

Update: a recording of the first session is available here: https://youtu.be/KKR5aH5oGH8

To get a little taste of what's to come, have a look at this video created by a colleague last year: