tag:blogger.com,1999:blog-20155610.post3405097000292675792..comments2024-03-06T02:46:19.929+00:00Comments on Translation Tribulations: A quick trip to LiveDocs for EUR-Lex bilingual textsKevin Lossnerhttp://www.blogger.com/profile/14727800526216764023noreply@blogger.comBlogger8125tag:blogger.com,1999:blog-20155610.post-86001089133447215742015-09-18T17:58:50.176+01:002015-09-18T17:58:50.176+01:00@Charles: Doesn't LogiTerm have the same Align...@Charles: Doesn't LogiTerm have the same Alignment Editor as AlignFactory though?Michael Beijerhttps://www.blogger.com/profile/12826804655385764008noreply@blogger.comtag:blogger.com,1999:blog-20155610.post-64355083967607128942015-09-18T17:57:44.444+01:002015-09-18T17:57:44.444+01:00See also:
https://dl.dropboxusercontent.com/u/68...See also: <br /><br />https://dl.dropboxusercontent.com/u/6802597/Screenshots/LF-aligner-auto-downloading-and-aligning-with-CELEX-numbers.png<br />+<br />https://dl.dropboxusercontent.com/u/6802597/Screenshots/LF-aligner-CELEX-continued.png<br /><br />It seems there might not be any need to go through Excel, etc.Michael Beijerhttps://www.blogger.com/profile/12826804655385764008noreply@blogger.comtag:blogger.com,1999:blog-20155610.post-61376582462673502932015-09-16T14:46:02.488+01:002015-09-16T14:46:02.488+01:00Good information, Charles, thank you. I don't ...Good information, Charles, thank you. I don't know why my colleague is generating HTML bitexts if TMX is available. If you are a memoQ user you are better off to load the TMX from the alignment in LiveDocs rather than sacrifice the context by feeding it to a TM as I pointed out in another blog post recently. Kevin Lossnerhttps://www.blogger.com/profile/14727800526216764023noreply@blogger.comtag:blogger.com,1999:blog-20155610.post-87418401836909320222015-09-16T14:30:47.528+01:002015-09-16T14:30:47.528+01:00Kevin,
I was nice meeting you in Bordeaux at the ...Kevin,<br /><br />I was nice meeting you in Bordeaux at the IAPTI conference.<br /><br />I have AlignFactory Light and it does make TMs in XML and TMX formats. By the way, you can do the same with Logiterm, which is sold by the same company: Terminotix.<br /><br />I have been using Logiterm and AlignFactory Light to create and manage my TMs for years. If the source and target documents are in the same format, the alignment (which takes a few seconds) is more or less perfect. Although there may be problems if the documents are converted PDFs or contain tables, graphics or foot notes, they can easily be dealt with before alignment, for example, by removing unnecessary carriage returns or deleting the offending elements.<br /><br />I do a lot of financial translation and have used AlignFactory Light to align many EU directives and regulations dealing with the financial services industry.<br /><br />The advantage of AlignFactory Light over Logiterm's alignment module are its TM editing functions.<br /> <br />Anonymoushttps://www.blogger.com/profile/10058884254145088478noreply@blogger.comtag:blogger.com,1999:blog-20155610.post-90568753556389059882015-09-16T14:25:46.893+01:002015-09-16T14:25:46.893+01:00That's the approach that most seem to take, th...That's the approach that most seem to take, though as I noted some are unhappy with the alignment quality when they try to build bilingual references. Note Marek's suggestion below for fixing the misalignments in Excel.<br /><br />In general I think approaches like this with more focused, self-selected corpora are more useful than the bucket approach of the Big Data addicts. People seem to forget too often that a BAT tool beats a CAT tool nearly every time, and masses of surplus crap data are not useful for brain-assisted translation.Kevin Lossnerhttps://www.blogger.com/profile/14727800526216764023noreply@blogger.comtag:blogger.com,1999:blog-20155610.post-71090503517139527422015-09-16T12:18:45.462+01:002015-09-16T12:18:45.462+01:00You're right, Marek - Excel is a better enviro...You're right, Marek - Excel is a better environment to fix those misalignments.<br /><br />For the sentence alignment, you might as well just save the two languages as monolingual HTML docs and align them, which is what the users who inspired this have been doing. But here memoQ apparently does an inadequate job much of the time, and AlignFactory was said to be better.<br /><br />I spent some time before I published this looking at aligning the Excel file with itself, but selecting different content ranges. It worked as long as I did not specify structural alignment (which bizzarely ended up aligning one language with itself!) but here I uncovered a real mess of bugs in Kilgray's implementation of the Excel filter for LiveDocs import. You can specify ranges by typing them, for example, but you cannot do so interactively in the Excel file like you can when importing a translation document. That is a serious flaw. In general, my recent work has pointed out a number of areas where LiveDocs filter implementations need a serious upgrade/overhaul. The absence of the multilingual delimited text filter there is puzzling, and preview formation seems to fail almost all the time, much worse than the frequent problems for translation files.Kevin Lossnerhttps://www.blogger.com/profile/14727800526216764023noreply@blogger.comtag:blogger.com,1999:blog-20155610.post-87745917662377031072015-09-16T09:44:34.491+01:002015-09-16T09:44:34.491+01:00Simple and elegant. However, it's usually a go...Simple and elegant. However, it's usually a good idea to check the file in Excel - you can easily fix minor problems with dislocations, which will make life easier after importing to LiveDocs.<br /><br />And if you want to have sentence segmentation, save source and target columns in different files (but in the same columns, e.g. column A for both source and target) and then align with "Structural alignment" on.Wasatyhttps://www.blogger.com/profile/12415028316778775171noreply@blogger.comtag:blogger.com,1999:blog-20155610.post-36654876959481891592015-09-16T06:58:13.630+01:002015-09-16T06:58:13.630+01:00Good tip, thanks! I usually download the html vers...Good tip, thanks! I usually download the html versions of the texts and add these to LiveDocs.Stevennoreply@blogger.com