tag:blogger.com,1999:blog-20155610.post5399640480489236978..comments2024-03-06T02:46:19.929+00:00Comments on Translation Tribulations: The fix is in for PDF chartsKevin Lossnerhttp://www.blogger.com/profile/14727800526216764023noreply@blogger.comBlogger7125tag:blogger.com,1999:blog-20155610.post-23772072132973003952014-02-20T17:51:50.910+00:002014-02-20T17:51:50.910+00:00Vaclav, I have processed a few longer documents, b...Vaclav, I have processed a few longer documents, but in a somewhat unusual way - deleting the text and isolating the graphics. Because of page transition issues, I am rather skeptical of the wisdom of trying to handle longer documents with the main emphasis on text flow. However, I've discovered another good use (well, several related uses) with long documents, which I'll share separately at some point. As you may have noticed, Infix is also good at extracting/inserting pages. You can also select page ranges for export as text or XML. What you may not have noticed, is that the software can extract text from some documents in which text copying is otherwise blocked. Go look up the annual reports of the German Federal Patent Court (BPatG) and try to extract that text, for example. If you want to build a corpus for term extraction or general reference, Infix can be a nice tool to have in such cases.Kevin Lossnerhttps://www.blogger.com/profile/14727800526216764023noreply@blogger.comtag:blogger.com,1999:blog-20155610.post-76897553369578252172014-02-18T13:57:39.893+00:002014-02-18T13:57:39.893+00:00Kevin, thanks for sharing this, it opens incredibl...Kevin, thanks for sharing this, it opens incredible new opportunities for us - LSPs who have been struggling with PDFs sent by customers which had to be OCRed (a cost no customer would ever be willing to pay for because they don't understand WHY you need to play with their files in such a way, why you just don't TRANSLATE them :-)).<br />I have done a little bit of testing on some short files - forms, for example, and it worked excellent. Have you got experience with processing longer and more complicated files using infix?Vaclav Balacekhttps://www.blogger.com/profile/17916409335092866014noreply@blogger.comtag:blogger.com,1999:blog-20155610.post-33321324421210129982014-02-10T07:32:57.213+00:002014-02-10T07:32:57.213+00:00Indeed there is the icon, and they even name the s...Indeed there is the icon, and they even name the system requirements for Macintosh. But there is no Mac download, even not when accessing the page with a Mac. Really, really strange, though you can find the software on cnet.com.Torstenhttps://www.blogger.com/profile/11115731755158723704noreply@blogger.comtag:blogger.com,1999:blog-20155610.post-50160260132190806802014-02-03T10:07:43.143+00:002014-02-03T10:07:43.143+00:00What's the problem? Look at the little icons a...What's the problem? Look at the little icons at the left of the product box picture. I believe one of them indicates that there is a Mac version :-) This is by no means a Windows-specific post.Kevin Lossnerhttps://www.blogger.com/profile/14727800526216764023noreply@blogger.comtag:blogger.com,1999:blog-20155610.post-68320176050678665802014-02-03T08:01:29.452+00:002014-02-03T08:01:29.452+00:00Sounds good, though I am looking for a Mac alterna...Sounds good, though I am looking for a Mac alternative for this. Recently I translated a landscape PDF after opening and saving it as LibreOffice (or OpenOffice) Draw (the memoQ ODF filter converts this format without any problem). It worked rather good and might be an alternative for smaller projects (there were some serious, but resolvable issues with carriage returns). Torstenhttps://www.blogger.com/profile/11115731755158723704noreply@blogger.comtag:blogger.com,1999:blog-20155610.post-91269546871695811362014-02-02T13:42:02.922+00:002014-02-02T13:42:02.922+00:00"... more form than content" - that is m..."... more form than content" - that is more or less what another colleague remarked on his experience with it. Thanks for the correction/addendum on filter adaptation. For te graphics I was translating in this case, the exported XML required no adaptation whatsoever, but had I tried to deal with body text, I probably would have found the same thing. Do you know which tags had to be redefined as inline in your work?<br /><br />The fonts are indeed a headache. If the client is cooperative enough to provide them, however, that should not be the case.<br /><br />When our Brazilian colleague (who I think was involved in the inspiration of many of the translation features) began to push this tool many years ago, I thought the idea of translating a PDF directly and returning it in a PDF layout was rather idiotic, though I refrained from saying much about that in public. At the time I was doing mostly manuals and documents with a <i>lot</i> of text, and I think it is not suited for that. But for some of the flyers and other graphic-heavy documents I've seen (form over content) or cases like these charts, I have not seen a better tool.Kevin Lossnerhttps://www.blogger.com/profile/14727800526216764023noreply@blogger.comtag:blogger.com,1999:blog-20155610.post-27813386403615549222014-02-02T08:33:03.733+00:002014-02-02T08:33:03.733+00:00I'm using Infix from time to time for over two...I'm using Infix from time to time for over two years now and it's excellent in cases when there's much more form than content, e.g. ads, folders, complicated tables, stuff like that. Unfortunately when working with XML export usually I have to modify the import settings, because some tags for different inline formatting are by default treated as external, which generates wrong segmentation. But other than that it's great.<br />Of course usually, if a client want's to receive a PDF file back, there's a problem with fonts, because usually folders use proprietary fonts I don't have and/or without full UTF support. But that's another story.Wasatyhttps://www.blogger.com/profile/12415028316778775171noreply@blogger.com