Showing posts with label PowerPoint. Show all posts
Showing posts with label PowerPoint. Show all posts

May 28, 2022

Filtering formatted text in Microsoft Office files

 Recently, I shared an approach to selecting text in a Microsoft Word file with editing restricted to certain paragraphs. This feature of Microsoft Word is, alas, not supported by any translation tool filters of which I am aware, so to import only the text designated for editing it is necessary to go inside the DOCX file (which is just a ZIP archive with the extension changed) and use the XML file which contains the document text with all its format markers.

This approach is generally valid for all formats applied to Microsoft Office files since Office 2007, such as DOCX from Word or PPTX from PowerPoint. I have prepared a video to show how the process of extracting the content and importing it for translation can work:

 After translation, the relevant XML file is exported and the original XML is replaced with the translated file inside the archive. If the DOCX or PPTX file was unpacked to get at the XML, the folder structure can then be re-zipped and the extension changed to its original form to create the deliverable translated file.

What I do not show in the video is that the content can also be extracted by other means, such as convenient memoQ project templates using filters with masks to extract directly using various ZIP filter options. But the lower tech approach shown in the video is one that should be accessible to any professional with access to modern translation environment tools which permit filter customization with regular expressions.

Once a filter has been created for a particular format such as red text, adapting it to extract only green highlighted text or text in italics or some other format takes less than a minute in an editor. Different filters are necessary for the same formats in DOCX and PPTX, because unfortunately Microsoft's markup for yellow highlighting, for example, differs between Word and PowerPoint in the versions I tested.

Although this is a bit of a nerdy hack, it's probably easier for most people than various macro solutions to hide and unhide text. And it takes far less time and is more accurate than copying text to another file.

In cases where it is important to see the original context of the text being translated, this can be done, for example, using memoQ's PDF Preview Tool, a viewer available in recent versions which will track the imported text in a PDF made from the original file. This can be done using the PDF Save options available in Microsoft applications.


Feb 21, 2018

Double Vision with MS PowerPoint to Check Translations

I learned an interesting thing tonight thanks to David Hardisty, who teaches translation technical skills at Universidade Nova in Lisbon and translates quite a bit from European Portuguese to English. For all the years - decades? - that I have used Microsoft PowerPoint, I was never aware that it is possible to run two different presentations - as presentations - at the same time.

Why is this useful? If you want to do a detailed, screen-by-screen comparison of two different presentations - say a German original and its Arabic translation - and ensure that everything looks OK with the with animations, effects, text sizing in fields, etc. this can enable more focused, accurate comparisons. Just clicking through the slides in two windows in edit mode might miss something that depends on dynamic elements in the application.

Here's how you do this:

1. Open the first PowerPoint file.

2. Go to Slide Show... Set Up Slide Show... choose Browsed by an individual (window)

3. Click OK. Then press F5. Resize the presentation window as you like and put it wherever you want on one of your screens.
Then repeat Steps 1 through 3 for the second presentation. And the third if you like. And....



Aug 6, 2013

Translating presentations in memoQ: PowerPoint vs. OpenOffice Impress

Microsoft PowerPoint files can be a real nuisance to translate. One of the biggest challenges with these files is the haphazard formatting that many authors apply when working in that medium: line breaks and paragraph breaks in the most inconvenient places, which can cause some stress when working with many translation environment tools.

The current status of the PowerPoint filters in memoQ (version 6.5 build 10) is not as well developed as the filters for Microsoft Word and Excel files; in particular the inability to configure the handling of "soft breaks" (line feeds) causes me no little grief. However, I can at least join segments to get complete sentences where I want them. That's something you can't do in SDL Trados Studio, though that tool at least represent the breaks as inline tags. Sometimes I prepare my PowerPoint files in Trados Studio and then translate the SDLXLIFF file in memoQ if there are a lot of breaks in the sentences. But then I miss the preview.

Recently I had occasion to look at a presentation created with OpenOffice Impress, a rather nice alternative to PowerPoint. Given the confusion over Microsoft's new licensing practices for MS Office 2013, I would not be surprised if more of my corporate clients begin to use the clever free alternative.

However, when I tried to import the Impress (ODP) files to memoQ, I found that the files were not recognized as a translatable format. However, that problem was quickly solved, and the technique for translating ODP files in the current and older versions of memoQ is shown in the video below. One could, of course, convert these to PowerPoint formats, but you might not want to. With ODP files, it is possible to have breaks treated as inline tags.


Time Description
0:33
  Importing the PowerPoint file to memoQ with options
1:10  Examining the segments of the imported PowerPoint file
1:35  Joining segments for "broken sentences" in the imported PowerPoint file
1:43  The presentation as an OpenOffice Impress (ODP) file
2:07  Importing the ODP file to memoQ
2:39  Setting the filter for the "unknown" file type
3:04  Configuring "soft" breaks as inline tags
3:34  Examining the segments of the imported ODP file
I hope to see a few more refinements of the PowerPoint and OpenOffice filters in future builds of memoQ!