Showing posts with label views. Show all posts
Showing posts with label views. Show all posts

Mar 17, 2016

Dynamic filtering with regular expressions in memoQ


Regular expressions (aka regex) are not a tool for everyone, though this is something that the nerdily inclined often fail to appreciate. For average users, a plain language query interface, perhaps with more limited options, is generally more accessible and used. However, sometimes it's nice to have such "shortcuts" available to select particular structures in a text for translation or editing, and the many people who complained for years that Kilgray did not provide a dynamic regex filter for the working translation grid - a feature of SDL Trados Studio for quite a while now - did have a point worth addressing in development. Now that has happened, though still a bit incompletely when considered in the full scope of memoQ's usual features for selecting text.

memoQ uses regex in a number of its modules, and Kilgray has several webinars which describe these applications, though they require some stamina to watch, and I expect that most people will become hopelessly confused if they try to take in more than one area of application in a single sitting. The uses of regex for segmentation rules, tagging, autotranslatables and text filtering on document import (with the Regex Text Filter) are very different in their approach, even though the underlying syntax of the regex is the same. However, all of these applications allow the configured rules to be saved and re-used, so one could ask an expert to create the settings needed and provide these in a resource file, and many users do exactly that. Thus as long as one understand that regex can be used for a particular problem, the details can be hired out.

This new application of regex for dynamically filtering, introduced in recent builds of memoQ 2015, is a little different (at present). Although the Find/Replace dialog will "remember" regex syntax in its dropdown menu of recent expressions, there is no way to store these expressions, and they must be entered manually to use them. This means that, for now, the average user will have to collect useful expressions like a tourist might scribble phrases in a notebook to use on holiday in a foreign country, and those with a little more sense of adventure might find themselves with a hovercraft full of eels and wonder why.

One such phrase might be the example in the screenshot above. I was translating some financial statements with several formats present for digits in account numbers, dates and monetary expressions. In order to work more systematically with these various formats, I used several different regex expressions to sort and separate them. In the example I was looking for instances where at least four digits were written together in a source segment. That isn't terribly selective, but most of these occurrences in my documents were account numbers, and this helpfully cleaned up the text a lot and allowed me to work a little faster. Other expressions were used to QA date formats and monetary expressions more specifically.

In the working grid for translation and editing, regular expressions can be used in one or both of the fields for the source and target text when the checkbox in the toolbar at the right is marked. Or the regular expressions option in the Find/Replace dialog can be used.


It is somewhat disappointing that regex cannot be used to create static views at the present time. While marking can be used in the Find dialog to enable one to go back and forth between the filter criteria and other configurations of the working grid, there is no way to make a permanent "record" of the filtered segments. For quite a few years, I have wished for the possibility to save the results of my filtering in the working grid in some sort of view, but I was always able at least to recreate the filtering criteria in the dialog to create a memoQ View, which could then be opened at any time or exported in various formats for clients and project collaborators. However, at the moment that is not possible with regex filtering. (There are workarounds involving a change in segment status, but these are often inconvenient in a project in progress.)

The addition of regex filtering to the working grid in memoQ is a welcome feature for many, which I hope will be expanded by Kilgray in the future to achieve more of its potential. But to take advantage of this potential in any way, the average user will indeed need a "phrase book" of sorts, and an efficient way of managing useful collected regex snippets (and naming them for easier re-use in searches and filtering) would be very desirable. If these "regex phrase books" for dynamic filtering and view creation were able to be saved as shareable light resources, it would be possible to build many useful collections to help users at all levels in the translation, editing and quality assurance tasks.

Nov 2, 2013

Games freelancers translate

ames are no longer a big part of my world despite years spent collecting, playing and developing them ages ago. In the world of translation, I am an interested observer, fascinated a little by the technical peculiarities I hear of in that domain as well as what appears to be a diversity of opinion and working methods even greater than one finds in my familiar areas of work.

I always enjoy a close look at the working processes of colleagues and clients; often I learn new things from the observation, and I like to ask myself as I see each stage what approach I might take or whether there are changes in the available tools which might make a process more efficient.

An Italian freelance team (leader?) put together a series of seven YouTube videos showing how jobs are prepared and distributed, as well as some particulars of their translation process and QA. The main working tool is Kilgray's memoQ - one of the 6-series versions it seems - as well as the Italian version of Dragon Naturally Speaking and Apsic Xbench, which also make a brief appearance. Altogether 22 minutes of show and tell, which I find mostly interesting and recommend as a nice little process overview.

I've made a YouTube playlist compilation here so it is easier to view the clips in sequence, since I had a little trouble navigating them myself in the somewhat random YouTube suggestion menus. I'm not embedding them here, because the interface for navigating a playlist is much easier to cope with on YouTube itself.

I wish there were more overviews like this available for common translation workflows in other areas as well, such as patent translation, financial report translation in the midst of the "silly season", web site translation or just about anything else. It's doubtful that any of these would betray great trade secrets, but they might offer clients and prospects a little more realistic view of what some might think involves little more than "retyping in another language".

Some content notes on the individual videos of the playlist:

#1 Discusses background research and style guides in the team's approach

#2 Covers the import of the source files (Excel) and the selection of ranges

#3 Term extraction

#4 Statistics, handoff packages and sending out the jobs with the project management system

#5 Creating views of multiple files, voice recognition in Italian, concordances and term lookups

#6 Receiving translated project packages; text to speech reviewing!

#7 QA in memoQ, export to XLIFF for final QA in Apsic Xbench

Sep 28, 2013

memoQ filters for static and dynamic views and navigation

The filtering functions for translation documents in memoQ are really cool. I'm not talking about the import filters for different types of documents, though most of these are rather good, and improvements are being made all the time. I mean the ways in which you can use filters to look at the content you are translating and sort and navigate it in different ways.

The filters can be used to create static views from one or more translation documents selected on the Documents tab of the Translations window of Project home using the Create view command.

Creating a view in memoQ

I use this function a lot to create comment and feedback lists for clients or select some particular part of my content that I want to save and work on or share separate from the rest. Exported as bilingual RTF tables, the content can be corrected or questions answered in the comments column, and all the changes and commentary can be re-imported to update your project.

What I use even more often are the dynamic filters in the translation window. There are three main types: sorting filters in a dropdown list, source and target text filters in the fields above the related columns, and the dilog filter with its many options, which is invoked with the funnel icon (2):


Source and target text filtering of the segments can be made case-sensitive by marking the icon (1). Any number of filtering operations can be applied cumulatively in sequence, and filters applied with the source and target text fields can be cleared with the red X icon (4). To clear a sorting filter, you must select "No sorting" from the dropdown menu.

These view filters can be very helpful for translation and quality assurance. But what many do not realize is that memoQ also allows you to navigate through translation segments using filter criteria. This is done with Edit > Goto Next (Ctrl+G). The filter criteria to apply for navigation are chosen under Edit > Goto Next Settings (Ctrl+Shift+G).

This often has the advantage over view filtering that all the segments remain visible and you can see the context better. Examples of this are shown for navigating to commented segments and navigating through the many footnotes in a document to check their formatting in the short (3.5 minute) video tutorial below. It was prepared with the most recent build of memoQ 2013 (6.5.15) and shows the new "golden" bubble icon for commented segments. The video demonstrates (with footnotes) how tag type can be used as a filtering or segment navigation criterion. This might come in handy for an academic thesis or a legal document with many footnotes to check.


Use the icon at the bottom right of the video to toggle full screen mode for viewing; this makes it much easier to see the details of this somewhat fast-paced clip.
0:17  Static views on the View tab of the Translations window
0:30  Dynamic source and target text field filtering
0:40  Dialog filtering with the funnel icon (3rd cumulative filter)
0:52  Using filters to navigate: Goto Next (rationale & contrast with view filter)
1:20  Goto Next settings for navigating commented segments
1:42  Navigating footnotes in a translation document with Goto Next
1:55  Setting the navigation filter for a tag type
2:30  Getting rid of a static view to allow segment joining
3:23  Oops! I join something I shouldn't and split the segment again, hoping nobody will notice.
Subscribe to my free YouTube channel and I think you'll receive updates of new video tutorials I add (or at least it'll be easier to find them). I would also like to thank Ulrich Scheffler of LSP.net for providing me with a Camtasia license recently to support my teaching - it's much better than the free Open Source CamStudio I started working with months ago, though I can definitely recommend CamStudio as a good tool to get started with making demonstration or teaching videos.

Jun 22, 2012

memoQuickie: customizing project, file and view lists

Many are not aware of this, but three of the important working lists in memoQ - the project list on the Dashboard and the Documents and Views lists on a project's Translation page - are customizable.

Right-clicking the header bar of the list opens a context menu where columns to be displayed are selected:

Project list context menu on the memoQ Dashboard
 
Documents list context menu in a project
Views list context menu in a project

Customizing the column display is particularly helpful in the Documents list when using memoQ versioning. If Document Version is marked in the columns choices, the major and minor version will be shown for each document. (The major or source version is the number before the decimal, the minor version - the target version for that source version - the number after it.) If versioning is not active for a document, the column displays "n/a".