About this blog

Translator's Shack is a collection of links, news, reviews and opinions about translation technologies. It's edited and updated by Roberto Savelli, an English to Italian translator, project manager and company owner of Albatros Soluzioni Linguistiche, a team of English-Italian translators, which hosts and supports this blog.

The Life as a PM category, managed by Gabriella Ascari, contains topics that are less technical in nature, but which we're sure will be appreciated by owners of small translation businesses and freelancers.

Here are links to my pages on some social networks:

Highly recommended:


memoQ masterclass by Angelika Zerfass, part 2

CSV import of a TM

Use Olifant to open a TMX file, copy all rows (this is actually a great tip for converting a TMX file to a columnar format with just a couple of clicks) and paste to Excel. You can change column headings in Excel. From Excel you have to export as unicode .txt file if you want the procedure to work.

Advanced HTML import procedures

Balázs Kis from Kilgray explains how the standard HTML filter in memoQ will not make some HTML attributes editable (e.g. IMG titles). As a workaround, you can use Import file as… and use the XML import file to specify how every single tag should be treated, so any tag attribute can be made editable. Interestingly, the XML files is always used, behind the scenes, when you import an HTML file using the standard settings.

Bilingual formats managed by memoQ

.MBD

.DOC

.TTX (you need to pre-segment TTX files before importing them into memoQ. Unless the file is pre-processed this way, there’s no guarantee that it will work when opening the file in TagEditor). Here’s where the option is located in Trados:

image

.XLIFF

.SDLXLIFF, containing lots of Trados-specific metadata. You can process this type of file in memoQ, but some metadata (like segment status) will not be preserved.

.RTF multi-columnar export, allowing to use other tools or a word processor for reviewing the translation.

.Transit

Handoff packages

This feature allows the project manager to create an offline project and send handoff packages (containing all the resources needed for carrying out the project, i.e. term bases, translation memories, non-translatables, etc.). If you assign different files to different users, memoQ will create as many packages as the number of translators, and include the translator’s name in its file name. The packages can contain TMX files if the translators have to work offline, or a .TMI file, which is a reference to the server TM, for server-based projects.

memoQ masterclass by Angelika Zerfass, part 1

TMX

A short comparison between the contents of TMX data coming from different translation tools (Trados 2007, Studio, memoQ).

TMX can contain tool-specific information (additional fields, segment status, segment context, alignment penalty, etc.) that’s not easily imported into other tools.

If a TMX import does not work, look at the language identifiers first.

A memoQ TMX file contains TM-level (project. client, etc.) and segment-level (project etc., but also changeID, client, corrected, aligned, context)metadata.

The information entered into the User and meta-information fields is case-sensitive, so this can lead to data duplication.

For the moment, the meta-information fields for each new project is limited to username, project ID, domain, client, subject.

Since the user ID is overwritten when a new user changes a field, a workaround is to use the “subject” (or another) field to specify the name of the original translator.

Trados has both creation ID and change ID fields. memoQ only has the change ID field, so when you import from Trados, the change ID from Trados will be imported and the creation ID will be lost.

Trados Studio includes context information in its TMX exports, but these are in form of hashes (a long string of digits), and do not contain the actual context strings like memoQ does. As a consequence, context information cannot be imported into memoQ from Trados.

The memoQ TM import settings

The field Process TRADOS TMX for best results in memoQ should be used if both the TM and the translatable files are in Trados format.

Import <ut> as memoQ tag: this allows support for legacy TMs. “UT” means “Unknown tag” here.

For Trados versions up to 2007, it’s very important to apply the penalties. Otherwise the statistics will treat segments that are almost identical except for punctuation, tags, and formatting, as identical.

If Use context is selected, you should not use Allow multiple translations. Two identical segments with different context will both be saved to the TM.

Olifant

Angelika showed us some quick methods for doing maintenance on TMX files using Olifant. Delete duplicate and inconsistent segments.

It’s probably best to re-create a TM from scratch after the edits, rather to import back into the original memoQ TM.

Import of a Trados TM into memoQ

In order to map the Trados fields to the corresponding memoQ fields, a search & replace is done in the TMX file, using a text editor. The values contained between quotes in the <prop type> fields are replaced by the corresponding hardcoded field names for memoQ.

If the search&replace is too complex, it’s probably better to export from CSV in Olifant and import the CSV into memoQ, where fields mappings can be set up during the pre-import procedure.

Based on the participants’ inputs, here are some of the issues that users face when migrating their resources to memoQ from other tools.

  • Loss of metadata
  • Loss of tags
  • TM files too big for import
  • Moving TMs from one server to another

Moving term bases