About this blog

Translator's Shack is a collection of links, news, reviews and opinions about translation technologies. It's edited and updated by Roberto Savelli, an English to Italian translator, project manager and company owner of Albatros Soluzioni Linguistiche, a team of English-Italian translators, which hosts and supports this blog.


The Life as a PM category, managed by Gabriella Ascari, contains topics that are less technical in nature, but which we're sure will be appreciated by owners of small translation businesses and freelancers.


Here are links to my pages on some social networks:


Highly recommended:

memoQfest 2010 - TM Repository

Moravia are the first adopters of the TM Repository.

The main problem with the TM repository so far was: “what is it, and what is it intended for?”

According to statistics, 80% of new software projects fail if there is no clear vision about the project. The TM Repository project has been put on hold for a certain period, but the company has resumed development now that there is such a vision.

Problem statements

TMs can be scattered in hundreds of tiny files.

Or, in a “big mama” type of approach, the TMs are managed by using filters and metadata. A project TM is a TM that only contains the segments (e.g. from a “big mama”) that are relevant to a specific project.

Including the reviewing process into the technology is essential. Some reviewers will refuse to use a translation environment to review a text (this has been solved by memoQ’s RTF columnar export).

TM contents need regular cleanup. What needs to be deleted/fixed: outdated entries, bad translations, “other” garbage.

In technical documentation, there is a lot of terminology that refers to concepts that get deprecated.

Cross-tool TMX transfer does not give good enough leverage. Attributes specified in one tool will often not be transferred to a different tool.

Context information: how can we transfer the context information (saved as hashes in Trados Studio TMs and as clear text in memoQ)? In short, we can’t. The hashing algorithm used in Idiom is patented. In other cases, it’s kept secret.

Features and workflows

  • Single database with TMX import/export. At its current stage TM Repository can only receive TMX files.

  • It accepts any sort of metadata (client, project number, dates, etc.), no matter which tool was used to create the TMX.

  • The TM is extracted from TM Repository and can be modified in any translation tool. The TM then needs to get back into the TM Repository, which offers complex features for comparing the new TM to the current contents of the repository.

  • You can import TMs containing one language pair and merge them into a multi-pair TM, then split it again in sub-pairs.

  • Version control and rollbacks: entry history is included for every translation unit. If you make a mistake, you can always roll back to the previous version

  • TM Repository is tool-agnostic. Every attribute is converted to an attribute system that is specified by the user for the TM Repository. Preconfigured mapping templates are available.

  • Full-text (not fuzzy!) concordance search

  • Search and replace in the repository

Next steps

  • A beta version is available now
  • A connector to memoQ Server is in the works. It will make import/exports from/to memoQ transparent through an integration API.

Specifications

Back-end database application with no direct connection to translation management systems (for now). Requires a dedicated server, as it’s resource-intensive.

In its current state, TM Repository will not allow you to upload a translatable project, run a fuzzy-match analysis, and extract the relevant translation units from the repository, regardless of the metadata. From my personal point of view this is a real show-stopper. I cannot see real potential in this product for a small LSP if it does not offer this kind of functionality.

TMbuilder (translation memory export creator)

image TMbuilder is a small tool that makes building up TM export/import files as straight-forward as possible. You can use it to batch-import several files in Excel (2003 or 2007) or tab-delimited format and build a Trados-compatible or TMX 1.4b TMX file with a couple of mouse clicks. Here are some more details about the features:

- Accepts two input formats: tab-delimited text files and MS Excel spreadsheets
– Creates output files in two file formats: Translator’s Workbench 7.x/8.x (TXT) and the Translation Memory eXchange (TMX)
– Works on multiple input files and offers a merging feature – there might be just one import file
– Allows the user to specify standard TM fields, like: source and target ISO flags, segment descriptions and author name
– Removes additional quotes often created by MS Excel when saving the file to the Text form
– Works with standard encodings: Unicode and UTF-8
– Rapid file creation: milliseconds for .txt and seconds for .xls input files

The application is free for non-commercial use and can be distributed as a standalone executable program. It requires Microsoft .NET Framework 3.5.

TMbuilder – the easiest Translation Memory export creator

Some news from Terminotix

Terminotix



Yesterday I received the latest edition of the LogiTerm newsletter. You can download it here. There are some interesting announcements:

  • Agreement with SYSTRAN:
    Terminotix has entered into an agreement with SYSTRAN to  add  machine  translation  solutions  to  the  Terminotix product  line.
  • YouAlign completely free:
    YouAlign, the   text   alignment   website   launched   by Terminotix in August 2009, was supposed to be free for a limited time only, but is now completely free. YouAlign lets you  quickly  and  easily  create  HTML  bitext  and  TMX translation memory files from pairs of input files. Bitext and translation  memory  files  generated  by  YouAlign  can  be downloaded for use with bilingual full-text search engines and translation memory systems. No software to install — everything  is  done  through  your  web  browser.
  • SynchroTerm 2010 released:
    The 2010 release of SynchroTerm, the powerful bilingual term extraction program, is now available. Enhancements include  optimized  memory  use  for  handling  larger  files; support  for  Greek,  Dutch,  Hungarian,  Norwegian,  Polish and Turkish;

Two interesting (and perhaps antagonistic) blog posts on translation quality

First there’s 90% of in-country reviews are a waste of time, from the Medical Translation Blog where, after a somehow provocative title, the author explains the difference between theoretical, ideal situations and the hard facts of in-country reviews, which are often marred by the following problems:

  • Lack of information sharing (e.g., no reference materials)
  • Lack of understanding regarding brand
  • Review schedules that are “black holes”
  • Clarity of review changes is lacking (ever try reading a French doctor’s handwriting?)
  • Mechanics fail (file exchanges don’t work, changes are entered inconsistently)
  • Quality of review changes (linguistic, technical errors are introduced)

Then there’s Quality translation dictates a collaborative effort, from the Translation and Software Localization Blog, which can be considered as some sort of retort to the previous post. The author adopts concepts from control theory to explain how in-country review is in fact an essential step of the translation process.

I think that the two articles complement each other and really support the idea that quality control, when done properly, can make a huge difference for the final quality of any translated or localized product. In conclusion, two interesting reads.

Alchemy launches PUBLISHER 2.0

Yesterday Alchemy announced PUBLISHER 2.0, its advanced translation memory solution. The product contains three modules:

  • Analysis Expert: allows to re-use previously translated content by analyzing different types of translation memory formats (Catalyst TTK; Wordfast, SDL Trados, SDL TM Server, SDL Idiom TM, and TMX)
  • Translate Expert: matches previously translated content to new content, in order to (guess what?) reduce the number of words being sent to translators
  • Clean Up Expert: creates the localized version of the translated content and updates the translation memories

PUBLISHER 2.0 supports several source formats, among which FrameMaker, Word, XPress, InDesign, several Help authoring systems and Web and “tagged” formats (HTML, XML, PHP, JSP, etc.)

This is one of the major announcements from Alchemy since its acquisition by Translations.com last spring.

For the full announcement text and trial download:

Alchemy Publisher 2.0