Yesterday Alchemy announced PUBLISHER 2.0, its advanced translation memory solution. The product contains three modules:
- Analysis Expert: allows to re-use previously translated content by analyzing different types of translation memory formats (Catalyst TTK; Wordfast, SDL Trados, SDL TM Server, SDL Idiom TM, and TMX)
- Translate Expert: matches previously translated content to new content, in order to (guess what?) reduce the number of words being sent to translators
- Clean Up Expert: creates the localized version of the translated content and updates the translation memories
PUBLISHER 2.0 supports several source formats, among which FrameMaker, Word, XPress, InDesign, several Help authoring systems and Web and “tagged” formats (HTML, XML, PHP, JSP, etc.)
This is one of the major announcements from Alchemy since its acquisition by Translations.com last spring.
For the full announcement text and trial download:
Alchemy Publisher 2.0
Anaphraseus (formerly OpenWordfast) is a CAT (Computer Aided Translation) tool, OpenOffice.org 2 macro set similar to Wordfast. It works with Wordfast Translation Memory format (*.TXT). Supports text segmentation. Features: term recognition, fuzzy search, Unicode support.
Installation is pretty easy. We tried it with OpenOffice.org 3.0 Beta:
- Download the Anaphraseus extension here
- Launch OpenOffice Writer and select Tools > Extension Manager
- Click on Add… and navigate to the extension you just downloaded (Anaphraseus-1.21-beta.oxt in our case)
- Click on Open, accept the License Agreement and check that Anaphraseus is installed under My Extensions
- Restart OpenOffice Writer for the changes to take effect
- The Anaphraseus menu should now be available in Writer
- To enable the Anaphraseus toolbar, choose View > Toolbars > Add-On 1
Here is the Anaphraseus toolbar with the familiar functions offered by most TEnTs:
Pressing the Setup button brings up the Setup window, which has very basic functions that allow to:
- Select an existing TM, create a new TM, reorganize a TM, invert the TM source/target and export the TM as ANSI or TMX file
- Select, create, reorganize glossaries and specify the use of glossaries (termbases)
- Specify user IDs, untranslatable characters, sentence delimiters and the fuzzy match threshold
We tried Anaphraseus on a test project and it worked as advertised. However, we found the feature set very limited, especially as far as file format support is concerned. The program can be an acceptable choice for translators who work with Word files or other file formats that can be properly opened and saved by OpenOffice Writer. However, there does not seem to be any support for tagged formats (XML, MIF, HTML, etc.), not even if the tagged files are preformatted with an external tool such as Raibow, S-Tagger or the Trados macros. This makes Anaphraseus much less palatable to agencies or translators who need a slightly more complex workflow than just translating plain RTF or Word files. Hopefully this will change in future versions.
Kilgray have released yet another update for MemoQ. Here is the download link:
— A filtering crash when using the filter on a View;
— Word index entries imported (even if hidden text is not);
— Projects stored on network drives should now work OK (please verify).
Isometry is a free computer-assisted translation tool developed by Toshiya Kazuyoshi. We grabbed version 1.0.6 from the FINITE FIELD website. Apparently the last build dates back to April 2006, so this may a point of concern about future developments of the application.
Apart from the usual features offered in modern CAT tools, there are a couple of highlights and shortcomings we’d like to point out:
- Isometry only supports Unicode text files as the working files for translation. So no support whatsoever is offered for Office documents, let alone more exotic formats such as tagged, XML or XLIFF files.
- Translation memories can be imported as flat TXT tab-separated Unicode files. TMX is not supported.
- Quick access to the TM: Quoting from the site: “Preliminary Translation Memory Search. Because this is done in the background, Isometry searches a great volume of TM without making you wait.”
- According to the author, Isometry may be used under Linux through the Mono Project.
Isometry home page
Open-Tran.eu is a searchable database of strings taken from several open-source applications and operating systems.
Here’s an example of how the system works: go to Open-Tran.eu, choose the source language, the target language and type a string that you would like to look up. Finally, click on “Translate”. In the example below, we are looking for the Italian translation of “taskbar”:
The results page displays all the occurrences of the string with the relevant translation:
By clicking on the icon at the right side of each entry, you can obtain extra information about the project the string belongs to (in this example KDE and OpenSUSE), as well as the source string surrounded by its context:
This is a great idea for translators who are localizing open.source content. The system can also work as a reference for those hard-to-translate software terms, as the sources the strings originate from are pretty reliable.
Total Recall’s Snowball
is a translation environment tool (still in beta) that introduces several interesting concepts. It can import TMX translation memories and it performs a preliminary term analysis before the translator starts the actual work. During our tests, the import invariably got stuck during the database analysis, so we could not form a final opinion of how well this feature works. We will post a full review as soon as we solve the glitches we encountered. Quoting from Total Recall’s website:
The presentation focused on some key features of Snowball, such as the user-definable popup translation window with its list of translations scrolling to track the words the user selects in the source, computer translation suggestions, and Snowball’s amazing ability to work completely in the background, invisibly storing translation memory segments as the user works and popping up to help only when it recognizes something it’s seen before.
Snowball currently costs € 56.25 and requires Microsoft Word.
Snowball presented and demonstrated at University of Copenhagen Language Technology Forum
MemoQ received another update. Here are the download links (with and without .Net Framework) and the changelog:
The build contains RTF and XLS filter fixes, and updated French, German and
Hungarian strings. Preview creation for large bilingual RTF/DOC files has also
become a lot faster.
MemoQ: Translate better, cheaper, faster
OmegaT, the free (GPL) translation memory tool, has received a minor release update. Quoting from the changelog:
The version 1.8 update 2 contains the same changes as the 1.7.3 update 3. In addition, it contains a number of small enhancements and bug fixes specific to 1.8, notably concerning the user interface and spellchecking. The Catalan localization has been updated.
OmegaT 1.8.0 update 2 Changes
1.8.0 update 2 vs. 1.8.0 update 1
1.8.0 update 2
– Spellchecking dialogs now have a title
– Non-breaking spaces are trimmed from translatable text
– Catalan localisation updated to 1.8 (UI)
– Spellchecking dialogs not dismissable with [esc]
– Wrong Import file dialog title
– Hunspell not working on Linux 64-bit
– Non-ascii char badly handled in hunspell lib loading on OS-X
OmegaT – multiplatform CAT tool [from SourceForge.net]
Bitext2tmx is a program written in Java (and therefore cross-platform) that allows to segment and align corresponding translated sentences from plain text files and generate a TMX translation memory for use in computer-assisted translation applications.
The program is quite straightforward: once you have two TXT files (the source document and the corresponding translation) open Bitext2tmx, then choose File > Open, choose the source and target TXT files and set the relevant languages and encodings (UTF-8 is supported).
The text files are opened in the program window, where you can fine-tune the alignment by joining, splitting or deleting rows. There is also a command for deleting empty rows.When the text looks properly aligned, choose File > Save as to save the aligned texts as a TMX (v 1.1) memory file that you can then import into the translation environment of your choice.
Compared to its commercial siblings, like Trados WinAlign, Bitext2tmx is pretty basic and lacks the powerful functions and granularity offered by a full-fledged aligner. However, the program delivered a TMX memory as advertised and, given the price, it is a useful tool for lightweight alignment work.
Bitext2tmx requires a A Java Runtime Environment (JRE) and is released as free software under the GPL license.
bitext2tmx: a bitext aligner/converter
Google Translate can now detect the language of the source text pasted by the user. This service is rapidly growing into a solid option for fast , casual translations where the user needs to grasp the general meaning of a foreign-language text. In general, the translations seem to be better than the ones offered by other machine translation programs and services.
Here are some highlights about the service:
- 22 languages with bi-directional translation in all combinations, for a total of 506 language pairs
- Auto-detection of the pasted source text (first option in the source language drop-down button)
- Google Translate’s APIs allow developers to connect their applications to the service. It will be interesting to see which one of the many TEnT providers first comes up with a good implementation of this service. For instance, when a sentence has no matches in a translation memory, the TEnT could quickly query Google Translation and offer a draft automated translation that the human translator can adapt and proofread
Google Translate [via Google Operating System]