I came across this interesting paper by Ergun Biçici 1 and Marc Dymetman. In short:
The paper proposes to perform this automation in the following way: a phrase-based Statistical Machine Translation (SMT) system (trained on a bilingual corpus in the same domain as the TM) is combined with the TM fuzzy match, by extracting from the fuzzy-match a large (possibly gapped) bi-phrase that is dynamically added to the usual set of “static” bi-phrases used for decoding the source.
Here is the download link
Yesterday I received the latest edition of the LogiTerm newsletter. You can download it here. There are some interesting announcements:
- Agreement with SYSTRAN:
Terminotix has entered into an agreement with SYSTRAN to add machine translation solutions to the Terminotix product line.
- YouAlign completely free:
YouAlign, the text alignment website launched by Terminotix in August 2009, was supposed to be free for a limited time only, but is now completely free. YouAlign lets you quickly and easily create HTML bitext and TMX translation memory files from pairs of input files. Bitext and translation memory files generated by YouAlign can be downloaded for use with bilingual full-text search engines and translation memory systems. No software to install — everything is done through your web browser.
- SynchroTerm 2010 released:
The 2010 release of SynchroTerm, the powerful bilingual term extraction program, is now available. Enhancements include optimized memory use for handling larger files; support for Greek, Dutch, Hungarian, Norwegian, Polish and Turkish;
Jaap van der Meer of TAUS (Translation Automation User Society) says in his recent post about Google Translation Toolkit:
Translators using the Translation Toolkit ‘share’ their translations with Google. If 100,000 translators start using the service, Google will be harvesting 50 billion words of good quality translation data per year to help Google improve their automatic translation engines. In addition translators may be uploading their own (or their customers’) TMs.
Read the ret of the post at the URL below:
Google Translation Toolkit | Technology.
This link came through Twitter this morning. Google has taken one more step towards implementing a web-based translation environment that supports both human and computer-generated translations. Here is a brief description from Google Translator Toolkit Help:
Google Translator Toolkit is part of our effort to make information universally accessible through translation. Google Translator Toolkit helps translators translate better and more quickly through one shared, innovative translation technology.
Here’s what you can do with Google Translator Toolkit:
- Upload Word documents, OpenOffice, RTF, HTML, text, Wikipedia articles and knols.
- Use previous human translations and machine translation to ‘pretranslate’ your uploaded documents.
- Use our simple WYSIWYG editor to improve the pretranslation.
- Invite others (by email) to edit or view your translations.
- Edit documents online with whomever you choose.
- Download documents to your desktop in their native formats — Word, OpenOffice, RTF or HTML.
- Publish your Wikipedia and knol translations back to Wikipedia or Knol.
How is this different from Google Translate? Google Translate provides ‘automatic translations’ produced purely by technology, without intervention from human translators. In contrast, Google Translator Toolkit allows human translators to work faster and more accurately, aided by technologies like Google Translate.
Here’s a 1 1/2 minute YouTube video that illustrates the main features.
Google Translator Toolkit basics
The Microsoft Research Machine Translation (MSR-MT) Team Blog has published some details about a translation plug-in for Microsoft Office.
Now you can translate your Microsoft Office documents with Microsoft Translator – right within Office! You can translate words, phrases, or even your entire document, through the Research task pane. We blogged about setting this up manually for Office 2007 or Office 2003 previously – now it’s really easy!
This works for both Microsoft Office 2003 and 2007. The current default in Microsoft Office is WorldLingo – this installer will update your task pane to use Microsoft Translator as the default translator for the languages we provide.
Download the installer now and let us know what you think over in the Forum!
Via Microsoft Research Machine Translation (MSR-MT) Team Blog
AppTek, a developer of software for human language technology, has completed its hybrid machine translation (HMT) system for both Windows server and 32-bit Windows for PC. AppTek’s TranSphere HMT system is a full integration of statistical and rule-based methodologies.
Via MultiLingual Computing, Inc., News
The Speech Technology Magazine contains an article about AppTek’s hybrid machine translation software. Here’s a brief excerpt:
According to Hassan Sawaf, chief scientist at AppTek, the company’s hybrid model is unlike any other system on the market today—a fact that has lead some universities to attempt to copy the hybrid model.
“Even if companies attempt to hybridize they only do hybridization insofar as that they basically combine translation memory with machine translation,” he says. “Hybridization like we do and a tight integration of rule-based features and statistical-based features are unique.”
AppTek’s HMT solution provides a full integration of both methodologies instead of simply adding rules to the statistical system or a minor statistical module to the rule-based engine.
SpeechTechMag.com: AppTek Launches Hybrid Machine Translation Software
The Global Watchtower blog contains a post about recent developments in rule-based MT.
For a brief overview of rules-based MT as opposed to its competing technology, statistical MT, about the differences between the two approaches and why they might matter to technical translators, see also the following Wikipedia links: More »
The following site offers machine translation of Twitter messages. What’s interesting about this service, at least on paper, is the fact that the user’s Twitter messages can be reviewed and edited by followers.
Worldwide Lexicon Twitter Translator
Translate your Twitter feed! We translate your tweets using machine translation, and your followers can edit or replace these translations to improve them.
via Worldwide Lexicon : Twitter Translator.