|
In this issue…
OSCAR
New SIG to develop Translation Memory Exchange Format Standard
The formation of a new LISA SIG, OSCAR (Open Standards for Container/Content Allowing Re-use), was proposed at the June LISA Forum in Washington DC. OSCAR was the result of a very successful meeting of leading tool developers and some of their clients held immediately before the Forum. Aimed at discussing ways to standardize data exchange between various translation tool systems, the meeting grew out of ILE's Open TAG initiative, launched earlier this year. Franz Rau, Microsoft, who has been appointed the Group's facilitator, explains its main objectives. An ad hoc meeting was held in Alexandria, Virginia, on June 2, 1997 to discuss data exchange standards for translation tools. It was hosted by Trados, chaired by Microsoft (Franz Rau), and stimulated by the OpenTag proposal from ILE. Alan K. Melby of Brigham Young University was selected as Technical Secretary. The participants were drawn from major developers and users of translation tools. The group agreed to cooperate under the umbrella of LISA in the development of an industry standard translation memory exchange format (TMX). Member companies are ACM, AlpNet, IBM, ILE, Indigo, ITP, Logos, Microsoft, Multiling, Star, Systran, and Trados. Once the format is defined, each developer will be able to write a routine that will export to and import from this format. Users can then export translation memory databases to this intermediate format and import the exchange file to another translation memory tool. They will be thus be able in the medium term to port memories between different tools, thus speeding up their work and protecting their own and their clients' investments in both tools and content. A decision was taken that the initial standard should provide a high-level format dealing only with how segments of text are aligned, without specifying segment internals. However, a subsequent standard will address segment-internal aspects of the OpenTag format and other approaches to processing the markup codes inside text segments. The most immediate subtasks in the development of the TMX are to:
Once these five tasks have been completed, the format can be tested on real translation memory data. Successful implementation of the TMX may lead to a source-text markup format that would allow filters to be re-used across various tools. In addition, the group will adopt a standard for termbase sharing between translator workbench tools and machine translation tools. The future cluster of data exchange standards has been given the code name "OSCAR" (Open Standards for Container/Content Allowing Re-use). The following goals, action items and time frames have been agreed on:
In order to ensure maximum input from all LISA members and other relevant parties, and hence produce optimum and widely accepted results, OSCAR has instituted a two-tier input system. While the members of the SIG will be responsible for producing the deliverables mentioned above, all LISA members will have 30 days after publication of the drafts to make technical and other comments. To register as a corresponding member, please following the instructions given in the box below. To add your name to the OSCAR distribution list, please send an e-mail to this effect to LISA@lisa.org The subject line of the e-mail must contain the words "OSCAR distribution list" plus your company name. Please also ensure that you give your full name, address, telephone and fax numbers, and your e-mail address. Please note that only one representative per company is allowed. |
![]() 8-12 December 2008 |
||