|
TMX Special Issue
TMX - a standard ahead of its timeIn 1997, the year LISA's OSCAR special interest group was started, I attended the LISA Workshop on Integrating Advanced Translation Technology (IATT) in Washington D.C. Most of the attendees at the IATT workshop were individuals coming to learn about translation technologies and had little previous experience with translation technologies. A good deal of time in both the IATT workshop and in sessions conducted by vendors in the LISA Forum was spent on simply explaining what Translation Memory is and why it would be useful. Just five years ago many in the localization industry still weren't fully aware of TM and its importance, or needed to be convinced that it would affect them. Times certainly have changed, but it is useful to look at how recently the things we now think we've always known were still new and untested. I would like to suggest that this is one reason for what some perceive to be slow implementation of TMX - OSCAR was ahead of its time in even addressing the problem of translation memory exchange in two ways: First, if the majority of attendees at the IATT workshop had to be convinced that TM was something that they needed to use, it is no wonder that most of them were not looking at exchange issues at that point. Second, TMX was the first publicly-defined and available standard based on XML (itself not finalized when work on TMX began!). TMX anticipated a need that was just then beginning to be perceived as an issue, and it used the latest means to achieve it. This put TMX in the position of being so far ahead of the crowd that there has been a lag between TMX and the progress of translation tools vendors and customers. TMX had to sit, so to speak, until implementations caught up with it and could finally begin to demonstrate where it was successful and where changes were needed. Since 1997 adoption of TM and other technologies has been fairly rapid - by 1999 most attendees at LISA events were using TM and knew the technology in (perhaps painful) detail, or were in the process of implementing TM. However there is a lag between first adoption of TM and the point where TM repositories are large enough where exchange becomes worthwhile - if a company only has 12,000 segments in their TM repository it represents a much smaller investment than if they had 6 million segments, a figure that is not unreasonable today. So TMX has also had to wait for maturation of TM assets, something that takes years to happen. Since 2000 the need for TMX has been more and more acutely felt, and industry adoption of TMX has been quite high (approximately 25% of companies surveyed now use it in some form), but often without any real direction. Some of the issues that have surfaced recently (such as the effects of different segmentation methods on TMX usability) simply could not have been anticipated in 1997 and only now have become apparent. While complaints about TMX's limitations are real, I would say that TMX was remarkably well designed considering few had any experience with translation memory exchange issues at the time. TMX has been remarkably stable since it was first issued to the public. Although now on version 1.4, the changes have been fairly minimal, and credit should be given to the original OSCAR team that succeeded in creating a standard with enough foresight to allow for its use today. Is TMX perfect? No, but OSCAR is working to make it so. Is it usable at present? Yes, and it seems that some companies may already be using it as their main format for Translation Memories (see information below in the TM Survey results). Given current work in OSCAR's Segmentation Working Group it seems that TMX will become even more useful and that the time when TMs could be freely shared between tools with minimal difficulty is on the not-too-distant horizon. Preliminary results of the LISA TM SurveyAs many of our readers are no doubt aware, LISA has been conducting a survey regarding TM usage. Although results are preliminary, a few interesting points have already surfaced:
Figure 1. Years of TM use versus translation volume. Columns show relative numbers of survey respondents in each category.
Those who adopted Translation Memory tools around five years ago today have large and mature TMs. When TMX was first proposed the number of companies with multi-million segment TM repositories was quite small, but their numbers are growing - and the real need for TMX is growing with this. It is possible to discern in the data from this survey a coming swell of companies with large and valuable TMs who may find TMX useful, if not vital.
is LISA Publications Manager. A native of Alaska, he currently resides in Indiana. In addition to working for LISA, he is an emeritus member of the Brigham Young University Translation Research Group (TRG), a Provo, Utah-based translation, theory and technology think-tank directed by Dr. Alan Melby, and has edited a number of books on linguistics. |
![]() 8-12 December 2008 |
|||