Main Content

Understanding How to Use
Globalization Industry Standards

Instructor: Andrzej Zydron

09:00 - 17:00 : Thursday, December 11

Overview

This workshop teaches the key concepts behind Unicode, DITA, SRX, TBX, TMX, XLIFF, Translation Web Services, xml:tm, W3C ITS, GMX-V, GMX-C and GMX-Q. The workshop takes you through each standard in detail providing a form of insight from both the executive and technical point of view that is not available anywhere else in one session.

The workshop concentrates in addition on how the various standards can be made to inter operate between each other, providing for a ‘Lego’ approach to putting together fast and efficient localization architectures.

Put together these standards enable a dramatic improvement in how text for localization is authored, architected exchanged and translated. These standards cover everything from authoring granularity, through standard metrics for volume, quality and complexity, through automated interchange of files, memory and terminology as well as dramatically improving translation memory operations and performance.

Who will benefit from this workshop and why?

Translation Agencies, Translators, Localization Engineers interested in

  • Gain valuable insight into these standards which will affect you in the near future. Customers will increasingly demand support for these standards from you. Understand how these standards will affect you and make your life much easier.

Executives, International Directors, Localization Managers interested in

  • Understand the value that these standards will bring to your organizations and how they will simplify interoperability between customers and suppliers. Understand how they can dramatically improve translation memory operations and how they can establish standard metrics for volume, quality and complexity.

Documentation Content Managers interested in

  • Learn how the authoring and translation processes can be dramatically improved through the implementation of these standards.

Agenda

09:00 - 09:20 Welcome and Introduction

09:20 - 09:45 Why Standards?

This chapter discusses the concept of standards, why we use them, what benefit do they bring as well as the main difference between de facto and open standards. We will discuss the benefits that open standards bring.

09:45 - 10:00 Why XML?

This chapter tells us about the importance that XML has brought to the standards process as well as its strengths and weaknesses.

10:00 - 10:15 Different types of Localization Industry Standards

Here we will define the different types of Localization Industry Standards by role and type.

10:15 - 10:45 Coffee Break

10:45 - 11:30 Encoding Standards

This chapter discusses the main encoding standards and their relationship with Unicode, UTF-8, UTF-16, UTF-32 and the four normalization forms.

11:30 - 12:15 Descriptive Standards

This chapter will discuss the work of the W3C ITS committee and the effects this will have on Localization Industry Standards. A detailed status will be provided on the work in progress of this important committee.

12:15 - 13:30 Lunch

13:30 - 14:30 Exchange Standards

This chapter will provide a detailed description of the Localization Industry Exchange Standards: TMX, SRX, TBX, GMX, XLIFF and OLIF. Detailed explanations of the role of each standard will be provided, along with examples.

14:30 - 15:00 Coffee Break

15:00 - 15:45 Exchange Standards Continued

15:45 - 16:15 Interoperability

This chapter will provide a detailed description of the Translation Web Services Standard, destined to become the ‘glue’ that will bind localization automation over the Internet.

16:15 - 16:45 Reuse

A detailed explanation of two of the most effective ways of reducing localization costs will be provided. Both will have a profound effect on how we author and translate XML based documentation.

16:45 - 17:00 Putting it all together

Having presented all of the various types of standard we will put together an architecture that shows how the various standards interact and cooperate with one another.

Program

Welcome and Introduction

  • Who's who?
  • What do we want from this workshop?

Why Standards?

  • De facto standards
  • Open standards
  • The standards process
  • Main Standards bodies providing Localization Industry Standards
    • W3C
    • ISO
    • Unicode Consortium
    • OASIS
    • LISA OSCAR
    • OLIF

Why XML?

  • The road to XML
  • The effect of XML on standards

Different types of Localization Industry Standards

  • Encoding
  • Descriptive
  • Exchange
  • Interoperability
  • Reuse

Encoding Standards

  • Character sets

    History of character sets ASCII to Windows code page 1252 MBCS: multi-byte character sets Unicode & UTF-8, UTF-16, UTF-32 Unicode normalization forms GB18030 and its implications for doing business in China

  • Locales
    • Definition
    • Usage
    • Locale-dependent functions

Different types of Localization Industry Standards

  • W3C ITS
    • History and evolution
    • Current status

Exchange Standards

  • Introduction to Localization Industry Exchange Standards
    • TMX - Translation Memory eXchange
    • SRX - Segmentation Rules eXchange
    • TBX - TermBase eXchange
    • GMX - Global Information Management eXchange
    • XLIFF - XML Localization Interchange File Format
    • OLIF - Open Lexicon Interchange Format
  • TMX - Translation Memory eXchange
    • History and evolution
    • Current status
    • Document structure
    • Implementation Guidelines
    • Examples
  • SRX - Segmentation Rules eXchange
    • History and evolution
    • Current status
    • Document structure
    • Implementation Guidelines
    • Examples
  • TBX - TermBase eXchange
    • History and evolution
    • Current status
    • Document structure
    • Implementation Guidelines
    • Variants: TBX Lite, TBX eXtra Lite
    • Examples
  • GMX - Global Information Management eXchange
    • History and evolution
    • Current status
      • GMX-V - Volume
      • GMX-C - Complexity
      • GMX-Q - Quality
  • GMX-V - Global Information Management eXchange Volume
    • Document structure
    • Implementation Guidelines
    • Examples
  • XLIFF - XML Localization Interchange File Format
    • History and evolution
    • Current status
    • Document structure
    • Implementation Guidelines
    • Examples
  • OLIF - Open Lexicon Interchange Format
    • History and evolution
    • Current status
    • Document structure
    • Implementation Guidelines
    • Examples

Interoperability

  • Translation Web Services
    • History and evolution
    • Current status
    • Document structure
    • Implementation Guidelines
    • Examples

Reuse

  • DITA - Darwin Information Typing Architecture
  • xml:tm - XML Text Memory

DITA - Darwin Information Typing Architecture

  • History and evolution
  • Current status
  • Document structure
  • Implementation Guidelines
  • Examples

xml:tm - XML Text Memory

  • History and evolution
  • Current status
  • Document structure
  • Implementation Guidelines
  • Examples

Putting it all together

Having presented all of the various types of standard we will put together an architecture that shows how the various standards interact and cooperate with one another.

About Andrzej Zydron

Andrzej Zydroń was born in England. Educated in France he started working in IT in 1976. His experience has covered all aspects of computing, with in depth knowledge of Software Engineering, SGML, XML, encoding methodologies and translation memory. Highlights of his career include:

• The design and architecture of the European Patent Office patent data capture system for Xerox Business Services.

• The design and architecture of the Xerox Language Services XTM translation memory system in 1996.

• Author of XML and SGML filters for SDL International’s SDLX Translation Suite.

• Assisting the Oxford University Press, the British Council and Oxford University in work on the New Dictionary of the National Biography.

Andrzej Zydroń is a member of the LISA OSCAR Steering Committee. He is the technical architect and editor of the GILT Metrics proposed specification suite, as well as editor of the proposed TBX Link specification. Zydroń also sits on the OASIS technical committees for Translation Web Services, XLIFF and XLIFF segmentation. He is also a W3C invited expert sitting on the W3C ITS technical committee.

As CTO for XML-INTL, he is currently developing the next generation of XML-based text memory systems to reduce authoring and translation costs for documentation. Zydroń is fluent in English, Polish and French.

About LISA Workshops

LISA Workshops make use of presenters with real-world experience in the subjects they teach about. Workshops do not pitch any products or services, but instead provide a general overview with candid and practical evaluations of tools and products that impact your business.