Doing Trados-translated rtf cleanup manually in GNU Emacs

translation_articles_icon

ProZ.com Translation Article Knowledgebase

Articles about translation and interpreting
Article Categories
Search Articles


Advanced Search
About the Articles Knowledgebase
ProZ.com has created this section with the goals of:

Further enabling knowledge sharing among professionals
Providing resources for the education of clients and translators
Offering an additional channel for promotion of ProZ.com members (as authors)

We invite your participation and feedback concerning this new resource.

More info and discussion >

Article Options
Your Favorite Articles
You Recently Viewed...
Recommended Articles
  1. ProZ.com overview and action plan (#1 of 8): Sourcing (ie. jobs / directory)
  2. Réalité de la traduction automatique en 2014
  3. Getting the most out of ProZ.com: A guide for translators and interpreters
  4. Does Juliet's Rose, by Any Other Name, Smell as Sweet?
  5. The difference between editing and proofreading
No recommended articles found.

 »  Articles Overview  »  Technology  »  CAT Tools  »  Doing Trados-translated rtf cleanup manually in GNU Emacs

Doing Trados-translated rtf cleanup manually in GNU Emacs

By Maxime Bujakov | Published  09/21/2006 | CAT Tools | Recommendation:RateSecARateSecARateSecARateSecIRateSecI
Contact the author
Quicklink: http://som.proz.com/doc/928
Author:
Maxime Bujakov
France
Faransiis to Ingiriisi translator
 

See this author's ProZ.com profile
As a Trados user, you may want to clean a bilingual Trados-tagged *.rtf file "manually" for some reasons.

One such reason is that Trados, both the version 7 and earlier, may alter file formatting when cleaning the file from the source language segments with the Trados "Clean up" functionality.

(This might be due to Trados interacting with WordPad components. MFC WordPad components do not guarantee 100% preservation of styles/formats when re-saving files.)

The following is a step-by-step instruction on cleaning Trados-tagged *.rtf files in a plain editor with support for macro commands.

You will need to open your *.rtf file in a plain text editor.

(An example of a plain text editor is Notepad, a Windows built-in utility; however, its function may not be enough for the described procedure. I use GNU Emacs.)

An *.rtf file looks intimidating in its plain textual form to an unexperienced user, due to abundance of special tags.

Luckily, the relevant ones have been sorted out for you in the following description.

With a bit of care, you can save hours on erasing fragments "by mouse".

Make sure to leave out a backup copy before you start, in case you are lost in editing.

In the described instance, the source language is Russian (Russia), and the target is English (US).

Examples of Trados tags:

{\cs19\v\cf12\sub \{0>}

93\{>}

{\cs19\v\cf12\sub }

0\{>}

{\cs19\v\cf12\lang1049\langfe1033\sub\langnp1049 } (a 93% match marker)

A divider immediately follows the source closing tag. After the step 2, your cursor should be just before the divider string.

Define a macro to accommodate for the variable length of the divider (one, two or three digits). I recommend relying on the "{>}" ending substring.

5) Delete source (Russian) fragments, beginning with type (b) opening tags

{\cs19\v\cf12\lang1049\langfe1033\sub\langnp1049 \{0>}

and ending with the closing tags

{\cs19\v\cf12\lang1049\langfe1033\sub\langnp1049


Copyright © ProZ.com, 1999-2025. All rights reserved.
Comments on this article

Knowledgebase Contributions Related to this Article
  • No contributions found.
     
Want to contribute to the article knowledgebase? Join ProZ.com.


Articles are copyright © ProZ.com, 1999-2025, except where otherwise indicated. All rights reserved.
Content may not be republished without the consent of ProZ.com.