TM translation memory maintenance
De persoon die dit onderwerp heeft geplaatst: Marcos Zattar
Marcos Zattar
Marcos Zattar
Duitsland
Local time: 00:40
Duits naar Portugees
+ ...
Mar 12, 2011

Hello!

I have 3 TMs of the same client which I imported to memoQ, I am sure there are duplicates in it. I would like to condense all three TMs in one, after cleaning the duplicates. Is it possible to do it inside memoQ?

Also: In past projects there where some source with bad translations which I retranslated. That means that there are also "source text duplicates" with different translations - in these cases, I would need to get rid of the older translation (maybe using
... See more
Hello!

I have 3 TMs of the same client which I imported to memoQ, I am sure there are duplicates in it. I would like to condense all three TMs in one, after cleaning the duplicates. Is it possible to do it inside memoQ?

Also: In past projects there where some source with bad translations which I retranslated. That means that there are also "source text duplicates" with different translations - in these cases, I would need to get rid of the older translation (maybe using the name of the translator to sort out the bad ones??)

In the past I used to export my memories from the Workbench into TMX format, open it in Olifant and copy to an Excel table - Excel can easily filter out duplicates. This method works, but I didn't manage to preserve the fields other than the source and the target text fields, like translator, date, use count, etc.

It seems that all major CATs systems are continually improving, but managing memories still seems to be awkward and lacks precision/straightforwardness.

Does anybody know a good workaround or program?

Thanks!
Collapse


I_CH
 
Epameinondas Soufleros
Epameinondas Soufleros  Identity Verified
Griekenland
Local time: 01:40
Lid 2008
Engels naar Grieks
+ ...
You can add Excel in the process Mar 12, 2011

You can export your 3 TMs as CSV files, combine them into an Excel sheet, and then (using Excel 2007 or 2010), format the range that contains your data as a table, so that you can then go to "Data" in the ribbon and click on "Remove duplicates".

After the above process, you can save the file as Unicode text (Save as... > File type: Unicode text) and import the .txt file into a memoQ TM.

Hope that is clear enough.


 
Marcos Zattar
Marcos Zattar
Duitsland
Local time: 00:40
Duits naar Portugees
+ ...
ONDERWERPSTARTER
No need for Excel Mar 13, 2011

Epameinondas,

thank you for this hint. This is exactly what I tried to do, but the way back to Olifant didn't work.

I found out that the manipulation and changes I need can be done inside Olifant, we don't need Excel. Here is how to remove duplicates:

http://okapi.sourceforge.net/Release/Olifant/Help/howtos.htm

Thanks!


 
Epameinondas Soufleros
Epameinondas Soufleros  Identity Verified
Griekenland
Local time: 01:40
Lid 2008
Engels naar Grieks
+ ...
Good point Mar 13, 2011

Yes, that works fine; I've just downloaded Olifant and tried it. I thought Olifant would be obsolete by now, but they still haven't introduced a replacement for it in the new, Java implementation of OKAPI Apps.

By the way, Checkmate (a component in the Java flavour of OKAPI Apps) can perform a series of useful checks against a TMX (and other file-types). When you see an error you'd like to fix, you open the corresponding file in a text editor, locate the segment based on Checkmate's
... See more
Yes, that works fine; I've just downloaded Olifant and tried it. I thought Olifant would be obsolete by now, but they still haven't introduced a replacement for it in the new, Java implementation of OKAPI Apps.

By the way, Checkmate (a component in the Java flavour of OKAPI Apps) can perform a series of useful checks against a TMX (and other file-types). When you see an error you'd like to fix, you open the corresponding file in a text editor, locate the segment based on Checkmate's info, and do the correction you desire. For example, for a TMX of mine it spotted three instances of typos, where I had typed a word twice in a row (e.g. "της της"). By using the above method, I eliminated those typos easily.
Collapse


 
Marcos Zattar
Marcos Zattar
Duitsland
Local time: 00:40
Duits naar Portugees
+ ...
ONDERWERPSTARTER
Thanks Mar 31, 2016

Thanks for introducing me to Checkmate, I will give it a try. Cheers!

 
I_CH
I_CH  Identity Verified

Local time: 00:40
Duits naar Italiaans
+ ...
Update? Jun 4, 2024

Hi, is there an update to this thread like new more powerful apps?
I also need to clean up rebarbative memoQ TMs
Many thanks in advance


 
CafeTran Trainer
CafeTran Trainer
Nederland
Lid 2006
CafeTran Espresso Jun 4, 2024

I_CH wrote:

Hi, is there an update to this thread like new more powerful apps?


CafeTran Espresso has a TMX Edit mode in which you can perform many useful cleaning tasks.


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

TM translation memory maintenance






CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

Buy now! »
Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

Buy now! »