Localizing HTML documents

Continuing my search into internationalizing and localizing the 48 lessons for RUR-PLE and asking all around I have raised a few very meaningful feedbacks from my community: it’s a lot of work to recode HTML pages and insert language tags, it also makes authoring a lot more complicated and not for everyone. Imagine a teacher wanting to add lesson No.49, he’ll have to understand the technique behind it and thus learning a bit of HTML is probably a lot easier. So what you gain from simplifying the work for translators is lost in hindering authoring.

Well actually NO! A big THANK YOU to Elly for pointing out this great Java application to me: OmegaT. OmegaT let you open XHTML, HTML, Microsoft Office 2007 XML, OpenOffice.org/StarOffice, XLIFF (Okapi), MediaWiki and plain text files and hides the code to the translator. Translators just see what needs to be translated, can use shared memory per project (you create projects inside the application) and thus avoids you to re-translate something you already translated, and export in the same format as the original document without touching the markup. I’ve tried with HTML and presentation files and wow it just works as it should. Great gain of time, no need to internationalize anything for your project (and mine) and anyone can now generate new lessons without having to worry about the translators.

This is also going to help me in my daily work, where I get presentation files in different languages all the time and spend hours editing them slide by slide. On top of that, this is Open Source Software (GPL) and cross platform (using Java). What else could you ask for? I just regret to have discovered it so late. 😉