discussion of text alignment tools
-
- Orange Belt
- Posts: 228
- Joined: Sun Feb 26, 2017 4:01 pm
- Languages: English (native); strong reading skills - Russian, Spanish, French, Italian, German, Serbo-Croatian, Macedonian, Bulgarian, Slovene, Farsi; fair reading skills - Polish, Czech, Dutch, Esperanto, Portuguese; beginner/rusty - Swedish, Norwegian, Danish
- x 590
discussion of text alignment tools
http://www.proz.com/forum/translator_re ... ml#2678642 mentions several free tools for sentence-aligning a source text with its translation (e.g., to produce parallel texts as pairs of sentences). LF Aligner might be the most obvious choice for home use but several others are also mentioned.
3 x
-
OnlineIversen
- Black Belt - 4th Dan
- Posts: 4787
- Joined: Sun Jul 19, 2015 7:36 pm
- Location: Denmark
- Languages: Monolingual travels in Danish, English, German, Dutch, Swedish, French, Portuguese, Spanish, Catalan, Italian, Romanian and (part time) Esperanto
Ahem, not yet: Norwegian, Afrikaans, Platt, Scots, Russian, Serbian, Bulgarian, Albanian, Greek, Latin, Irish, Indonesian and a few more... - Language Log: viewtopic.php?f=15&t=1027
- x 15038
Re: discussion of text alignment tools
If you have to align a very long text then it may be relevant to use a special tool, but the bilingual texts I use for intensive are short so I just use MS Word (similar programs like LibraWriter can also be used).
I make a table with two columns, and then I put a page or so of text in the target language to the left and the translation to the left. Then I can align the texts simply by moving the vertical line in the middle to the left or to yhe right right, and I can do fine adjustments by changing the font size (or the font itself), for instance to get the texts to match nicely up at the bottom of each page. The tables don't have to end precisely at a page shift - once you have found a position of the central line that gives good results you can usually do several pages with that setup. If not, then you just start a new table beneath - just as you do if you switch to another source text.
Long ago I experimented with other layouts, including interlinear and side-by-side layouts where the sentences were exactly matched, but I concluded that they weren't more easy to work with than the side-by-side layout you get with the method described above, and they were much more difficult to produce.
If I wanted a bilingual edition of the Lost time of Proust or the collected works of J.K. Rowling then I would still be satisfied with a tool that just produced a simple side-by-side setup. If you really want to read whole books then you must be fairly good at reading already, and then it doesn't matter whether the translations are half a centimeter higher or lower than the original passages.
The big problem is to get translations that follow the original texts so closely that you actually CAN match sentences and paragraphs, Often you'll find that whole sections in humanmade translations are missing or based on another version of the original text or whatever, and I haven't heard about any software that can solve that problem.
I make a table with two columns, and then I put a page or so of text in the target language to the left and the translation to the left. Then I can align the texts simply by moving the vertical line in the middle to the left or to yhe right right, and I can do fine adjustments by changing the font size (or the font itself), for instance to get the texts to match nicely up at the bottom of each page. The tables don't have to end precisely at a page shift - once you have found a position of the central line that gives good results you can usually do several pages with that setup. If not, then you just start a new table beneath - just as you do if you switch to another source text.
Long ago I experimented with other layouts, including interlinear and side-by-side layouts where the sentences were exactly matched, but I concluded that they weren't more easy to work with than the side-by-side layout you get with the method described above, and they were much more difficult to produce.
If I wanted a bilingual edition of the Lost time of Proust or the collected works of J.K. Rowling then I would still be satisfied with a tool that just produced a simple side-by-side setup. If you really want to read whole books then you must be fairly good at reading already, and then it doesn't matter whether the translations are half a centimeter higher or lower than the original passages.
The big problem is to get translations that follow the original texts so closely that you actually CAN match sentences and paragraphs, Often you'll find that whole sections in humanmade translations are missing or based on another version of the original text or whatever, and I haven't heard about any software that can solve that problem.
1 x
-
- Orange Belt
- Posts: 132
- Joined: Sat Jul 29, 2017 6:46 pm
- Languages: Polish (N), English (~B2ish), German (~B2ish)
- Language Log: https://tinyurl.com/yaofno9u
- x 316
Re: discussion of text alignment tools
I use LF Aligner to create a Excel file in which the sentences more or less match. Then I go through it and cut/paste text or delete rows for the text to match exactly. Some examples of how it looks after processing can be found here: https://forum.language-learners.org/viewtopic.php?f=19&t=6438
0 x
-
- Orange Belt
- Posts: 228
- Joined: Sun Feb 26, 2017 4:01 pm
- Languages: English (native); strong reading skills - Russian, Spanish, French, Italian, German, Serbo-Croatian, Macedonian, Bulgarian, Slovene, Farsi; fair reading skills - Polish, Czech, Dutch, Esperanto, Portuguese; beginner/rusty - Swedish, Norwegian, Danish
- x 590
Re: discussion of text alignment tools
Iversen, I was a bit confused by your response, but I think we might be speaking two different languages here... Alignment in this context does not refer to physical page layout, or to whether translations are "half a centimeter higher or lower" than the original.
Alignment by a sentence aligner means that an algorithm is used to determine which sentence(s) in a translation most likely match which sentence(s) in the original text. The sentence pairs, whether in the form of spreadsheet columns or a TMX file or something else, could be reformatted for presentation in various ways, but that has nothing to do with the process of identifying the sentence pairs.
Alignment by a sentence aligner means that an algorithm is used to determine which sentence(s) in a translation most likely match which sentence(s) in the original text. The sentence pairs, whether in the form of spreadsheet columns or a TMX file or something else, could be reformatted for presentation in various ways, but that has nothing to do with the process of identifying the sentence pairs.
2 x
- MorkTheFiddle
- Black Belt - 2nd Dan
- Posts: 2141
- Joined: Sat Jul 18, 2015 8:59 pm
- Location: North Texas USA
- Languages: English (N). Read (only) French and Spanish. Studying Ancient Greek. Studying a bit of Latin. Once studied Old Norse. Dabbled in Catalan, Provençal and Italian.
- Language Log: https://forum.language-learners.org/vie ... 11#p133911
- x 4883
Re: discussion of text alignment tools
Aligned texts get frequent mention hereabouts. I started a different thread, Bilingual Texts Redux, about the location of some parallel texts here https://forum.language-learners.org/vie ... =19&t=6725
0 x
Many things which are false are transmitted from book to book, and gain credit in the world. -- attributed to Samuel Johnson
-
OnlineIversen
- Black Belt - 4th Dan
- Posts: 4787
- Joined: Sun Jul 19, 2015 7:36 pm
- Location: Denmark
- Languages: Monolingual travels in Danish, English, German, Dutch, Swedish, French, Portuguese, Spanish, Catalan, Italian, Romanian and (part time) Esperanto
Ahem, not yet: Norwegian, Afrikaans, Platt, Scots, Russian, Serbian, Bulgarian, Albanian, Greek, Latin, Irish, Indonesian and a few more... - Language Log: viewtopic.php?f=15&t=1027
- x 15038
Re: discussion of text alignment tools
mcthulhu wrote:Alignment by a sentence aligner means that an algorithm is used to determine which sentence(s) in a translation most likely match which sentence(s) in the original text.
Since I don't use sentence aligners I naively used the word alignment in its normal, purely geometrical sense. It is actually a surprise to me that it is used differently in sentence alignment software - I would have expected a word like 'matching' for the task you describe.
0 x
Return to “Language Programs and Resources”
Who is online
Users browsing this forum: No registered users and 2 guests