Does anybody have recommendations for SRT editors that allow users to align two SRT files? I'm in particular interested in editors that allow users to copy the translations over the original text so that both have the same time codes.
So far I've only found one tool, Subs Re-Timer, which comes with sub2srs. However, it doesn't allow users to merge or split source or target segments. (Merging is necessary, if the source SRT file contains one segment and the target SRT file two or more segments; splitting is necessary in the opposite case.)
Here's an example of two German segments that would need to be merged:
Are there any other free tools like Subs Re-Timer that support merging/splitting of source and target segments?
Recommendations for bilingual SRT editors
-
- Green Belt
- Posts: 404
- Joined: Sat Jul 18, 2015 6:21 pm
- Languages: German (N)
- x 807
- rdearman
- Site Admin
- Posts: 7260
- Joined: Thu May 14, 2015 4:18 pm
- Location: United Kingdom
- Languages: English (N)
- Language Log: viewtopic.php?f=15&t=1836
- x 23314
- Contact:
Re: Recommendations for bilingual SRT editors
We have a thread about this a long while ago. I did manage to align SRT files (with help from EMK's instructions) using a free linux tool called Aeneas. I didn't bother with the VM Docker thing, but just installed it directly on my Linux box. You can also look at hunalign.
https://forum.language-learners.org/vie ... =14&t=7314
https://forum.language-learners.org/vie ... =14&t=7314
2 x
: Read 150 books in 2024
My YouTube Channel
The Autodidactic Podcast
My Author's Newsletter
I post on this forum with mobile devices, so excuse short msgs and typos.
My YouTube Channel
The Autodidactic Podcast
My Author's Newsletter
I post on this forum with mobile devices, so excuse short msgs and typos.
- Adrianslont
- Blue Belt
- Posts: 827
- Joined: Sun Aug 16, 2015 10:39 am
- Location: Australia
- Languages: English (N), Learning Indonesian and French
- x 1936
Re: Recommendations for bilingual SRT editors
Have you looked at the free program SubtitleEdit. It’s for windows.
I never align two languages, I just use it’s built in Google translate function and get a very literal translation BUT as far as I can see, you can align.
It will split and merge a line, easily.
I never align two languages, I just use it’s built in Google translate function and get a very literal translation BUT as far as I can see, you can align.
It will split and merge a line, easily.
2 x
-
- Orange Belt
- Posts: 242
- Joined: Wed Mar 21, 2018 6:54 pm
- Languages: English, Portuguese, Spanish, Catalan, French, Persian, Arabic, Mandarin, Japanese.
- x 444
Re: Recommendations for bilingual SRT editors
I wrote this in Python it should copy the timings and merge... but it will make a mess when split would be required...
needs: https://github.com/tkarabela/pysubs2
needs: https://github.com/tkarabela/pysubs2
Code: Select all
import pysubs2
engSub = pysubs2.load(r'timingSub.ass')
japSub = pysubs2.load(r'textSub.ass')
lastTrans = (0.0, 0.0)
def getTranslation(line, sub):
global lastTrans
return u'\\N'.join([a.text for a in sub if (a.start + a.end)/2 >= line.start and \
(a.start + a.end)/2 <= line.end and \
(a.start, a.end) != lastTrans])
lastTrans = (a.start, a.end)
for line in engSub:
line.text = getTranslation(line, japSub)
engSub.save("result.srt")
1 x
-
- Black Belt - 2nd Dan
- Posts: 2622
- Joined: Mon Aug 07, 2017 10:37 pm
- Languages: German (N)
French (C)
English (C)
Spanish (A2)
Lithuanian - x 3226
Re: Recommendations for bilingual SRT editors
https://github.com/bonigarcia/dualsub/
i don't know if Google Translate now works for Dualsub, but the two subs should be aligned matchingly.
i don't know if Google Translate now works for Dualsub, but the two subs should be aligned matchingly.
1 x
- iguanamon
- Black Belt - 2nd Dan
- Posts: 2363
- Joined: Sat Jul 18, 2015 11:14 am
- Location: Virgin Islands
- Languages: Speaks: English (Native); Spanish (C2); Portuguese (C2); Haitian Creole (C1); Ladino/Djudeo-espanyol (C1); Lesser Antilles French Creole (B2)
Studies: Catalan (B2) - Language Log: viewtopic.php?t=797
- x 14269
Re: Recommendations for bilingual SRT editors
For those of us who do not have programming skills, there's another way to align subs. It can be done manually. Once a learner gets to the point where they can benefit from subs, they can be downloaded into an srt file and opened in microsoft word pad.
Then open a word/openoffice document. Insert a two column one row table. Insert TL subs on the left. Then insert L1 subs on the right and line them up. The time stamps almost never match. sometimes there may be extraneous data- like written signs being translated that are not in the original audio; stage comments: "she coughs"; etc. These superfluous phrases can be easily omitted manually.
It's funny, many learners here rave about how useful Assimil is. An Assimil course is a parallel text and annotated graded reading (with audio). While it isn't easy to replicate the graded reading from online sources or the annotations, the parallel text aspect is easy to replicate, even if one doesn't have the computer skills necessary to do it. I believe the reason more learners do not take advantage of parallel texts is that they are not easily available out of the box, "plug and chug" ready.
Even if a learner has little knowledge of TL, clues to help with manual alignment are present with proper names, international vocabulary, etc. I can align a 22 minute dubbed US show in about 5-10 minutes work in a Latin script IE language. I can print it to pdf, add notes and definitions, put it on my phone or tablet and read it while I listen on another device... if I have audio. If my pronunciation is good enough, I don'e even need the audio.
While I know that it is much easier and desirable to let the computer do the work for you, and I know we have a large number of members who know how to program, there are also a lot of us who simply do not have these skills and wouldn't know a Python script from Devanagari script. Learning from parallel texts, whether they be from subtitle files or short articles, is so highly useful and very often underutilized by many learners. It is a wasted opportunity for many learners who could gain a lot of benefit from using them in their learning. Integrating parallel texts in learning languages can add a major boost in learning a language, bridging the gap in reading when a learner doesn't yet know a large percentage of vocabulary to make reading less frustrating.
I wouldn't do a manual alignment of a book, that would take a lot of time to do manually, but a half an hour or an hour television show, a short 3-5 page article, a book of the Bible, yes. That's easy to do, even if I can't program my way out of a wet paper sack.
Then open a word/openoffice document. Insert a two column one row table. Insert TL subs on the left. Then insert L1 subs on the right and line them up. The time stamps almost never match. sometimes there may be extraneous data- like written signs being translated that are not in the original audio; stage comments: "she coughs"; etc. These superfluous phrases can be easily omitted manually.
It's funny, many learners here rave about how useful Assimil is. An Assimil course is a parallel text and annotated graded reading (with audio). While it isn't easy to replicate the graded reading from online sources or the annotations, the parallel text aspect is easy to replicate, even if one doesn't have the computer skills necessary to do it. I believe the reason more learners do not take advantage of parallel texts is that they are not easily available out of the box, "plug and chug" ready.
Even if a learner has little knowledge of TL, clues to help with manual alignment are present with proper names, international vocabulary, etc. I can align a 22 minute dubbed US show in about 5-10 minutes work in a Latin script IE language. I can print it to pdf, add notes and definitions, put it on my phone or tablet and read it while I listen on another device... if I have audio. If my pronunciation is good enough, I don'e even need the audio.
While I know that it is much easier and desirable to let the computer do the work for you, and I know we have a large number of members who know how to program, there are also a lot of us who simply do not have these skills and wouldn't know a Python script from Devanagari script. Learning from parallel texts, whether they be from subtitle files or short articles, is so highly useful and very often underutilized by many learners. It is a wasted opportunity for many learners who could gain a lot of benefit from using them in their learning. Integrating parallel texts in learning languages can add a major boost in learning a language, bridging the gap in reading when a learner doesn't yet know a large percentage of vocabulary to make reading less frustrating.
I wouldn't do a manual alignment of a book, that would take a lot of time to do manually, but a half an hour or an hour television show, a short 3-5 page article, a book of the Bible, yes. That's easy to do, even if I can't program my way out of a wet paper sack.
Last edited by iguanamon on Sun May 05, 2019 1:32 pm, edited 1 time in total.
2 x
- rdearman
- Site Admin
- Posts: 7260
- Joined: Thu May 14, 2015 4:18 pm
- Location: United Kingdom
- Languages: English (N)
- Language Log: viewtopic.php?f=15&t=1836
- x 23314
- Contact:
Re: Recommendations for bilingual SRT editors
Good point. I did actually align an entire book once. I didn't use word but a linux editor but it worked out the same. I didn't use tables but page breaks so that when printed I had english on odd pages and french on even pages.
EDIT: I thought I would show some photos. Admittedly the OP wanted electronic files, but you can generate dead tree materials too. It took me about 3-4 hours if I remember correctly, but afterwards it was relativity easy to produce a PDF, print, glue, and trim into a nice book. Which reminds me, I should dig it out and read it. I actually made about 4 of them and gave 3 away to friends who were learning French.
EDIT: I thought I would show some photos. Admittedly the OP wanted electronic files, but you can generate dead tree materials too. It took me about 3-4 hours if I remember correctly, but afterwards it was relativity easy to produce a PDF, print, glue, and trim into a nice book. Which reminds me, I should dig it out and read it. I actually made about 4 of them and gave 3 away to friends who were learning French.
You do not have the required permissions to view the files attached to this post.
0 x
: Read 150 books in 2024
My YouTube Channel
The Autodidactic Podcast
My Author's Newsletter
I post on this forum with mobile devices, so excuse short msgs and typos.
My YouTube Channel
The Autodidactic Podcast
My Author's Newsletter
I post on this forum with mobile devices, so excuse short msgs and typos.
-
- Black Belt - 2nd Dan
- Posts: 2622
- Joined: Mon Aug 07, 2017 10:37 pm
- Languages: German (N)
French (C)
English (C)
Spanish (A2)
Lithuanian - x 3226
Re: Recommendations for bilingual SRT editors
There is a problem with professionally done translations that can be seen in the first line of the example above
Three words don't appear in the German translation at all: "me", "countless" and "suffered"; "at the hands of" is reduced to the single word "seitens".
Comparing this may be interesting to a student of stylistics, but not to a learner.
I also do the bi-directional method, but always look for an equivalent as long as it doesn't hurt grammar (and style) too much.
It can also be done here:
Er erzählte mir von zahllosen Beleidigungen , die er von Seiten ... erlitten hatte.
He told me of countless insults suffered at the hands of
Er erzählte von Beleidigungen seitens sein ....
Three words don't appear in the German translation at all: "me", "countless" and "suffered"; "at the hands of" is reduced to the single word "seitens".
Comparing this may be interesting to a student of stylistics, but not to a learner.
I also do the bi-directional method, but always look for an equivalent as long as it doesn't hurt grammar (and style) too much.
It can also be done here:
Er erzählte mir von zahllosen Beleidigungen , die er von Seiten ... erlitten hatte.
1 x
-
- Green Belt
- Posts: 404
- Joined: Sat Jul 18, 2015 6:21 pm
- Languages: German (N)
- x 807
Re: Recommendations for bilingual SRT editors
Subtitlers need to observe line llimits specified by movie production companies and/or distributors. For example, Netflix recommends around 42 Latin characters per line. And since German words tend to be longer than English words, subtitlers are often forced to omit some words in order not to exceed these limits.Kraut wrote:Three words don't appear in the German translation at all: "me", "countless" and "suffered"; "at the hands of" is reduced to the single word "seitens".
1 x
-
- Orange Belt
- Posts: 228
- Joined: Sun Feb 26, 2017 4:01 pm
- Languages: English (native); strong reading skills - Russian, Spanish, French, Italian, German, Serbo-Croatian, Macedonian, Bulgarian, Slovene, Farsi; fair reading skills - Polish, Czech, Dutch, Esperanto, Portuguese; beginner/rusty - Swedish, Norwegian, Danish
- x 590
Re: Recommendations for bilingual SRT editors
There are a number of sentence alignment tools available, used for producing bilingual translation memories (databases of aligned sentences), for instance, which allow splitting and merging segments. This task seems similar enough. LF Aligner (free) at https://sourceforge.net/projects/aligner/ would probably be my first recommendation. It has a graphical interface for correcting alignments. For a commercial tool, see https://youalign.com/.
Once you have a .tmx file, a TMX editor like https://sourceforge.net/projects/tmxeditor/ might also be useful for correcting it.
Obviously you can do anything you want manually, if you have enough time, but it's a lot more work than using a tool designed for that purpose.
Once you have a .tmx file, a TMX editor like https://sourceforge.net/projects/tmxeditor/ might also be useful for correcting it.
Obviously you can do anything you want manually, if you have enough time, but it's a lot more work than using a tool designed for that purpose.
1 x
Return to “Practical Questions and Advice”
Who is online
Users browsing this forum: No registered users and 2 guests