Help me create a Subs2SRS course?

General discussion about learning languages
User avatar
rdearman
Site Admin
Posts: 7231
Joined: Thu May 14, 2015 4:18 pm
Location: United Kingdom
Languages: English (N)
Language Log: viewtopic.php?f=15&t=1836
x 23128
Contact:

Re: Help me create a Subs2SRS course?

Postby rdearman » Wed Dec 06, 2017 1:58 pm

rdearman wrote:Cool, it works. I had some issues with the text files because messing around with them on windows seems to have confused the file encoding. So I'll start again and download the text to my linux box and do all the hunalign, split, etc. there and then run the forced alignment tool.

I played it in SubTitle Edit and it was perfectly aligned for the 3 different srt files I tried. So, fix the encoding and it is a done deal. I had to compile hunalign, but since I'm modifying the file to have one sentence per line, it was pretty good on the alignment FR to EN as well. Actually it makes for a good LR system. You can Listen & Read all day long. :)

OK, Windows wasn't the culprit for the encoding problem, it was the dipstick in charge of the programming (me) who forgot that perl needs you to tell it that it is getting UTF-8 input and that you want UTF-8 output. So having fixed that by adding a couple of lines to the perl script, it output the files correctly and the srt files.

Now for me, I'm actually cool with just having subtitle files in French and I think I could work with this as a complete french card. However, the sentence length is really long. I had to look at them in Sub-Title Edit because even VLC will not display the mp3 with sub-title (and I thought it did everything!). But a little digging showed that if you do the following:
enabled visualizations (Audio > Visualizations > Spectrometer) => subtitles are autodetected and play fine (srt)

Then it will show the subtitles, and even the really long sentences. So this is great for LR. I can listen and read along in French. Sweet.

I will align the English and generate a deck at some point to share on AnkiWeb. But this will be great for me just the LR.
1 x
: 0 / 150 Read 150 books in 2024

My YouTube Channel
The Autodidactic Podcast
My Author's Newsletter

I post on this forum with mobile devices, so excuse short msgs and typos.

User avatar
smallwhite
Black Belt - 2nd Dan
Posts: 2386
Joined: Mon Jul 06, 2015 6:55 am
Location: Hong Kong
Languages: Native: Cantonese;
Good: English, French, Spanish, Italian;
Mediocre: Mandarin, German, Swedish, Dutch.
.
x 4876

Re: Help me create a Subs2SRS course?

Postby smallwhite » Wed Dec 06, 2017 2:25 pm

rdearman wrote:So I downloaded Around the world in 80 days, English and French, both text and audio.
I wrote ... some perl scripts to create one line per sentence.

I thought very hard about your project yesterday. So hard I dreamt about it last night. In the dream, a man kept talking and talking, and I was to split his speech into sentences, and to sort the sentences into different plastic trays based on some criteria. No idea how I did it, but the sentences in the trays were tangible. Like those long bars with company names that you slide into building directory signs.
4 x
Dialang or it didn't happen.

User avatar
rdearman
Site Admin
Posts: 7231
Joined: Thu May 14, 2015 4:18 pm
Location: United Kingdom
Languages: English (N)
Language Log: viewtopic.php?f=15&t=1836
x 23128
Contact:

Re: Help me create a Subs2SRS course?

Postby rdearman » Wed Dec 06, 2017 4:11 pm

smallwhite wrote:
rdearman wrote:So I downloaded Around the world in 80 days, English and French, both text and audio.
I wrote ... some perl scripts to create one line per sentence.

I thought very hard about your project yesterday. So hard I dreamt about it last night. In the dream, a man kept talking and talking, and I was to split his speech into sentences, and to sort the sentences into different plastic trays based on some criteria. No idea how I did it, but the sentences in the trays were tangible. Like those long bars with company names that you slide into building directory signs.

I think you should reduce the amount of caffeine and/or alcohol before bedtimes. :D
2 x
: 0 / 150 Read 150 books in 2024

My YouTube Channel
The Autodidactic Podcast
My Author's Newsletter

I post on this forum with mobile devices, so excuse short msgs and typos.


Return to “General Language Discussion”

Who is online

Users browsing this forum: Google [Bot] and 2 guests