Problem downloading DLI, checking links (size).

Ask specific questions about your target languages. Beginner questions welcome!
User avatar
astromule
Green Belt
Posts: 434
Joined: Tue Jul 21, 2015 12:51 am
Location: Argentina
Languages: Spanish (N), English (C2), French, Portuguese, Italian, Norwegian, Swedish, Danish, German, Russian
Language Log: viewtopic.php?f=15&t=794&start=240
x 281

Problem downloading DLI, checking links (size).

Postby astromule » Sat Aug 22, 2015 2:56 am

Hi! I've had a problem downloading all courses from https://www.livelingua.com/dli-language-courses.php, using "DownThemAll!", as for some languages the addon interprets the links as repetitions and don't download them. The problem is that sometimes seem are to be identical copies and sometimes only the name is the same. I've downloaded all languages, but I believe some of them were interpreted as "identical" when they actually weren't, for the following languages: arabic, polish, russian, portuguese, thai.

With DownThemAll you can export your links to a simple text file. Is there some way to check the links with the downloaded content? I'll need to check both size and name. The download managers that I've tried (jdownloader, mipony, freedownloadmanager, downthemall) only offer the option to check by name only and they ask for each individual file each time. That means that if you're downloading 800 files they're going to ask you the same question 800 times.

Thanks in advance.
0 x

User avatar
rdearman
Site Admin
Posts: 7259
Joined: Thu May 14, 2015 4:18 pm
Location: United Kingdom
Languages: English (N)
Language Log: viewtopic.php?f=15&t=1836
x 23303
Contact:

Re: Problem downloading DLI, checking links (size).

Postby rdearman » Sat Aug 22, 2015 10:27 am

You could use wget to mirror the entire website down to your harddrive. I think that is what you want, basically download everything regardless and don't prompt.

wget -mkEpnp http://example.org

Because this will hammer the crap out of the website you might want to look at the various wget options for limiting the bandwidth used.
3 x
: 26 / 150 Read 150 books in 2024

My YouTube Channel
The Autodidactic Podcast
My Author's Newsletter

I post on this forum with mobile devices, so excuse short msgs and typos.

User avatar
astromule
Green Belt
Posts: 434
Joined: Tue Jul 21, 2015 12:51 am
Location: Argentina
Languages: Spanish (N), English (C2), French, Portuguese, Italian, Norwegian, Swedish, Danish, German, Russian
Language Log: viewtopic.php?f=15&t=794&start=240
x 281

Re: Problem downloading DLI, checking links (size).

Postby astromule » Sat Aug 22, 2015 6:02 pm

Thanks! I didn't know about wget, so I tried to use it using this guide https://builtvisible.com/download-your- ... with-wget/ I couldn't make it work.
In the past to download full websites I've used HTTrack Website Copier and WebSuction.
But isn't a way to check just for the missing files? DownThemAll, for example, compares what has been downloaded and if it finds it there, it doesn't download it. As I said before, the problem with this is that several files from DLI have the same name but different content. I'd like if possible to not have to download everything again, but as I already have the links, perhaps that would be the easiest option, just to go language by language and create another folder when the files in the DTA list appear in red. That's what I did for Thai, when I detected the problem.

rdearman wrote:You could use wget to mirror the entire website down to your harddrive. I think that is what you want, basically download everything regardless and don't prompt.

wget -mkEpnp http://example.org

Because this will hammer the crap out of the website you might want to look at the various wget options for limiting the bandwidth used.
0 x

User avatar
daegga
Blue Belt
Posts: 565
Joined: Thu Jul 09, 2015 12:00 am
Location: Upper Austria
Languages: Bavarian (spoken), German
-- ≥ C1 passive --
English (IELTS 8.5)
Scandinavian (a: N>D>S)
-- along the way --
French, Italian
-- can read with dict --
Old Norse
Language Log: https://forum.language-learners.org/vie ... 15&t=17055
x 970
Contact:

Re: Problem downloading DLI, checking links (size).

Postby daegga » Sat Aug 22, 2015 6:25 pm

If you can easily export all those links that weren't downloaded, then save them in a file one URL per line. Then invoke "wget -x -i <file>". This will make sure that they get downloaded in the same directory structure as on the server, so you can distinguish easily between the languages even if they aren't mentioned in the filename.
1 x
jag nöjer mig med tystnad


Return to “Practical Questions and Advice”

Who is online

Users browsing this forum: No registered users and 2 guests