http://lemmatise.ijs.si/Services handles lemmatization for 12 European languages. I like LemmaGen's user interface - the output has the inflected form crossed out in red, with the dictionary form of the word in green just above it. This seems more readable to me than the inflection/lemma format. I'm not sure about LemmaGen's maximum capacity but it lemmatized a whole chapter of a German book almost instantly. A couple of words looked strange, but overall it seems to do a very good job.
I took a look at the HTML and the individual words have markup like <div class="lw" title="begann >> beginnen">, which seems pretty easy to parse.
All about language programs, courses, websites and other learning resources
1 post • Page 1 of 1
- Orange Belt
- Posts: 124
- Joined: Sun Feb 26, 2017 4:01 pm
- Languages: English (native); strong reading skills - Russian, Spanish, French, Italian, German, Serbo-Croatian, Macedonian, Bulgarian, Slovene, Farsi; fair reading skills - Polish, Czech, Dutch, Esperanto, Portuguese; beginner/rusty - Swedish, Norwegian, Danish
- x 321
Who is online
Users browsing this forum: No registered users and 1 guest