low-frequency words that are unexpectedly frequent
-
- Orange Belt
- Posts: 242
- Joined: Wed Mar 21, 2018 6:54 pm
- Languages: English, Portuguese, Spanish, Catalan, French, Persian, Arabic, Mandarin, Japanese.
- x 444
Re: low-frequency words that are unexpectedly frequent
General (balanced) frequency lists are of limited usage other than for selecting a core (about 3000?) of commonly used words, because the frequency distribution of words will vary greatly from author or genre to genre. If you want to use frequency lists to guide your vocabulary learning, you shoud have the frequency counted from a corpora that reflects rather closely the gerna and registers you are getting exposed to, lest you will be missing frequent words and learning a lot of useless words.
0 x
-
- Brown Belt
- Posts: 1035
- Joined: Mon Jul 23, 2018 3:30 am
- Languages: English (n)
Italian - x 3289
Re: low-frequency words that are unexpectedly frequent
白田龍 wrote:General (balanced) frequency lists are of limited usage other than for selecting a core (about 3000?) of commonly used words, because the frequency distribution of words will vary greatly from author or genre to genre. If you want to use frequency lists to guide your vocabulary learning, you shoud have the frequency counted from a corpora that reflects rather closely the gerna and registers you are getting exposed to, lest you will be missing frequent words and learning a lot of useless words.
I can't tell if this is directed at me or someone else...if it was a response meant for me, then it doesn't apply because I already mentioned that I don't use frequency lists, I decide for myself which words are likely frequent enough to be worth focusing on.
0 x
Season 4 Lucifer Italian transcripts I created: https://learnanylanguage.fandom.com/wik ... ranscripts
- Iversen
- Black Belt - 4th Dan
- Posts: 4787
- Joined: Sun Jul 19, 2015 7:36 pm
- Location: Denmark
- Languages: Monolingual travels in Danish, English, German, Dutch, Swedish, French, Portuguese, Spanish, Catalan, Italian, Romanian and (part time) Esperanto
Ahem, not yet: Norwegian, Afrikaans, Platt, Scots, Russian, Serbian, Bulgarian, Albanian, Greek, Latin, Irish, Indonesian and a few more... - Language Log: viewtopic.php?f=15&t=1027
- x 15050
Re: low-frequency words that are unexpectedly frequent
People who define "headache", "boyfriend", "airport" etc. as two-words combinations may have inhaled too much coca or something ... and their results couldn't be taken seriously if that's how they do their counting. There are combinations which definitely are two words - like "port authority". But then their parts are mostly pronounced as two words, with a weak stress on each word. But even allowing for a grey zone the words on Hashimi's list aren't inside it. I seriously hope that trippingly is right, i.e. that the words are accepted on the frequency list, which would indicate that they are seen as single words.
Apart from that: in my world there is a lot of scientific terms which never would reach the 25.000 word treshold in a general corpus, but that's because I read too much about science...
Apart from that: in my world there is a lot of scientific terms which never would reach the 25.000 word treshold in a general corpus, but that's because I read too much about science...
1 x