Bex wrote:
Interesting to think that HP1 in English has 5,687 unique words and that would be classed as C2 on the CEFR scale if you knew every word or do the two not correlate like that?
1. "5,687 unique words and that would be
classed as C2 on the CEFR scale" is not true. That's not what the table represents. The document says "Milton and Meara (2003) tested students taking and passing Cambridge exams at every level of the CEFR and estimated their vocabulary sizes using the XLex tests". That is, the students who got C2
happened to have vocabs of ~5000. They could well happen to be nurses or happen to like Metallica but none of that is related to C2.
2. 5,687 unique words likely means comería comerías comería comeríamos comeríais comerían get counted as 6 unique words. In the document they likely count differently.
Bex wrote:
I would like to know 90% which I believe is 5,118.3 words (90% of 5,687)
1.
90% wouldn't be 90% x 5,687 because some words appear more than others. Easy words like "que" would appear many times. Knowing 5,118 words would most likely produce a known words figure higher than 90%.
2.
HTLAL thread "Experimenting with French word frequency" by emk, Message 48 of 55.
The pic is missing but it says 90% coverage = 4117 words. Definition of "word" can likely be found earlier in that thread.
3.
I don't have data for HP1 at 90%. I started reading later. Some of my data here:
Spanish Group, page 23.
You will know far more English-Spanish cognates than I do.
4.
For
Greek, which has far fewer cognates with English, after studying 4305 flashcards, I knew 88.9% of the words of my first translated crime fiction for adults. My flashcards had headwords (
comer rather than
comeríamos). The words were roughly 50% from courses and shared decks, 25% from LingQ, 25% extracted from reading non-fiction. I know more than what's in my flashcards.
Bex wrote:Clozemaster...
I would like to get through the 3000 top words and then I hope my reading comprehension will be better.
I am not sure why but I find it so much more enjoyable than actual reading. I am viewing it like a graded reader...
Please bear in mind that the way Clozemaster is designed, when you do 3000 top words, you are only ever exposed to 3000 top words, no 3001th or 5999th at all.