HSK5 is a level of Mandarin Chinese that is supposed to contain about 2500 words, and HSK6 about 5000 words.
It’s obvious that in Clozemaster, the HSK5 and HSK6 collections are far from containing every words of these levels, because the number of sentences within these collections are relatively low. (I think someone had stated in an other thread that for one of the two it was close to 50% of words contained…)
I was curious as to why this is the case, when “HSK?” category has such a huge number of words. I find it hard to believe that none of the missing HSK5 and HSK6 words appear in “HSK?”.
My hypothesis was that the condition of having >75% of words appearing in the sentence from that HSK level or below is too restrictive; and thus some words are missing. Of course sentences respecting that condition are ideal and very pleasant to learn; but I would rather have a sentence for each words, even if they don’t respect this condition, instead of half the words of the category missing.