I got 150+ update notifications of user reports in the last 30 days. And I found some fundamental technical issues in approx. 20% of the updates that may also affect other language courses. In the past six months, I see the update speed has been accelerated but the quality has been declining. I wish the user report handlers stop updates for a while and spend more time on reviewing their operation.
Summary of Issues:
- False notification – The email notification tells that the English translation was update. The general search returns the updated translation. However, none of the collections is actually updated.
- Mismatch with Tatoeba – Tatoeba sometimes had already updated a sentence in a target language (TL) in order to reconcile a mismatch between TL and English. However, the Clozemaster admin ignored it and updated the English translation instead. As a result, such a sentence pair is unsearchable on Tatoeba.
- No update info in the email – Some email notifications don’t tell which parts Clozemaster updated although most of normal ones show both “pre-” and “post-” updates in the email. Without the comparison, I cannot see which part(s) was changed: TL or translation.
- Ignoring word frequency – Some updated sentences in TL ignore the word frequency list and change the {{cloze-words}} to super easy ones.
Here are some screenshots for your reference.
Issue #1: False notification
- ID: Dia menjadi kegirangan setelah mendapatkan SIM.
- EN (old): He is bent on getting the driving license.
- EN (new): He became overjoyed after getting his driver’s license.
- The pair currently belongs to 50K Most Common, Random Collection (RC), and Fast Fluency Track (FFT)
Image 1: Email notification for the update on the translation
Image 2: Looked it up in the general sentence search
Image 3: After pushing each of the three collections
Image 4: Another similar glitch
- ID: Tambahkan gula dan vanili pada krim kemudian kocok sampai krimnya mengental.
- EN (old): Add the sugar and vanilla to the cream and beat vigorously until the cream thickens.
- EN (new): Add the sugar and vanilla to the cream then beat until the cream thickens.
- The pair belongs to 50K and FFT.
Issue #2: Mismatch with Tatoeba
Image 1: “going through Boston”
- The original author on Tatoeba fixed the error in the target language (TL) on April 4, 2022.
- I shared the update via the Clozemaster sentence discussion and filed the error report on June 20, 2022.
- Clozemaster ignored it and changed the English translation instead on September 25, 2022.
- ID (Tatoeba version): Tom dengan mobilnya akan pergi melewati Boston dalam perjalanan mengunjungi Mary.
- ID (Clozemaster version): Tom akan melewati Boston dalam perjalanan mengunjungi Mary.
- EN (Tatoeba version): Tom will be driving through Boston on his way to visit Mary.
- EN (Clozemaster version): Tom was going through Boston on his way to visit Mary.
Image 2: “looks and money”
- The original author on Tatoeba fixed the error in TL on April 4, 2022.
- I shared the update with Clozemaster via the the error report.
- Clozemaster ignored it and changed the English translation instead on September 25, 2022.
- ID (Tatoeba version): Dunia itu hanyalah tentang paras dan materi.
- ID (Clozemaster version): Dunia itu hanyalah tentang uang dan kehormatan.
- EN (Tatoeba version): Only looks and money count in this world.
- EN (Clozemaster version): The world is all about money and honor.
Issue #3: No update info in the email
Image 1
Image 2
Issue 4: Ignoring word frequency
Image 1: The cloze-word for 50K Most Common Collection is now “mother” (ibu)
==> See the sentence discussion.
Image 2: The cloze-word for 50K Most Common Collection is now “chair” (kursi)
==> See the sentence discussion.
Image 3: The cloze-word for 50K Most Common Collection is now “am/are/is” (adalah)
==> See the sentence discussion.
Re: #2 (mismatch with Tatoeba), this is particularly inconvenient for mobile app users. At least the iOS app doesn’t give a hyperlink to the source (Tatoeba) via the sentence search. There is no way for mobile users but to look up directly in Tatoeba. As far as I know, however, Tatoeba doesn’t enable us to look up old sentences. So, the new sentence pair is not searchable.
This is also technically a copyright infringement. All sentences on Tatoeba are published under the CC BY 2.0 license, which requires external users to display a hyperlink to fulfill the attribution right of the original author (BY). I raised this issue five months ago, and @mike said that “work in progress”. Five months are probably not a standard length of “grace period”.
Re: #4 (word frequency), the more sentences CM updates, the more the Most Common Word Collections are disorganized. The same issue was reported by @zeiphon (Tagalog learner) two months ago, and @mike promised that the team would work on a fix. However, I guess the issue was not shared with the user report handlers and they keep spreading the same issue to other languages.
Furthermore, such a random change in cloze-word selection makes users’ own edits paralyzed. For example, some PRO users added hints for the initial cloze-words, which now don’t make sense due to the new cloze-words.
These are just the tip of the iceberg.