Greetings from a world where… we get high on SuperCLUE, not superglue …As always, the searchable archive of all past issues is here. Please please subscribe here to support ChinAI under a Guardian/Wikipedia-style tipping model (everyone gets the same content but those who can pay support access for all AND compensation for awesome ChinAI contributors).
Human performance in SuperCLUE (96.5%!!!) was unreasonably high compared to similar benchmarks - SuperGLUE has an estimated human performance of 88% in their paper. Even Winograd Schemas only have human performance of ~92%. The Github page for SuperCLUE notes that this was because it was based on 3 college/grad students with access to the internet. Still, they must have been very talented and motivated individuals, because every single one of them got a score of 100% on the Classical Chinese section, a notoriously difficult subject. Do they even teach that in Chinese Colleges to non-majors?
Human performance in SuperCLUE (96.5%!!!) was unreasonably high compared to similar benchmarks - SuperGLUE has an estimated human performance of 88% in their paper. Even Winograd Schemas only have human performance of ~92%. The Github page for SuperCLUE notes that this was because it was based on 3 college/grad students with access to the internet. Still, they must have been very talented and motivated individuals, because every single one of them got a score of 100% on the Classical Chinese section, a notoriously difficult subject. Do they even teach that in Chinese Colleges to non-majors?