From my site TrackingAI.org, I've noticed it misunderstands a relatively high proportion of questions (compared to other AIs.) Obviously that's not a reason *not* to rank it, but it's why I deprioritized it here for the sake of time. But in the future, I'll make it more comprehensive.
Any particular reason you didn't test "Le Chat Mistral" (https://mistral.ai/)?
It solves #2 (2/2) and fails on #27 (0/2) (using the instructions you provided).
From my site TrackingAI.org, I've noticed it misunderstands a relatively high proportion of questions (compared to other AIs.) Obviously that's not a reason *not* to rank it, but it's why I deprioritized it here for the sake of time. But in the future, I'll make it more comprehensive.
Maybe it'd do better in French?
Happy to share the full verbalized set of questions with you, or anyone, if you'd like to replicate it.
That'd be great!