Discussion about this post

User's avatar
Emerald Fleur's avatar

I’m daily driving Gemini right now, because it’s so cheap with Google One. I really miss ChatGPT, because even 4o beat the () out of Gemini Flash for me, (regardless of whatever MMR it has) but with Flash Thinking Google has finally made a model that feels smart enough for me to not say, “Oh, to () with this, I’m going back to ChatGPT.”

I still can’t figure out why ChatGPT 4o felt like it never got anything wrong and Gemini Flash gets it so blaringly obviously wrong so much. Maybe OpenAI’s userbase gives them that much of an edge from people thumbing up and down responses that even a higher MMR doesn’t make for a better daily driver experience.

Expand full comment
Daniel Reeves's avatar

Ha, my prediction about Grok 3 immediately came true, with Claude Sonnet 3.7 out today.

Expand full comment
1 more comment...

No posts