Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Did AI write the post?

First section says "The models that passed the car wash test: ...Gemini 2.0 Flash Lite..."

A section or 2 down it says: "Single-Run Results by Model Family: Gemini 3 models nailed it, all 2.x failed"

In the section below that about 10 runs it says: 10/10 — The Only Reliable AI Models ... Gemini 2.0 Flash Lite ..."

So which it is? Gemini 2.x failed (2nd section) or it succeeded (1st and 3rd) section. Or am I mis-understanding

 help



Flash lite succeeded in every test, smth got lost in editing, just updated it. thx!



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: