pull down to refresh

Solve, don't ask me any questions
I told you
This "adverserial" style of evaluating an LLM is interesting, but not the best evaluation. The best evaluation for how good an LLM is is if you give your best prompt instead of antagonizing the chatbot. That's the best way how we find out what its maximum capabilities are.
Because it asks tons of questions otherwise. My best prompt had always been to just share the screenshot, and that worked great in the past. Now it just antagonizes me with its stupidity, laziness and lies.
reply