Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Opus 4.6 with thinking. Result was near-instant:

“Drive. You need the car at the car wash.”



Changed 50 meters to 43 meters with Opus 4.6:

“Walk. 43 meters is basically crossing a parking lot. ”


lol, are AI companies patching this answer in real time. I thought it took months long effort for a training run. How would they make changes in such a short period?


The companies aren’t changing anything. LLM outputs are just more random than people realize. Run the same prompt 10 times if you really want to know how well they can answer.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: