Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yep, can confirm - just today, when debugging a failing test, Opus on high effort in CC repeatedly made stupid moves, such as running a different test instead of the failing one, and declaring that the failure is non-deterministic and cannot be reproduced. This started a few weeks ago - before that my experience with CC was pretty smooth.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: