A couple drawbacks so far via our scenario-based tests:
1. You can't ask the model to "think hard" about something anymore - model decides
2. Reasoning traces are no longer true to the thinking – vs opus 4.6, they really are summaries now
3. Reasoning is no longer consciously visible to the agent
They claim the personality is less warm, but I haven't experienced that yet with the prompts we have – seems just as warm, just disconnected from its own thought processes. Would be great for our application if they could improve on the above!
1. You can't ask the model to "think hard" about something anymore - model decides 2. Reasoning traces are no longer true to the thinking – vs opus 4.6, they really are summaries now 3. Reasoning is no longer consciously visible to the agent
They claim the personality is less warm, but I haven't experienced that yet with the prompts we have – seems just as warm, just disconnected from its own thought processes. Would be great for our application if they could improve on the above!