thebeas's comments

thebeas · 2026-03-13T20:22:59 1773433379

We do both:

We compress tool outputs at each step, so the cache isn't broken during the run. Once we hit the 85% context-window limit, we preemptively trigger a summarization step and load that when the context-window fills up.

thebeas · 2026-03-13T20:18:37 1773433117

That's why give the chance to the model to call expand() in case if it needs more context. We know it's counterintuitive, so we will add the benchmarks to the repo soon.

Given our observations, the performance depends on the task and the model itself, most visible on long-running tasks

fcarraldo · 2026-03-13T20:33:30 1773434010

How does the model know it needs more context?

kingo55 · 2026-03-13T21:20:57 1773436857

Presumably in much the same way it knows it needs to use to calls for reaching its objective.