More

KaiLetov · 2026-04-12T13:34:04 1776000844

It probably wouldn't hurt to set up Microsoft Clarity - it's a free solution that lets you see how people actually use your product and identify bottlenecks and pain points. It also has a “copilot ai” feature that can offer some suggestions (though it's just a copilot, so don't expect too much). But I’m sure that as soon as you see exactly what users are doing, you’ll be able to spot the problem areas.

oyaa52 · 2026-04-13T09:40:15 1776073215

Thanks for the recommendation! Never heard of this one but looks good

KaiLetov · 2026-04-12T13:29:17 1776000557

I think we should start with an idea—do you have any ideas for practical applications?

KaiLetov · 2026-04-11T05:36:06 1775885766

So you end up with two parallel permission systems that contradict each other, and the Settings UI only controls one of them. It's not a bug, it's architectural debt that they've decided is cheaper to leave than to fix.

KaiLetov · 2026-04-11T05:34:44 1775885684

The policy makes sense as a liability shield, but it doesn't address the actual problem, which is review bandwidth. A human signs off on AI-generated code they don't fully understand, the patch looks fine, it gets merged. Six months later someone finds a subtle bug in an edge case no reviewer would've caught because the code was "too clean."

ugh123 · 2026-04-11T06:21:21 1775888481

> they don't fully understand, the patch looks fine

I don't get this part. Why is the reviewer signing off on it? AI code should be fully documented (probably more so than a human could) and require new tests. Code review gates should not change

altmanaltman · 2026-04-11T08:09:50 1775894990

I mean the same can happen with human-written code no? Reviewer signs off on it and subtle bug in edge case no one saw?

Or you mean the velocity of commits will be so much that reviewers will start making more mistakes?

KaiLetov · 2026-04-11T05:33:45 1775885625

The extensions marketplace is designed like a trust-based system where trust has a known expiration date. We keep acting surprised when it expires.

KaiLetov · 2026-04-11T05:25:21 1775885121

But "teammate" is a stretch. The failure mode is different from a human -- a person will tell you "I don't know how to do this," an agent will confidently do it wrong and you won't notice until something breaks in production. The supervision cost doesn't go away, it just changes shape.

KaiLetov · 2026-04-11T05:24:04 1775885044

The fact that OpenAI's pipeline had no minimumReleaseAge configured is surprising though. That's basically saying "run whatever npm published 5 minutes ago in a context that has access to my signing keys." For a company that size, with that attack surface, feels like this should've been caught in a security review.

KaiLetov · 2026-04-07T04:46:24 1775537184

I've been on Max20 for quite a while now, and I remember my transition process very well. Now I'm missing the Max20 subscription, and I’m thinking about buying a second account. I can’t say the problem is with Anthropic, because I really am using the service more and more. With the Pro subscription, I couldn’t afford to run two agents in a separate terminal that restart each other for hours on end. Or run research with 10–15 agents simultaneously, but this really boosts efficiency by a factor of several times, so yes, a second account is the way to go for me.

KaiLetov · 2026-04-06T14:26:49 1775485609

Borrow checker in a functional concatenative language is a wild combination. I write Rust for real-time audio and Elixir for the orchestration layer in the same project, so I deal with both worlds daily. In Rust the borrow checker saves you from data races but fights you on anything concurrent. In Elixir you just don't have shared mutable state at all, problem solved differently. Curious where Slap lands -- does it feel more like Rust's "prove to the compiler you're safe" or more like "the language just doesn't let you do the unsafe thing"?

surprisetalk · 2026-04-06T14:52:42 1775487162

Right now it feels a lot more like Rust, but I'm hoping to make it feel more like that smooth Elixir experience via opinionated APIs.

If we build everything right, only library maintainers should really ever feel the borrow checker.

For example, I've been experimenting with a new primitive that creates a sort of Agent/GoFunc thing:

  'count-channel
    ('sum let 'msg let sum 1 plus) server
    def
  0 count-channel
    1 send
    1 send
    1 send
    recv 3 eq
    free

But I'm really not sure where this whole thing is headed yet :)

KaiLetov · 2026-04-06T14:24:19 1775485459

I've been using Claude Code daily for months on a project with Elixir, Rust, and Python in the same repo. It handles multi-language stuff surprisingly well most of the time. The worst failure mode for me is when it does a replace_all on a string that also appears inside a constant definition -- ended up with GROQ_URL = GROQ_URL instead of the actual URL. Took a second round of review agents to catch it. So yeah, you absolutely can't trust it to self-verify.

StanAngeloff · 2026-04-06T14:31:31 1775485891

You say you've used it for months, I wonder if the example you gave was recent and if you've been noticing an overall degradation in quality or it's been constantly bad for you?