Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> That's much easier to intuit and use to guide experiments than actually build into the bandit algo.

IDK about your intuition, but for most other people, it gets in the way of statistics.

The "loss function" is just as easy to calculate for A/B tests as for multi-armed bandit. The value of user doing A is $X, the value of B is $Y, and the value of C is $Z.



You gave me some reading to do. :) Thank you.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: