Most fine tunes will have much larger datasets (I am under the impression you want 10’s of thousands of examples for most runs).
So I’m similarly impressed 20 examples would make such a big difference.
But also note entity density decreases as example count increases. This is counterintuitive — maybe something else is going on here?