Within minutes of each other I can read a bunch of comments about how an 8GB AI hat for RPi would be grand for LLMs, while another contingent points out that my M2 MAX 96GB MacBook is completely useless for LLMs. All I can say is at least the latter is also a great laptop.
For server use, the Pi is cheaper, better-supported and lower power. I really cannot recommend that anyone spend $2,500+ to run inference on a laptop that is slower than $300 GPUs.