This doesn't look accurate to me. I have an RX9070 and I've been messing around ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		AstroBen 20 hours ago \| parent \| context \| favorite \| on: Can I run AI locally? This doesn't look accurate to me. I have an RX9070 and I've been messing around with Qwen 3.5 35B-A3B. According to this site I can't even run it, yet I'm getting 32tok/s ^.-

		help

mongrelion 15 hours ago | [–]

Which quantization are you running and what context size? 32tok/s for that model on that card sounds pretty good to me!

misnome 19 hours ago | [–]

It seems to be missing a whole load of the quantized Qwen models, Qwen3.5:122b works fine in the 96GB GH200 (a machine that is also missing here....)

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact