Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
AstroBen
20 hours ago
|
parent
|
context
|
favorite
| on:
Can I run AI locally?
This doesn't look accurate to me. I have an RX9070 and I've been messing around with Qwen 3.5 35B-A3B. According to this site I can't even run it, yet I'm getting 32tok/s ^.-
help
mongrelion
15 hours ago
|
next
[–]
Which quantization are you running and what context size? 32tok/s for that model on that card sounds pretty good to me!
reply
misnome
19 hours ago
|
prev
[–]
It seems to be missing a whole load of the quantized Qwen models, Qwen3.5:122b works fine in the 96GB GH200 (a machine that is also missing here....)
reply
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: