Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
gvand
on Nov 13, 2023
|
parent
|
context
|
favorite
| on:
Fast and Portable Llama2 Inference on the Heteroge...
The binary size is not really important in this case, llama.cpp should not be that far from this, what's matter as we all know is how much gpu memory we need.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: