The binary size is not really important in this case, llama.cpp should not be th...

gvand on Nov 13, 2023 | parent | context | favorite | on: Fast and Portable Llama2 Inference on the Heteroge...

The binary size is not really important in this case, llama.cpp should not be that far from this, what's matter as we all know is how much gpu memory we need.