Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Interesting that they reduced the memory usage by half. This would address what is IMO the biggest problem with local LLMs: the limited number of parameters resulting in answers that are not very good.

Also it's funny that they are saying that Llama 4 Maverick performs about the same as GPT-4.1 Nano.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: