Hacker Newsnew | past | comments | ask | show | jobs | submit | pferdone's commentslogin

I can see that and I don't know your setup, but there are people pushing >70t/s with MTP on a single 3090, with big contexts still >50t/s. 64k is not a lot for agentic coding, and IIRC 128k with turboquant and the likes should be possible for you. r/LocalLLM/ and r/LocalLLaMA/ are worth a visit IMO.

EDIT: just found this recipe repo, may wanna give it a go: https://github.com/noonghunna/club-3090

EDIT-2: this can also shave off a lot of context need for tool calling -> https://github.com/rtk-ai/rtk


I managed to execute with vllm successfully, but it breaks opencode on simple "what's this repo?" task. On oh-my-pi it wont event execute because omp sends multiple system prompts. I'll try with llama.cpp later and see if it works more reliably.

will give more info in the post

EDIT: thanks for the links!


pi.dev as well

They said in the last paragraph[0]:

"[...] In the coming days, we will also open-source smaller-scale variants, reaffirming our commitment to accessibility and community-driven innovation. [...]"

[0] https://qwen.ai/blog?id=qwen3.6#summary--future-work


> we will also open-source smaller-scale variants

In other words, like GP said, this Qwen3.6-Plus model is not open-weight unlike the other Qwen models.


In a practical sense, I'm primarily interested in small to medium sized models being open. I think that might be common sentiment.

However, my hope is that there will be at least somewhat competitive big and open models as well, from an ethical/ideological perspective. These things were trained on data that was provided by people without their consent, so they should at least be be publicly accessible or even public domain.


Qwen3.5-Plus is the largest variant of the open weight Qwen3.5 model, expanded with a 1M context window and fine-tuned on the Qwen-native harness’ specific tools.


> unlike almost all qwen models

Almost all means there have been ones before that were not open. So, no contradiction there.


> unlike the other Qwen models

Please send the download link for qwen 3.5-plus.

Also, who cares? If you have the hardware to run a ~400b model i don’t think you count as a home user anymore.


So the Qwen3.6-Plus model is like the Qwen3.5-Plus model?


It‘s mainly due to system requirements that Flux.2-dev doesn’t get same usage as Z-Image. A 5090 needs about a minute to generate an image with a basic workflow with Flux.2-dev. But prompt adherence and scene/character consistency in edit mode is (way) ahead of Qwen-Edit-2509 if you ask me.


I also have a HUD in my car and I can read it just fine, even in bright sunlight.


I feel tired reading about him and it doesn’t even phase me anymore. It‘s just another thing I add on to the pile. Maybe it’s part of the plan to go numb to everything he does.


It is part of the plan. Search “flood the zone.” They’re intentionally doing so much crap that you can’t keep track.


John Oliver has episode trump 2.0 that shows video evidence of exactly this.


Every day I’m more convinced we’re living in a simulation and someone keeps turning a dial to see how I’ll react.


Me too. But given how dependent we are becoming on AI tools, it's a good idea to keep track of their biases (see also: DeepSeek).


Between the fanboys and those who are consumed by hate the internet has been really annoying these last weeks.


Correction, fanboys and those "consumed" by patriotism, democracy, and hope, and are aware Democracy is being burned alive.


[flagged]


I still don’t understand what a “leftist” is, exactly. In my politics, I guess I’m conventionally more of a centrist. I’ve never registered as a Democrat. I tend to stay out of “woke” activism. But I’m appalled that a convicted felon and rapist who very visibly tried to steal the last election is now performing a hostile takeover of the federal government together with the richest businessman in the world. Does that make me a “leftist”? Or just, like, a totally fucking normal person?


>I’m appalled that a convicted felon and rapist who very visibly tried to steal the last election is now performing a hostile takeover of the federal government together with the richest businessman in the world. Does that make me a “leftist”?

sadly, yes. Look at the comment upstream trying to equivalate "musk fanboys" and labeling opponents as "people on the internet consumed by hate". That's the US polarization at work.

lot of apolitical people just want to sweep everything under a rug and ignore it. I realized at the beginning of the month that this isn't something to ignore, though.


Its comedic to hear that the left is the party of hate. Yes, up is down, down is up. War is peace. All that. Sure man, sure.

I somewhat agree that there's a totally out of touch disconnected sort that is bothered by being part of the body electorate, that doesn't take seriously a civic duty: would rather not pay attention, who doesn't like the conflict. I doubt your biased lopsided anecdata, doubt know many centrists changing their vote. But as the grossly unpopular & despised Project 2025 that Trump disavowed steamrolls this nation & as algorithmic AI run systems perhaps start being used by the state to mechanize programmed bias, well, those folks will be impacted deeply, and saddened, and it's unfortunate they were left slumbering & derelict from their civic duty to pay real attention, or to tune into something besides Fox News or Newsmax earlier.


I mean they make a bold statement up top just to paddle back a little bit further down with: "[…] In terms of Chinese and Cantonese recognition, the SenseVoice-Small model has advantages."

It feels dishonest to me.

[0] https://github.com/FunAudioLLM/SenseVoice?tab=readme-ov-file...


only next frame


The Guardian, especially for their podcasts, is the only news website I am paying and have ever payed for. And I pay more willingly than any other newspaper would get from me for their paywall stuff. It‘s that valuable for me to support this approach.


Where have you heard that? Because Typescript is the defacto standard right now. Projects like bun and deno cement that status further.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: