More

pferdone · 2026-05-07T11:31:24 1778153484

I can see that and I don't know your setup, but there are people pushing >70t/s with MTP on a single 3090, with big contexts still >50t/s. 64k is not a lot for agentic coding, and IIRC 128k with turboquant and the likes should be possible for you. r/LocalLLM/ and r/LocalLLaMA/ are worth a visit IMO.

EDIT: just found this recipe repo, may wanna give it a go: https://github.com/noonghunna/club-3090

EDIT-2: this can also shave off a lot of context need for tool calling -> https://github.com/rtk-ai/rtk

gchamonlive · 2026-05-07T13:50:59 1778161859

I managed to execute with vllm successfully, but it breaks opencode on simple "what's this repo?" task. On oh-my-pi it wont event execute because omp sends multiple system prompts. I'll try with llama.cpp later and see if it works more reliably.

gchamonlive · 2026-05-07T12:04:33 1778155473

will give more info in the post

EDIT: thanks for the links!

pferdone · 2026-04-24T16:44:57 1777049097

pi.dev as well

pferdone · 2026-04-02T14:53:34 1775141614

They said in the last paragraph[0]:

"[...] In the coming days, we will also open-source smaller-scale variants, reaffirming our commitment to accessibility and community-driven innovation. [...]"

[0] https://qwen.ai/blog?id=qwen3.6#summary--future-work

deaux · 2026-04-02T15:00:37 1775142037

> we will also open-source smaller-scale variants

In other words, like GP said, this Qwen3.6-Plus model is not open-weight unlike the other Qwen models.

dgb23 · 2026-04-02T15:18:44 1775143124

In a practical sense, I'm primarily interested in small to medium sized models being open. I think that might be common sentiment.

However, my hope is that there will be at least somewhat competitive big and open models as well, from an ethical/ideological perspective. These things were trained on data that was provided by people without their consent, so they should at least be be publicly accessible or even public domain.

thepasch · 2026-04-02T15:25:19 1775143519

Qwen3.5-Plus is the largest variant of the open weight Qwen3.5 model, expanded with a 1M context window and fine-tuned on the Qwen-native harness’ specific tools.

pferdone · 2026-04-02T15:04:27 1775142267

> unlike almost all qwen models

Almost all means there have been ones before that were not open. So, no contradiction there.

kennywinker · 2026-04-02T15:10:59 1775142659

> unlike the other Qwen models

Please send the download link for qwen 3.5-plus.

Also, who cares? If you have the hardware to run a ~400b model i don’t think you count as a home user anymore.

sroussey · 2026-04-03T07:11:00 1775200260

So the Qwen3.6-Plus model is like the Qwen3.5-Plus model?

pferdone · 2025-12-07T11:57:08 1765108628

It‘s mainly due to system requirements that Flux.2-dev doesn’t get same usage as Z-Image. A 5090 needs about a minute to generate an image with a basic workflow with Flux.2-dev. But prompt adherence and scene/character consistency in edit mode is (way) ahead of Qwen-Edit-2509 if you ask me.

pferdone · 2025-11-08T21:11:27 1762636287

I also have a HUD in my car and I can read it just fine, even in bright sunlight.

pferdone · on Feb 23, 2025

I feel tired reading about him and it doesn’t even phase me anymore. It‘s just another thing I add on to the pile. Maybe it’s part of the plan to go numb to everything he does.

wat10000 · on Feb 23, 2025

It is part of the plan. Search “flood the zone.” They’re intentionally doing so much crap that you can’t keep track.

beretguy · on Feb 24, 2025

John Oliver has episode trump 2.0 that shows video evidence of exactly this.

pupppet · on Feb 23, 2025

Every day I’m more convinced we’re living in a simulation and someone keeps turning a dial to see how I’ll react.

stickfigure · on Feb 23, 2025

Me too. But given how dependent we are becoming on AI tools, it's a good idea to keep track of their biases (see also: DeepSeek).

jisnsm · on Feb 23, 2025

Between the fanboys and those who are consumed by hate the internet has been really annoying these last weeks.

jauntywundrkind · on Feb 23, 2025

Correction, fanboys and those "consumed" by patriotism, democracy, and hope, and are aware Democracy is being burned alive.

lynx97 · on Feb 23, 2025

[flagged]

archagon · on Feb 23, 2025

I still don’t understand what a “leftist” is, exactly. In my politics, I guess I’m conventionally more of a centrist. I’ve never registered as a Democrat. I tend to stay out of “woke” activism. But I’m appalled that a convicted felon and rapist who very visibly tried to steal the last election is now performing a hostile takeover of the federal government together with the richest businessman in the world. Does that make me a “leftist”? Or just, like, a totally fucking normal person?

johnnyanmac · on Feb 24, 2025

>I’m appalled that a convicted felon and rapist who very visibly tried to steal the last election is now performing a hostile takeover of the federal government together with the richest businessman in the world. Does that make me a “leftist”?

sadly, yes. Look at the comment upstream trying to equivalate "musk fanboys" and labeling opponents as "people on the internet consumed by hate". That's the US polarization at work.

lot of apolitical people just want to sweep everything under a rug and ignore it. I realized at the beginning of the month that this isn't something to ignore, though.

jauntywundrkind · on Feb 23, 2025

Its comedic to hear that the left is the party of hate. Yes, up is down, down is up. War is peace. All that. Sure man, sure.

I somewhat agree that there's a totally out of touch disconnected sort that is bothered by being part of the body electorate, that doesn't take seriously a civic duty: would rather not pay attention, who doesn't like the conflict. I doubt your biased lopsided anecdata, doubt know many centrists changing their vote. But as the grossly unpopular & despised Project 2025 that Trump disavowed steamrolls this nation & as algorithmic AI run systems perhaps start being used by the state to mechanize programmed bias, well, those folks will be impacted deeply, and saddened, and it's unfortunate they were left slumbering & derelict from their civic duty to pay real attention, or to tune into something besides Fox News or Newsmax earlier.

pferdone · on Oct 13, 2024

I mean they make a bold statement up top just to paddle back a little bit further down with: "[…] In terms of Chinese and Cantonese recognition, the SenseVoice-Small model has advantages."

It feels dishonest to me.

[0] https://github.com/FunAudioLLM/SenseVoice?tab=readme-ov-file...

pferdone · on Aug 17, 2024

only next frame

pferdone · on Nov 7, 2023

The Guardian, especially for their podcasts, is the only news website I am paying and have ever payed for. And I pay more willingly than any other newspaper would get from me for their paywall stuff. It‘s that valuable for me to support this approach.

pferdone · on Nov 4, 2023

Where have you heard that? Because Typescript is the defacto standard right now. Projects like bun and deno cement that status further.