The winning image entry for "The Yarrctic Circle" by OpenAI 4o doesn't actually ...

danpalmer · 2025-05-20T22:37:49 1747780669

In my own testing between the two this is what I’ve noticed. Imagen will follow the instructions, and 4o will often not, but produces aesthetically more pleasing images.

I don’t know which is more important, but I would say that people mostly won’t pay for fun but disposable images, and I think people will pay for art but there will be an increased emphasis on the human artist. However users might pay for reliable tools that can generate images for a purpose, things like educational illustrations, and those need to be able to follow the spec very well.

fragmede · 2025-05-21T00:19:05 1747786745

People pay for digital sticker packs so their memoji in iMessage are customized. How much money they make on sticker packs is unknown to me, but image generation platform Midjourney seems to be doing alright.

vunderba · 2025-05-21T03:29:04 1747798144

Midjourney got in REALLY early in the GenAI game despite only allowing image generation through Discord for at least a year. I heard that it was one of the largest Discord channels ever having something absurd like 20+ million members.

I'd love to see some financials but I'd tend to agree they're probably doing pretty well.

ilikehurdles · 2025-05-21T13:00:55 1747832455

o4-mini-high I’ve noticed is far better the 4o on prompt adherence in image generation in personal use.

echelon · 2025-05-20T21:36:43 1747777003

Google Flow is remarkable as video editing UX, but Imagen 4 doesn't really stand out amongst its image gen peers.

I want to interrupt all of this hype over Imagen 4 to talk about the totally slept on Tencent Hunyuan Image 2.0 that stealthily launched last Friday. It's absolutely remarkable and features:

- millisecond generation times

- real time image-to-image drawing capabilities

- visual instructivity (eg. you can circle regions, draw arrows, and write prompts addressing them.)

- incredible prompt adherence and quality

Nothing else on the market has these properties in quite this combination, so it's rather unique.

Release Tweet: https://x.com/TencentHunyuan/status/1923263203825549457

Tencent Hunyuan had a bunch of model releases all wrapped up in a product that they call "Hunyuan Game", but the Hunyuan Image 2.0 real time drawing canvas is the real star of it all. It's basically a faster, higher quality Krea: https://x.com/TencentHunyuan/status/1924713242150273424

More real time canvas samples: https://youtu.be/tVgT42iI31c?si=WEuvie-fIDaGk2J6&t=141 (I haven't found any other videos on the internet apart from these two.)

You can see how this is an incredible illustration tool. If they were to open source this, this would immediately become the top image generation model over Flux, Imagen 4, etc. At this point, really only gpt-image-1 stands apart as having godlike instructivity, but it's on the other end of the [real time <--> instructive] spectrum.

A total creative image tool kit might just be gpt-image-1 and Hunyuan Image 2.0. The other models are degenerate cases.

More image samples: https://x.com/Gdgtify/status/1923374102653317545

If anyone from Tencent or the Hunyuan team is reading this: PLEASE, PLEASE, PLEASE OPEN SOURCE THIS. (PLEASE!!)

dheera · 2025-05-20T22:35:57 1747780557

> but Imagen 4 doesn't really stand out amongst its image gen peers.

In this AI rat race, whenever one model gets ahead, they all tend to reach parity within 3-6 months. If you can wait 6 months to create your video I'm sure Imagen 5 will be more than good enough.

It's honestly kind of ridiculous the pace things are moving at these days. 10 years ago waiting a year for something was very normal, nowadays people are judging the model-of-the-week against last week's model-of-the-week but last week's org will probably not sleep and they'll release another one next week.

Narciss · 2025-05-20T22:12:53 1747779173

This is amazing, can’t see how I’ve missed it. Thank you!

echelon · 2025-05-21T12:58:28 1747832308

I've given this some more thought. Even if Imagen 4 isn't that great on its own, all of Google's models and UX products in conjunction (Veo 3, Flow, etc.) are orders of magnitude above the rest of the playing field.

If Tencent wants to keep Google from winning the game, they should open source their models. From my perspective right now, it looks like Google is going to win this entire game, and open source AI might be the only way to stop that from being a runaway victory.

vunderba · 2025-05-21T00:11:08 1747786268

Good catch - that's on me I accidentally uploaded the wrong image for gpt-image-1. Fixed!

NoahZuniga · 2025-05-20T21:16:29 1747775789

I can't find the image you're talking about. Link pls?