r/StableDiffusion – Telegram
Can Hunyuan Video 1.5 actually do more than 5 seconds unlike WAN?

I heard this was the case? does it work? Does it require more vram? i would love some insight as I read a comment about it but i'm a little unsure whether I read correctly. Thanks!

https://redd.it/1p3babt
@rStableDiffusion
What happened to the Tencent's HunyuanImage-3.0 model? seems like Nano banana pro.

HunyuanImage-3.0 by Tencent is a great model, but it needs a lot of VRAM as it is a 13B model, I am sure a lot of you guys have tested it, and in the noscript, I said it seems like nano banana pro because it also does reasoning while making an image as it not only understands prompts but also engages in an intermediate "thinking" phase where it elaborates, conceptualizes, and rewrites user prompts to produce highly context-aware and visually detailed images.


that is why we need a refined version of this model, and I guess later we will get to see its instruct model, which will really blow up the open source community, as it is powerful. However, because this model is so resource-hungry, it is still not known to a lot of people.

Please share your feedback regarding this model.

https://redd.it/1p384ir
@rStableDiffusion
Doubt with comfy

Hello! In the comfyui folder I have two files, run_nvidia_gpu and run_nvidia_gpu_fast_fp16_accumulation, could you tell me what the difference is between these two files or if they are used for different things, currently I only use the second one because it says "fast"... Thank you

https://redd.it/1p3t786
@rStableDiffusion