r/StableDiffusion – Telegram
Is an RTX 5090 necessary for the newest and most advanced AI video models? Is it normal for RTX GPUs to be so expensive in Europe? If video models continue to advance, will more GB of VRAM be needed? What will happen if GPU prices continue to rise? Is AMD behind NVIDIA?

https://redd.it/1oufag3
@rStableDiffusion
ComfyUi on new AMD GPU - today and future

Hi, I want to get more invested in AI generation and also lora training. I have some experience with comfy from work, but would like to dig deeper at home.
Since NVidia GPUs with 24GB are above my budget, I am curious about the AMD Radeon AI PRO R9700.
I know that AMD was said to be no good for comfyui. Has this changed? I read about PyTorch support and things like ROCm etc, but to be honest I don't know how that affects workflows in practical means. Does this mean that I will be able to do everything that I would be able to do with NVidia? I have no background in engineering whatsoever, so I would have a hard time finding workarounds and stuff. But is this even the case with the new GPUs from AMD?

Would be greatful for any help!

https://redd.it/1ouhneo
@rStableDiffusion
Sharing the winners of the first Arca Gidan Prize. All made with open models + most shared the workflows and LoRAs they used. Amazing to see what a solo artist can do in a week (but we'll give more time for the next edition!)

Link here. Congrats to prize recipients and all who participated! I'll share details on the next one here + on our discord if you're interested.

https://redd.it/1oujqlj
@rStableDiffusion
Finally hand without six fingers.
https://redd.it/1ouz7df
@rStableDiffusion
InfinityStar - new model

https://huggingface.co/FoundationVision/InfinityStar

We introduce InfinityStar, a unified spacetime autoregressive framework for high-resolution image and dynamic video synthesis. Building on the recent success of autoregressive modeling in both vision and language, our purely discrete approach jointly captures spatial and temporal dependencies within a single architecture. This unified design naturally supports a variety of generation tasks such as text-to-image, text-to-video, image-to-video, and long-duration video synthesis via straightforward temporal autoregression. Through extensive experiments, InfinityStar scores 83.74 on VBench, outperforming all autoregressive models by large margins, even surpassing diffusion competitors like HunyuanVideo. Without extra optimizations, our model generates a 5s, 720p video approximately 10$\\times$ faster than leading diffusion-based methods. To our knowledge, InfinityStar is the first discrete autoregressive video generator capable of producing industrial-level 720p videos. We release all code and models to foster further research in efficient, high-quality video generation.

weights on HF

https://huggingface.co/FoundationVision/InfinityStar/tree/main

InfinityStarInteract\_24K\_iters

infinitystar\_8b\_480p\_weights

infinitystar\_8b\_720p\_weights

https://redd.it/1ov05oq
@rStableDiffusion