NEW BOT Телеграм, страница

7 views13:40

Qwen Image LoRA Training Tutorial on RunPod using Diffusion Pipe
https://www.youtube.com/watch?v=hXnFChMvLwg

https://redd.it/1olnwl4
@rStableDiffusion

YouTube

Qwen Image LoRA Training Tutorial on RunPod using Diffusion Pipe

This video takes you through captioning a dataset and training a Qwen Image LoRA on RunPod.

To deploy:
https://get.runpod.io/diffusion-pipe-template

► To Join The Hideout: https://www.hearmemanai.com

Join my Discord server for updates on new Workflows…

6 views14:40

r/StableDiffusion

Wan 2.2 multi-shot scene + character consistency test

The post Wan 2.2 MULTI-SHOTS (no extras) Consistent Scene + Character : r/comfyui took my interest on how to raise consistence for shots in a scene. The idea is not to create the whole scene in one go but rather to create 81 frames videos including multiple shots to get some material for start/end frames of actual shots. Due the 81 frames sampling the model keeps the consistency at a higher level in that window. It's not perfect but gets in the direction of believable.

Here is the test result, which startet with one 1080p image generated in Wan 2.2 t2i.

Final result after rife47 frame interpolation + Wan2.2 v2v and SeedVR2 1080p passes.

Other than the original post I used Wan 2.2 fun control, with 5 random pexels videos and different poses, cut down to fit into 81 frames.

https://reddit.com/link/1oloosp/video/4o4dtwy3hnyf1/player

With the starting t2i image and the poses Wan 2.2 Fun control generated the following 81 frames at 720p.

Not sure if needed but I added random shot denoscriptions in the prompt to describe a simple photo studio scene and plain simple gray background.

Wan 2.2 Fun Control 87 frames

Still a bit rough on the edges so I did a Wan 2.2 v2v pass to get it to 1536x864 resolution to sharpen things up.

https://reddit.com/link/1oloosp/video/kn4pnob0inyf1/player

And the top video is after rife47 frame interpolation from 16 to 32 and SeedVR2 upscale to 1080p with batch size 89.

\---------------

My takeaway from this is that this may help to get believable somewhat consistent shot frames. But more importantly it can be used to generate material for a character lora since from one high res start image dozens of shots can be made to get all sorts of expressions and poses with a high likeness.

The workflows used are just the default workflows with almost nothing changed other than resolution and and random messing with sampler values.

https://redd.it/1oloosp
@rStableDiffusion

From the comfyui community on Reddit

Explore this post and more from the comfyui community

8 views15:40

r/StableDiffusion

Any way to get consistent face with flymy-ai/qwen-image-realism-lora

https://redd.it/1olpt5t
@rStableDiffusion

From the StableDiffusion community on Reddit: Any way to get consistent face with flymy-ai/qwen-image-realism-lora

Explore this post and more from the StableDiffusion community

8 views16:40

8 views16:40

Mario the crazy conspiracy theorist was too much fun not to create! LTX-2

https://redd.it/1olt8jb
@rStableDiffusion

8 views17:40

r/StableDiffusion

0:06

This media is not supported in your browser

VIEW IN TELEGRAM

👋🏻

https://redd.it/1oltzox
@rStableDiffusion

8 views18:40

r/StableDiffusion

Workflow for Captioning
https://redd.it/1oltsy6
@rStableDiffusion

6 views19:40

r/StableDiffusion

Reporting Pro 6000 Blackwell can handle batch size 8 while training an Illustrious LoRA.
https://redd.it/1olvxy8
@rStableDiffusion

6 views20:40

r/StableDiffusion

FlashVSR_Ultra_Fast vs. Topaz Starlight
https://redd.it/1olznsq
@rStableDiffusion

7 views22:40

r/StableDiffusion

What Illustrious models is everyone using?

I have experimented with many Illustrious models, with WAI, Prefect and JANKU being my favorites, but I am curious what you guys are using! I'd love to find a daily driver as opposed to swapping between models so often.

https://redd.it/1om1e9a
@rStableDiffusion

From the StableDiffusion community on Reddit

Explore this post and more from the StableDiffusion community

7 views23:40

r/StableDiffusion