r/StableDiffusion – Telegram
Will Stability ever make a comeback?

I know the family of SD3 models was really not what we had hoped for. But it seemed like they got a decent investment after that. And they've been making a lot of commercial deals (EA and UMG). Do you think they'll ever come back to the open-source space? Or are they just going to go full close and be corporate? Model providers at this point.

I know we have a lot better open models like flux and qwen but for me SDXL is still a GOAT of a model, and I find myself still using it for different specific tasks even though I can run the larger ones.

https://redd.it/1onkffi
@rStableDiffusion
Has anybody managed to get hunyuan 3d to work on GPUs that only have 8GB of VRAM?

I'm a 3D hobbyists looking for a program that can turn images into rough blockouts.

https://redd.it/1ony2nw
@rStableDiffusion
QwenEditUtils2.0 Any Resolution Reference

Hey everyone, I am xiaozhijason aka lrzjason! I'm excited to share my latest custom node collection for Qwen-based image editing workflows.



Comfyui-QwenEditUtils is a comprehensive set of utility nodes that brings advanced text encoding with reference image support for Qwen-based image editing.



Key Features:

\- Multi-Image Support: Incorporate up to 5 reference images into your text-to-image generation workflow

\- Dual Resize Options: Separate resizing controls for VAE encoding (1024px) and VL encoding (384px)

\- Individual Image Outputs: Each processed reference image is provided as a separate output for flexible connections

\- Latent Space Integration: Encode reference images into latent space for efficient processing

\- Qwen Model Compatibility: Specifically designed for Qwen-based image editing models

\- Customizable Templates: Use custom Llama templates for tailored image editing instructions



New in v2.0.0:

\- Added TextEncodeQwenImageEditPlusCustom_lrzjason for highly customized image editing

\- Added QwenEditConfigPreparer, QwenEditConfigJsonParser for creating image configurations

\- Added QwenEditOutputExtractor for extracting outputs from the custom node

\- Added QwenEditListExtractor for extracting items from lists

\- Added CropWithPadInfo for cropping images with pad information



Available Nodes:

\- TextEncodeQwenImageEditPlusCustom: Maximum customization with per-image configurations

\- Helper Nodes: QwenEditConfigPreparer, QwenEditConfigJsonParser, QwenEditOutputExtractor, QwenEditListExtractor, CropWithPadInfo



The package includes complete workflow examples in both simple and advanced configurations. The custom node offers maximum flexibility by allowing per-image configurations for both reference and vision-language processing.



Perfect for users who need fine-grained control over image editing workflows with multiple reference images and customizable processing parameters.



Installation: Manager or Clone/download to your ComfyUI's custom_nodes directory and restart.



Check out the full documentation on GitHub for detailed usage instructions and examples. Looking forward to seeing what you create!

https://preview.redd.it/7j76g2csi7zf1.jpg?width=4344&format=pjpg&auto=webp&s=6e4f39f8da6aabae91c9f9b4f047f4184434a43f

https://preview.redd.it/iseesncsi7zf1.jpg?width=4344&format=pjpg&auto=webp&s=2e2ad72f92e2e3bf74b0396d3ff2dbe99f0532b0

https://preview.redd.it/wd97d3csi7zf1.jpg?width=4344&format=pjpg&auto=webp&s=25cc1724d8397ad214f594886f75816b8086c750




https://redd.it/1oo2u0i
@rStableDiffusion
Open source Model to create posters/educational pictures

I have been trying to create a text to image tool for K-12 students for educational purpose. Outputs along with aesthetic pictures needs to be posters, flash cards etc with text in it.

Problem is stable diffusion models and even flux struggles with text heavily. Flux is somewhat ok sometimes but not reliable enough. I have tried layout parsing over background generated by stable diffusion too, this gives me okayish results if i hard code layouts properly so can't be automated with llm being attached for layouts.

What are my options in terms of open source models or anyone has done any work in this domain before which i can take reference from?


https://redd.it/1oo4w5g
@rStableDiffusion