r/StableDiffusion – Telegram
Ace Step 1.5 could open up a booming market for huge, comprehensive music LoRAs

I'm still settling into my initial Ace Step 1.5 setup, but I'm getting some pretty high-quality sound out of the software as I gain familiarity with the parameters and prompting conventions. All that's missing to bring Ace Step much closer to Udio is a huge database of the music we already like.

Personally, I'm not talking about - nor do I have any interest in - "ethical" databases.

I would be delighted to pay for well-trained LoRAs. I don't know how big Ace Step LoRAs can be or how many songs they can hold, or if they can be combined, but I'm eager to find out more about that stuff. As of yet I'm not even sure how to load/implement LoRAs but I'll figure it out.

It seems that training music LoRAs might be a bit more involved than training AI image LoRAs, so I don't know if I should expect to see a CivitAI-style gallery of frequent releases such as the huge & still growing collections of SDXL models and LoRAs.

Anyway, I'm really looking forward to what the community produces. I haven't been this excited about Music AI since I discovered Udio almost two years ago.

https://redd.it/1qzutkp
@rStableDiffusion
This media is not supported in your browser
VIEW IN TELEGRAM
KaniTTS2 - open-source 400M TTS model with voice cloning, runs in 3GB VRAM. Pretrain code included.

https://redd.it/1r4svm5
@rStableDiffusion
This media is not supported in your browser
VIEW IN TELEGRAM
ACEStep1.5 LoRA + Prompt Blending & Temporal Latent Noise Mask in ComfyUI: Think Daft Punk Chorus and Dr Dre verse

https://redd.it/1r4ops9
@rStableDiffusion
Dear QWEN Team - Happy New Year!

Thank you for all your contributions to the Open Source community over the past year. You guys are awesome!

Please enjoy a blessed new year celebration and we can't wait to see what cool stuff you have in stock for us in the year of the horse!

Have a great time - 新年快樂\~

https://redd.it/1r51lct
@rStableDiffusion
Quantz for RedFire-Image-Edit 1.0 FP8 / NVFP4

https://preview.redd.it/6irwlbb4qhjg1.png?width=1328&format=png&auto=webp&s=d7061447c977b6f11afdcbdca779216037f7d006

I just created quant-models for the new RedFire-Image-Edit 1.0

It works with the qwen-edit workflow, text-encoder and vae.

Here you can download the FP8 and NVFP4 versions.

Happy Prompting!

https://huggingface.co/Starnodes/quants

[https://huggingface.co/FireRedTeam/FireRed-Image-Edit-1.0\]

https://redd.it/1r4pmby
@rStableDiffusion
Training LoRA on 5060 Ti 16GB .. is this the best speed or is there any way to speed up iteration time?
https://redd.it/1r559f0
@rStableDiffusion
SDXL is still the undisputed king of n𝚜fw content

When will this change? Yeah you might get an extra arm and have to regenerate a couple times. But you get what you ask for. I have high hopes for Flux Klein but progress is slow.

https://redd.it/1r55ib0
@rStableDiffusion