This media is not supported in your browser
VIEW IN TELEGRAM
PersonaLive: Expressive Portrait Image Animation for Live Streaming
https://redd.it/1pn7hih
@rStableDiffusion
https://redd.it/1pn7hih
@rStableDiffusion
Fun-CosyVoice 3.0 is an advanced text-to-speech (TTS) system
https://redd.it/1pn793c
@rStableDiffusion
https://redd.it/1pn793c
@rStableDiffusion
I accidentally made Realism LoRa while trying to make lora of myself. Z-image potential is huge.
https://redd.it/1png3ef
@rStableDiffusion
https://redd.it/1png3ef
@rStableDiffusion
Reddit
From the StableDiffusion community on Reddit: I accidentally made Realism LoRa while trying to make lora of myself. Z-image potential…
Explore this post and more from the StableDiffusion community
This B300 server at my work will be unused until after the holidays. What should I train, boys???
https://redd.it/1pnio1b
@rStableDiffusion
https://redd.it/1pnio1b
@rStableDiffusion
This media is not supported in your browser
VIEW IN TELEGRAM
I used Flux-Schnell to generate card art in real time as the player progresses
https://redd.it/1pnlvsk
@rStableDiffusion
https://redd.it/1pnlvsk
@rStableDiffusion
Analyse Lora Blocks and in real-time choose the blocks used for inference in Comfy UI. Z-image, Qwen, Wan 2.2, Flux Dev and SDXL supported.
https://www.youtube.com/watch?v=dkEB5i5yBUI
https://redd.it/1pnooaf
@rStableDiffusion
https://www.youtube.com/watch?v=dkEB5i5yBUI
https://redd.it/1pnooaf
@rStableDiffusion
YouTube
Realtime LoRA Analysis and Block editing in-generation in ComfyUI. Z-Image, Flux, Wan 2.2, Qwen
Train, analyze, and selectively by block load LoRAs inside ComfyUI. Supports Z-Image, Qwen Image, Qwen Image Edit, SDXL, FLUX, Wan 2.2
https://github.com/shootthesound/comfyUI-Realtime-Lora
Version 2 Beta includes Lora Saving: https://www.youtube.com/…
https://github.com/shootthesound/comfyUI-Realtime-Lora
Version 2 Beta includes Lora Saving: https://www.youtube.com/…
Chatterbox Turbo Released Today
I didn't see another post on this, but the open source TTS was released today.
https://huggingface.co/collections/ResembleAI/chatterbox-turbo
I tested it with a recording of my voice and in 5 seconds it was able to create a pretty decent facsimile of my voice.
https://redd.it/1pnozbo
@rStableDiffusion
I didn't see another post on this, but the open source TTS was released today.
https://huggingface.co/collections/ResembleAI/chatterbox-turbo
I tested it with a recording of my voice and in 5 seconds it was able to create a pretty decent facsimile of my voice.
https://redd.it/1pnozbo
@rStableDiffusion
huggingface.co
Chatterbox Turbo - a ResembleAI Collection
Ultra-Fast, Open-Source Text-to-Speech for Real-Time Voice AI
My updated 4 stage upscale workflow to squeeze z-image and those character lora's dry
https://redd.it/1pny489
@rStableDiffusion
https://redd.it/1pny489
@rStableDiffusion
Reddit
From the StableDiffusion community on Reddit: My updated 4 stage upscale workflow to squeeze z-image and those character lora's…
Explore this post and more from the StableDiffusion community