Unterschiede

Hier werden die Unterschiede zwischen zwei Versionen angezeigt.

Link zu dieser Vergleichsansicht

Beide Seiten der vorigen RevisionVorhergehende Überarbeitung
Nächste Überarbeitung
Vorhergehende Überarbeitung
ai_ki [2025/08/14 09:34] muonitai_ki [2025/08/18 14:13] (aktuell) – [TTS] muonit
Zeile 32: Zeile 32:
 ===== Lokal ===== ===== Lokal =====
   * https://github.com/google-ai-edge/gallery on cellphone AI models   * https://github.com/google-ai-edge/gallery on cellphone AI models
 +  * https://github.com/ZeyueT/AudioX Anything to Audio
 ==== Chat ==== ==== Chat ====
   * https://github.com/mezbaul-h/june Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit    * https://github.com/mezbaul-h/june Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit 
Zeile 40: Zeile 41:
   * https://bannerify.co/tools/remove-bg   * https://bannerify.co/tools/remove-bg
   * https://github.com/pinokiofactory/RMBG-2-Studio?tab=readme-ov-file   * https://github.com/pinokiofactory/RMBG-2-Studio?tab=readme-ov-file
 +  * https://github.com/azizaydi23/video-bg-remover
 +
 +
 === Bildgenerierung === === Bildgenerierung ===
   * https://ltxv.video A groundbreaking 13B-parameter AI model by Lightricks, on a 4090/5090   * https://ltxv.video A groundbreaking 13B-parameter AI model by Lightricks, on a 4090/5090
Zeile 50: Zeile 54:
   * https://github.com/antgroup/echomimic_v3?tab=readme-ov-file   * https://github.com/antgroup/echomimic_v3?tab=readme-ov-file
   * https://github.com/deepbeepmeep/Wan2GP Open Source Video Generative Models Accessible to the GPU Poor   * https://github.com/deepbeepmeep/Wan2GP Open Source Video Generative Models Accessible to the GPU Poor
 +  * https://github.com/IDEA-Research/DWPose Like OpenPose 
  
 ==== TTS ==== ==== TTS ====
   * https://github.com/fishaudio/fish-speech SOTA Open Source TTS  - OpenAudio   * https://github.com/fishaudio/fish-speech SOTA Open Source TTS  - OpenAudio
 +  * https://github.com/nari-labs/dia
 +  * https://github.com/myshell-ai/OpenVoice Voice Cloning
 +  * https://github.com/yl4579/StyleTTS2
 +  * https://docs.hyprnote.com/owhisper/what-is-this OWhisper SST für Hyprnote
  
 ==== STT ==== ==== STT ====