Unterschiede

Hier werden die Unterschiede zwischen zwei Versionen angezeigt.

Link zu dieser Vergleichsansicht

Beide Seiten der vorigen RevisionVorhergehende Überarbeitung
Nächste Überarbeitung
Vorhergehende Überarbeitung
ai_ki [2025/08/14 09:33] muonitai_ki [2025/08/18 14:13] (aktuell) – [TTS] muonit
Zeile 32: Zeile 32:
 ===== Lokal ===== ===== Lokal =====
   * https://github.com/google-ai-edge/gallery on cellphone AI models   * https://github.com/google-ai-edge/gallery on cellphone AI models
 +  * https://github.com/ZeyueT/AudioX Anything to Audio
 ==== Chat ==== ==== Chat ====
   * https://github.com/mezbaul-h/june Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit    * https://github.com/mezbaul-h/june Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit 
Zeile 40: Zeile 41:
   * https://bannerify.co/tools/remove-bg   * https://bannerify.co/tools/remove-bg
   * https://github.com/pinokiofactory/RMBG-2-Studio?tab=readme-ov-file   * https://github.com/pinokiofactory/RMBG-2-Studio?tab=readme-ov-file
-===== Bildgenerierung =====+  * https://github.com/azizaydi23/video-bg-remover 
 + 
 + 
 +=== Bildgenerierung ===
   * https://ltxv.video A groundbreaking 13B-parameter AI model by Lightricks, on a 4090/5090   * https://ltxv.video A groundbreaking 13B-parameter AI model by Lightricks, on a 4090/5090
   * https://github.com/runew0lf/RuinedFooocus   * https://github.com/runew0lf/RuinedFooocus
Zeile 46: Zeile 50:
   * https://github.com/deepbeepmeep/Hunyuan3D-2GP 3D Model from Image   * https://github.com/deepbeepmeep/Hunyuan3D-2GP 3D Model from Image
  
-===== Clone Movement to Avatar/Videogenerierung =====+=== Clone Movement to Avatar/Videogenerierung ===
   * https://github.com/antgroup/echomimic_v2 Clone Movement to digital Avatar   * https://github.com/antgroup/echomimic_v2 Clone Movement to digital Avatar
   * https://github.com/antgroup/echomimic_v3?tab=readme-ov-file   * https://github.com/antgroup/echomimic_v3?tab=readme-ov-file
   * https://github.com/deepbeepmeep/Wan2GP Open Source Video Generative Models Accessible to the GPU Poor   * https://github.com/deepbeepmeep/Wan2GP Open Source Video Generative Models Accessible to the GPU Poor
 +  * https://github.com/IDEA-Research/DWPose Like OpenPose 
  
-====== TTS ======+==== TTS ====
   * https://github.com/fishaudio/fish-speech SOTA Open Source TTS  - OpenAudio   * https://github.com/fishaudio/fish-speech SOTA Open Source TTS  - OpenAudio
-====== STT ======+  * https://github.com/nari-labs/dia 
 +  * https://github.com/myshell-ai/OpenVoice Voice Cloning 
 +  * https://github.com/yl4579/StyleTTS2 
 +  * https://docs.hyprnote.com/owhisper/what-is-this OWhisper SST für Hyprnote 
 + 
 +==== STT ====
   * https://github.com/fastrepl/hyprnote Local-first AI Notepad for Private Meetings    * https://github.com/fastrepl/hyprnote Local-first AI Notepad for Private Meetings 
   * https://github.com/thewh1teagle/vibe  Transcribe on your own!    * https://github.com/thewh1teagle/vibe  Transcribe on your own! 
   * https://github.com/jhj0517/Whisper-WebUI Subtitle editing   * https://github.com/jhj0517/Whisper-WebUI Subtitle editing
 +  * https://github.com/RayFernando1337/MLX-Auto-Subtitled-Video-Generator/ Für M Macs