GitHub repos

lucidrains/audiolm-pytorch
Implementation of AudioLM, a Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language: Python
#artificial_intelligence #attention_mechanisms #audio_synthesis #deep_learning #transformers
Stars: 121 Issues: 1 Forks: 1
https://github.com/lucidrains/audiolm-pytorch

GitHub

GitHub - lucidrains/audiolm-pytorch: Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google…

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch - lucidrains/audiolm-pytorch

🔥2

2.35K views04:22

GitHub repos

enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E, WIP
Language: Python
#audio_lm #pytorch #text_to_speech #tts #vall_e #valle
Stars: 212 Issues: 2 Forks: 32
https://github.com/enhuiz/vall-e

GitHub

GitHub - enhuiz/vall-e: An unofficial PyTorch implementation of the audio LM VALL-E

An unofficial PyTorch implementation of the audio LM VALL-E - GitHub - enhuiz/vall-e: An unofficial PyTorch implementation of the audio LM VALL-E

👍5😐1

2.66K views11:04

GitHub repos

jafarlihi/sysm
sysm makes your system play custom sounds when any configured system or external event happens
Language: C++
#audio #linux #music #system_monitor #system_monitoring
Stars: 160 Issues: 0 Forks: 6
https://github.com/jafarlihi/sysm

GitHub

GitHub - h2337/sysm: sysm makes your system play custom sounds when any configured system or external event happens

sysm makes your system play custom sounds when any configured system or external event happens - h2337/sysm

👍3😐1

2.6K views11:04

GitHub repos

archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
#artificial_intelligence #audio_generation #machine_learning
Stars: 319 Issues: 0 Forks: 7
https://github.com/archinetai/audio-ai-timeline

GitHub

GitHub - archinetai/audio-ai-timeline: A timeline of the latest AI models for audio generation, starting in 2023!

A timeline of the latest AI models for audio generation, starting in 2023! - archinetai/audio-ai-timeline

🤔6👍5

4.75K views17:04

GitHub repos

samim23/polymath
Convert any music library into a music production sample-library with ML
Language: Python
#audio #machine_learning #ml #music #python
Stars: 725 Issues: 1 Forks: 39
https://github.com/samim23/polymath

GitHub

GitHub - samim23/polymath: Convert any music library into a music production sample-library with ML

Convert any music library into a music production sample-library with ML - samim23/polymath

🤯7❤2🔥2

3.93K views11:05

GitHub repos

StanGirard/quiver
Dump all your files and thoughts into your GenerativeAI brain and chat with it
Language: Python
#audio #chat #chatgpt #csv #embeddings #generativeai #obsidian #pdf #second_brain #vectorstore #whisper
Stars: 185 Issues: 6 Forks: 18
https://github.com/StanGirard/quiver

GitHub

GitHub - QuivrHQ/quivr: Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration…

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstor...

👍1

2.46K views04:10

GitHub repos

lucidrains/soundstorm-pytorch
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Language: Python
#artificial_intelligence #attention_mechanism #audio_generation #deep_learning #non_autoregressive #transformers
Stars: 181 Issues: 0 Forks: 6
https://github.com/lucidrains/soundstorm-pytorch

GitHub

GitHub - lucidrains/soundstorm-pytorch: Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind…

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch - lucidrains/soundstorm-pytorch

😁1

2.24K views22:11

GitHub repos

OFA-Sys/ONE-PEACE
A general representation modal across vision, audio, language modalities.
Language: Python
#audio_language #foundation_models #multimodal #representation_learning #vision_language
Stars: 185 Issues: 2 Forks: 5
https://github.com/OFA-Sys/ONE-PEACE

GitHub

GitHub - OFA-Sys/ONE-PEACE: A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring…

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities - OFA-Sys/ONE-PEACE

2.25K views04:11

GitHub repos

VASTDynamics/Vaporizer2
Vaporizer2 hybrid wavetable additive / subtractive VST / AU / AAX synthesizer / sampler workstation plugin
Language: C++
#aax #audio #audiounit_plugins #cpp #daw #music #plugin #sampler #synthesizer #vst #vst3 #vst3_plugin #wavetable
Stars: 186 Issues: 5 Forks: 9
https://github.com/VASTDynamics/Vaporizer2

GitHub

GitHub - VASTDynamics/Vaporizer2: Vaporizer2 hybrid wavetable additive / subtractive VST / AU / AAX synthesizer / sampler workstation…

Vaporizer2 hybrid wavetable additive / subtractive VST / AU / AAX synthesizer / sampler workstation plugin - VASTDynamics/Vaporizer2

🔥1

2.24K views22:19

GitHub repos

huggingface/distil-whisper
#audio #speech_recognition #whisper
Stars: 261 Issues: 2 Forks: 9
https://github.com/huggingface/distil-whisper

GitHub

GitHub - huggingface/distil-whisper: Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word…

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate. - huggingface/distil-whisper

2.19K views10:20

GitHub repos

ZiqiaoPeng/SyncTalk
This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
#audio_driven_talking_face #talking_face #talking_face_generation #talking_head
Stars: 180 Issues: 5 Forks: 2
https://github.com/ZiqiaoPeng/SyncTalk

GitHub

GitHub - ZiqiaoPeng/SyncTalk: [CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization…

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis" - ZiqiaoPeng/SyncTalk

2.08K views05:23

GitHub repos

TuneNN/TuneNN
A transformer-based network model for pitch detection
Language: Python
#audio #machine_learning #music #pitch_detection #pitch_estimation
Stars: 142 Issues: 0 Forks: 3
https://github.com/TuneNN/TuneNN

GitHub

GitHub - TuneNN/TuneNN: A transformer-based network model for pitch detection

A transformer-based network model for pitch detection - TuneNN/TuneNN

👍1

1.95K views23:24

GitHub repos

ali-vilab/dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Language: Python
#audio_visual_learning #face_animation #talking_head #video_generation
Stars: 217 Issues: 7 Forks: 20
https://github.com/ali-vilab/dreamtalk

GitHub

GitHub - ali-vilab/dreamtalk: Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion…

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models - ali-vilab/dreamtalk

2.3K views17:24

GitHub repos

Lessica/TrollRecorder
WIP: A simple audio recorder for TrollStore.
Language: Objective-C++
#audio_recorder #ios #jailbreak #trollstore #tweak
Stars: 282 Issues: 1 Forks: 10
https://github.com/Lessica/TrollRecorder

GitHub

GitHub - Lessica/TrollRecorder: (i18n/CLI) Not the first, but the best phone call recorder with TrollStore.

(i18n/CLI) Not the first, but the best phone call recorder with TrollStore. - Lessica/TrollRecorder

👍5

3.35K views23:27

GitHub repos