OFA-Sys/ONE-PEACE
A general representation modal across vision, audio, language modalities.
Language: Python
#audio_language #foundation_models #multimodal #representation_learning #vision_language
Stars: 185 Issues: 2 Forks: 5
https://github.com/OFA-Sys/ONE-PEACE
A general representation modal across vision, audio, language modalities.
Language: Python
#audio_language #foundation_models #multimodal #representation_learning #vision_language
Stars: 185 Issues: 2 Forks: 5
https://github.com/OFA-Sys/ONE-PEACE
GitHub
GitHub - OFA-Sys/ONE-PEACE: A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring…
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities - OFA-Sys/ONE-PEACE
FunAudioLLM/Fun-ASR
Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.
Language: Python
#audio #audio_language_model #audio_understanding #fun_asr #multimodal_large_language_models #pytorch #speaker_diarization #speech_recognition
Stars: 264 Issues: 4 Forks: 8
https://github.com/FunAudioLLM/Fun-ASR
Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.
Language: Python
#audio #audio_language_model #audio_understanding #fun_asr #multimodal_large_language_models #pytorch #speaker_diarization #speech_recognition
Stars: 264 Issues: 4 Forks: 8
https://github.com/FunAudioLLM/Fun-ASR
GitHub
GitHub - FunAudioLLM/Fun-ASR: Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.
Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab. - GitHub - FunAudioLLM/Fun-ASR: Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.