JonasGeiping/cramming
Cramming the training of a (BERT-type) language model into limited compute.
Language: Python
#english_language #language_model #machine_learning
Stars: 127 Issues: 1 Forks: 11
https://github.com/JonasGeiping/cramming
Cramming the training of a (BERT-type) language model into limited compute.
Language: Python
#english_language #language_model #machine_learning
Stars: 127 Issues: 1 Forks: 11
https://github.com/JonasGeiping/cramming
GitHub
GitHub - JonasGeiping/cramming: Cramming the training of a (BERT-type) language model into limited compute.
Cramming the training of a (BERT-type) language model into limited compute. - JonasGeiping/cramming
👍2
BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by the RWKV (100% RNN) language model, and open source.
Language: Python
#chatbot #chatgpt #language_model #pytorch #rnn #rwkv
Stars: 293 Issues: 0 Forks: 13
https://github.com/BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by the RWKV (100% RNN) language model, and open source.
Language: Python
#chatbot #chatgpt #language_model #pytorch #rnn #rwkv
Stars: 293 Issues: 0 Forks: 13
https://github.com/BlinkDL/ChatRWKV
GitHub
GitHub - BlinkDL/ChatRWKV: ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source. - BlinkDL/ChatRWKV
🔥3👍1👏1
NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
GitHub
GitHub - NVlabs/prismer: The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts". - NVlabs/prismer
🔥3