chiphuyen/lazynlp
Library to scrape and clean web pages to create massive datasets.
Language: Python
#artificial_intelligence #data_science #language_model #natural_language_processing #nlp #open #python #text_mining
Stars: 371 Issues: 2 Forks: 31
https://github.com/chiphuyen/lazynlp
Library to scrape and clean web pages to create massive datasets.
Language: Python
#artificial_intelligence #data_science #language_model #natural_language_processing #nlp #open #python #text_mining
Stars: 371 Issues: 2 Forks: 31
https://github.com/chiphuyen/lazynlp
GitHub
GitHub - chiphuyen/lazynlp: Library to scrape and clean web pages to create massive datasets.
Library to scrape and clean web pages to create massive datasets. - chiphuyen/lazynlp
pingpong-ai/xlnet-pytorch
2019 Google Brain's XLNet Pytorch Implementation
Language: Python
#language_model #pytorch #transformer_xl #xlnet #xlnet_pytorch
Stars: 110 Issues: 2 Forks: 9
https://github.com/pingpong-ai/xlnet-pytorch
2019 Google Brain's XLNet Pytorch Implementation
Language: Python
#language_model #pytorch #transformer_xl #xlnet #xlnet_pytorch
Stars: 110 Issues: 2 Forks: 9
https://github.com/pingpong-ai/xlnet-pytorch
SKTBrain/KoBERT
Korean BERT pre-trained cased (KoBERT)
Language: Python
#korean_nlp #language_model
Stars: 132 Issues: 0 Forks: 17
https://github.com/SKTBrain/KoBERT
Korean BERT pre-trained cased (KoBERT)
Language: Python
#korean_nlp #language_model
Stars: 132 Issues: 0 Forks: 17
https://github.com/SKTBrain/KoBERT
GitHub
GitHub - SKTBrain/KoBERT: Korean BERT pre-trained cased (KoBERT)
Korean BERT pre-trained cased (KoBERT). Contribute to SKTBrain/KoBERT development by creating an account on GitHub.
maraoz/gpt-scrolls
A collaborative collection of open-source safe GPT-3 prompts that work well
#generator #gpt_3 #language_model #openai #safety #transformer
Stars: 123 Issues: 4 Forks: 7
https://github.com/maraoz/gpt-scrolls
A collaborative collection of open-source safe GPT-3 prompts that work well
#generator #gpt_3 #language_model #openai #safety #transformer
Stars: 123 Issues: 4 Forks: 7
https://github.com/maraoz/gpt-scrolls
GitHub
GitHub - maraoz/gpt-scrolls: A collaborative collection of open-source safe GPT-3 prompts that work well
A collaborative collection of open-source safe GPT-3 prompts that work well - GitHub - maraoz/gpt-scrolls: A collaborative collection of open-source safe GPT-3 prompts that work well
nlp-uoregon/trankit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Language: Python
#adapters #artificial_intelligence #deeplearning #dependency_parsing #language_model #lemmatization #machine_learning #morphological_tagging #multilingual #natural_language_processing #nlp #part_of_speech_tagging #pytorch #sentence_segmentation #tokenization #universal_dependencies #xlm_roberta
Stars: 120 Issues: 0 Forks: 8
https://github.com/nlp-uoregon/trankit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Language: Python
#adapters #artificial_intelligence #deeplearning #dependency_parsing #language_model #lemmatization #machine_learning #morphological_tagging #multilingual #natural_language_processing #nlp #part_of_speech_tagging #pytorch #sentence_segmentation #tokenization #universal_dependencies #xlm_roberta
Stars: 120 Issues: 0 Forks: 8
https://github.com/nlp-uoregon/trankit
GitHub
GitHub - nlp-uoregon/trankit: Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing - nlp-uoregon/trankit
will-thompson-k/tldr-transformers
The "tl;dr" on a few notable transformer papers.
#nlp #deep_learning #notes #transformers #attention #transfer_learning #language_models #language_model #bert #open_ai #huggingface #huggingface_transformer #gpt_3
Stars: 90 Issues: 2 Forks: 2
https://github.com/will-thompson-k/tldr-transformers
The "tl;dr" on a few notable transformer papers.
#nlp #deep_learning #notes #transformers #attention #transfer_learning #language_models #language_model #bert #open_ai #huggingface #huggingface_transformer #gpt_3
Stars: 90 Issues: 2 Forks: 2
https://github.com/will-thompson-k/tldr-transformers
GitHub
GitHub - will-thompson-k/tldr-transformers: The "tl;dr" on a few notable transformer papers (pre-2022).
The "tl;dr" on a few notable transformer papers (pre-2022). - will-thompson-k/tldr-transformers
DeutscheKI/tevr-asr-tool
State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.
Language: C
#asr #c_plus_plus #command_line_tool #debian #debian_package #deep_learning #german_speech_recognition #kenlm #language_model #machine_learning #no_cloud #offline #pretrained_models #private #speech #speech_recognition #speech_to_text #tensorflow #tensorflow_lite #wave
Stars: 250 Issues: 0 Forks: 9
https://github.com/DeutscheKI/tevr-asr-tool
State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.
Language: C
#asr #c_plus_plus #command_line_tool #debian #debian_package #deep_learning #german_speech_recognition #kenlm #language_model #machine_learning #no_cloud #offline #pretrained_models #private #speech #speech_recognition #speech_to_text #tensorflow #tensorflow_lite #wave
Stars: 250 Issues: 0 Forks: 9
https://github.com/DeutscheKI/tevr-asr-tool
GitHub
GitHub - DeutscheKI/tevr-asr-tool: State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is…
State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool. - DeutscheKI/tevr-asr-tool
extreme-bert/extreme-bert
ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT”.
Language: Python
#bert #deep_learning #language_model #language_models #machine_learning #natural_language_processing #nlp #python #pytorch #transformer
Stars: 135 Issues: 0 Forks: 5
https://github.com/extreme-bert/extreme-bert
ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT”.
Language: Python
#bert #deep_learning #language_model #language_models #machine_learning #natural_language_processing #nlp #python #pytorch #transformer
Stars: 135 Issues: 0 Forks: 5
https://github.com/extreme-bert/extreme-bert
GitHub
GitHub - extreme-bert/extreme-bert: ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on…
ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Custom...
JonasGeiping/cramming
Cramming the training of a (BERT-type) language model into limited compute.
Language: Python
#english_language #language_model #machine_learning
Stars: 127 Issues: 1 Forks: 11
https://github.com/JonasGeiping/cramming
Cramming the training of a (BERT-type) language model into limited compute.
Language: Python
#english_language #language_model #machine_learning
Stars: 127 Issues: 1 Forks: 11
https://github.com/JonasGeiping/cramming
GitHub
GitHub - JonasGeiping/cramming: Cramming the training of a (BERT-type) language model into limited compute.
Cramming the training of a (BERT-type) language model into limited compute. - JonasGeiping/cramming
BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by the RWKV (100% RNN) language model, and open source.
Language: Python
#chatbot #chatgpt #language_model #pytorch #rnn #rwkv
Stars: 293 Issues: 0 Forks: 13
https://github.com/BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by the RWKV (100% RNN) language model, and open source.
Language: Python
#chatbot #chatgpt #language_model #pytorch #rnn #rwkv
Stars: 293 Issues: 0 Forks: 13
https://github.com/BlinkDL/ChatRWKV
GitHub
GitHub - BlinkDL/ChatRWKV: ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source. - BlinkDL/ChatRWKV
NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
GitHub
GitHub - NVlabs/prismer: The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts". - NVlabs/prismer
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language: Python
#deep_learning #instruction_following #language_model
Stars: 2633 Issues: 3 Forks: 169
https://github.com/tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language: Python
#deep_learning #instruction_following #language_model
Stars: 2633 Issues: 3 Forks: 169
https://github.com/tatsu-lab/stanford_alpaca
GitHub
GitHub - tatsu-lab/stanford_alpaca: Code and documentation to train Stanford's Alpaca models, and generate the data.
Code and documentation to train Stanford's Alpaca models, and generate the data. - tatsu-lab/stanford_alpaca
context-labs/autodoc
Experimental toolkit for auto-generating codebase documentation using LLMs
Language: TypeScript
#cli_tool #documentation_generator #language_model #typescript
Stars: 568 Issues: 7 Forks: 18
https://github.com/context-labs/autodoc
Experimental toolkit for auto-generating codebase documentation using LLMs
Language: TypeScript
#cli_tool #documentation_generator #language_model #typescript
Stars: 568 Issues: 7 Forks: 18
https://github.com/context-labs/autodoc
GitHub
GitHub - context-labs/autodoc: Experimental toolkit for auto-generating codebase documentation using LLMs
Experimental toolkit for auto-generating codebase documentation using LLMs - context-labs/autodoc
mlc-ai/web-llm
Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.
Language: Python
#chatgpt #deep_learning #language_model #llm #tvm #webgpu #webml
Stars: 1009 Issues: 1 Forks: 41
https://github.com/mlc-ai/web-llm
Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.
Language: Python
#chatgpt #deep_learning #language_model #llm #tvm #webgpu #webml
Stars: 1009 Issues: 1 Forks: 41
https://github.com/mlc-ai/web-llm
GitHub
GitHub - mlc-ai/web-llm: High-performance In-browser LLM Inference Engine
High-performance In-browser LLM Inference Engine . Contribute to mlc-ai/web-llm development by creating an account on GitHub.
xtekky/chatgpt-clone
ChatGPT interface with better UI + running on free gpt api's
Language: JavaScript
#chatgpt #chatgpt_api #chatgpt_app #chatgpt_clone #gpt_4 #gpt_4_api #gpt_interface #gpt3 #gpt4 #gpt4_api #gpt4all #interface #language #language_model #site #ui
Stars: 287 Issues: 4 Forks: 70
https://github.com/xtekky/chatgpt-clone
ChatGPT interface with better UI + running on free gpt api's
Language: JavaScript
#chatgpt #chatgpt_api #chatgpt_app #chatgpt_clone #gpt_4 #gpt_4_api #gpt_interface #gpt3 #gpt4 #gpt4_api #gpt4all #interface #language #language_model #site #ui
Stars: 287 Issues: 4 Forks: 70
https://github.com/xtekky/chatgpt-clone
GitHub
GitHub - xtekky/chatgpt-clone: ChatGPT interface with better UI
ChatGPT interface with better UI . Contribute to xtekky/chatgpt-clone development by creating an account on GitHub.
mlc-ai/mlc-llm
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
Language: Python
#language_model #llm #machine_learning_compilation #tvm
Stars: 319 Issues: 5 Forks: 15
https://github.com/mlc-ai/mlc-llm
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
Language: Python
#language_model #llm #machine_learning_compilation #tvm
Stars: 319 Issues: 5 Forks: 15
https://github.com/mlc-ai/mlc-llm
GitHub
GitHub - mlc-ai/mlc-llm: Universal LLM Deployment Engine with ML Compilation
Universal LLM Deployment Engine with ML Compilation - mlc-ai/mlc-llm
salesforce/xgen
Salesforce open-source LLMs with 8k sequence length.
Language: Python
#language_model #large_language_models #llm #nlp
Stars: 357 Issues: 6 Forks: 18
https://github.com/salesforce/xgen
Salesforce open-source LLMs with 8k sequence length.
Language: Python
#language_model #large_language_models #llm #nlp
Stars: 357 Issues: 6 Forks: 18
https://github.com/salesforce/xgen
GitHub
GitHub - salesforce/xgen: Salesforce open-source LLMs with 8k sequence length.
Salesforce open-source LLMs with 8k sequence length. - salesforce/xgen
searchableguy/whiz
A copilot for your terminal
Language: TypeScript
#agent #chat_gpt #chatgpt #cli #copilot #enquirer #language_model #llm #node #openai #transformer #typescript #whiz
Stars: 146 Issues: 4 Forks: 3
https://github.com/searchableguy/whiz
A copilot for your terminal
Language: TypeScript
#agent #chat_gpt #chatgpt #cli #copilot #enquirer #language_model #llm #node #openai #transformer #typescript #whiz
Stars: 146 Issues: 4 Forks: 3
https://github.com/searchableguy/whiz
GitHub
GitHub - searchableguy/whiz: A copilot for your terminal
A copilot for your terminal. Contribute to searchableguy/whiz development by creating an account on GitHub.
elicit/machine-learning-list
#artificial_intelligence #language_model #machine_learning #transformers
Stars: 188 Issues: 0 Forks: 9
https://github.com/elicit/machine-learning-list
#artificial_intelligence #language_model #machine_learning #transformers
Stars: 188 Issues: 0 Forks: 9
https://github.com/elicit/machine-learning-list
GitHub
GitHub - elicit/machine-learning-list: A curriculum for learning about foundation models, from scratch to the frontier
A curriculum for learning about foundation models, from scratch to the frontier - elicit/machine-learning-list
adaline/gateway
The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.
Language: TypeScript
#ai #ai_agents #anthropic #language_model #llm #llmops #openai #prompt_engineering #togetherai #typescript
Stars: 277 Issues: 0 Forks: 5
https://github.com/adaline/gateway
The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.
Language: TypeScript
#ai #ai_agents #anthropic #language_model #llm #llmops #openai #prompt_engineering #togetherai #typescript
Stars: 277 Issues: 0 Forks: 5
https://github.com/adaline/gateway
GitHub
GitHub - adaline/gateway: The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface…
The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs. - adaline/gateway