sberbank-ai/ru-dalle
Generate images from texts. In Russian
Language: Jupyter Notebook
#dalle #image_generation #openai #python #pytorch #russian #russian_language #text_to_image #transformer
Stars: 245 Issues: 2 Forks: 20
https://github.com/sberbank-ai/ru-dalle
  Generate images from texts. In Russian
Language: Jupyter Notebook
#dalle #image_generation #openai #python #pytorch #russian #russian_language #text_to_image #transformer
Stars: 245 Issues: 2 Forks: 20
https://github.com/sberbank-ai/ru-dalle
sail-sg/poolformer
PoolFormer: MetaFormer is Actually What You Need for Vision
Language: Jupyter Notebook
#image_classification #mlp #pooling #transformer
Stars: 184 Issues: 2 Forks: 7
https://github.com/sail-sg/poolformer
  
  PoolFormer: MetaFormer is Actually What You Need for Vision
Language: Jupyter Notebook
#image_classification #mlp #pooling #transformer
Stars: 184 Issues: 2 Forks: 7
https://github.com/sail-sg/poolformer
GitHub
  
  GitHub - sail-sg/poolformer: PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
  PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral) - sail-sg/poolformer
  wjf5203/VNext
Next-generation Video instance recognition framework on top of Detectron2 which supports SeqFormer(ECCV Oral) and IDOL(ECCV Oral))
Language: Python
#instance_segmentation #object_detection #tracking #transformer #video_instance_segmentation
Stars: 109 Issues: 0 Forks: 4
https://github.com/wjf5203/VNext
  
  Next-generation Video instance recognition framework on top of Detectron2 which supports SeqFormer(ECCV Oral) and IDOL(ECCV Oral))
Language: Python
#instance_segmentation #object_detection #tracking #transformer #video_instance_segmentation
Stars: 109 Issues: 0 Forks: 4
https://github.com/wjf5203/VNext
GitHub
  
  GitHub - wjf5203/VNext: Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR…
  Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral),  and IDOL(ECCV Oral)) - wjf5203/VNext
👍4
  sail-sg/Adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Language: Python
#adan #bert_model #convnext #deep_learning #fairseq #mae #optimizer #resnet #timm #transformer_xl #vit
Stars: 158 Issues: 2 Forks: 5
https://github.com/sail-sg/Adan
  
  Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Language: Python
#adan #bert_model #convnext #deep_learning #fairseq #mae #optimizer #resnet #timm #transformer_xl #vit
Stars: 158 Issues: 2 Forks: 5
https://github.com/sail-sg/Adan
GitHub
  
  GitHub - sail-sg/Adan: Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
  Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models - sail-sg/Adan
🔥2
  extreme-bert/extreme-bert
ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT”.
Language: Python
#bert #deep_learning #language_model #language_models #machine_learning #natural_language_processing #nlp #python #pytorch #transformer
Stars: 135 Issues: 0 Forks: 5
https://github.com/extreme-bert/extreme-bert
  
  ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT”.
Language: Python
#bert #deep_learning #language_model #language_models #machine_learning #natural_language_processing #nlp #python #pytorch #transformer
Stars: 135 Issues: 0 Forks: 5
https://github.com/extreme-bert/extreme-bert
GitHub
  
  GitHub - extreme-bert/extreme-bert: ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on…
  ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Custom...
👍3
  open-mmlab/Multimodal-GPT
Multimodal-GPT
Language: Python
#flamingo #gpt #gpt_4 #llama #multimodal #transformer #vision_and_language
Stars: 244 Issues: 1 Forks: 12
https://github.com/open-mmlab/Multimodal-GPT
  
  Multimodal-GPT
Language: Python
#flamingo #gpt #gpt_4 #llama #multimodal #transformer #vision_and_language
Stars: 244 Issues: 1 Forks: 12
https://github.com/open-mmlab/Multimodal-GPT
GitHub
  
  GitHub - open-mmlab/Multimodal-GPT: Multimodal-GPT
  Multimodal-GPT. Contribute to open-mmlab/Multimodal-GPT development by creating an account on GitHub.
👎1
  X-PLUG/mPLUG-Owl
mPLUG-Owl🦉: Modularization Empowers Large Language Models with Multimodality
Language: Python
#alpaca #chatbot #chatgpt #computer_vision #damo #gpt #gpt4 #gpt4_api #huggingface #instruction_tuning #large_language_models #llama #mplug #mplug_owl #multimodal #pretraining #pytorch #transformer #visual_reasoning #visual_recognition
Stars: 209 Issues: 1 Forks: 9
https://github.com/X-PLUG/mPLUG-Owl
  
  mPLUG-Owl🦉: Modularization Empowers Large Language Models with Multimodality
Language: Python
#alpaca #chatbot #chatgpt #computer_vision #damo #gpt #gpt4 #gpt4_api #huggingface #instruction_tuning #large_language_models #llama #mplug #mplug_owl #multimodal #pretraining #pytorch #transformer #visual_reasoning #visual_recognition
Stars: 209 Issues: 1 Forks: 9
https://github.com/X-PLUG/mPLUG-Owl
GitHub
  
  GitHub - X-PLUG/mPLUG-Owl: mPLUG-Owl: The Powerful Multi-modal Large Language Model  Family
  mPLUG-Owl: The Powerful Multi-modal Large Language Model  Family - X-PLUG/mPLUG-Owl
  kyegomez/LongNet
Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
Language: Python
#artificial_intelligence #attention #attention_is_all_you_need #attention_mechanisms #chatgpt #context_length #gpt3 #gpt4 #machine_learning #transformer
Stars: 381 Issues: 4 Forks: 55
https://github.com/kyegomez/LongNet
  
  Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
Language: Python
#artificial_intelligence #attention #attention_is_all_you_need #attention_mechanisms #chatgpt #context_length #gpt3 #gpt4 #machine_learning #transformer
Stars: 381 Issues: 4 Forks: 55
https://github.com/kyegomez/LongNet
GitHub
  
  GitHub - kyegomez/LongNet: Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
  Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens" - kyegomez/LongNet
👌2👍1
  searchableguy/whiz
A copilot for your terminal
Language: TypeScript
#agent #chat_gpt #chatgpt #cli #copilot #enquirer #language_model #llm #node #openai #transformer #typescript #whiz
Stars: 146 Issues: 4 Forks: 3
https://github.com/searchableguy/whiz
  
  A copilot for your terminal
Language: TypeScript
#agent #chat_gpt #chatgpt #cli #copilot #enquirer #language_model #llm #node #openai #transformer #typescript #whiz
Stars: 146 Issues: 4 Forks: 3
https://github.com/searchableguy/whiz
GitHub
  
  GitHub - cloudycotton/whiz: A copilot for your terminal
  A copilot for your terminal. Contribute to cloudycotton/whiz development by creating an account on GitHub.
👍3
  SqueezeAILab/LLMCompiler
LLMCompiler: An LLM Compiler for Parallel Function Calling
Language: Python
#efficient_inference #function_calling #large_language_models #llama #llama2 #llm #llm_agent #llm_agents #llm_framework #llms #natural_language_processing #nlp #parallel_function_call #transformer
Stars: 216 Issues: 0 Forks: 11
https://github.com/SqueezeAILab/LLMCompiler
  
  LLMCompiler: An LLM Compiler for Parallel Function Calling
Language: Python
#efficient_inference #function_calling #large_language_models #llama #llama2 #llm #llm_agent #llm_agents #llm_framework #llms #natural_language_processing #nlp #parallel_function_call #transformer
Stars: 216 Issues: 0 Forks: 11
https://github.com/SqueezeAILab/LLMCompiler
GitHub
  
  GitHub - SqueezeAILab/LLMCompiler: [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
  [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling - SqueezeAILab/LLMCompiler
  kyegomez/MultiModalMamba
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.
Language: Python
#ai #artificial_intelligence #attention_mechanism #machine_learning #mamba #ml #pytorch #ssm #torch #transformer_architecture #transformers #zeta
Stars: 264 Issues: 0 Forks: 9
https://github.com/kyegomez/MultiModalMamba
  
  A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.
Language: Python
#ai #artificial_intelligence #attention_mechanism #machine_learning #mamba #ml #pytorch #ssm #torch #transformer_architecture #transformers #zeta
Stars: 264 Issues: 0 Forks: 9
https://github.com/kyegomez/MultiModalMamba
GitHub
  
  GitHub - kyegomez/MultiModalMamba: A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi…
  A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever. - kyegomez/MultiModalMamba
🔥1
  buaacyw/MeshAnything
From anything to mesh like human artists
Language: Python
#3d #generative_ai #generative_model #mesh #mesh_generation #transformer
Stars: 405 Issues: 1 Forks: 12
https://github.com/buaacyw/MeshAnything
  
  From anything to mesh like human artists
Language: Python
#3d #generative_ai #generative_model #mesh #mesh_generation #transformer
Stars: 405 Issues: 1 Forks: 12
https://github.com/buaacyw/MeshAnything
GitHub
  
  GitHub - buaacyw/MeshAnything: [ICLR 2025] From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created…
  [ICLR 2025] From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers" - buaacyw/MeshAnything
🔥4
  InternLM/MindSearch
🔍 a LLM-based Multi-agent Framework of Web Search Engine similar to Perplexity.ai Pro and SearchGPT
Language: Python
#ai_search_engine #gpt #llm #llms #multi_agent_systems #perplexity_ai #search #searchgpt #transformer #web_search
Stars: 792 Issues: 9 Forks: 60
https://github.com/InternLM/MindSearch
  
  🔍 a LLM-based Multi-agent Framework of Web Search Engine similar to Perplexity.ai Pro and SearchGPT
Language: Python
#ai_search_engine #gpt #llm #llms #multi_agent_systems #perplexity_ai #search #searchgpt #transformer #web_search
Stars: 792 Issues: 9 Forks: 60
https://github.com/InternLM/MindSearch
GitHub
  
  GitHub - InternLM/MindSearch: 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
  🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT) - InternLM/MindSearch
  DepthAnything/Video-Depth-Anything
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
Language: Python
#depth_estimation #monocular_depth_estimation #transformer #video_depth
Stars: 234 Issues: 2 Forks: 8
https://github.com/DepthAnything/Video-Depth-Anything
  
  Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
Language: Python
#depth_estimation #monocular_depth_estimation #transformer #video_depth
Stars: 234 Issues: 2 Forks: 8
https://github.com/DepthAnything/Video-Depth-Anything
GitHub
  
  GitHub - DepthAnything/Video-Depth-Anything: [CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super…
  [CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos - DepthAnything/Video-Depth-Anything
  MoonshotAI/MoBA
MoBA: Mixture of Block Attention for Long-Context LLMs
Language: Python
#flash_attention #llm #llm_serving #llm_training #moe #pytorch #transformer
Stars: 521 Issues: 2 Forks: 16
https://github.com/MoonshotAI/MoBA
  
  MoBA: Mixture of Block Attention for Long-Context LLMs
Language: Python
#flash_attention #llm #llm_serving #llm_training #moe #pytorch #transformer
Stars: 521 Issues: 2 Forks: 16
https://github.com/MoonshotAI/MoBA
GitHub
  
  GitHub - MoonshotAI/MoBA: MoBA: Mixture of Block Attention for Long-Context LLMs
  MoBA: Mixture of Block Attention for Long-Context LLMs - MoonshotAI/MoBA
  therealoliver/Deepdive-llama3-from-scratch
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
Language: Jupyter Notebook
#attention #attention_mechanism #gpt #inference #kv_cache #language_model #llama #llm_configuration #llms #mask #multi_head_attention #positional_encoding #residuals #rms #rms_norm #rope #rotary_position_encoding #swiglu #tokenizer #transformer
Stars: 388 Issues: 0 Forks: 28
https://github.com/therealoliver/Deepdive-llama3-from-scratch
  
  Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
Language: Jupyter Notebook
#attention #attention_mechanism #gpt #inference #kv_cache #language_model #llama #llm_configuration #llms #mask #multi_head_attention #positional_encoding #residuals #rms #rms_norm #rope #rotary_position_encoding #swiglu #tokenizer #transformer
Stars: 388 Issues: 0 Forks: 28
https://github.com/therealoliver/Deepdive-llama3-from-scratch
GitHub
  
  GitHub - therealoliver/Deepdive-llama3-from-scratch: Achieve the llama3 inference step-by-step, grasp the core concepts, master…
  Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code. - therealoliver/Deepdive-llama3-from-scratch
👍1
  yassa9/qwen600
Static suckless single batch CUDA-only qwen3-0.6B mini inference engine
Language: Cuda
#cuda #cuda_programming #gpu #llamacpp #llm #llm_inference #qwen #qwen3 #transformer
Stars: 287 Issues: 1 Forks: 17
https://github.com/yassa9/qwen600
  
  Static suckless single batch CUDA-only qwen3-0.6B mini inference engine
Language: Cuda
#cuda #cuda_programming #gpu #llamacpp #llm #llm_inference #qwen #qwen3 #transformer
Stars: 287 Issues: 1 Forks: 17
https://github.com/yassa9/qwen600
GitHub
  
  GitHub - yassa9/qwen600: Static suckless single batch CUDA-only qwen3-0.6B mini inference engine
  Static suckless single batch CUDA-only qwen3-0.6B mini inference engine - yassa9/qwen600
❤1
  