OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Language: Python
#chinese #computer_vision #multi_modal_learning #nlp #pytorch #vision_and_language_pre_training
Stars: 80 Issues: 0 Forks: 7
https://github.com/OFA-Sys/Chinese-CLIP
  
  Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Language: Python
#chinese #computer_vision #multi_modal_learning #nlp #pytorch #vision_and_language_pre_training
Stars: 80 Issues: 0 Forks: 7
https://github.com/OFA-Sys/Chinese-CLIP
GitHub
  
  GitHub - OFA-Sys/Chinese-CLIP: Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
  Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation. - OFA-Sys/Chinese-CLIP
π1π₯1
  MasterBin-IIAU/Unicorn
[ECCV'22 Oral] Towards Grand Unification of Object Tracking
Language: Python
#multi_object_tracking_segmentation #multiple_object_tracking #object_tracking #single_object_tracking #video_object_segmentation
Stars: 132 Issues: 1 Forks: 5
https://github.com/MasterBin-IIAU/Unicorn
  
  [ECCV'22 Oral] Towards Grand Unification of Object Tracking
Language: Python
#multi_object_tracking_segmentation #multiple_object_tracking #object_tracking #single_object_tracking #video_object_segmentation
Stars: 132 Issues: 1 Forks: 5
https://github.com/MasterBin-IIAU/Unicorn
GitHub
  
  GitHub - MasterBin-IIAU/Unicorn: [ECCV'22 Oral] Towards Grand Unification of Object Tracking
  [ECCV'22 Oral] Towards Grand Unification of Object Tracking - MasterBin-IIAU/Unicorn
π1
  kubewharf/kubezoo
a lightweight kubernetes multi-tenancy gateway
Language: Go
#kubernetes #multi_tenancy #serverless
Stars: 136 Issues: 1 Forks: 15
https://github.com/kubewharf/kubezoo
  
  a lightweight kubernetes multi-tenancy gateway
Language: Go
#kubernetes #multi_tenancy #serverless
Stars: 136 Issues: 1 Forks: 15
https://github.com/kubewharf/kubezoo
GitHub
  
  GitHub - kubewharf/kubezoo: a lightweight kubernetes multi-tenancy gateway
  a lightweight kubernetes multi-tenancy gateway. Contribute to kubewharf/kubezoo development by creating an account on GitHub.
π2π₯°1
  jfversluis/learn-dotnet-maui
A repository filled with resources available to you to start learning or deepen your knowledge about .NET MAUI
#cross_platform #dotnet_for_android #dotnet_for_ios #dotnet_maui #maui #multi_platform_app_ui #xamarin #xamarin_forms
Stars: 135 Issues: 0 Forks: 8
https://github.com/jfversluis/learn-dotnet-maui
  
  A repository filled with resources available to you to start learning or deepen your knowledge about .NET MAUI
#cross_platform #dotnet_for_android #dotnet_for_ios #dotnet_maui #maui #multi_platform_app_ui #xamarin #xamarin_forms
Stars: 135 Issues: 0 Forks: 8
https://github.com/jfversluis/learn-dotnet-maui
GitHub
  
  GitHub - jfversluis/learn-dotnet-maui: A repository filled with resources available to you to start learning or deepen your knowledgeβ¦
  A repository filled with resources available to you to start learning or deepen your knowledge about .NET MAUI - jfversluis/learn-dotnet-maui
π6π€1
  NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
  
  The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
GitHub
  
  GitHub - NVlabs/prismer: The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
  The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts". - NVlabs/prismer
π₯3
  kyegomez/Sophia
Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.
Language: Python
#artificial_intelligence #chatgpt #deep_learning #multi_modality #neural_network #optimizer
Stars: 229 Issues: 11 Forks: 16
https://github.com/kyegomez/Sophia
  
  Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.
Language: Python
#artificial_intelligence #chatgpt #deep_learning #multi_modality #neural_network #optimizer
Stars: 229 Issues: 11 Forks: 16
https://github.com/kyegomez/Sophia
GitHub
  
  GitHub - kyegomez/Sophia: Effortless plugin and play Optimizer to cut model training costs by 50%.  New optimizer that is 2x fasterβ¦
  Effortless plugin and play Optimizer to cut model training costs by 50%.  New optimizer that is 2x faster than Adam on LLMs. - kyegomez/Sophia
  netease-youdao/EmotiVoice
EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python
#ai #deep_learning #emotion #emotivoice #multi_speaker #prompt #python #pytorch #speech #speech_synthesis #style #text_to_speech #tts
Stars: 432 Issues: 3 Forks: 38
https://github.com/netease-youdao/EmotiVoice
  
  EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python
#ai #deep_learning #emotion #emotivoice #multi_speaker #prompt #python #pytorch #speech #speech_synthesis #style #text_to_speech #tts
Stars: 432 Issues: 3 Forks: 38
https://github.com/netease-youdao/EmotiVoice
GitHub
  
  GitHub - netease-youdao/EmotiVoice: EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine
  EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine - netease-youdao/EmotiVoice
π1
  ixartz/SaaS-Boilerplate
πππ SaaS Boilerplate built with Next.js + Tailwind CSS + Shadcn UI + TypeScript. β‘οΈ Full-stack React application with Auth, Multi-tenancy, Roles & Permissions, i18n, Landing Page, DB, Logging, Testing
Language: TypeScript
#authentication #boilerplate #multi_tenancy #nextjs #react #reactjs #saas #saas_app #saas_application #saas_boilerplate #saas_kit #shadcn_ui #stack #starter #starter_kit #starter_project #starter_template #template #template_project #typescript
Stars: 634 Issues: 0 Forks: 75
https://github.com/ixartz/SaaS-Boilerplate
  
  πππ SaaS Boilerplate built with Next.js + Tailwind CSS + Shadcn UI + TypeScript. β‘οΈ Full-stack React application with Auth, Multi-tenancy, Roles & Permissions, i18n, Landing Page, DB, Logging, Testing
Language: TypeScript
#authentication #boilerplate #multi_tenancy #nextjs #react #reactjs #saas #saas_app #saas_application #saas_boilerplate #saas_kit #shadcn_ui #stack #starter #starter_kit #starter_project #starter_template #template #template_project #typescript
Stars: 634 Issues: 0 Forks: 75
https://github.com/ixartz/SaaS-Boilerplate
GitHub
  
  GitHub - ixartz/SaaS-Boilerplate: πππ SaaS Boilerplate built with Next.js + Tailwind CSS + Shadcn UI + TypeScript. β‘οΈ Full-stackβ¦
  πππ SaaS Boilerplate built with Next.js + Tailwind CSS + Shadcn UI + TypeScript. β‘οΈ Full-stack React application with Auth, Multi-tenancy, Roles & Permissions, i18n, Landing Page, DB, Loggi...
π₯6π2π1
  InternLM/MindSearch
π a LLM-based Multi-agent Framework of Web Search Engine similar to Perplexity.ai Pro and SearchGPT
Language: Python
#ai_search_engine #gpt #llm #llms #multi_agent_systems #perplexity_ai #search #searchgpt #transformer #web_search
Stars: 792 Issues: 9 Forks: 60
https://github.com/InternLM/MindSearch
  
  π a LLM-based Multi-agent Framework of Web Search Engine similar to Perplexity.ai Pro and SearchGPT
Language: Python
#ai_search_engine #gpt #llm #llms #multi_agent_systems #perplexity_ai #search #searchgpt #transformer #web_search
Stars: 792 Issues: 9 Forks: 60
https://github.com/InternLM/MindSearch
GitHub
  
  GitHub - InternLM/MindSearch: π An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
  π An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT) - InternLM/MindSearch
  cvg/depthsplat
DepthSplat: Connecting Gaussian Splatting and Depth
Language: Python
#feed_forward_gaussian_splatting #monocular_depth #multi_view_stereo #view_synthesis
Stars: 318 Issues: 8 Forks: 9
https://github.com/cvg/depthsplat
  
  DepthSplat: Connecting Gaussian Splatting and Depth
Language: Python
#feed_forward_gaussian_splatting #monocular_depth #multi_view_stereo #view_synthesis
Stars: 318 Issues: 8 Forks: 9
https://github.com/cvg/depthsplat
GitHub
  
  GitHub - cvg/depthsplat: [CVPR'25] DepthSplat: Connecting Gaussian Splatting and Depth
  [CVPR'25] DepthSplat: Connecting Gaussian Splatting and Depth - cvg/depthsplat
  HKUDS/VideoRAG
"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"
Language: Python
#large_language_models #llms #long_video_understanding #multi_modal_llms #rag #retrieval_augmented_generation
Stars: 201 Issues: 1 Forks: 14
https://github.com/HKUDS/VideoRAG
  
  "VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"
Language: Python
#large_language_models #llms #long_video_understanding #multi_modal_llms #rag #retrieval_augmented_generation
Stars: 201 Issues: 1 Forks: 14
https://github.com/HKUDS/VideoRAG
GitHub
  
  GitHub - HKUDS/VideoRAG: "VideoRAG: Chat with Your Videos"
  "VideoRAG: Chat with Your Videos". Contribute to HKUDS/VideoRAG development by creating an account on GitHub.
β‘1π1
  therealoliver/Deepdive-llama3-from-scratch
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
Language: Jupyter Notebook
#attention #attention_mechanism #gpt #inference #kv_cache #language_model #llama #llm_configuration #llms #mask #multi_head_attention #positional_encoding #residuals #rms #rms_norm #rope #rotary_position_encoding #swiglu #tokenizer #transformer
Stars: 388 Issues: 0 Forks: 28
https://github.com/therealoliver/Deepdive-llama3-from-scratch
  
  Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
Language: Jupyter Notebook
#attention #attention_mechanism #gpt #inference #kv_cache #language_model #llama #llm_configuration #llms #mask #multi_head_attention #positional_encoding #residuals #rms #rms_norm #rope #rotary_position_encoding #swiglu #tokenizer #transformer
Stars: 388 Issues: 0 Forks: 28
https://github.com/therealoliver/Deepdive-llama3-from-scratch
GitHub
  
  GitHub - therealoliver/Deepdive-llama3-from-scratch: Achieve the llama3 inference step-by-step, grasp the core concepts, masterβ¦
  Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code. - therealoliver/Deepdive-llama3-from-scratch
π1
  ibelick/zola
Zola is a free, open-source AI chat app with multi-model support.
Language: TypeScript
#ai #chat #multi_model #nextjs #open_source #prompt_kit #shadcn_ui #supabase #typescript
Stars: 262 Issues: 3 Forks: 41
https://github.com/ibelick/zola
  
  Zola is a free, open-source AI chat app with multi-model support.
Language: TypeScript
#ai #chat #multi_model #nextjs #open_source #prompt_kit #shadcn_ui #supabase #typescript
Stars: 262 Issues: 3 Forks: 41
https://github.com/ibelick/zola
GitHub
  
  GitHub - ibelick/zola: Open chat interface for all your models.
  Open chat interface for all your models. Contribute to ibelick/zola development by creating an account on GitHub.
π1
  ses4255/Versatile-OCR-Program
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
Language: Python
#doclayout #educational_data #exam_ocr #machine_learning #ml_datasets #multi_modal #ocr #openai #paper_ocr #table_parsing
Stars: 250 Issues: 0 Forks: 11
https://github.com/ses4255/Versatile-OCR-Program
  
  Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
Language: Python
#doclayout #educational_data #exam_ocr #machine_learning #ml_datasets #multi_modal #ocr #openai #paper_ocr #table_parsing
Stars: 250 Issues: 0 Forks: 11
https://github.com/ses4255/Versatile-OCR-Program
GitHub
  
  GitHub - ses4255/Versatile-OCR-Program: Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
  Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams) - ses4255/Versatile-OCR-Program
β€1π1
  bytedance/deer-flow
DeerFlow is a community-driven framework for deep research, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
Language: TypeScript
#agent #agentic #agentic_framework #agentic_workflow #ai #ai_agents #bytedance #deep_research #langchain #langgraph #langmanus #llm #multi_agent #nodejs #podcast #python #typescript
Stars: 661 Issues: 4 Forks: 59
https://github.com/bytedance/deer-flow
  
  DeerFlow is a community-driven framework for deep research, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
Language: TypeScript
#agent #agentic #agentic_framework #agentic_workflow #ai #ai_agents #bytedance #deep_research #langchain #langgraph #langmanus #llm #multi_agent #nodejs #podcast #python #typescript
Stars: 661 Issues: 4 Forks: 59
https://github.com/bytedance/deer-flow
GitHub
  
  GitHub - bytedance/deer-flow: DeerFlow is a community-driven Deep Research framework, combining language models with tools likeβ¦
  DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community. -...
β€1
  strands-agents/sdk-python
A model-driven approach to building AI agents in just a few lines of code.
Language: Python
#agentic #agentic_ai #agents #ai #anthropic #autonomous_agents #genai #litellm #llm #machine_learning #mcp #multi_agent_systems #ollama #opentelemetry #python
Stars: 217 Issues: 9 Forks: 23
https://github.com/strands-agents/sdk-python
  
  A model-driven approach to building AI agents in just a few lines of code.
Language: Python
#agentic #agentic_ai #agents #ai #anthropic #autonomous_agents #genai #litellm #llm #machine_learning #mcp #multi_agent_systems #ollama #opentelemetry #python
Stars: 217 Issues: 9 Forks: 23
https://github.com/strands-agents/sdk-python
GitHub
  
  GitHub - strands-agents/sdk-python: A model-driven approach to building AI agents in just a few lines of code.
  A model-driven approach to building AI agents in just a few lines of code. - strands-agents/sdk-python
  NirDiamant/agents-towards-production
This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for real-world launches.
Language: Jupyter Notebook
#agent #agent_framework #agents #ai_agents #genai #generative_ai #llm #llms #mlops #multi_agent #production #tool_integration #tutorials
Stars: 1422 Issues: 0 Forks: 141
https://github.com/NirDiamant/agents-towards-production
  
  This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for real-world launches.
Language: Jupyter Notebook
#agent #agent_framework #agents #ai_agents #genai #generative_ai #llm #llms #mlops #multi_agent #production #tool_integration #tutorials
Stars: 1422 Issues: 0 Forks: 141
https://github.com/NirDiamant/agents-towards-production
GitHub
  
  GitHub - NirDiamant/agents-towards-production: This repository delivers end-to-end, code-first tutorials covering every layer ofβ¦
  This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for re...
β€3
  NVlabs/Long-RL
Long-RL: Scaling RL to Long Sequences
Language: Python
#efficient_ai #large_language_models #long_sequence #multi_modality #reinforcement_learning #sequence_parallelism
Stars: 301 Issues: 2 Forks: 3
https://github.com/NVlabs/Long-RL
  
  Long-RL: Scaling RL to Long Sequences
Language: Python
#efficient_ai #large_language_models #long_sequence #multi_modality #reinforcement_learning #sequence_parallelism
Stars: 301 Issues: 2 Forks: 3
https://github.com/NVlabs/Long-RL
GitHub
  
  GitHub - NVlabs/Long-RL: Long-RL: Scaling RL to Long Sequences
  Long-RL: Scaling RL to Long Sequences. Contribute to NVlabs/Long-RL development by creating an account on GitHub.
  office-sec/AionUi
Free, local, open-source GUI app for Gemini CLI β Enhance Chat Experience, Multi-tasking, Code Diff View, File & Project Management, and more | π Star if you like it!
Language: TypeScript
#ai #ai_agent #gemini #gemini_ai #gemini_cli #gemini_pro #gui #gui_application #ide #llm #llm_code #multi_agent #nodejs #react #typescript
Stars: 375 Issues: 2 Forks: 30
https://github.com/office-sec/AionUi
  
  Free, local, open-source GUI app for Gemini CLI β Enhance Chat Experience, Multi-tasking, Code Diff View, File & Project Management, and more | π Star if you like it!
Language: TypeScript
#ai #ai_agent #gemini #gemini_ai #gemini_cli #gemini_pro #gui #gui_application #ide #llm #llm_code #multi_agent #nodejs #react #typescript
Stars: 375 Issues: 2 Forks: 30
https://github.com/office-sec/AionUi
GitHub
  
  GitHub - iOfficeAI/AionUi: Free, local, open-source GUI app for Gemini CLI β Better Chat UI,  Multi-agent, Multi-LLMs & apikeyβ¦
  Free, local, open-source GUI app for Gemini CLI β Better Chat UI,  Multi-agent, Multi-LLMs & apikey polling, Workspace, MCP, Remote WebUi Mode & more | π Star if you like it! - iOff...
  showlab/Code2Video
Video generation via code
Language: Python
#coding #multi_agent #video_generation
Stars: 256 Issues: 0 Forks: 31
https://github.com/showlab/Code2Video
  
  Video generation via code
Language: Python
#coding #multi_agent #video_generation
Stars: 256 Issues: 0 Forks: 31
https://github.com/showlab/Code2Video
GitHub
  
  GitHub - showlab/Code2Video: Video generation via code
  Video generation via code. Contribute to showlab/Code2Video development by creating an account on GitHub.
  