opencv/dldt
Deep Learning Deployment Toolkit
Language: C++
#deep_learning #inference #inference_engine #performance
Stars: 97 Issues: 2 Forks: 13
https://github.com/opencv/dldt
  
  Deep Learning Deployment Toolkit
Language: C++
#deep_learning #inference #inference_engine #performance
Stars: 97 Issues: 2 Forks: 13
https://github.com/opencv/dldt
GitHub
  
  GitHub - openvinotoolkit/openvino: OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
  OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference - openvinotoolkit/openvino
  Tencent/Forward
a library for high performance deep learning inference on NVIDIA GPUs.
Language: C++
#cuda #deep_learning #forward #gpu #inference #inference_engine #keras #neural_network #pytorch #tensorflow #tensorrt
Stars: 102 Issues: 0 Forks: 8
https://github.com/Tencent/Forward
  
  a library for high performance deep learning inference on NVIDIA GPUs.
Language: C++
#cuda #deep_learning #forward #gpu #inference #inference_engine #keras #neural_network #pytorch #tensorflow #tensorrt
Stars: 102 Issues: 0 Forks: 8
https://github.com/Tencent/Forward
GitHub
  
  GitHub - Tencent/Forward: A library for high performance deep learning inference on NVIDIA GPUs.
  A library for high performance deep learning inference on NVIDIA GPUs.  - GitHub - Tencent/Forward: A library for high performance deep learning inference on NVIDIA GPUs.
👍1
  chengzeyi/stable-fast
An ultra lightweight inference performance optimization library for HuggingFace Diffusers on NVIDIA GPUs.
Language: Python
#cuda #deep_learning #deeplearning #diffusers #inference #inference_engine #performance_optimization #pytorch #stable_diffusion #triton
Stars: 134 Issues: 3 Forks: 5
https://github.com/chengzeyi/stable-fast
  
  An ultra lightweight inference performance optimization library for HuggingFace Diffusers on NVIDIA GPUs.
Language: Python
#cuda #deep_learning #deeplearning #diffusers #inference #inference_engine #performance_optimization #pytorch #stable_diffusion #triton
Stars: 134 Issues: 3 Forks: 5
https://github.com/chengzeyi/stable-fast
GitHub
  
  GitHub - chengzeyi/stable-fast: https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers…
  https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs. - chengzeyi/stable-fast
🔥2❤1
  zhihu/ZhiLight
A highly optimized inference acceleration engine for Llama and its variants.
Language: C++
#cpm #cuda #gpt #inference_engine #llama #llm #llm_serving #minicpm #pytorch #qwen
Stars: 192 Issues: 1 Forks: 16
https://github.com/zhihu/ZhiLight
  
  A highly optimized inference acceleration engine for Llama and its variants.
Language: C++
#cpm #cuda #gpt #inference_engine #llama #llm #llm_serving #minicpm #pytorch #qwen
Stars: 192 Issues: 1 Forks: 16
https://github.com/zhihu/ZhiLight
GitHub
  
  GitHub - zhihu/ZhiLight: A highly optimized LLM inference acceleration engine for Llama and its variants.
  A highly optimized LLM inference acceleration engine for Llama and its variants. - zhihu/ZhiLight
👍1
  