yassa9/qwen600
Static suckless single batch CUDA-only qwen3-0.6B mini inference engine
Language: Cuda
#cuda #cuda_programming #gpu #llamacpp #llm #llm_inference #qwen #qwen3 #transformer
Stars: 287 Issues: 1 Forks: 17
https://github.com/yassa9/qwen600
Static suckless single batch CUDA-only qwen3-0.6B mini inference engine
Language: Cuda
#cuda #cuda_programming #gpu #llamacpp #llm #llm_inference #qwen #qwen3 #transformer
Stars: 287 Issues: 1 Forks: 17
https://github.com/yassa9/qwen600
GitHub
GitHub - yassa9/qwen600: Static suckless single batch CUDA-only qwen3-0.6B mini inference engine
Static suckless single batch CUDA-only qwen3-0.6B mini inference engine - yassa9/qwen600
❤1
RunanywhereAI/RCLI
Talk to your Mac, query your docs, no cloud required. On-device voice AI + RAG
Language: C++
#ai_assistant #apple_silicon #kitten_tts #kokoro_tts #lfm2 #llama_cpp #llm #local_ai #metal #on_device_ai #parakeet #qwen3 #rag #speech_to_text #text_to_speech #tool_calling #voice_assistant
Stars: 627 Issues: 10 Forks: 19
https://github.com/RunanywhereAI/RCLI
Talk to your Mac, query your docs, no cloud required. On-device voice AI + RAG
Language: C++
#ai_assistant #apple_silicon #kitten_tts #kokoro_tts #lfm2 #llama_cpp #llm #local_ai #metal #on_device_ai #parakeet #qwen3 #rag #speech_to_text #text_to_speech #tool_calling #voice_assistant
Stars: 627 Issues: 10 Forks: 19
https://github.com/RunanywhereAI/RCLI
GitHub
GitHub - RunanywhereAI/RCLI: Talk to your Mac, query your docs, no cloud required. On-device voice AI + RAG
Talk to your Mac, query your docs, no cloud required. On-device voice AI + RAG - RunanywhereAI/RCLI
❤3