QwenLM/ParScale
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
Language: Python
#large_language_models #llm #machine_learning #scaling_law
Stars: 222 Issues: 1 Forks: 9
https://github.com/QwenLM/ParScale
  
  Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
Language: Python
#large_language_models #llm #machine_learning #scaling_law
Stars: 222 Issues: 1 Forks: 9
https://github.com/QwenLM/ParScale
GitHub
  
  GitHub - QwenLM/ParScale: Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
  Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling - QwenLM/ParScale
  codelion/openevolve
Open-source implementation of AlphaEvolve
Language: Python
#alpha_evolve #alphacode #alphaevolve #coding_agent #deepmind #deepmind_lab #discovery #distributed_evolutionary_algorithms #evolutionary_algorithms #evolutionary_computation #genetic_algorithm #genetic_algorithms #iterative_methods #iterative_refinement #llm_engineering #llm_ensemble #llm_inference #openevolve #optimize
Stars: 312 Issues: 1 Forks: 26
https://github.com/codelion/openevolve
  
  Open-source implementation of AlphaEvolve
Language: Python
#alpha_evolve #alphacode #alphaevolve #coding_agent #deepmind #deepmind_lab #discovery #distributed_evolutionary_algorithms #evolutionary_algorithms #evolutionary_computation #genetic_algorithm #genetic_algorithms #iterative_methods #iterative_refinement #llm_engineering #llm_ensemble #llm_inference #openevolve #optimize
Stars: 312 Issues: 1 Forks: 26
https://github.com/codelion/openevolve
GitHub
  
  GitHub - codelion/openevolve: Open-source implementation of AlphaEvolve
  Open-source implementation of AlphaEvolve. Contribute to codelion/openevolve development by creating an account on GitHub.
  Gen-Verse/MMaDA
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models
Language: Python
#diffusion_models #llm_reasoning #unified_multimodal_understanding_and_generation
Stars: 494 Issues: 4 Forks: 13
https://github.com/Gen-Verse/MMaDA
  
  MMaDA - Open-Sourced Multimodal Large Diffusion Language Models
Language: Python
#diffusion_models #llm_reasoning #unified_multimodal_understanding_and_generation
Stars: 494 Issues: 4 Forks: 13
https://github.com/Gen-Verse/MMaDA
GitHub
  
  GitHub - Gen-Verse/MMaDA: [NeurIPS 2025] MMaDA - Open-Sourced Multimodal Large Diffusion Language Models
  [NeurIPS 2025] MMaDA - Open-Sourced Multimodal Large Diffusion Language Models - Gen-Verse/MMaDA
  Olow304/memvid
Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
Language: Python
#ai #context #embedded #faiss #knowledge_base #knowledge_graph #llm #machine_learning #memory #nlp #offline_first #opencv #python #rag #retrieval_augmented_generation #semantic_search #vector_database #video_processing
Stars: 252 Issues: 2 Forks: 25
https://github.com/Olow304/memvid
  
  Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
Language: Python
#ai #context #embedded #faiss #knowledge_base #knowledge_graph #llm #machine_learning #memory #nlp #offline_first #opencv #python #rag #retrieval_augmented_generation #semantic_search #vector_database #video_processing
Stars: 252 Issues: 2 Forks: 25
https://github.com/Olow304/memvid
GitHub
  
  GitHub - Olow304/memvid: Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic…
  Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed. - Olow304/memvid
  hexdocom/lemonai
The world's first Full-Stack Open-Source General AI Agent
Language: JavaScript
#agent #agentic_ai #ai #desktop #fullstack #javascript #llm #nodejs #vue3
Stars: 183 Issues: 4 Forks: 11
https://github.com/hexdocom/lemonai
  
  The world's first Full-Stack Open-Source General AI Agent
Language: JavaScript
#agent #agentic_ai #ai #desktop #fullstack #javascript #llm #nodejs #vue3
Stars: 183 Issues: 4 Forks: 11
https://github.com/hexdocom/lemonai
GitHub
  
  GitHub - hexdocom/lemonai: Lemon AI is the first Full-stack, Open-source, Agentic AI framework, offering a fully local alternative…
  Lemon AI is the first Full-stack, Open-source, Agentic AI framework, offering a fully local alternative to platforms like Manus & Genspark AI. It features an integrated Code Interpreter VM ...
  mendableai/firesearch
Language: TypeScript
#firecrawl #langchain #langgraph #llm #research
Stars: 216 Issues: 2 Forks: 35
https://github.com/mendableai/firesearch
  
  Language: TypeScript
#firecrawl #langchain #langgraph #llm #research
Stars: 216 Issues: 2 Forks: 35
https://github.com/mendableai/firesearch
GitHub
  
  GitHub - firecrawl/firesearch: 🔥 AI-powered deep research tool that breaks down complex queries, validates answers, and provides…
  🔥 AI-powered deep research tool that breaks down complex queries, validates answers, and provides cited comprehensive results using Firecrawl and LangGraph - firecrawl/firesearch
  JAMESYJL/ShapeLLM-Omni
A Native Multimodal LLM for 3D Generation and Understanding
Language: Python
#3d_captioning #3d_editing #image_to_3d #llm #text_to_3d
Stars: 223 Issues: 3 Forks: 6
https://github.com/JAMESYJL/ShapeLLM-Omni
  
  A Native Multimodal LLM for 3D Generation and Understanding
Language: Python
#3d_captioning #3d_editing #image_to_3d #llm #text_to_3d
Stars: 223 Issues: 3 Forks: 6
https://github.com/JAMESYJL/ShapeLLM-Omni
GitHub
  
  GitHub - JAMESYJL/ShapeLLM-Omni: [NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding
  [NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding - JAMESYJL/ShapeLLM-Omni
  MiniMax-AI/MiniMax-M1
MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
Language: Python
#large_language_models #llm #minimax_m1 #reasoning_models
Stars: 328 Issues: 3 Forks: 9
https://github.com/MiniMax-AI/MiniMax-M1
  
  MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
Language: Python
#large_language_models #llm #minimax_m1 #reasoning_models
Stars: 328 Issues: 3 Forks: 9
https://github.com/MiniMax-AI/MiniMax-M1
GitHub
  
  GitHub - MiniMax-AI/MiniMax-M1: MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
  MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. - MiniMax-AI/MiniMax-M1
❤2
  NirDiamant/agents-towards-production
This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for real-world launches.
Language: Jupyter Notebook
#agent #agent_framework #agents #ai_agents #genai #generative_ai #llm #llms #mlops #multi_agent #production #tool_integration #tutorials
Stars: 1422 Issues: 0 Forks: 141
https://github.com/NirDiamant/agents-towards-production
  
  This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for real-world launches.
Language: Jupyter Notebook
#agent #agent_framework #agents #ai_agents #genai #generative_ai #llm #llms #mlops #multi_agent #production #tool_integration #tutorials
Stars: 1422 Issues: 0 Forks: 141
https://github.com/NirDiamant/agents-towards-production
GitHub
  
  GitHub - NirDiamant/agents-towards-production: This repository delivers end-to-end, code-first tutorials covering every layer of…
  This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for re...
❤3
  getAsterisk/claudia
A powerful GUI app and Toolkit for Claude Code - Create custom agents, manage interactive Claude Code sessions, run secure background agents, and more.
Language: TypeScript
#anthropic #anthropic_claude #claude #claude_4 #claude_4_opus #claude_4_sonnet #claude_ai #claude_code #claude_code_sdk #cursor #ide #llm #llm_code #rust #tauri
Stars: 859 Issues: 14 Forks: 61
https://github.com/getAsterisk/claudia
  
  A powerful GUI app and Toolkit for Claude Code - Create custom agents, manage interactive Claude Code sessions, run secure background agents, and more.
Language: TypeScript
#anthropic #anthropic_claude #claude #claude_4 #claude_4_opus #claude_4_sonnet #claude_ai #claude_code #claude_code_sdk #cursor #ide #llm #llm_code #rust #tauri
Stars: 859 Issues: 14 Forks: 61
https://github.com/getAsterisk/claudia
GitHub
  
  GitHub - winfunc/opcode: A powerful GUI app and Toolkit for Claude Code - Create custom agents, manage interactive Claude Code…
  A powerful GUI app and Toolkit for Claude Code - Create custom agents, manage interactive Claude Code sessions, run secure background agents, and more. - winfunc/opcode
  MemTensor/MemOS
MemOS (Preview) | Intelligence Begins with Memory
Language: Python
#agent #kv_cache #language_model #llm #llm_memory #long_term_memory #lora #memcube #memory #memory_management #memory_operating_system #memory_retrieval #memory_scheduling #memos #neo4j #rag #retrieval_augmented_generation #tree
Stars: 512 Issues: 14 Forks: 35
https://github.com/MemTensor/MemOS
  
  MemOS (Preview) | Intelligence Begins with Memory
Language: Python
#agent #kv_cache #language_model #llm #llm_memory #long_term_memory #lora #memcube #memory #memory_management #memory_operating_system #memory_retrieval #memory_scheduling #memos #neo4j #rag #retrieval_augmented_generation #tree
Stars: 512 Issues: 14 Forks: 35
https://github.com/MemTensor/MemOS
GitHub
  
  GitHub - MemTensor/MemOS: MemOS (Preview) | Intelligence Begins with Memory
  MemOS (Preview) | Intelligence Begins with Memory. Contribute to MemTensor/MemOS development by creating an account on GitHub.
  jtang613/GhidrAssistMCP
An MCP extension for Ghidra
Language: Java
#ghidra #ghidra_extension #ghidra_plugin #llm #mcp #mcp_server #reverse_engineering
Stars: 228 Issues: 0 Forks: 9
https://github.com/jtang613/GhidrAssistMCP
  
  An MCP extension for Ghidra
Language: Java
#ghidra #ghidra_extension #ghidra_plugin #llm #mcp #mcp_server #reverse_engineering
Stars: 228 Issues: 0 Forks: 9
https://github.com/jtang613/GhidrAssistMCP
GitHub
  
  GitHub - jtang613/GhidrAssistMCP: An MCP extension for Ghidra
  An MCP extension for Ghidra. Contribute to jtang613/GhidrAssistMCP development by creating an account on GitHub.
❤2
  bentoml/llm-inference-in-production
Everything you need to know about LLM inference
Language: TypeScript
#llm #llm_inference
Stars: 154 Issues: 3 Forks: 8
https://github.com/bentoml/llm-inference-in-production
  
  Everything you need to know about LLM inference
Language: TypeScript
#llm #llm_inference
Stars: 154 Issues: 3 Forks: 8
https://github.com/bentoml/llm-inference-in-production
GitHub
  
  GitHub - bentoml/llm-inference-handbook: Everything you need to know about LLM inference
  Everything you need to know about LLM inference. Contribute to bentoml/llm-inference-handbook development by creating an account on GitHub.
❤1
  scottvr/wtffmpeg
a toy that has a local llm spit out ffmpeg commands from natural language prompts on the command-line
Language: Python
#ffmpeg #llm
Stars: 250 Issues: 0 Forks: 4
https://github.com/scottvr/wtffmpeg
  
  a toy that has a local llm spit out ffmpeg commands from natural language prompts on the command-line
Language: Python
#ffmpeg #llm
Stars: 250 Issues: 0 Forks: 4
https://github.com/scottvr/wtffmpeg
GitHub
  
  GitHub - scottvr/wtffmpeg: a toy that has a local llm spit out ffmpeg commands from natural language prompts on the command-line
  a toy that has a local llm spit out ffmpeg commands from natural language prompts on the command-line - scottvr/wtffmpeg
  NU-QRG/optiml
Acceleration library for LLM agents.
Language: C++
#llama #llm
Stars: 198 Issues: 7 Forks: 44
https://github.com/NU-QRG/optiml
  Acceleration library for LLM agents.
Language: C++
#llama #llm
Stars: 198 Issues: 7 Forks: 44
https://github.com/NU-QRG/optiml
office-sec/AionUi
Free, local, open-source GUI app for Gemini CLI — Enhance Chat Experience, Multi-tasking, Code Diff View, File & Project Management, and more | 🌟 Star if you like it!
Language: TypeScript
#ai #ai_agent #gemini #gemini_ai #gemini_cli #gemini_pro #gui #gui_application #ide #llm #llm_code #multi_agent #nodejs #react #typescript
Stars: 375 Issues: 2 Forks: 30
https://github.com/office-sec/AionUi
  
  Free, local, open-source GUI app for Gemini CLI — Enhance Chat Experience, Multi-tasking, Code Diff View, File & Project Management, and more | 🌟 Star if you like it!
Language: TypeScript
#ai #ai_agent #gemini #gemini_ai #gemini_cli #gemini_pro #gui #gui_application #ide #llm #llm_code #multi_agent #nodejs #react #typescript
Stars: 375 Issues: 2 Forks: 30
https://github.com/office-sec/AionUi
GitHub
  
  GitHub - iOfficeAI/AionUi: Free, local, open-source GUI app for Gemini CLI — Better Chat UI,  Multi-agent, Multi-LLMs & apikey…
  Free, local, open-source GUI app for Gemini CLI — Better Chat UI,  Multi-agent, Multi-LLMs & apikey polling, Workspace, MCP, Remote WebUi Mode & more | 🌟 Star if you like it! - iOff...
  instructa/browser-echo
⚡ Stream browser logs to terminal, zero setup, perfect for Ai Agents
Language: TypeScript
#ai #browser #claude_code #codex_cli #cursor #gemini_cli #llm #log #logging
Stars: 193 Issues: 2 Forks: 7
https://github.com/instructa/browser-echo
  
  ⚡ Stream browser logs to terminal, zero setup, perfect for Ai Agents
Language: TypeScript
#ai #browser #claude_code #codex_cli #cursor #gemini_cli #llm #log #logging
Stars: 193 Issues: 2 Forks: 7
https://github.com/instructa/browser-echo
GitHub
  
  GitHub - instructa/browser-echo: ⚡ Stream browser logs to terminal, zero setup, perfect for Ai Agents
  ⚡ Stream browser logs to terminal, zero setup, perfect for Ai Agents - instructa/browser-echo
❤2
  Chen-zexi/vllm-cli
A command-line interface tool for serving LLM using vLLM.
Language: Python
#llm #llm_tools #vllm
Stars: 243 Issues: 1 Forks: 5
https://github.com/Chen-zexi/vllm-cli
  
  A command-line interface tool for serving LLM using vLLM.
Language: Python
#llm #llm_tools #vllm
Stars: 243 Issues: 1 Forks: 5
https://github.com/Chen-zexi/vllm-cli
GitHub
  
  GitHub - Chen-zexi/vllm-cli: A command-line interface tool for serving LLM using vLLM.
  A command-line interface tool for serving LLM using vLLM. - Chen-zexi/vllm-cli
  vakovalskii/sgr-deep-research
Schema-Guided Reasoning (SGR) is a technique that guides large language models (LLMs) to produce structured, clear, and predictable outputs by enforcing reasoning through predefined steps. By creating a specific schema (or structured template), you explicitly define:
Language: Python
#agent #llm #sgr #so #structured_output
Stars: 173 Issues: 0 Forks: 35
https://github.com/vakovalskii/sgr-deep-research
  
  Schema-Guided Reasoning (SGR) is a technique that guides large language models (LLMs) to produce structured, clear, and predictable outputs by enforcing reasoning through predefined steps. By creating a specific schema (or structured template), you explicitly define:
Language: Python
#agent #llm #sgr #so #structured_output
Stars: 173 Issues: 0 Forks: 35
https://github.com/vakovalskii/sgr-deep-research
GitHub
  
  GitHub - vamplabAI/sgr-deep-research: redmadrobot.ai | Hybrid Schema-Guided Reasoning (SGR) has agentic system design create by…
  redmadrobot.ai | Hybrid Schema-Guided Reasoning (SGR) has agentic system design create by neuraldeep community - vamplabAI/sgr-deep-research
  yassa9/qwen600
Static suckless single batch CUDA-only qwen3-0.6B mini inference engine
Language: Cuda
#cuda #cuda_programming #gpu #llamacpp #llm #llm_inference #qwen #qwen3 #transformer
Stars: 287 Issues: 1 Forks: 17
https://github.com/yassa9/qwen600
  
  Static suckless single batch CUDA-only qwen3-0.6B mini inference engine
Language: Cuda
#cuda #cuda_programming #gpu #llamacpp #llm #llm_inference #qwen #qwen3 #transformer
Stars: 287 Issues: 1 Forks: 17
https://github.com/yassa9/qwen600
GitHub
  
  GitHub - yassa9/qwen600: Static suckless single batch CUDA-only qwen3-0.6B mini inference engine
  Static suckless single batch CUDA-only qwen3-0.6B mini inference engine - yassa9/qwen600
❤1
  