#python #agent #ai #anthropic #claude_code #compression #context_engineering #context_window #cursor #fastapi #langchain #llm #mcp #openai #prompt_engineering #proxy #python #rag #token_optimization #tokens #typescript
Headroom is a local tool for AI agents that shrinks prompts, logs, files, and chat history before sending them to an LLM, often cutting tokens by 60–95% while keeping the same answer quality. It can work as a library, proxy, MCP server, or agent wrapper, so you can save tokens, speed up workflows, and still recover the original content when needed.
https://github.com/chopratejas/headroom
Headroom is a local tool for AI agents that shrinks prompts, logs, files, and chat history before sending them to an LLM, often cutting tokens by 60–95% while keeping the same answer quality. It can work as a library, proxy, MCP server, or agent wrapper, so you can save tokens, speed up workflows, and still recover the original content when needed.
https://github.com/chopratejas/headroom
GitHub
GitHub - chopratejas/headroom: Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens…
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server. - chopratejas/headroom