AI & ML Papers
32.8K subscribers
7.05K photos
519 videos
24 files
7.71K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
AI & ML Papers
Photo
🔥 Recursive Language Models

💡 The paper introduces Recursive Language Models, a novel approach to enable large language models to process arbitrarily long prompts. The problem addressed is that current language models have limited context windows, which restricts their ability to handle long inputs. The proposed method treats long prompts as part of an external environment and allows the language model to programmatically examine, decompose, and recursively call itself over snippets of the prompt. This approach enables the model to handle inputs that are up to two orders of magnitude beyond the model context window. The results show that Recursive Language Models successfully handle long inputs and outperform base language models and common long-context scaffolds across four diverse long-context tasks, while having comparable or cheaper cost per query. Overall, the paper contributes a general inference strategy that improves the ability of large language models to process long prompts, making them more effective and efficient.


📅 Published on Dec 31, 2025

🔗 Links:
• arXiv: https://arxiv.org/abs/2512.24601
• PDF: https://arxiv.org/pdf/2512.24601
• Project Page: https://alexzhang13.github.io/blog/2025/rlm/
• GitHub: https://github.com/alexzhang13/rlm 4.2k

🤖 Models citing this paper:
https://huggingface.co/mit-oasys/rlm-qwen3-8b-v0.1
https://huggingface.co/nightmedia/Qwen3.5-9B-Claude-4.6-Opus-Deckard-V4.2-Uncensored-Heretic-Thinking-qx86-hi-mlx

🚀 Spaces citing this paper:
https://huggingface.co/spaces/sergiopaniego/repl
https://huggingface.co/spaces/openenv/repl
https://huggingface.co/spaces/sergiopaniego/repl-env

━━━━━━━━━━━━━━━━━━━━━━━━
📢 By: https://xn--r1a.website/PaperNexus

#RecursiveLanguageModels #LargeLanguageModels #LongContextProcessing #LanguageModelArchitectures #NaturalLanguageProcessing
3