AI & ML Papers

✨Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models

📝 Summary:
This study optimizes small language models for real-device latency by identifying key architectural factors and efficient operators. It introduces Nemotron-Flash, a new family of hybrid SLMs that significantly improves accuracy, latency, and throughput compared to current models.

🔹 Publication Date: Published on Nov 24

🔹 Paper Links:
• arXiv Page: https://arxiv.org/pdf/2511.18890
• PDF: https://arxiv.org/pdf/2511.18890

🔹 Models citing this paper:
• https://huggingface.co/nvidia/Nemotron-Flash-3B-Instruct
• https://huggingface.co/nvidia/Nemotron-Flash-1B
• https://huggingface.co/nvidia/Nemotron-Flash-3B

==================================

For more data science resources:
✓ https://xn--r1a.website/DataScienceT

#SmallLanguageModels #LatencyOptimization #AI #DeepLearning #NLP

❤1

405 views22:08

✨ Explore Data Science 📝 Write your paper

AI & ML Papers

✨Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

📝 Summary:
Nanbeige4.1-3B is a 3B-parameter model excelling in agentic behavior, code generation, and reasoning. It outperforms larger models through advanced reward modeling and training, demonstrating broad competence for a small language model.

🔹 Publication Date: Published on Feb 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.13367
• PDF: https://arxiv.org/pdf/2602.13367
• Project Page: https://huggingface.co/Nanbeige/Nanbeige4.1-3B

🔹 Models citing this paper:
• https://huggingface.co/Nanbeige/Nanbeige4.1-3B

✨ Spaces citing this paper:
• https://huggingface.co/spaces/PioTio/AIMan

==================================

For more data science resources:
✓ https://xn--r1a.website/DataScienceT

#LLM #AI #SmallLanguageModels #AgenticAI #CodeGeneration

❤1

283 views11:05

✨ Explore Data Science 📝 Write your paper

AI & ML Papers

Photo

🔥 VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

💡 The paper introduces VibeThinker-3B, a compact language model with 3 billion parameters, that achieves state-of-the-art performance on verifiable reasoning tasks, challenging the conventional assumption that large models are necessary for such tasks. The model was developed using a specialized training pipeline that includes curriculum-based supervised fine-tuning, multi-domain reinforcement learning, and offline self-distillation. The model was evaluated on several highly demanding verifiable tasks and achieved impressive results, including a score of 94.3 on AIME26, 80.2 Pass@1 on LiveCodeBench v6, and a 96.1 percent acceptance rate on recent unseen LeetCode contests. These results place VibeThinker-3B in the performance band of first-tier reasoning systems, matching or exceeding the performance of much larger models. The paper also shows that the model's performance does not compromise its instruction controllability, with a score of 93.4 on IFEval. The results of this study support the Parametric Compression-Coverage Hypothesis, which suggests that verifiable reasoning can be compressed into compact reasoning cores, while open-domain knowledge and general-purpose competence require larger models with broader parameter coverage. Overall, the paper demonstrates that compact models can be a complementary path to achieving frontier-level performance on verifiable reasoning tasks, and that they are not just efficient substitutes for larger models.

📅 Published on Jun 15

🔗 Links:
• GitHub: https://github.com/huggingface
• arXiv: https://arxiv.org/abs/2606.16140
• PDF: https://arxiv.org/pdf/2606.16140
• Project Page: https://github.com/WeiboAI/VibeThinker

🤖 Models citing this paper:
• https://huggingface.co/WeiboAI/VibeThinker-3B
• https://huggingface.co/KakTakOne/VibeThinker-3B-GGUF
• https://huggingface.co/ffkbblu/pepekberbulu

🚀 Spaces citing this paper:
• https://huggingface.co/spaces/Mike0021/vibethinker-3b-zerogpu
• https://huggingface.co/spaces/ffkbblu/trst

━━━━━━━━━━━━━━━━━━━━━━━━
📢 By: https://xn--r1a.website/PaperNexus

#VerifiableReasoning #SmallLanguageModels #CompactModelArchitecture #ReinforcementLearningForNLP #EfficientLanguageModeling

GitHub

Hugging Face

The AI community building the future. Hugging Face has 438 repositories available. Follow their code on GitHub.

374 views18:21

✨ Join Best TG Channels

👋 Join Our WhatsApp Channel

📝 Contact / Collaborate

About

Blog

Apps

Platform