For Developers
210 subscribers
65 photos
3 videos
1.01K files
998 links
YAC
Download Telegram
#llm #gpt #cost #best_practice #RAG

ROUTELLM: LEARNING TO ROUTE LLMS WITH
PREFERENCE DATA
https://arxiv.org/pdf/2406.18665

Searching for Best Practices in Retrieval-Augmented
Generation
https://arxiv.org/pdf/2407.01219
A Survey on Efficient Inference for Large
Language Models
https://arxiv.org/pdf/2404.14294
#vLLM #vs #deepspeed #overview #survey #inference #optimization