For Developers
213 subscribers
65 photos
3 videos
1.01K files
991 links
YAC
Download Telegram
A Survey on Efficient Inference for Large
Language Models
https://arxiv.org/pdf/2404.14294
#vLLM #vs #deepspeed #overview #survey #inference #optimization