Medium / Medium.com – Telegram

Medium / Medium.com

1.25K subscribers

106K links

Just main page of medium.com fresh from the oven

Download Telegram

About

Blog

Apps

Platform

Medium / Medium.com

1.25K subscribers

Medium / Medium.com

Orca, the World’s Largest Carbon Processing Facility and Iceland's Carbon Removal Project

#iceland #orca #carbonemissions #carbonremoval #greenenergy #greentech #greenproducts #howtolowercarbonfootprint

https://hackernoon.com/orca-the-worlds-largest-carbon-processing-facility-and-icelands-carbon-removal-project

Orca, the World’s Largest Carbon Processing Facility and Iceland's Carbon Removal Project | HackerNoon

Iceland has the world’s largest carbon processing plant, Orca, which can turn 4,000 metric tons of carbon dioxide into stone annually.

12 views11:15

Medium / Medium.com

Evaluating vLLM With Basic Sampling

#llms #vllm #vllmevaluation #basicsampling #whatisbasicsampling #sharegpt #alpacadataset #orca

https://hackernoon.com/evaluating-vllm-with-basic-sampling

Evaluating vLLM With Basic Sampling

We evaluate the performance of vLLM with basic sampling (one sample per request) on three models and two datasets.

26 views17:30

Medium / Medium.com

How Good Is PagedAttention at Memory Sharing?

#llms #pagedattention #memorysharing #parallelsampling #beamsharing #parallelsequences #orca #orcabaselines

https://hackernoon.com/how-good-is-pagedattention-at-memory-sharing

How Good Is PagedAttention at Memory Sharing?

We evaluate the effectiveness of memory sharing in PagedAttention with two popular sampling methods: parallel sampling and beam search.

24 views01:01

Medium / Medium.com

How We Implemented a Chatbot Into Our LLM

#llms #vllm #orca #sharegpt #opt13b #pagedattention #chatbots #chatbotimplementation

https://hackernoon.com/how-we-implemented-a-chatbot-into-our-llm

How We Implemented a Chatbot Into Our LLM

To implement a chatbot, we let the model generate a response by concatenating the chatting history and the last user query into a prompt.

37 views17:45

Medium / Medium.com

How Effective is vLLM When a Prefix Is Thrown Into the Mix?

#llms #vllm #prefix #vllmeffectiveness #llama13b #orca #multilingualllm #woosukkwon

https://hackernoon.com/how-effective-is-vllm-when-a-prefix-is-thrown-into-the-mix

How Effective is vLLM When a Prefix Is Thrown Into the Mix?

We explore the effectiveness of vLLM for the case a prefix is shared among different input prompts

39 views18:15

Medium / Medium.com

General Model Serving Systems and Memory Optimizations Explained

#llms #vllm #generalmodelserving #memoryoptimization #orca #transformers #alpaserve #gpukernel

https://hackernoon.com/general-model-serving-systems-and-memory-optimizations-explained

General Model Serving Systems and Memory Optimizations Explained

Model serving has been an active area of research in recent years, with numerous systems proposed to tackle diverse aspects of deep learning model deployment.

43 views00:31