Medium / Medium.com – Telegram

Medium / Medium.com

1.23K subscribers

106K links

Just main page of medium.com fresh from the oven

Download Telegram

About

Blog

Apps

Platform

Medium / Medium.com

1.23K subscribers

Medium / Medium.com

LLaVA-Phi: The Training We Put It Through

#llms #llavaphi #clipvitl #llava15 #phi2 #supervisedfinetuning #sharegpt #trainingllavaphi

https://hackernoon.com/llava-phi-the-training-we-put-it-through

LLaVA-Phi: The Training We Put It Through

Our overall network architecture is similar to LLaVA-1.5. We use the pre-trained CLIP ViT-L/14 with a resolution of 336x336

13 views00:16

Medium / Medium.com

The Distributed Execution of vLLM

#llms #vllm #megatronlm #memorymanager #spmd #modelparallel #kvcachemanager #kvcache

https://hackernoon.com/the-distributed-execution-of-vllm

The Distributed Execution of vLLM

vLLM is effective in distributed settings by supporting the widely used Megatron-LM style tensor model parallelism strategy on Transformers

19 views00:30

Medium / Medium.com

How vLLM Prioritizes a Subset of Requests

#llms #vllm #pagedattention #gpumemory #cpuram #woosukkwon #zhuohanli #siyuanzhuang

https://hackernoon.com/how-vllm-prioritizes-a-subset-of-requests

How vLLM Prioritizes a Subset of Requests

In vLLM, we adopt the first-come-first-serve (FCFS) scheduling policy for all requests, ensuring fairness and preventing starvation.

17 views00:45

Medium / Medium.com

LLaVA-Phi: Related Work to Get You Caught Up

#llms #gemini #gemininano #llavaphi #mobilevlm #blipfamily #llavafamily #mideagroup

https://hackernoon.com/llava-phi-related-work-to-get-you-caught-up

LLaVA-Phi: Related Work to Get You Caught Up

The rapid advancements in Large Language Models (LLMs) have significantly propelled the development of vision-language models based on LLMs.

21 views01:01

Medium / Medium.com

How vLLM Can Be Applied to Other Decoding Scenarios

#llms #vllm #vllmapplications #decodingalgorithm #llmapplications #parallelsampling #osvirtualmemory #machinetranslation

https://hackernoon.com/how-vllm-can-be-applied-to-other-decoding-scenarios

How vLLM Can Be Applied to Other Decoding Scenarios

We show the general applicability of vLLM on them in this section.

31 views01:16

Medium / Medium.com

Evaluating vLLM With Basic Sampling

#llms #vllm #vllmevaluation #basicsampling #whatisbasicsampling #sharegpt #alpacadataset #orca

https://hackernoon.com/evaluating-vllm-with-basic-sampling

Evaluating vLLM With Basic Sampling

We evaluate the performance of vLLM with basic sampling (one sample per request) on three models and two datasets.

26 views17:30

Medium / Medium.com

Evaluating the Performance of vLLM: How Did It Do?

#llms #vllm #vllmevaluation #opt #fastertransformer #sharegpt #alpaca #oracle

https://hackernoon.com/evaluating-the-performance-of-vllm-how-did-it-do

Evaluating the Performance of vLLM: How Did It Do?

In this section, we evaluate the performance of vLLM under a variety of workloads.

20 views18:00

Medium / Medium.com

LLaVA-Phi: Limitations and What You Can Expect in the Future

#llms #llavaphi #whatisllavaphi #llavaphilimitations #futureofllms #phi2 #llavaphiarchitecture #mideagroup

https://hackernoon.com/llava-phi-limitations-and-what-you-can-expect-in-the-future

LLaVA-Phi: Limitations and What You Can Expect in the Future

We introduce LLaVA-Phi, a vision language assistant developed using the compact language model Phi-2.

19 views00:16

Medium / Medium.com

LLaVA-Phi: Qualitative Results - Take A Look At Its Remarkable Generelization Capabilities

#llms #llavaphi #llava15 #scienceqa #whatisllavaphi #llavaphiqualitativeresults #mideagroup #visionlanguageassistant

https://hackernoon.com/llava-phi-qualitative-results-take-a-look-at-its-remarkable-generelization-capabilities

LLaVA-Phi: Qualitative Results - Take A Look At Its Remarkable Generelization Capabilities

We present several examples that demonstrate the remarkable generalization capabilities of LLaVA-Phi, comparing its outputs with those of the LLaVA-1.5-13B

19 views00:46

Medium / Medium.com

LLaVA-Phi: How We Rigorously Evaluated It Using an Extensive Array of Academic Benchmarks

#llms #llavaphi #whatisllavaphi #llavaphiexperiments #mobilevlm #mmbench #instructblip #vizwizqa

https://hackernoon.com/llava-phi-how-we-rigorously-evaluated-it-using-an-extensive-array-of-academic-benchmarks

LLaVA-Phi: How We Rigorously Evaluated It Using an Extensive Array of Academic Benchmarks

We rigorously evaluated LLaVA-Phi using an extensive array of academic benchmarks specifically designed for multi-modal models.

33 views01:16

Medium / Medium.com

Try Llama 3.1 8B in Your Browser: AQLM.rs Delivers Al at Your Fingertips

#llama318b #aqlm #llms #rust #runningllama3locally #webassembly #llmmodelquantization #pvtuning

https://hackernoon.com/try-llama-31-8b-in-your-browser-aqlmrs-delivers-al-at-your-fingertips

Try Llama 3.1 8B in Your Browser: AQLM.rs Delivers Al at Your Fingertips

Try Llama 3.1 8B in Your Browser: AQLM.rs Delivers Al at Your Fingertips

16 views02:46

Medium / Medium.com

How Good Is PagedAttention at Memory Sharing?

#llms #pagedattention #memorysharing #parallelsampling #beamsharing #parallelsequences #orca #orcabaselines

https://hackernoon.com/how-good-is-pagedattention-at-memory-sharing

How Good Is PagedAttention at Memory Sharing?

We evaluate the effectiveness of memory sharing in PagedAttention with two popular sampling methods: parallel sampling and beam search.

24 views01:01

Medium / Medium.com

How We Implemented a Chatbot Into Our LLM

#llms #vllm #orca #sharegpt #opt13b #pagedattention #chatbots #chatbotimplementation

https://hackernoon.com/how-we-implemented-a-chatbot-into-our-llm

How We Implemented a Chatbot Into Our LLM

To implement a chatbot, we let the model generate a response by concatenating the chatting history and the last user query into a prompt.

37 views17:45

Medium / Medium.com

How Effective is vLLM When a Prefix Is Thrown Into the Mix?

#llms #vllm #prefix #vllmeffectiveness #llama13b #orca #multilingualllm #woosukkwon

https://hackernoon.com/how-effective-is-vllm-when-a-prefix-is-thrown-into-the-mix

How Effective is vLLM When a Prefix Is Thrown Into the Mix?

We explore the effectiveness of vLLM for the case a prefix is shared among different input prompts

39 views18:15

Medium / Medium.com

PagedAttention and vLLM Explained: What Are They?

#llms #vllm #pagedattention #llmservingsystem #decodingalgorithm #attentionalgorithm #virtualmemory #copyonwrite

https://hackernoon.com/pagedattention-and-vllm-explained-what-are-they

PagedAttention and vLLM Explained: What Are They?

This paper proposes PagedAttention, a new attention algorithm that allows attention keys and values to be stored in non-contiguous paged memory

42 views00:16

Medium / Medium.com

General Model Serving Systems and Memory Optimizations Explained

#llms #vllm #generalmodelserving #memoryoptimization #orca #transformers #alpaserve #gpukernel

https://hackernoon.com/general-model-serving-systems-and-memory-optimizations-explained

General Model Serving Systems and Memory Optimizations Explained

Model serving has been an active area of research in recent years, with numerous systems proposed to tackle diverse aspects of deep learning model deployment.

43 views00:31

Medium / Medium.com

Applying the Virtual Memory and Paging Technique: A Discussion

#llms #virtualmemory #pagingtechnique #kvcache #vllm #gpuworkload #gpukernels #gpumemory

https://hackernoon.com/applying-the-virtual-memory-and-paging-technique-a-discussion

Applying the Virtual Memory and Paging Technique: A Discussion

The idea of virtual memory and paging is effective for managing the KV cache in LLM serving because the workload requires dynamic memory allocation

42 views00:46

Medium / Medium.com

Evaluating vLLM's Design Choices With Ablation Experiments

#llms #vllm #evaluatingvllm #vllmdesign #pagedattention #gpu #sharegpt #microbenchmark

https://hackernoon.com/evaluating-vllms-design-choices-with-ablation-experiments

Evaluating vLLM's Design Choices With Ablation Experiments

In this section, we study various aspects of vLLM and evaluate the design choices we make with ablation experiments.

42 views01:01

Medium / Medium.com

WLTech’s AI Agent Scores Big in $1 Million Challenge

#ai #agi #arc #aiagent #llms #gpt #arcagiprize #goodcompany

https://hackernoon.com/wltechs-ai-agent-scores-big-in-$1-million-challenge

WLTech’s AI Agent Scores Big in $1 Million Challenge

WLTech.AI explores the ARC challenge, an important benchmark in AI research, advancing the quest for artificial general intelligence through generalization.

32 views18:30

Medium / Medium.com

Learn to Generate Flow Charts With This Simple AI Integration

#llms #softwaredevelopment #generativeai #llmtools #diagrammingsoftware #umldiagrams #sequencediagrams #aiforflowchart

https://hackernoon.com/learn-to-generate-flow-charts-with-this-simple-ai-integration

Learn to Generate Flow Charts With This Simple AI Integration

Integrating Large Language Models with diagramming tools like Mermaid and UML is revolutionizing software development.

36 views21:15