For Developers – Telegram

For Developers

213 subscribers

65 photos

3 videos

1.01K files

991 links

YAC

Download Telegram

About

Blog

Apps

Platform

213 subscribers

#onnx #tensorflow #PyTorch #inference #production #deploy #deployment
https://thenewstack.io/tutorial-import-an-onnx-model-into-tensorflow-for-inference/

Tutorial: Import an ONNX Model into TensorFlow for Inference

This post is the fourth in a series of introductory tutorials on the Open Neural Network Exchange (ONNX), an initiative from AWS, Microsoft, and Facebook to define a standard for interoperability across machine learning platforms.

218 viewsedited 17:21

#pytorch #onnx #tensorRT #inference #ml_engineering #ml #dl
https://www.learnopencv.com/how-to-run-inference-using-tensorrt-c-api/

LearnOpenCV – Learn OpenCV, PyTorch, Keras, Tensorflow with code, & tutorials

How To Run Inference Using TensorRT C++ API

In this post, we continue to consider how to speed up inference quickly and painlessly if we already have a trained model in PyTorch. In the previous post We discussed what ONNX and TensorRT are and why they are needed Сonfigured the environment for PyTorch…

152 views10:52

#onnx #onnx_runtime #inference #tensorRT #deployment #pytorch #tensorflow #ml #dl #demo #asic #fpga #gpu
https://www.youtube.com/watch?v=cK5AyawZSUI&ab_channel=JanakiramMSV

Everything You Want to Know About ONNX

Open Neural Network Exchange (ONNX) is an open format built to represent machine learning models. It defines the building blocks of machine learning and deep learning models along with a common file format to enable AI developers to use models with a variety…

166 views16:10

#blochchain #inference #smartcontract #ml #ai #fl #federated_learning
https://www.mdpi.com/2076-3417/11/3/1010

Towards Blockchain-Based Federated Machine Learning: Smart Contract for Model Inference

Federated learning is a branch of machine learning where a shared model is created in a decentralized and privacy-preserving fashion, but existing approaches using blockchain are limited by tailored models. We consider the possibility to extend a set of supported…

273 views13:21

A Survey on Efficient Inference for Large
Language Models
https://arxiv.org/pdf/2404.14294
#vLLM #vs #deepspeed #overview #survey #inference #optimization

230 viewsedited 11:20

#fingpt #rag #llm #gpt
https://arxiv.org/abs/2310.04027v1

#structured_output #vs #outlines #vs #mirascope #vs #instructor #langhchain #guidance
https://simmering.dev/blog/structured_output/
https://simmering.dev/blog/openai_structured_output/

#aws #team #sagemaker #genai #inference #better #autoscale #subminute #metrics #cloudwatch
https://aws.amazon.com/about-aws/whats-new/2024/07/amazon-sagemaker-faster-auto-scaling-generative-ai-models/
https://aws.amazon.com/blogs/machine-learning/amazon-sagemaker-inference-launches-faster-auto-scaling-for-generative-ai-models/

Enhancing Financial Sentiment Analysis via Retrieval Augmented...

Financial sentiment analysis is critical for valuation and investment decision-making. Traditional NLP models, however, are limited by their parameter size and the scope of their training...

164 viewsedited 15:34

#stability_ai #team #deepseek #vs #openai #comments #forecast https://youtu.be/lY8Ja00PCQM?si=aChjauEHB0Qu_41z&t=1277

#cpu #inference #llm #gen_ai

https://techcrunch.com/2025/04/16/microsoft-researchers-say-theyve-developed-a-hyper-efficient-ai-model-that-can-run-on-cpus/

Microsoft researchers say they've developed a hyper-efficient AI model that can run on CPUs | TechCrunch

Microsoft researchers have developed — and released — a hyper-efficient AI model that can run on CPUs, including Apple's M2.

82 viewsedited 10:22