This media is not supported in your browser
VIEW IN TELEGRAM
GPU by hand βοΈ I drew this to show how a GPU speeds up an array operation of 8 elements in parallel over 4 threads in 2 clock cycles. Read more π
CPU
β’ It has one core.
β’ Its global memory has 120 locations (0-119).
β’ To use the GPU, it needs to copy data from the global memory to the GPU.
β’ After GPU is done, it will copy the results back.
GPU
β’ It has four cores to run four threads (0-3).
β’ It has a register file of 28 locations (0-27)
β’ This register file has four banks (0-3).
β’ All threads share the same register file.
β’ But they must read/write using the four banks.
β’ Each bank allows 2 reads (Read 0, Read 1) and 1 write in a single clock cycle.
βοΈ Our Telegram channels: https://xn--r1a.website/addlist/0f6vfFbEMdAwODBk
CPU
β’ It has one core.
β’ Its global memory has 120 locations (0-119).
β’ To use the GPU, it needs to copy data from the global memory to the GPU.
β’ After GPU is done, it will copy the results back.
GPU
β’ It has four cores to run four threads (0-3).
β’ It has a register file of 28 locations (0-27)
β’ This register file has four banks (0-3).
β’ All threads share the same register file.
β’ But they must read/write using the four banks.
β’ Each bank allows 2 reads (Read 0, Read 1) and 1 write in a single clock cycle.
#AIEngineering #MachineLearning #DeepLearning #LLMs #RAG #MLOps #Python #GitHubProjects #AIForBeginners #ArtificialIntelligence #NeuralNetworks #OpenSourceAI #DataScienceCareers
Please open Telegram to view this post
VIEW IN TELEGRAM
π6β€4
What is torch.nn really?
This article explains it quite well.
π Read
βοΈ Our Telegram channels: https://xn--r1a.website/addlist/0f6vfFbEMdAwODBk
When I started working with PyTorch, my biggest question was: "What is torch.nn?".
This article explains it quite well.
π Read
#pytorch #AIEngineering #MachineLearning #DeepLearning #LLMs #RAG #MLOps #Python #GitHubProjects #AIForBeginners #ArtificialIntelligence #NeuralNetworks #OpenSourceAI #DataScienceCareers
Please open Telegram to view this post
VIEW IN TELEGRAM
β€5
π€π§ The Little Book of Deep Learning β A Complete Summary and Chapter-Wise Overview
ποΈ 08 Oct 2025
π AI News & Trends
In the ever-evolving world of Artificial Intelligence, deep learning continues to be the driving force behind breakthroughs in computer vision, speech recognition and natural language processing. For those seeking a clear, structured and accessible guide to understanding how deep learning really works, βThe Little Book of Deep Learningβ by FranΓ§ois Fleuret is a gem. This ...
#DeepLearning #ArtificialIntelligence #MachineLearning #NeuralNetworks #AIGuides #FrancoisFleuret
ποΈ 08 Oct 2025
π AI News & Trends
In the ever-evolving world of Artificial Intelligence, deep learning continues to be the driving force behind breakthroughs in computer vision, speech recognition and natural language processing. For those seeking a clear, structured and accessible guide to understanding how deep learning really works, βThe Little Book of Deep Learningβ by FranΓ§ois Fleuret is a gem. This ...
#DeepLearning #ArtificialIntelligence #MachineLearning #NeuralNetworks #AIGuides #FrancoisFleuret
β€6
π€π§ Build a Large Language Model From Scratch: A Step-by-Step Guide to Understanding and Creating LLMs
ποΈ 08 Oct 2025
π AI News & Trends
In recent years, Large Language Models (LLMs) have revolutionized the world of Artificial Intelligence (AI). From ChatGPT and Claude to Llama and Mistral, these models power the conversational systems, copilots, and generative tools that dominate todayβs AI landscape. However, for most developers and learners, the inner workings of these systems remain a mystery until now. ...
#LargeLanguageModels #LLM #ArtificialIntelligence #DeepLearning #MachineLearning #AIGuides
ποΈ 08 Oct 2025
π AI News & Trends
In recent years, Large Language Models (LLMs) have revolutionized the world of Artificial Intelligence (AI). From ChatGPT and Claude to Llama and Mistral, these models power the conversational systems, copilots, and generative tools that dominate todayβs AI landscape. However, for most developers and learners, the inner workings of these systems remain a mystery until now. ...
#LargeLanguageModels #LLM #ArtificialIntelligence #DeepLearning #MachineLearning #AIGuides
β€5
π€π§ Unleashing the Power of AI with Open Agent Builder: A Visual Workflow Tool for AI Agents
ποΈ 19 Oct 2025
π AI News & Trends
In todayβs rapidly advancing technological landscape, artificial intelligence (AI) is not just a buzzword, itβs a transformative force across industries. From automating complex tasks to streamlining operations, AI is revolutionizing workflows. However, designing and deploying AI-driven workflows has traditionally required expert-level programming knowledge. Enter Open Agent Builder, a revolutionary tool that democratizes the creation of ...
#AI #ArtificialIntelligence #OpenAgentBuilder #AIAgents #VisualWorkflow #TechInnovation
ποΈ 19 Oct 2025
π AI News & Trends
In todayβs rapidly advancing technological landscape, artificial intelligence (AI) is not just a buzzword, itβs a transformative force across industries. From automating complex tasks to streamlining operations, AI is revolutionizing workflows. However, designing and deploying AI-driven workflows has traditionally required expert-level programming knowledge. Enter Open Agent Builder, a revolutionary tool that democratizes the creation of ...
#AI #ArtificialIntelligence #OpenAgentBuilder #AIAgents #VisualWorkflow #TechInnovation
β€4π1
π€π§ Wan 2.1: Alibabaβs Open-Source Revolution in Video Generation
ποΈ 21 Oct 2025
π AI News & Trends
The landscape of artificial intelligence has been evolving rapidly, especially in the domain of video generation. Since OpenAI unveiled Sora in 2024, the world has witnessed an explosive surge in research and innovation within generative AI. However, most of these cutting-edge tools remained closed-source limiting transparency and accessibility. Recognizing this gap, Alibaba Group introduced Wan, ...
#Alibaba #Wan2.1 #VideoGeneration #GenerativeAI #OpenSource #ArtificialIntelligence
ποΈ 21 Oct 2025
π AI News & Trends
The landscape of artificial intelligence has been evolving rapidly, especially in the domain of video generation. Since OpenAI unveiled Sora in 2024, the world has witnessed an explosive surge in research and innovation within generative AI. However, most of these cutting-edge tools remained closed-source limiting transparency and accessibility. Recognizing this gap, Alibaba Group introduced Wan, ...
#Alibaba #Wan2.1 #VideoGeneration #GenerativeAI #OpenSource #ArtificialIntelligence
β€2
π€π§ Mastering Large Language Models: Top #1 Complete Guide to Maxime Labonneβs LLM Course
ποΈ 22 Oct 2025
π AI News & Trends
In the rapidly evolving landscape of artificial intelligence, large language models (LLMs) have become the foundation of modern AI innovation powering tools like ChatGPT, Claude, Gemini and countless enterprise AI applications. However, building, fine-tuning and deploying these models require deep technical understanding and hands-on expertise. To bridge this knowledge gap, Maxime Labonne, a leading AI ...
#LLM #ArtificialIntelligence #MachineLearning #DeepLearning #AIEngineering #LargeLanguageModels
ποΈ 22 Oct 2025
π AI News & Trends
In the rapidly evolving landscape of artificial intelligence, large language models (LLMs) have become the foundation of modern AI innovation powering tools like ChatGPT, Claude, Gemini and countless enterprise AI applications. However, building, fine-tuning and deploying these models require deep technical understanding and hands-on expertise. To bridge this knowledge gap, Maxime Labonne, a leading AI ...
#LLM #ArtificialIntelligence #MachineLearning #DeepLearning #AIEngineering #LargeLanguageModels
β€4π1
π€π§ The Ultimate #1 Collection of AI Books In Awesome-AI-Books Repository
ποΈ 22 Oct 2025
π AI News & Trends
Artificial Intelligence (AI) has emerged as one of the most transformative technologies of the 21st century. From powering self-driving cars to enabling advanced conversational AI like ChatGPT, AI is redefining how humans interact with machines. However, mastering AI requires a strong foundation in theory, mathematics, programming and hands-on experimentation. For enthusiasts, students and professionals seeking ...
#ArtificialIntelligence #AIBooks #MachineLearning #DeepLearning #AIResources #TechBooks
ποΈ 22 Oct 2025
π AI News & Trends
Artificial Intelligence (AI) has emerged as one of the most transformative technologies of the 21st century. From powering self-driving cars to enabling advanced conversational AI like ChatGPT, AI is redefining how humans interact with machines. However, mastering AI requires a strong foundation in theory, mathematics, programming and hands-on experimentation. For enthusiasts, students and professionals seeking ...
#ArtificialIntelligence #AIBooks #MachineLearning #DeepLearning #AIResources #TechBooks
β€2π₯1
π€π§ Master Machine Learning: Explore the Ultimate βMachine-Learning-Tutorialsβ Repository
ποΈ 23 Oct 2025
π AI News & Trends
In todayβs data-driven world, Machine Learning (ML) has become the cornerstone of modern technology from intelligent chatbots to predictive analytics and recommendation systems. However, mastering ML isnβt just about coding, it requires a structured understanding of algorithms, statistics, optimization techniques and real-world problem-solving. Thatβs where Ujjwal Karnβs Machine-Learning-Tutorials GitHub repository stands out. This open-source, topic-wise ...
#MachineLearning #MLTutorials #ArtificialIntelligence #DataScience #OpenSource #AIEducation
ποΈ 23 Oct 2025
π AI News & Trends
In todayβs data-driven world, Machine Learning (ML) has become the cornerstone of modern technology from intelligent chatbots to predictive analytics and recommendation systems. However, mastering ML isnβt just about coding, it requires a structured understanding of algorithms, statistics, optimization techniques and real-world problem-solving. Thatβs where Ujjwal Karnβs Machine-Learning-Tutorials GitHub repository stands out. This open-source, topic-wise ...
#MachineLearning #MLTutorials #ArtificialIntelligence #DataScience #OpenSource #AIEducation
β€7π2
π€π§ LangChain: The Ultimate Framework for Building Reliable AI Agents and LLM Applications
ποΈ 24 Oct 2025
π AI News & Trends
As artificial intelligence continues to transform industries, developers are racing to build smarter, more adaptive applications powered by Large Language Models (LLMs). Yet, one major challenge remains how to make these models interact intelligently with real-world data and external systems in a scalable, reliable way. Enter LangChain, an open-source framework designed to make LLM-powered application ...
#LangChain #AI #LLM #ArtificialIntelligence #OpenSource #AIAgents
ποΈ 24 Oct 2025
π AI News & Trends
As artificial intelligence continues to transform industries, developers are racing to build smarter, more adaptive applications powered by Large Language Models (LLMs). Yet, one major challenge remains how to make these models interact intelligently with real-world data and external systems in a scalable, reliable way. Enter LangChain, an open-source framework designed to make LLM-powered application ...
#LangChain #AI #LLM #ArtificialIntelligence #OpenSource #AIAgents
β€5π2
π€π§ AI Projects : A Comprehensive Showcase of Machine Learning, Deep Learning and Generative AI
ποΈ 27 Oct 2025
π AI News & Trends
Artificial Intelligence (AI) is transforming industries across the globe, driving innovation through automation, data-driven insights and intelligent decision-making. Whether itβs predicting house prices, detecting diseases or building conversational chatbots, AI is at the core of modern digital solutions. The AI Project Gallery by Hema Kalyan Murapaka is an exceptional GitHub repository that curates a wide ...
#AI #MachineLearning #DeepLearning #GenerativeAI #ArtificialIntelligence #GitHub
ποΈ 27 Oct 2025
π AI News & Trends
Artificial Intelligence (AI) is transforming industries across the globe, driving innovation through automation, data-driven insights and intelligent decision-making. Whether itβs predicting house prices, detecting diseases or building conversational chatbots, AI is at the core of modern digital solutions. The AI Project Gallery by Hema Kalyan Murapaka is an exceptional GitHub repository that curates a wide ...
#AI #MachineLearning #DeepLearning #GenerativeAI #ArtificialIntelligence #GitHub
β€3π₯1
π€π§ Free for 1 Year: ChatGPT Goβs Big Move in India
ποΈ 28 Oct 2025
π AI News & Trends
On 28 October 2025, OpenAI announced that its mid-tier subscription plan, ChatGPT Go, will be available free for one full year in India starting from 4 November. (www.ndtv.com) What is ChatGPT Go? Whatβs the deal? Why this matters ? Things to check / caveats What should users do? Broader implications This move by OpenAI indicates ...
#ChatGPTGo #OpenAI #India #FreeAccess #ArtificialIntelligence #TechNews
ποΈ 28 Oct 2025
π AI News & Trends
On 28 October 2025, OpenAI announced that its mid-tier subscription plan, ChatGPT Go, will be available free for one full year in India starting from 4 November. (www.ndtv.com) What is ChatGPT Go? Whatβs the deal? Why this matters ? Things to check / caveats What should users do? Broader implications This move by OpenAI indicates ...
#ChatGPTGo #OpenAI #India #FreeAccess #ArtificialIntelligence #TechNews
β€8
Forwarded from Machine Learning
Data leakage is one of the main reasons why ML demos look impressive... and then fail in production. π
The model didn't become smarter.
It just happened to see the correct answers in advance.
In 4 minutes, you'll understand where data leaks hide. π
Let's break it down below: π
1. Data Leakage π³οΈ
Data leakage occurs when information that won't be available at the time of actual prediction is used during the model training process.
Because of this, metrics on the validation stage can look much better than the actual quality of the model on new, previously unseen data.
2. Model Evaluation βοΈ
The test set isn't just "additional data".
It's a simulation of the future.
Only train the model on the information that would have been available to you at the time of prediction.
Evaluate it on examples that the model couldn't have influenced during training.
3. Direct Leakage π¨
This is the most obvious type of leakage.
Examples:
- a field with information from the future;
- an ID that encodes the target variable;
- a variable that appears only after an event has occurred;
- duplicate records in both the training and test sets.
If a feature doesn't exist at the time of inference (prediction), then it's likely a source of data leakage.
4. Indirect Leakage π΅οΈ
This is the type of leakage that most often traps teams.
You perform normalization, imputation, feature selection, outlier removal, or dimensionality reduction before splitting the data into a training and test set.
The model didn't directly see the data from the test set.
But your preprocessing pipeline already saw it.
5. Train/Test Split βοΈ
Wrong:
Right:
The same idea applies to imputers, encoders, feature selection, PCA, and any preprocessing step that is trained on the data.
6. Cross-Validation π
Each fold is a mini-experiment with a training and test set.
Therefore, preprocessing should be performed within each fold.
If you prepared the entire dataset once and then ran cross-validation, each fold would already have had access to its held-out data.
7. Pipelines π οΈ
A pipeline isn't just a way to make the code cleaner.
It's also a defense against data leakage.
Combine preprocessing, feature selection, and the model into a single pipeline, and then pass this pipeline to cross-validation or hyperparameter search (grid search).
8. AI Engineering Version π€
Data leaks also occur in RAG systems and when evaluating LLMs.
Leakage occurs when you tune chunks, prompts, re-rankers, thresholds, or examples on the same evaluation dataset that you later present as "held-out".
As a result, your benchmark turns into training data.
9. Leakage Checklist β
Before trusting the obtained metric, ask yourself:
- Could this feature exist at the time of prediction?
- Was any transformation (transform) step trained (fit) on the test data?
- Did cross-validation include the entire pipeline?
- Were we tuning parameters on the final evaluation dataset?
If the answer is "yes", then the metric likely doesn't reflect the actual quality of the model.
#MachineLearning #DataScience #MLOps #DataLeakage #ArtificialIntelligence #TechTips
β¨ Join Best TG Channels https://xn--r1a.website/addlist/0f6vfFbEMdAwODBk
βοΈ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
The model didn't become smarter.
It just happened to see the correct answers in advance.
In 4 minutes, you'll understand where data leaks hide. π
Let's break it down below: π
1. Data Leakage π³οΈ
Data leakage occurs when information that won't be available at the time of actual prediction is used during the model training process.
Because of this, metrics on the validation stage can look much better than the actual quality of the model on new, previously unseen data.
2. Model Evaluation βοΈ
The test set isn't just "additional data".
It's a simulation of the future.
Only train the model on the information that would have been available to you at the time of prediction.
Evaluate it on examples that the model couldn't have influenced during training.
3. Direct Leakage π¨
This is the most obvious type of leakage.
Examples:
- a field with information from the future;
- an ID that encodes the target variable;
- a variable that appears only after an event has occurred;
- duplicate records in both the training and test sets.
If a feature doesn't exist at the time of inference (prediction), then it's likely a source of data leakage.
4. Indirect Leakage π΅οΈ
This is the type of leakage that most often traps teams.
You perform normalization, imputation, feature selection, outlier removal, or dimensionality reduction before splitting the data into a training and test set.
The model didn't directly see the data from the test set.
But your preprocessing pipeline already saw it.
5. Train/Test Split βοΈ
Wrong:
fit the scaler on all data β split the data β evaluate
Right:
split the data β fit the scaler only on the training set β apply it to both the training and test sets
The same idea applies to imputers, encoders, feature selection, PCA, and any preprocessing step that is trained on the data.
6. Cross-Validation π
Each fold is a mini-experiment with a training and test set.
Therefore, preprocessing should be performed within each fold.
If you prepared the entire dataset once and then ran cross-validation, each fold would already have had access to its held-out data.
7. Pipelines π οΈ
A pipeline isn't just a way to make the code cleaner.
It's also a defense against data leakage.
Combine preprocessing, feature selection, and the model into a single pipeline, and then pass this pipeline to cross-validation or hyperparameter search (grid search).
8. AI Engineering Version π€
Data leaks also occur in RAG systems and when evaluating LLMs.
Leakage occurs when you tune chunks, prompts, re-rankers, thresholds, or examples on the same evaluation dataset that you later present as "held-out".
As a result, your benchmark turns into training data.
9. Leakage Checklist β
Before trusting the obtained metric, ask yourself:
- Could this feature exist at the time of prediction?
- Was any transformation (transform) step trained (fit) on the test data?
- Did cross-validation include the entire pipeline?
- Were we tuning parameters on the final evaluation dataset?
If the answer is "yes", then the metric likely doesn't reflect the actual quality of the model.
#MachineLearning #DataScience #MLOps #DataLeakage #ArtificialIntelligence #TechTips
β¨ Join Best TG Channels https://xn--r1a.website/addlist/0f6vfFbEMdAwODBk
βοΈ Join Our WhatsApp Channel https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
Telegram
AI PYTHON π
Youβve been invited to add the folder βAI PYTHON πβ, which includes 14 chats.
β€8π―1