#jupyter_notebook #chatglm #chatglm3 #gemma_2b_it #glm_4 #internlm2 #llama3 #llm #lora #minicpm #q_wen #qwen #qwen1_5 #qwen2
This guide helps beginners set up and use open-source large language models (LLMs) on Linux or cloud platforms like AutoDL, with step-by-step instructions for environment setup, model deployment, and fine-tuning for models such as LLaMA, ChatGLM, and InternLM[2][4][5]. It covers everything from basic installation to advanced techniques like LoRA and distributed fine-tuning, and supports integration with tools like LangChain and online demo deployment. The main benefit is making powerful AI models accessible and easy to use for students, researchers, and anyone interested in experimenting with or customizing LLMs for their own projects[2][4][5].
https://github.com/datawhalechina/self-llm
This guide helps beginners set up and use open-source large language models (LLMs) on Linux or cloud platforms like AutoDL, with step-by-step instructions for environment setup, model deployment, and fine-tuning for models such as LLaMA, ChatGLM, and InternLM[2][4][5]. It covers everything from basic installation to advanced techniques like LoRA and distributed fine-tuning, and supports integration with tools like LangChain and online demo deployment. The main benefit is making powerful AI models accessible and easy to use for students, researchers, and anyone interested in experimenting with or customizing LLMs for their own projects[2][4][5].
https://github.com/datawhalechina/self-llm
GitHub
GitHub - datawhalechina/self-llm: 《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程 - datawhalechina/self-llm
#jupyter_notebook
MiniCPM is a family of highly efficient, open-source AI models designed to run well even on regular computers or mobile devices, not just powerful servers. The latest version, MiniCPM 4, is especially fast and smart, handling long texts and complex tasks much quicker than similar models, and it can be used for things like answering questions, writing summaries, and working with tools or data. MiniCPM also supports both English and Chinese, making it useful for bilingual users. The main benefit is that you get strong AI performance without needing expensive hardware, so it’s easy to use for many different applications[1][5].
https://github.com/OpenBMB/MiniCPM
MiniCPM is a family of highly efficient, open-source AI models designed to run well even on regular computers or mobile devices, not just powerful servers. The latest version, MiniCPM 4, is especially fast and smart, handling long texts and complex tasks much quicker than similar models, and it can be used for things like answering questions, writing summaries, and working with tools or data. MiniCPM also supports both English and Chinese, making it useful for bilingual users. The main benefit is that you get strong AI performance without needing expensive hardware, so it’s easy to use for many different applications[1][5].
https://github.com/OpenBMB/MiniCPM
GitHub
GitHub - OpenBMB/MiniCPM: MiniCPM5-1B: A SOTA 1B on-device LLM, small yet powerful.
MiniCPM5-1B: A SOTA 1B on-device LLM, small yet powerful. - OpenBMB/MiniCPM
#jupyter_notebook #ai #llm #llms #multi_modal #openai #python #rag
Retrieval-Augmented Generation (RAG) is a technique that helps improve the accuracy of large language models by fetching relevant information from databases or documents. This approach ensures that the model's responses are based on up-to-date and accurate data, reducing errors and "hallucinations" where the model might provide false information. For users, RAG offers more reliable and trustworthy responses, allowing them to verify the sources used to generate those responses. This method also saves resources by avoiding the need to retrain models with new data.
https://github.com/FareedKhan-dev/all-rag-techniques
Retrieval-Augmented Generation (RAG) is a technique that helps improve the accuracy of large language models by fetching relevant information from databases or documents. This approach ensures that the model's responses are based on up-to-date and accurate data, reducing errors and "hallucinations" where the model might provide false information. For users, RAG offers more reliable and trustworthy responses, allowing them to verify the sources used to generate those responses. This method also saves resources by avoiding the need to retrain models with new data.
https://github.com/FareedKhan-dev/all-rag-techniques
❤1
#jupyter_notebook #jax
Flax is a library for creating neural networks with JAX. It offers a flexible way to build and analyze these networks. The new Flax NNX API makes it easier to work with neural networks by using regular Python objects, which helps in creating, debugging, and analyzing models more efficiently. This means users can express their models in a more intuitive way, making it simpler to develop and modify neural networks. Flax also provides many tools and examples to help users get started quickly.
https://github.com/google/flax
Flax is a library for creating neural networks with JAX. It offers a flexible way to build and analyze these networks. The new Flax NNX API makes it easier to work with neural networks by using regular Python objects, which helps in creating, debugging, and analyzing models more efficiently. This means users can express their models in a more intuitive way, making it simpler to develop and modify neural networks. Flax also provides many tools and examples to help users get started quickly.
https://github.com/google/flax
GitHub
GitHub - google/flax: Flax is a neural network library for JAX that is designed for flexibility.
Flax is a neural network library for JAX that is designed for flexibility. - google/flax
#jupyter_notebook #ai #artificial_intelligence #chatgpt #deep_learning #from_scratch #gpt #language_model #large_language_models #llm #machine_learning #python #pytorch #transformer
You can learn how to build your own large language model (LLM) like GPT from scratch with clear, step-by-step guidance, including coding, training, and fine-tuning, all explained with examples and diagrams. This approach mirrors how big models like ChatGPT are made but is designed to run on a regular laptop without special hardware. You also get access to code for loading pretrained models and fine-tuning them for tasks like text classification or instruction following. This helps you deeply understand how LLMs work inside and lets you create your own functional AI assistant, gaining practical skills in AI development[1][2][3][4].
https://github.com/rasbt/LLMs-from-scratch
You can learn how to build your own large language model (LLM) like GPT from scratch with clear, step-by-step guidance, including coding, training, and fine-tuning, all explained with examples and diagrams. This approach mirrors how big models like ChatGPT are made but is designed to run on a regular laptop without special hardware. You also get access to code for loading pretrained models and fine-tuning them for tasks like text classification or instruction following. This helps you deeply understand how LLMs work inside and lets you create your own functional AI assistant, gaining practical skills in AI development[1][2][3][4].
https://github.com/rasbt/LLMs-from-scratch
GitHub
GitHub - rasbt/LLMs-from-scratch: Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step - rasbt/LLMs-from-scratch
#jupyter_notebook #artificial_intelligence #book #large_language_models #llm #llms #oreilly #oreilly_books
You can learn how to use Large Language Models (LLMs) effectively through the book *Hands-On Large Language Models* by Jay Alammar and Maarten Grootendorst. This book uses nearly 300 custom illustrations to explain key concepts and practical tools for working with LLMs, including tokenization, transformers, prompt engineering, fine-tuning, and advanced text generation. It also provides runnable code examples in Google Colab, making it easy to practice and apply what you learn. This resource helps you understand and build your own LLM applications confidently, saving you time and effort in mastering complex AI technology. It’s highly recommended for anyone wanting hands-on experience with LLMs.
https://github.com/HandsOnLLM/Hands-On-Large-Language-Models
You can learn how to use Large Language Models (LLMs) effectively through the book *Hands-On Large Language Models* by Jay Alammar and Maarten Grootendorst. This book uses nearly 300 custom illustrations to explain key concepts and practical tools for working with LLMs, including tokenization, transformers, prompt engineering, fine-tuning, and advanced text generation. It also provides runnable code examples in Google Colab, making it easy to practice and apply what you learn. This resource helps you understand and build your own LLM applications confidently, saving you time and effort in mastering complex AI technology. It’s highly recommended for anyone wanting hands-on experience with LLMs.
https://github.com/HandsOnLLM/Hands-On-Large-Language-Models
GitHub
GitHub - HandsOnLLM/Hands-On-Large-Language-Models: Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Official code repo for the O'Reilly Book - "Hands-On Large Language Models" - HandsOnLLM/Hands-On-Large-Language-Models
#jupyter_notebook
This course guides you through building and deploying your own AI agents using popular tools like OpenAI Agents SDK, CrewAI, LangGraph, AutoGen, and MCP over six weeks. You’ll learn to create agents that can think, act, and work together, with clear setup instructions for Windows, Mac, and Linux, plus support if you get stuck. The benefit is that you gain hands-on experience in the latest AI agent technology, making you ready to develop smart, autonomous systems for real-world tasks, while also connecting with a helpful community and having fun along the way[1][2][3].
https://github.com/ed-donner/agents
This course guides you through building and deploying your own AI agents using popular tools like OpenAI Agents SDK, CrewAI, LangGraph, AutoGen, and MCP over six weeks. You’ll learn to create agents that can think, act, and work together, with clear setup instructions for Windows, Mac, and Linux, plus support if you get stuck. The benefit is that you gain hands-on experience in the latest AI agent technology, making you ready to develop smart, autonomous systems for real-world tasks, while also connecting with a helpful community and having fun along the way[1][2][3].
https://github.com/ed-donner/agents
GitHub
GitHub - ed-donner/agents: Repo for the Complete Agentic AI Engineering Course
Repo for the Complete Agentic AI Engineering Course - ed-donner/agents
👍1
#jupyter_notebook #darknet #pytorch #scaled_yolov4 #yolor #yolov3 #yolov4 #yolov7
YOLOv7 is a powerful tool for detecting objects in images and videos. It is fast, accurate, and can work well on devices with limited power, making it useful for real-time applications like self-driving cars and surveillance systems. YOLOv7 uses advanced techniques like Feature Pyramid Networks to detect objects of different sizes and can handle complex scenes with overlapping objects. This makes it beneficial for users who need quick and precise object detection in various environments.
https://github.com/WongKinYiu/yolov7
YOLOv7 is a powerful tool for detecting objects in images and videos. It is fast, accurate, and can work well on devices with limited power, making it useful for real-time applications like self-driving cars and surveillance systems. YOLOv7 uses advanced techniques like Feature Pyramid Networks to detect objects of different sizes and can handle complex scenes with overlapping objects. This makes it beneficial for users who need quick and precise object detection in various environments.
https://github.com/WongKinYiu/yolov7
GitHub
GitHub - WongKinYiu/yolov7: Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time…
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors - WongKinYiu/yolov7
#jupyter_notebook
SAM 2 is a powerful new AI model that can quickly and accurately separate objects in both images and videos, even if it has never seen them before. It works in real-time, allowing you to select objects with simple prompts like clicks or boxes and refine the results interactively. This makes tasks like video editing, object tracking, and image annotation much easier and faster. SAM 2’s ability to handle complex scenes and track objects smoothly across video frames helps improve creativity and productivity in many fields, from media production to computer vision research. It is open-source and easy to use with Python and PyTorch.
https://github.com/facebookresearch/segment-anything
SAM 2 is a powerful new AI model that can quickly and accurately separate objects in both images and videos, even if it has never seen them before. It works in real-time, allowing you to select objects with simple prompts like clicks or boxes and refine the results interactively. This makes tasks like video editing, object tracking, and image annotation much easier and faster. SAM 2’s ability to handle complex scenes and track objects smoothly across video frames helps improve creativity and productivity in many fields, from media production to computer vision research. It is open-source and easy to use with Python and PyTorch.
https://github.com/facebookresearch/segment-anything
GitHub
GitHub - facebookresearch/segment-anything: The repository provides code for running inference with the SegmentAnything Model (SAM)…
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model. -...
#jupyter_notebook
Retrieval Augmented Generation (RAG) helps large language models (LLMs) answer questions using up-to-date or private information by connecting them to external data sources, unlike fine-tuning which retrains the model on specific data. RAG is useful when you need current, dynamic information without costly retraining, making it ideal for tasks like customer support or knowledge management. Fine-tuning is better for deep expertise in a specialized field but requires more data and effort. Using RAG lets you get accurate, relevant answers quickly by combining the model’s language skills with fresh, specific data, improving usefulness and reliability.
https://github.com/langchain-ai/rag-from-scratch
Retrieval Augmented Generation (RAG) helps large language models (LLMs) answer questions using up-to-date or private information by connecting them to external data sources, unlike fine-tuning which retrains the model on specific data. RAG is useful when you need current, dynamic information without costly retraining, making it ideal for tasks like customer support or knowledge management. Fine-tuning is better for deep expertise in a specialized field but requires more data and effort. Using RAG lets you get accurate, relevant answers quickly by combining the model’s language skills with fresh, specific data, improving usefulness and reliability.
https://github.com/langchain-ai/rag-from-scratch
GitHub
GitHub - langchain-ai/rag-from-scratch
Contribute to langchain-ai/rag-from-scratch development by creating an account on GitHub.