GitHub Trends

#jupyter_notebook #chatglm #chatglm3 #gemma_2b_it #glm_4 #internlm2 #llama3 #llm #lora #minicpm #q_wen #qwen #qwen1_5 #qwen2

This guide helps beginners set up and use open-source large language models (LLMs) on Linux or cloud platforms like AutoDL, with step-by-step instructions for environment setup, model deployment, and fine-tuning for models such as LLaMA, ChatGLM, and InternLM[2][4][5]. It covers everything from basic installation to advanced techniques like LoRA and distributed fine-tuning, and supports integration with tools like LangChain and online demo deployment. The main benefit is making powerful AI models accessible and easy to use for students, researchers, and anyone interested in experimenting with or customizing LLMs for their own projects[2][4][5].

https://github.com/datawhalechina/self-llm

GitHub

GitHub - datawhalechina/self-llm: 《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程 - datawhalechina/self-llm

491 views11:30

GitHub Trends

#jupyter_notebook

MiniCPM is a family of highly efficient, open-source AI models designed to run well even on regular computers or mobile devices, not just powerful servers. The latest version, MiniCPM 4, is especially fast and smart, handling long texts and complex tasks much quicker than similar models, and it can be used for things like answering questions, writing summaries, and working with tools or data. MiniCPM also supports both English and Chinese, making it useful for bilingual users. The main benefit is that you get strong AI performance without needing expensive hardware, so it’s easy to use for many different applications[1][5].

https://github.com/OpenBMB/MiniCPM

GitHub

GitHub - OpenBMB/MiniCPM: MiniCPM5-1B: A SOTA 1B on-device LLM, small yet powerful.

MiniCPM5-1B: A SOTA 1B on-device LLM, small yet powerful. - OpenBMB/MiniCPM

419 views12:30

GitHub Trends

#jupyter_notebook #ai #llm #llms #multi_modal #openai #python #rag

Retrieval-Augmented Generation (RAG) is a technique that helps improve the accuracy of large language models by fetching relevant information from databases or documents. This approach ensures that the model's responses are based on up-to-date and accurate data, reducing errors and "hallucinations" where the model might provide false information. For users, RAG offers more reliable and trustworthy responses, allowing them to verify the sources used to generate those responses. This method also saves resources by avoiding the need to retrain models with new data.

https://github.com/FareedKhan-dev/all-rag-techniques

❤1

580 views13:00

GitHub Trends

#jupyter_notebook #jax

Flax is a library for creating neural networks with JAX. It offers a flexible way to build and analyze these networks. The new Flax NNX API makes it easier to work with neural networks by using regular Python objects, which helps in creating, debugging, and analyzing models more efficiently. This means users can express their models in a more intuitive way, making it simpler to develop and modify neural networks. Flax also provides many tools and examples to help users get started quickly.

https://github.com/google/flax

GitHub

GitHub - google/flax: Flax is a neural network library for JAX that is designed for flexibility.

Flax is a neural network library for JAX that is designed for flexibility. - google/flax

477 views12:30

GitHub Trends

#jupyter_notebook #ai #artificial_intelligence #chatgpt #deep_learning #from_scratch #gpt #language_model #large_language_models #llm #machine_learning #python #pytorch #transformer

You can learn how to build your own large language model (LLM) like GPT from scratch with clear, step-by-step guidance, including coding, training, and fine-tuning, all explained with examples and diagrams. This approach mirrors how big models like ChatGPT are made but is designed to run on a regular laptop without special hardware. You also get access to code for loading pretrained models and fine-tuning them for tasks like text classification or instruction following. This helps you deeply understand how LLMs work inside and lets you create your own functional AI assistant, gaining practical skills in AI development[1][2][3][4].

https://github.com/rasbt/LLMs-from-scratch

GitHub

GitHub - rasbt/LLMs-from-scratch: Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step - rasbt/LLMs-from-scratch

482 views11:30

GitHub Trends

#jupyter_notebook #artificial_intelligence #book #large_language_models #llm #llms #oreilly #oreilly_books

You can learn how to use Large Language Models (LLMs) effectively through the book *Hands-On Large Language Models* by Jay Alammar and Maarten Grootendorst. This book uses nearly 300 custom illustrations to explain key concepts and practical tools for working with LLMs, including tokenization, transformers, prompt engineering, fine-tuning, and advanced text generation. It also provides runnable code examples in Google Colab, making it easy to practice and apply what you learn. This resource helps you understand and build your own LLM applications confidently, saving you time and effort in mastering complex AI technology. It’s highly recommended for anyone wanting hands-on experience with LLMs.

https://github.com/HandsOnLLM/Hands-On-Large-Language-Models

GitHub

GitHub - HandsOnLLM/Hands-On-Large-Language-Models: Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Official code repo for the O'Reilly Book - "Hands-On Large Language Models" - HandsOnLLM/Hands-On-Large-Language-Models

429 views11:30

GitHub Trends

#jupyter_notebook

This course guides you through building and deploying your own AI agents using popular tools like OpenAI Agents SDK, CrewAI, LangGraph, AutoGen, and MCP over six weeks. You’ll learn to create agents that can think, act, and work together, with clear setup instructions for Windows, Mac, and Linux, plus support if you get stuck. The benefit is that you gain hands-on experience in the latest AI agent technology, making you ready to develop smart, autonomous systems for real-world tasks, while also connecting with a helpful community and having fun along the way[1][2][3].

https://github.com/ed-donner/agents

GitHub

GitHub - ed-donner/agents: Repo for the Complete Agentic AI Engineering Course

Repo for the Complete Agentic AI Engineering Course - ed-donner/agents

👍1

397 views14:00

GitHub Trends

#jupyter_notebook #darknet #pytorch #scaled_yolov4 #yolor #yolov3 #yolov4 #yolov7

YOLOv7 is a powerful tool for detecting objects in images and videos. It is fast, accurate, and can work well on devices with limited power, making it useful for real-time applications like self-driving cars and surveillance systems. YOLOv7 uses advanced techniques like Feature Pyramid Networks to detect objects of different sizes and can handle complex scenes with overlapping objects. This makes it beneficial for users who need quick and precise object detection in various environments.

https://github.com/WongKinYiu/yolov7

GitHub

GitHub - WongKinYiu/yolov7: Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time…

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors - WongKinYiu/yolov7

445 views00:00

GitHub Trends

#jupyter_notebook

SAM 2 is a powerful new AI model that can quickly and accurately separate objects in both images and videos, even if it has never seen them before. It works in real-time, allowing you to select objects with simple prompts like clicks or boxes and refine the results interactively. This makes tasks like video editing, object tracking, and image annotation much easier and faster. SAM 2’s ability to handle complex scenes and track objects smoothly across video frames helps improve creativity and productivity in many fields, from media production to computer vision research. It is open-source and easy to use with Python and PyTorch.

https://github.com/facebookresearch/segment-anything

GitHub

GitHub - facebookresearch/segment-anything: The repository provides code for running inference with the SegmentAnything Model (SAM)…

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model. -...

381 views12:00

GitHub Trends

#jupyter_notebook

Retrieval Augmented Generation (RAG) helps large language models (LLMs) answer questions using up-to-date or private information by connecting them to external data sources, unlike fine-tuning which retrains the model on specific data. RAG is useful when you need current, dynamic information without costly retraining, making it ideal for tasks like customer support or knowledge management. Fine-tuning is better for deep expertise in a specialized field but requires more data and effort. Using RAG lets you get accurate, relevant answers quickly by combining the model’s language skills with fresh, specific data, improving usefulness and reliability.

https://github.com/langchain-ai/rag-from-scratch

GitHub

GitHub - langchain-ai/rag-from-scratch

Contribute to langchain-ai/rag-from-scratch development by creating an account on GitHub.

485 views11:30

About

Blog

Apps

Platform