GitHub Trends
10.7K subscribers
15.7K links
See what the GitHub community is most excited about today.

A bot automatically fetches new repositories from https://github.com/trending and sends them to the channel.

Author and maintainer: https://github.com/katursis
Download Telegram
#python #auto_regressive_model #autoregressive_models #diffusion_models #generative_ai #generative_model #gpt #gpt_2 #image_generation #large_language_models #neurips #transformers #vision_transformer

VAR (Visual Autoregressive Modeling) is a new way to generate images that improves upon existing methods. It uses a "next-scale prediction" approach, which means it generates images from coarse to fine details, unlike the traditional method of predicting pixel by pixel. This makes VAR models better than diffusion models for the first time. You can try VAR on a demo website and generate images interactively, which is fun and easy. VAR also follows power-law scaling laws, making it efficient and scalable. The benefit to you is that you can create high-quality images quickly and easily, and even explore technical details through provided scripts and models.

https://github.com/FoundationVision/VAR
👍1😁1
#python #anonymization #anonymization_service #data_anonymization #data_loss_prevention #data_masking #data_protection #data_scrubbing #de_identification #dlp #microsoft #pii #pii_anonymization #pii_anonymization_service #pii_detection #presidio #privacy #privacy_protection #python #text_anonymization #transformers

Presidio is a tool that helps protect sensitive information like names, credit card numbers, and addresses in text and images. It can quickly identify and hide this private data, making it safer to use. You can customize Presidio to fit your specific needs and use it in various ways, such as with Python, Docker, or Kubernetes. This helps organizations keep their data private and secure, which is important for protecting user information.

https://github.com/microsoft/presidio
#python #gpu #llm #pytorch #transformers

The `ipex-llm` library is a powerful tool for accelerating Large Language Models (LLMs) on Intel GPUs, NPUs, and CPUs. It integrates seamlessly with popular frameworks like HuggingFace transformers, LangChain, LlamaIndex, and more. Here are the key benefits `ipex-llm` optimizes LLM performance with advanced quantization techniques (FP8, FP6, FP4, INT4) and self-speculative decoding, leading to significant speedups.
- **Wide Model Support** It works on various Intel hardware such as Arc GPUs, Core Ultra NPUs, and CPUs, making it versatile for different setups.
- **Easy Integration** Detailed quickstart guides, code examples, and tutorials help users get started quickly.

Overall, `ipex-llm` enhances the performance and usability of LLMs on Intel hardware, making it a valuable tool for developers and researchers.

https://github.com/intel/ipex-llm
#mdx #deep_learning #hacktoberfest #nlp #transformers

The Hugging Face course teaches you how to use Transformers for natural language processing tasks. You'll learn about the Hugging Face ecosystem, including tools like Transformers, Datasets, Tokenizers, and Accelerate, as well as the Hugging Face Hub. This free course helps you understand how to fine-tune models and share your results. It's beneficial because it provides hands-on experience with popular AI libraries and allows you to build and showcase your own projects on the Hugging Face platform.

https://github.com/huggingface/course
1
#typescript #electron #llama #llms #lora #mlx #rlhf #transformers

Transformer Lab is a free, open-source tool that lets you easily work with large language models on your own computer, offering one-click downloads for popular models like Llama3 and Mistral, fine-tuning across different hardware (including Apple Silicon and GPUs), and features like chatting, training, and evaluating models through a simple interface—saving you from complex setups like CUDA or Python version issues[1][2][5].

https://github.com/transformerlab/transformerlab-app
#python #apple_silicon #audio_processing #mlx #multimodal #speech_recognition #speech_synthesis #speech_to_text #text_to_speech #transformers

MLX-Audio is a powerful tool for converting text into speech and speech into new audio. It works well on Apple Silicon devices, like M-series chips, making it fast and efficient. You can choose from different languages and voices, and even adjust how fast the speech is. It also includes a web interface where you can see audio in 3D and play your own files. This tool is helpful for making audiobooks, interactive media, and personal projects because it's easy to use and provides high-quality audio quickly.

https://github.com/Blaizzy/mlx-audio
#typescript #ai #chatgpt #docsgpt #hacktoberfest #information_retrieval #language_model #llm #machine_learning #natural_language_processing #python #pytorch #rag #react #semantic_search #transformers #web_app

DocsGPT is an open-source AI tool that helps you quickly find accurate answers from many types of documents and web sources without errors. It supports formats like PDF, DOCX, images, and integrates with websites, APIs, and chat platforms like Discord and Telegram. You can deploy it privately for security, customize it to fit your brand, and connect it to tools for advanced actions. This means you save time searching for information, get reliable answers with sources, and improve productivity whether you’re a developer, support team, or business user. It’s easy to set up and scales well for many users[2][3][4].

https://github.com/arc53/DocsGPT
1
#python #language_models #linux #machine_translation #nlp #open_source #python #transformers #translation

Argos Translate is a free, open-source tool that lets you translate text offline using your own computer. It works as a Python library, command-line tool, or with a graphical interface, and supports many languages. You can install language packages for direct translations, and it can even translate between languages that don’t have a direct package by using a middle language. This means you can translate more language pairs, though the quality might be a little lower. Argos Translate is fast, private, and does not need an internet connection after setup, making it useful for secure or offline translation needs.

https://github.com/argosopentech/argos-translate
#python #agents #ai #ai_agents #ai_engineering #computer_vision #course #deep_learning #from_scratch #generative_ai #llm #machine_learning #mcp #nlp #python #reinforcement_learning #rust #swarm_intelligence #transformers #tutorial #typescript

This is a free MIT learning guide for AI engineering with 428 lessons in 20 phases. It teaches you AI from the math up, then moves into machine learning, deep learning, LLMs, agents, tools, safety, and production. Each lesson helps you build useful code or AI tools, not just read theory. You can start at the right level, follow a clear path, and keep reusable artifacts for real work. The benefit is simple: you learn how AI actually works and gain practical skills you can use to build and ship better AI systems.

https://github.com/rohitg00/ai-engineering-from-scratch
1👍1
#jupyter_notebook #gemini #large_language_models #llm #openai #training #transformers

This project shows how to build and train a transformer language model from scratch in PyTorch. It uses the Pile dataset, tokenizes text with tiktoken, and stores tokens in HDF5 files for faster training. The code includes attention, MLP, transformer blocks, training, saving, and text generation. The benefit is that you can learn how LLMs work and train your own small or large model on a single GPU, then use it to generate text for your own tasks.

https://github.com/FareedKhan-dev/train-llm-from-scratch
1