Code Stars

MaximeVandegar/Papers-in-100-Lines-of-Code
Implementation of papers in 100 lines of code.
Language:Python
Total stars: 875
Stars trend:

6 Dec 2024
12am ▏ +1
 1am ▌ +4
 2am ▋ +5
 3am ▍ +3
 4am ▌ +4
 5am ▌ +4
 6am ▌ +4
 7am █▎ +10
 8am █▎ +10
 9am █▊ +14
10am ▊ +6
11am █▎ +10

#python
#3d, #aes, #artificialintelligence, #deeplearning, #diffusionmodels, #educational, #gans, #generativemodel, #implementationofresearchpaper, #inverserendering, #machinelearning, #metalearning, #nerf, #neuralradiancefields, #papers, #python, #pytorch, #reinforcementlearning, #research, #rl

133 views12:17

Code Stars

labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python
Total stars: 58051
Stars trend:

19 Jan 2025
 9pm ▏ +1
10pm █▏ +9
11pm ▌ +4
20 Jan 2025
12am ▍ +3
 1am ▍ +3
 2am ▋ +5
 3am █▏ +9
 4am █▎ +10
 5am █▏ +9
 6am █▏ +9
 7am █ +8
 8am ▋ +5

#python
#attention, #deeplearning, #deeplearningtutorial, #gan, #literateprogramming, #lora, #machinelearning, #neuralnetworks, #optimizers, #pytorch, #reinforcementlearning, #transformer, #transformers

93 views09:18

Code Stars

turningpoint-ai/VisualThinker-R1-Zero
Explore the Multimodal “Aha Moment” on 2B Model
Language:Python
Total stars: 208
Stars trend:

5 Mar 2025
 1am ▍ +3
 2am  +0
 3am ▏ +1
 4am █ +8
 5am ██▉ +23
 6am ██▉ +23
 7am ██▌ +20
 8am ▉ +7
 9am █▏ +9
10am ▉ +7
11am ▉ +7
12pm █▌ +12

#python
#deepseek, #deepseekr1, #deepseekr1zero, #grpo, #multimodal, #multimodaljourney, #multimodalr1, #posttraining, #r1, #r1zero, #reasoning, #reinforcementlearning

54 views13:18

Code Stars

FareedKhan-dev/all-rl-algorithms
Implementation of all RL algorithms in a simpler way
Language:Jupyter Notebook
Total stars: 87
Stars trend:

30 Mar 2025
 3pm ██▋ +21
 4pm ███▏ +25
 5pm █▉ +15
 6pm █▍ +11
 7pm █▊ +14

#jupyternotebook
#agent, #llm, #openai, #python, #reinforcementlearning, #rl

❤1

78 views20:17

Code Stars

inclusionAI/AReaL
Distributed RL System for LLM Reasoning
Language:Python
Total stars: 374
Stars trend:

31 Mar 2025
12am ▏ +1
 1am ▍ +3
 2am ███████▉ +63
 3am ███████▌ +60
 4am ███▍ +27

#python
#llm, #llmreasoning, #machinelearningsystems, #mlsys, #reinforcementlearning, #rl

84 views05:17

Code Stars

girafe-ai/ml-course
Open Machine Learning course
Language:Jupyter Notebook
Total stars: 2606
Stars trend:

9 Apr 2025
11am ▌ +4
12pm ▉ +7
 1pm █▏ +9
 2pm █ +8
 3pm ▊ +6
 4pm ▏ +1
 5pm ▎ +2
 6pm ▌ +4
 7pm █▎ +10
 8pm ▉ +7
 9pm █▏ +9
10pm █▏ +9

#jupyternotebook
#computervision, #course, #deeplearning, #machinelearning, #materials, #naturallanguageprocessing, #python, #pytorch, #reinforcementlearning, #seminars

88 views23:17

Code Stars

ivanbelenky/RL
R.L. methods and techniques.
Language:Python
Total stars: 108
Stars trend:

6 May 2025
10pm ▏ +1
11pm ██ +16
7 May 2025
12am █▎ +10
 1am █▉ +15
 2am ▉ +7
 3am ▊ +6
 4am ▌ +4
 5am ▍ +3
 6am █ +8
 7am ▉ +7
 8am ▍ +3

#python
#gridworld, #markov, #markovdecisionprocesses, #qlearning, #qlearning, #reinforcementlearning, #sarsa, #tabularmethods

88 views09:18

Code Stars

NVlabs/Long-RL
Long-RL: Scaling RL to Long Sequences
Language:Python
Total stars: 241
Stars trend:

11 Jul 2025
 2am ███▌ +28
 3am ███▌ +28
 4am █▉ +15
 5am █▎ +10
 6am ██▌ +20
 7am ▉ +7
 8am ▊ +6
 9am ▉ +7
10am ▎ +2
11am ▏ +1
12pm ▍ +3
 1pm ▌ +4

#python
#efficientai, #largelanguagemodels, #longsequence, #multimodality, #reinforcementlearning, #sequenceparallelism

85 views14:17

Code Stars

PufferAI/PufferLib
Simplifying reinforcement learning for complex game environments
Language:C
Total stars: 2477
Stars trend:

11 Jul 2025
 2pm ▌ +4
 3pm ▍ +3
 4pm █ +8
 5pm █▍ +11
 6pm █ +8
 7pm █▌ +12
 8pm █ +8
 9pm █▏ +9
10pm █▎ +10
11pm ▎ +2

#c
#reinforcementlearning

84 views00:17

Code Stars

OpenPipe/ART
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, Kimi, and more!
Language:Python
Total stars: 1030
Stars trend:

11 Jul 2025
 5pm ▉ +7
 6pm █▌ +12
 7pm ██ +16
 8pm █▌ +12
 9pm █▎ +10
10pm █▎ +10
11pm ▋ +5
12 Jul 2025
12am ▉ +7

#python
#agent, #agenticai, #grpo, #kimiai, #llms, #lora, #qwen, #qwen3, #reinforcementlearning, #rl

86 views01:17

About

Blog

Apps

Platform