GitHub repos – Telegram

GitHub repos

26.3K subscribers

18 photos

2 videos

12.5K links

Welcome to GitHub repos. Here you'll find valuable information on the latest trending projects. Subscribe to stay informed and gain insights from the thriving GitHub community.

Download Telegram

About

Blog

Apps

Platform

26.3K subscribers

PKU-Alignment/safe-rlhf
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language: Python
#ai_safety #alpaca #datasets #deepspeed #large_language_models #llama #llm #llms #reinforcement_learning #reinforcement_learning_from_human_feedback #rlhf #safe_reinforcement_learning #safe_reinforcement_learning_from_human_feedback #safe_rlhf #safety #transformers #vicuna
Stars: 279 Issues: 0 Forks: 14
https://github.com/PKU-Alignment/safe-rlhf

GitHub - PKU-Alignment/safe-rlhf: Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback - PKU-Alignment/safe-rlhf

👍2👏1

2.36K views16:11

Gen-Verse/OpenClaw-RL
OpenClaw-RL: Personalize openclaw simply by talking to it
Language: TypeScript
#async #grpo #memory_systems #on_policy_distillation #open_claw #openclaw_skills #rlhf #sglang #skill_learning #slime
Stars: 672 Issues: 3 Forks: 60
https://github.com/Gen-Verse/OpenClaw-RL

GitHub - Gen-Verse/OpenClaw-RL: OpenClaw-RL: Train any agent simply by talking

OpenClaw-RL: Train any agent simply by talking. Contribute to Gen-Verse/OpenClaw-RL development by creating an account on GitHub.

1.55K views05:00