bytedance/UI-TARS-desktop
A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
Language: TypeScript
#agent #browser_use #computer_use #electron #gui_agents #vision #vite #vlm
Stars: 505 Issues: 8 Forks: 35
https://github.com/bytedance/UI-TARS-desktop
  
  A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
Language: TypeScript
#agent #browser_use #computer_use #electron #gui_agents #vision #vite #vlm
Stars: 505 Issues: 8 Forks: 35
https://github.com/bytedance/UI-TARS-desktop
GitHub
  
  GitHub - bytedance/UI-TARS-desktop: The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
  The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra - bytedance/UI-TARS-desktop
❤1
  THUDM/GLM-4.1V-Thinking
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.
Language: Python
#image2text #reasoning #video_understanding #vlm
Stars: 449 Issues: 9 Forks: 8
https://github.com/THUDM/GLM-4.1V-Thinking
  
  GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.
Language: Python
#image2text #reasoning #video_understanding #vlm
Stars: 449 Issues: 9 Forks: 8
https://github.com/THUDM/GLM-4.1V-Thinking
GitHub
  
  GitHub - zai-org/GLM-V: GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
  GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning - zai-org/GLM-V
❤1
  Hunyuan-PromptEnhancer/PromptEnhancer
PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.
Language: Python
#hunyuan #hunyuan_image #prompt #prompt_engineering #prompt_enhancer #vlm
Stars: 260 Issues: 2 Forks: 24
https://github.com/Hunyuan-PromptEnhancer/PromptEnhancer
  
  PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.
Language: Python
#hunyuan #hunyuan_image #prompt #prompt_engineering #prompt_enhancer #vlm
Stars: 260 Issues: 2 Forks: 24
https://github.com/Hunyuan-PromptEnhancer/PromptEnhancer
GitHub
  
  GitHub - Hunyuan-PromptEnhancer/PromptEnhancer: PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured…
  PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation. - Hunyuan-PromptEnhancer/PromptEnhancer
  