GitHub repos

Danily07/Translumo
Advanced real-time screen translator for games, hardcoded subtitles in videos, static text and etc.
Language: C#
#autotranslate #easyocr #game_translation #mlnet #ocr #translation
Stars: 239 Issues: 5 Forks: 4
https://github.com/Danily07/Translumo

GitHub

GitHub - ramjke/Translumo: Advanced real-time screen translator for games, hardcoded subtitles in videos, static text and etc.

Advanced real-time screen translator for games, hardcoded subtitles in videos, static text and etc. - ramjke/Translumo

👍9👏3🤔2

4.38K views16:13

GitHub repos

junhoyeo/BetterOCR
🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract) with 🧠 LLM.
Language: Python
#ai #chatgpt #chatgpt_api #easyocr #llm #ocr #openai #openai_api #tesseract #tesseract_ocr
Stars: 154 Issues: 4 Forks: 7
https://github.com/junhoyeo/BetterOCR

GitHub

GitHub - junhoyeo/BetterOCR: 🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠…

🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM. - junhoyeo/BetterOCR

👍3👎1

2.42K views22:20

GitHub repos

reworkd/tarsier
Vision utilities for web interaction agents 👀
Language: Jupyter Notebook
#gpt4v #llms #ocr #playwright #pypi_package #python #selenium #webscraping
Stars: 236 Issues: 3 Forks: 14
https://github.com/reworkd/tarsier

GitHub

GitHub - reworkd/tarsier: Vision utilities for web interaction agents 👀

Vision utilities for web interaction agents 👀. Contribute to reworkd/tarsier development by creating an account on GitHub.

2.05K views23:21

GitHub repos

VikParuchuri/texify
OCR model for math that outputs LaTeX and markdown
Language: Python
#deep_learning #latex #markdown #ocr
Stars: 142 Issues: 0 Forks: 7
https://github.com/VikParuchuri/texify

GitHub

GitHub - VikParuchuri/texify: Math OCR model that outputs LaTeX and markdown

Math OCR model that outputs LaTeX and markdown. Contribute to VikParuchuri/texify development by creating an account on GitHub.

👍1

2.17K views05:24

GitHub repos

robertknight/ocrs
A modern OCR engine (extracts text from images), written in Rust
Language: Rust
#computer_vision #machine_learning #ocr
Stars: 220 Issues: 3 Forks: 4
https://github.com/robertknight/ocrs

GitHub

GitHub - robertknight/ocrs: Rust library and CLI tool for OCR (extracting text from images)

Rust library and CLI tool for OCR (extracting text from images) - robertknight/ocrs

🥰1👏1

2.32K views17:24

GitHub repos

VikParuchuri/tabled
Detect and extract tables to markdown and csv
Language: Python
#deep_learning #ocr #tables
Stars: 245 Issues: 4 Forks: 7
https://github.com/VikParuchuri/tabled

GitHub

GitHub - VikParuchuri/tabled: Detect and extract tables to markdown and csv

Detect and extract tables to markdown and csv. Contribute to VikParuchuri/tabled development by creating an account on GitHub.

👍1

1.88K views16:00

GitHub repos

umlx5h/LLPlayer
The media player for language learning, with dual subtitles, AI-generated subtitles, realtime-OCR, translation, word lookup, and more!
Language: C#
#asr #csharp #flyleaf #language_learning #media_player #ocr #player #tesseract #video #video_player #whisper #wpf #yt_dlp
Stars: 253 Issues: 5 Forks: 4
https://github.com/umlx5h/LLPlayer

GitHub

GitHub - umlx5h/LLPlayer: The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation…

The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation, and more! - umlx5h/LLPlayer

❤2👍1

1.96K views23:00

GitHub repos

ses4255/Versatile-OCR-Program
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
Language: Python
#doclayout #educational_data #exam_ocr #machine_learning #ml_datasets #multi_modal #ocr #openai #paper_ocr #table_parsing
Stars: 250 Issues: 0 Forks: 11
https://github.com/ses4255/Versatile-OCR-Program

GitHub

GitHub - raphael-seo/Versatile-OCR-Program: Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)

Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams) - raphael-seo/Versatile-OCR-Program

❤1👍1

1.88K views10:00

GitHub repos

TimmyOVO/deepseek-ocr.rs
Rust implementation of DeepSeek-OCR with OpenAI-compatible server & CLI No Python environment needed - just download and run.
Language: Rust
#candle #ocr #ocr_recognition #openai #rust
Stars: 808 Issues: 4 Forks: 61
https://github.com/TimmyOVO/deepseek-ocr.rs

GitHub

GitHub - TimmyOVO/deepseek-ocr.rs: Rust multi‑backend OCR/VLM engine (DeepSeek‑OCR-1/2, PaddleOCR‑VL, DotsOCR) with DSQ quantization…

Rust multi‑backend OCR/VLM engine (DeepSeek‑OCR-1/2, PaddleOCR‑VL, DotsOCR) with DSQ quantization and an OpenAI‑compatible server & CLI – run locally without Python. - TimmyOVO/deepseek-ocr.rs

❤1

1.59K views10:00

GitHub repos

majcheradam/ocrbase
📄 PDF ->.MD/.JSON Document OCR and structured data extraction API. PaddleOCR + LLM-powered parsing. Real-time WebSocket updates. Type-safe TypeScript SDK with React hooks. Self-hostable.
Language: TypeScript
#ai #bun #document_processing #drizzle #elysia #json #markdown #ocr #paddleocr #pdf #react_hooks #self_hosted #typescript #websocket
Stars: 523 Issues: 8 Forks: 31
https://github.com/majcheradam/ocrbase

GitHub

GitHub - ocrbase-hq/ocrbase: 📄 PDF/IMG ->.MD/JSON Document OCR API for PaddleOCR and GLMOCR. Self-hostable.

📄 PDF/IMG ->.MD/JSON Document OCR API for PaddleOCR and GLMOCR. Self-hostable. - ocrbase-hq/ocrbase

1.71K views05:00

GitHub repos

zai-org/GLM-OCR
GLM-OCR: Accurate × Fast × Comprehensive
Language: Python
#glm #image2text #ocr
Stars: 439 Issues: 18 Forks: 21
https://github.com/zai-org/GLM-OCR

GitHub

GitHub - zai-org/GLM-OCR: GLM-OCR: Accurate × Fast × Comprehensive

GLM-OCR: Accurate × Fast × Comprehensive. Contribute to zai-org/GLM-OCR development by creating an account on GitHub.

1.58K views23:00

About

Blog

Apps

Platform