Danily07/Translumo
Advanced real-time screen translator for games, hardcoded subtitles in videos, static text and etc.
Language: C#
#autotranslate #easyocr #game_translation #mlnet #ocr #translation
Stars: 239 Issues: 5 Forks: 4
https://github.com/Danily07/Translumo
Advanced real-time screen translator for games, hardcoded subtitles in videos, static text and etc.
Language: C#
#autotranslate #easyocr #game_translation #mlnet #ocr #translation
Stars: 239 Issues: 5 Forks: 4
https://github.com/Danily07/Translumo
GitHub
GitHub - ramjke/Translumo: Advanced real-time screen translator for games, hardcoded subtitles in videos, static text and etc.
Advanced real-time screen translator for games, hardcoded subtitles in videos, static text and etc. - ramjke/Translumo
👍9👏3🤔2
junhoyeo/BetterOCR
🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract) with 🧠 LLM.
Language: Python
#ai #chatgpt #chatgpt_api #easyocr #llm #ocr #openai #openai_api #tesseract #tesseract_ocr
Stars: 154 Issues: 4 Forks: 7
https://github.com/junhoyeo/BetterOCR
🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract) with 🧠 LLM.
Language: Python
#ai #chatgpt #chatgpt_api #easyocr #llm #ocr #openai #openai_api #tesseract #tesseract_ocr
Stars: 154 Issues: 4 Forks: 7
https://github.com/junhoyeo/BetterOCR
GitHub
GitHub - junhoyeo/BetterOCR: 🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠…
🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM. - junhoyeo/BetterOCR
👍3👎1
reworkd/tarsier
Vision utilities for web interaction agents 👀
Language: Jupyter Notebook
#gpt4v #llms #ocr #playwright #pypi_package #python #selenium #webscraping
Stars: 236 Issues: 3 Forks: 14
https://github.com/reworkd/tarsier
Vision utilities for web interaction agents 👀
Language: Jupyter Notebook
#gpt4v #llms #ocr #playwright #pypi_package #python #selenium #webscraping
Stars: 236 Issues: 3 Forks: 14
https://github.com/reworkd/tarsier
GitHub
GitHub - reworkd/tarsier: Vision utilities for web interaction agents 👀
Vision utilities for web interaction agents 👀. Contribute to reworkd/tarsier development by creating an account on GitHub.
VikParuchuri/texify
OCR model for math that outputs LaTeX and markdown
Language: Python
#deep_learning #latex #markdown #ocr
Stars: 142 Issues: 0 Forks: 7
https://github.com/VikParuchuri/texify
OCR model for math that outputs LaTeX and markdown
Language: Python
#deep_learning #latex #markdown #ocr
Stars: 142 Issues: 0 Forks: 7
https://github.com/VikParuchuri/texify
GitHub
GitHub - VikParuchuri/texify: Math OCR model that outputs LaTeX and markdown
Math OCR model that outputs LaTeX and markdown. Contribute to VikParuchuri/texify development by creating an account on GitHub.
👍1
robertknight/ocrs
A modern OCR engine (extracts text from images), written in Rust
Language: Rust
#computer_vision #machine_learning #ocr
Stars: 220 Issues: 3 Forks: 4
https://github.com/robertknight/ocrs
A modern OCR engine (extracts text from images), written in Rust
Language: Rust
#computer_vision #machine_learning #ocr
Stars: 220 Issues: 3 Forks: 4
https://github.com/robertknight/ocrs
GitHub
GitHub - robertknight/ocrs: Rust library and CLI tool for OCR (extracting text from images)
Rust library and CLI tool for OCR (extracting text from images) - robertknight/ocrs
🥰1👏1
VikParuchuri/tabled
Detect and extract tables to markdown and csv
Language: Python
#deep_learning #ocr #tables
Stars: 245 Issues: 4 Forks: 7
https://github.com/VikParuchuri/tabled
Detect and extract tables to markdown and csv
Language: Python
#deep_learning #ocr #tables
Stars: 245 Issues: 4 Forks: 7
https://github.com/VikParuchuri/tabled
GitHub
GitHub - VikParuchuri/tabled: Detect and extract tables to markdown and csv
Detect and extract tables to markdown and csv. Contribute to VikParuchuri/tabled development by creating an account on GitHub.
👍1
umlx5h/LLPlayer
The media player for language learning, with dual subtitles, AI-generated subtitles, realtime-OCR, translation, word lookup, and more!
Language: C#
#asr #csharp #flyleaf #language_learning #media_player #ocr #player #tesseract #video #video_player #whisper #wpf #yt_dlp
Stars: 253 Issues: 5 Forks: 4
https://github.com/umlx5h/LLPlayer
The media player for language learning, with dual subtitles, AI-generated subtitles, realtime-OCR, translation, word lookup, and more!
Language: C#
#asr #csharp #flyleaf #language_learning #media_player #ocr #player #tesseract #video #video_player #whisper #wpf #yt_dlp
Stars: 253 Issues: 5 Forks: 4
https://github.com/umlx5h/LLPlayer
GitHub
GitHub - umlx5h/LLPlayer: The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation…
The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation, and more! - umlx5h/LLPlayer
❤2👍1
ses4255/Versatile-OCR-Program
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
Language: Python
#doclayout #educational_data #exam_ocr #machine_learning #ml_datasets #multi_modal #ocr #openai #paper_ocr #table_parsing
Stars: 250 Issues: 0 Forks: 11
https://github.com/ses4255/Versatile-OCR-Program
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
Language: Python
#doclayout #educational_data #exam_ocr #machine_learning #ml_datasets #multi_modal #ocr #openai #paper_ocr #table_parsing
Stars: 250 Issues: 0 Forks: 11
https://github.com/ses4255/Versatile-OCR-Program
GitHub
GitHub - raphael-seo/Versatile-OCR-Program: Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams) - raphael-seo/Versatile-OCR-Program
❤1👍1
TimmyOVO/deepseek-ocr.rs
Rust implementation of DeepSeek-OCR with OpenAI-compatible server & CLI No Python environment needed - just download and run.
Language: Rust
#candle #ocr #ocr_recognition #openai #rust
Stars: 808 Issues: 4 Forks: 61
https://github.com/TimmyOVO/deepseek-ocr.rs
Rust implementation of DeepSeek-OCR with OpenAI-compatible server & CLI No Python environment needed - just download and run.
Language: Rust
#candle #ocr #ocr_recognition #openai #rust
Stars: 808 Issues: 4 Forks: 61
https://github.com/TimmyOVO/deepseek-ocr.rs
GitHub
GitHub - TimmyOVO/deepseek-ocr.rs: Rust multi‑backend OCR/VLM engine (DeepSeek‑OCR-1/2, PaddleOCR‑VL, DotsOCR) with DSQ quantization…
Rust multi‑backend OCR/VLM engine (DeepSeek‑OCR-1/2, PaddleOCR‑VL, DotsOCR) with DSQ quantization and an OpenAI‑compatible server & CLI – run locally without Python. - TimmyOVO/deepseek-ocr.rs
❤1
majcheradam/ocrbase
📄 PDF ->.MD/.JSON Document OCR and structured data extraction API. PaddleOCR + LLM-powered parsing. Real-time WebSocket updates. Type-safe TypeScript SDK with React hooks. Self-hostable.
Language: TypeScript
#ai #bun #document_processing #drizzle #elysia #json #markdown #ocr #paddleocr #pdf #react_hooks #self_hosted #typescript #websocket
Stars: 523 Issues: 8 Forks: 31
https://github.com/majcheradam/ocrbase
📄 PDF ->.MD/.JSON Document OCR and structured data extraction API. PaddleOCR + LLM-powered parsing. Real-time WebSocket updates. Type-safe TypeScript SDK with React hooks. Self-hostable.
Language: TypeScript
#ai #bun #document_processing #drizzle #elysia #json #markdown #ocr #paddleocr #pdf #react_hooks #self_hosted #typescript #websocket
Stars: 523 Issues: 8 Forks: 31
https://github.com/majcheradam/ocrbase
GitHub
GitHub - ocrbase-hq/ocrbase: 📄 PDF/IMG ->.MD/JSON Document OCR API for PaddleOCR and GLMOCR. Self-hostable.
📄 PDF/IMG ->.MD/JSON Document OCR API for PaddleOCR and GLMOCR. Self-hostable. - ocrbase-hq/ocrbase
zai-org/GLM-OCR
GLM-OCR: Accurate × Fast × Comprehensive
Language: Python
#glm #image2text #ocr
Stars: 439 Issues: 18 Forks: 21
https://github.com/zai-org/GLM-OCR
GLM-OCR: Accurate × Fast × Comprehensive
Language: Python
#glm #image2text #ocr
Stars: 439 Issues: 18 Forks: 21
https://github.com/zai-org/GLM-OCR
GitHub
GitHub - zai-org/GLM-OCR: GLM-OCR: Accurate × Fast × Comprehensive
GLM-OCR: Accurate × Fast × Comprehensive. Contribute to zai-org/GLM-OCR development by creating an account on GitHub.