GitHub Trends
10.7K subscribers
15.7K links
See what the GitHub community is most excited about today.

A bot automatically fetches new repositories from https://github.com/trending and sends them to the channel.

Author and maintainer: https://github.com/katursis
Download Telegram
#rust #document_ocr #document_processing #ocr #ocr_recognition #pdf #pdf_parser #text_extraction

LiteParse is a fast, local PDF parser that extracts text with bounding boxes, can use OCR, and works in Rust, Python, Node.js, and the browser. It also makes screenshots and can handle files like DOCX, XLSX, PPTX, and images after conversion. Benefit: you can turn documents into clean text or JSON on your own machine, which helps with private, quick, and structured document processing.

https://github.com/run-llama/liteparse