oomol-lab/pdf-craft
PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books. The project has just started.
Language:Python
Total stars: 1537
Stars trend:
#python
#ai, #document, #ocr, #pdf
PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books. The project has just started.
Language:Python
Total stars: 1537
Stars trend:
10 Apr 2025
4pm ▏ +1
5pm +0
6pm +0
7pm +0
8pm +0
9pm +0
10pm ▏ +1
11pm ▏ +1
11 Apr 2025
12am ████▊ +38
1am ██████████▊ +86
2am ████████ +64
#python
#ai, #document, #ocr, #pdf
QuivrHQ/MegaParse
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Language:Python
Total stars: 6094
Stars trend:
#python
#docx, #llm, #parser, #pdf, #powerpoint
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Language:Python
Total stars: 6094
Stars trend:
25 Apr 2025
3pm █▎ +10
4pm █▍ +11
5pm █▏ +9
6pm █▍ +11
7pm ▌ +4
8pm █ +8
9pm ▉ +7
10pm ▋ +5
11pm ▉ +7
26 Apr 2025
12am █▎ +10
1am █▎ +10
#python
#docx, #llm, #parser, #pdf, #powerpoint
👍1
EvanZhouDev/llm.pdf
Run LLMs inside a PDF file.
Language:Python
Total stars: 251
Stars trend:
#python
#ai, #llm, #pdf
Run LLMs inside a PDF file.
Language:Python
Total stars: 251
Stars trend:
26 Apr 2025
5pm ▎ +2
6pm ▉ +7
7pm █▍ +11
8pm █ +8
9pm █▍ +11
10pm █ +8
11pm ▋ +5
27 Apr 2025
12am █▎ +10
1am █ +8
2am ▍ +3
3am █▏ +9
4am █ +8
#python
#ai, #llm, #pdf
😁1
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
Language:Java
Total stars: 58829
Stars trend:
#java
#docker, #java, #pdf, #pdfconverter, #pdfeditor, #pdfmanipulation, #pdfmerger, #pdfocr, #pdftools, #pdfwebapps, #pdfmerger
#1 Locally hosted web application that allows you to perform various operations on PDF files
Language:Java
Total stars: 58829
Stars trend:
16 May 2025
4am ▋ +5
5am █ +8
6am ▉ +7
7am █ +8
8am █▉ +15
9am █▌ +12
10am █ +8
11am ██▎ +18
12pm █▎ +10
1pm █▏ +9
2pm ▉ +7
3pm █▎ +10
#java
#docker, #java, #pdf, #pdfconverter, #pdfeditor, #pdfmanipulation, #pdfmerger, #pdfocr, #pdftools, #pdfwebapps, #pdfmerger
clawsoftware/clawPDF
Open Source Virtual (Network) Printer for Windows that allows you to create PDFs, OCR text, and print images, with advanced features usually available only in enterprise solutions.
Language:C#
Total stars: 1043
Stars trend:
#csharp
#imageprocessing, #merge, #networkprinter, #ocr, #pdf, #pdfmerger, #pdfprinter, #print, #printer, #terminalserver, #windows
Open Source Virtual (Network) Printer for Windows that allows you to create PDFs, OCR text, and print images, with advanced features usually available only in enterprise solutions.
Language:C#
Total stars: 1043
Stars trend:
19 May 2025
12pm ▍ +3
1pm █████▌ +44
2pm ███████▎ +58
3pm ██████▌ +52
4pm ██▋ +21
#csharp
#imageprocessing, #merge, #networkprinter, #ocr, #pdf, #pdfmerger, #pdfprinter, #print, #printer, #terminalserver, #windows
iamgio/quarkdown
🪐 Markdown with superpowers — from ideas to presentations, articles and books.
Language:Kotlin
Total stars: 1292
Stars trend:
#kotlin
#compiler, #markdown, #markdownparser, #markuplanguage, #paper, #pdf, #presentations, #programminglanguage, #scriptinglanguage, #slides, #typesetting, #typesettingsystem
🪐 Markdown with superpowers — from ideas to presentations, articles and books.
Language:Kotlin
Total stars: 1292
Stars trend:
2 Jun 2025
10pm ▏ +1
11pm +0
3 Jun 2025
12am +0
1am ▏ +1
2am ▏ +1
3am ▏ +1
4am +0
5am +0
6am +0
7am +0
8am ██████▍ +51
9am █████████ +72
#kotlin
#compiler, #markdown, #markdownparser, #markuplanguage, #paper, #pdf, #presentations, #programminglanguage, #scriptinglanguage, #slides, #typesetting, #typesettingsystem
T8RIN/ImageToolbox
🖼️ Image Toolbox is a powerful app for advanced image manipulation. It offers dozens of features, from basic tools like crop and draw to filters, OCR, and a wide range of image processing options
Language:Kotlin
Total stars: 7435
Stars trend:
#kotlin
#android, #backgroundremoval, #cleanarchitecture, #crop, #editphoto, #exif, #fdroid, #filterimage, #imagemanipulation, #jetpackcompose, #jxl, #kotlin, #materialyou, #ocrrecognition, #pdf, #photocollage, #photoeditor, #psd, #qrcodescanner, #watermark
🖼️ Image Toolbox is a powerful app for advanced image manipulation. It offers dozens of features, from basic tools like crop and draw to filters, OCR, and a wide range of image processing options
Language:Kotlin
Total stars: 7435
Stars trend:
7 Jun 2025
9pm ▍ +3
10pm ▏ +1
11pm ▏ +1
8 Jun 2025
12am ▋ +5
1am █ +8
2am █▌ +12
3am █▌ +12
4am █▍ +11
5am ▉ +7
6am █ +8
7am ▉ +7
#kotlin
#android, #backgroundremoval, #cleanarchitecture, #crop, #editphoto, #exif, #fdroid, #filterimage, #imagemanipulation, #jetpackcompose, #jxl, #kotlin, #materialyou, #ocrrecognition, #pdf, #photocollage, #photoeditor, #psd, #qrcodescanner, #watermark
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Language:Python
Total stars: 35977
Stars trend:
#python
#ai4science, #documentanalysis, #extractdata, #layoutanalysis, #ocr, #parser, #pdf, #pdfconverter, #pdfextractorllm, #pdfextractorpretrain, #pdfextractorrag, #pdfparser, #python
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Language:Python
Total stars: 35977
Stars trend:
22 Jun 2025
3pm ▍ +3
4pm ▎ +2
5pm ██▍ +19
6pm ██▍ +19
7pm █▏ +9
8pm ▍ +3
9pm ▍ +3
10pm █▌ +12
11pm ██▍ +19
23 Jun 2025
12am ███ +24
1am ████▋ +37
2am █████▊ +46
#python
#ai4science, #documentanalysis, #extractdata, #layoutanalysis, #ocr, #parser, #pdf, #pdfconverter, #pdfextractorllm, #pdfextractorpretrain, #pdfextractorrag, #pdfparser, #python
docling-project/docling
Get your documents ready for gen AI
Language:Python
Total stars: 33012
Stars trend:
#python
#ai, #convert, #documentparser, #documentparsing, #documents, #docx, #html, #markdown, #pdf, #pdfconverter, #pdftojson, #pdftotext, #pptx, #tables, #xlsx
Get your documents ready for gen AI
Language:Python
Total stars: 33012
Stars trend:
29 Jun 2025
6am ▎ +2
7am ▉ +7
8am ▎ +2
9am ▌ +4
10am █ +8
11am █▎ +10
12pm █ +8
1pm █▋ +13
2pm █▎ +10
3pm █▊ +14
4pm █▋ +13
5pm ▌ +4
#python
#ai, #convert, #documentparser, #documentparsing, #documents, #docx, #html, #markdown, #pdf, #pdfconverter, #pdftojson, #pdftotext, #pptx, #tables, #xlsx
forthespada/CS-Books
🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~
Language:
Total stars: 23299
Stars trend:
#algorithms, #c, #cpp, #csbooks, #database, #interview, #java, #javascript, #linux, #os, #pdf, #python, #redis, #sql
🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~
Language:
Total stars: 23299
Stars trend:
8 Jul 2025
3pm █▋ +13
4pm ▉ +7
5pm ▏ +1
6pm ▏ +1
7pm ▏ +1
8pm ▎ +2
9pm ▏ +1
10pm ▎ +2
11pm ▋ +5
9 Jul 2025
12am █▌ +12
1am ████▎ +34
2am ███ +24
#algorithms, #c, #cpp, #csbooks, #database, #interview, #java, #javascript, #linux, #os, #pdf, #python, #redis, #sql
ocrmypdf/OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Language:Python
Total stars: 30169
Stars trend:
#python
#imageprocessing, #ocr, #pdf, #python, #tesseract
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Language:Python
Total stars: 30169
Stars trend:
12 Jul 2025
10am ▍ +3
11am █ +8
12pm ▉ +7
1pm ▋ +5
2pm █▍ +11
3pm ▌ +4
4pm ▎ +2
5pm ▊ +6
6pm ▋ +5
7pm █▌ +12
8pm ▉ +7
9pm █▍ +11
#python
#imageprocessing, #ocr, #pdf, #python, #tesseract
❤1
MarkPDFdown/markpdfdown
A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具
Language:Python
Total stars: 1036
Stars trend:
#python
#llm, #markdown, #pdf, #pdfconverter, #pdfmarkdown, #pdf2markdown, #pdf2md
A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具
Language:Python
Total stars: 1036
Stars trend:
26 Jul 2025
2pm ▋ +5
3pm ▍ +3
4pm █ +8
5pm ▎ +2
6pm █▉ +15
7pm ▍ +3
8pm █▊ +14
9pm █▍ +11
10pm ▉ +7
11pm ▋ +5
27 Jul 2025
12am █ +8
#python
#llm, #markdown, #pdf, #pdfconverter, #pdfmarkdown, #pdf2markdown, #pdf2md
DDULDDUCK/every-pdf
✍️ A powerful, all-in-one desktop PDF toolkit to edit, convert, merge, and secure your documents. Built with Electron, Next.js, and Python.
Language:HTML
Total stars: 554
Stars trend:
#html
#crossplatform, #desktopapplication, #electron, #fastapi, #nextjs, #pdf, #pdfconverter, #pdfviewer, #python
✍️ A powerful, all-in-one desktop PDF toolkit to edit, convert, merge, and secure your documents. Built with Electron, Next.js, and Python.
Language:HTML
Total stars: 554
Stars trend:
10 Aug 2025
6pm ▏ +1
7pm +0
8pm +0
9pm ▌ +4
10pm █ +8
11pm ▉ +7
11 Aug 2025
12am ██▍ +19
1am █▊ +14
2am ▊ +6
3am ▊ +6
4am ▉ +7
5am ▌ +4
#html
#crossplatform, #desktopapplication, #electron, #fastapi, #nextjs, #pdf, #pdfconverter, #pdfviewer, #python
❤1