Code Stars
1.87K subscribers
8.61K photos
8.9K links
Code Stars provides notifications about GitHub repositories that are gaining a significant number of stars in a short period of time. Be the first to find out about trending repositories that everybody will be talking about soon.
#AI #chatGPT #python
Download Telegram
Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Language:HTML
Total stars: 6025
Stars trend:
17 Apr 2024
5pm ▎ +2
6pm ▌ +4
7pm ▍ +3
8pm ▋ +5
9pm ▊ +6
10pm ▋ +5
11pm ▋ +5
18 Apr 2024
12am ▉ +7
1am █▏ +9
2am █▋ +13
3am █▎ +10
4am ██▏ +17

#html
#datapipelines, #deeplearning, #documentimageanalysis, #documentimageprocessing, #documentparser, #documentparsing, #docx, #donut, #informationretrieval, #langchain, #llm, #machinelearning, #ml, #naturallanguageprocessing, #nlp, #ocr, #pdf, #pdftojson, #pdftotext, #preprocessing
gotenberg/gotenberg
A developer-friendly API for converting numerous document formats into PDF files, and more!
Language:Go
Total stars: 6772
Stars trend:
18 Apr 2024
9pm ▌ +4
10pm ▏ +1
11pm ▍ +3
19 Apr 2024
12am ▏ +1
1am ▏ +1
2am +0
3am ▍ +3
4am ▍ +3
5am █▍ +11
6am ███▍ +27
7am ███▎ +26

#go
#api, #chrome, #chromium, #converter, #csv, #docker, #docx, #excel, #html, #http2, #libreoffice, #markdown, #pdf, #pdftk, #pptx, #puppeteer, #unoconv, #wkhtmltopdf, #word, #xlsx
koodo-reader/koodo-reader
A modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux and Web
Language:JavaScript
Total stars: 16335
Stars trend:
28 May 2024
12am ▎ +2
1am █▊ +14
2am █▉ +15
3am ▉ +7
4am ▊ +6
5am █ +8
6am █▏ +9
7am ▏ +1
8am ▎ +2
9am ▋ +5
10am ▍ +3
11am ▍ +3

#javascript
#book, #cb7, #cbr, #cbt, #cbz, #comic, #docx, #ebook, #epub, #fb2, #html, #markdown, #mobi, #pdf, #reader, #rtf, #txt, #xml
brsloan/warewoolf
A minimalist novel-writing system/rich text editor designed to be usable without a mouse. For desktop and standalone word processors/digital typewriters/writerDecks.
Language:JavaScript
Total stars: 116
Stars trend:
14 Sep 2024
11pm ▎ +2
15 Sep 2024
12am █ +8
1am █▍ +11
2am █▋ +13
3am █ +8
4am ▋ +5
5am ▏ +1
6am █ +8
7am ▊ +6
8am ▍ +3
9am ▍ +3
10am █ +8

#javascript
#docx, #editor, #fiction, #markdown, #novelwriting, #quill, #richtexteditor, #texteditor, #wordprocessor, #writerdeck, #writingsoftware
QuivrHQ/MegaParse
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Language:Python
Total stars: 807
Stars trend:
3 Dec 2024
6am ▏ +1
7am +0
8am ██▋ +21
9am █▌ +12
10am █▍ +11
11am █▎ +10
12pm █▏ +9
1pm ▉ +7
2pm █▎ +10

#python
#docx, #llm, #parser, #pdf, #powerpoint
DS4SD/docling
Get your documents ready for gen AI
Language:Python
Total stars: 18111
Stars trend:
13 Jan 2025
11pm ▌ +4
14 Jan 2025
12am ▏ +1
1am ▍ +3
2am ▎ +2
3am ▋ +5
4am ▊ +6
5am ██▌ +20
6am █▋ +13
7am ▉ +7
8am █▏ +9
9am ▉ +7
10am ▊ +6

#python
#ai, #convert, #documentparser, #documentparsing, #documents, #docx, #html, #markdown, #pdf, #pdfconverter, #pdftojson, #pdftotext, #pptx, #tables, #xlsx
yobix-ai/extractous
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Language:Rust
Total stars: 777
Stars trend:
29 Jan 2025
10pm █▏ +9
11pm ▌ +4
30 Jan 2025
12am █▎ +10
1am ▋ +5
2am █▏ +9
3am ▊ +6
4am ▉ +7
5am █ +8
6am ▉ +7
7am █ +8
8am ▋ +5
9am █ +8

#rust
#datapipelines, #docx, #etl, #etlpipelines, #extraction, #llm, #machinelearning, #naturallanguageprocessing, #nlp, #ocr, #pdf, #pdfparser, #rag, #rust, #tika, #unstructured, #unstructureddata
Goldziher/kreuzberg
A text extraction library supporting PDFs, images, office documents and more
Language:Python
Total stars: 304
Stars trend:
15 Feb 2025
12am █ +8
1am ▋ +5
2am █ +8
3am ▊ +6
4am ▉ +7
5am ▉ +7
6am ▊ +6
7am ▎ +2
8am █ +8
9am █ +8
10am █▋ +13

#python
#asyncio, #docx, #ocr, #pdf, #textextraction
koodo-reader/koodo-reader
A modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux and Web
Language:JavaScript
Total stars: 21188
Stars trend:
4 Mar 2025
12am █▉ +15
1am ▌ +4
2am █▏ +9
3am █▏ +9
4am ▎ +2
5am █▏ +9
6am ▊ +6
7am ▍ +3
8am ▌ +4
9am ▋ +5
10am ▉ +7
11am ▋ +5

#javascript
#book, #cb7, #cbr, #cbt, #cbz, #comic, #docx, #ebook, #epub, #fb2, #html, #markdown, #mobi, #pdf, #reader, #rtf, #txt, #xml
docling-project/docling
Get your documents ready for gen AI
Language:Python
Total stars: 26148
Stars trend:
6 Apr 2025
11am ▏ +1
12pm +0
1pm ▍ +3
2pm ▏ +1
3pm ▌ +4
4pm ▌ +4
5pm ██▏ +17
6pm █ +8
7pm ▊ +6
8pm █▏ +9
9pm █▌ +12
10pm ██ +16

#python
#ai, #convert, #documentparser, #documentparsing, #documents, #docx, #html, #markdown, #pdf, #pdfconverter, #pdftojson, #pdftotext, #pptx, #tables, #xlsx