QuivrHQ/MegaParse
File Parser optimised for LLM Ingestion with no loss ๐ง Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Language:Python
Total stars: 6094
Stars trend:
#python
#docx, #llm, #parser, #pdf, #powerpoint
File Parser optimised for LLM Ingestion with no loss ๐ง Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Language:Python
Total stars: 6094
Stars trend:
25 Apr 2025
3pm โโ +10
4pm โโ +11
5pm โโ +9
6pm โโ +11
7pm โ +4
8pm โ +8
9pm โ +7
10pm โ +5
11pm โ +7
26 Apr 2025
12am โโ +10
1am โโ +10#python
#docx, #llm, #parser, #pdf, #powerpoint
๐1
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.ไธ็ซๅผๅผๆบ้ซ่ดจ้ๆฐๆฎๆๅๅทฅๅ ท๏ผๅฐPDF่ฝฌๆขๆMarkdownๅJSONๆ ผๅผใ
Language:Python
Total stars: 35977
Stars trend:
#python
#ai4science, #documentanalysis, #extractdata, #layoutanalysis, #ocr, #parser, #pdf, #pdfconverter, #pdfextractorllm, #pdfextractorpretrain, #pdfextractorrag, #pdfparser, #python
A high-quality tool for convert PDF to Markdown and JSON.ไธ็ซๅผๅผๆบ้ซ่ดจ้ๆฐๆฎๆๅๅทฅๅ ท๏ผๅฐPDF่ฝฌๆขๆMarkdownๅJSONๆ ผๅผใ
Language:Python
Total stars: 35977
Stars trend:
22 Jun 2025
3pm โ +3
4pm โ +2
5pm โโโ +19
6pm โโโ +19
7pm โโ +9
8pm โ +3
9pm โ +3
10pm โโ +12
11pm โโโ +19
23 Jun 2025
12am โโโ +24
1am โโโโโ +37
2am โโโโโโ +46#python
#ai4science, #documentanalysis, #extractdata, #layoutanalysis, #ocr, #parser, #pdf, #pdfconverter, #pdfextractorllm, #pdfextractorpretrain, #pdfextractorrag, #pdfparser, #python
run-llama/semtools
Semantic search and document parsing tools for the command line
Language:Rust
Total stars: 410
Stars trend:
#rust
#cli, #embeddings, #parser, #rust, #search, #semantic, #semanticsearch, #staticembedding
Semantic search and document parsing tools for the command line
Language:Rust
Total stars: 410
Stars trend:
29 Aug 2025
8pm โ +5
9pm โโ +9
10pm โโ +11
11pm โโ +10
30 Aug 2025
12am โ +6
1am โ +5
2am โ +6
3am โ +4
4am โ +4
5am โโ +11
6am โ +7
7am โโ +10#rust
#cli, #embeddings, #parser, #rust, #search, #semantic, #semanticsearch, #staticembedding
mangiucugna/json_repair
A python module to repair invalid JSON from LLMs
Language:Python
Total stars: 3022
Stars trend:
#python
#deeplearning, #gpt4, #json, #llama3, #llm, #machinelearning, #mistral, #openaiapi, #parser, #repair
A python module to repair invalid JSON from LLMs
Language:Python
Total stars: 3022
Stars trend:
17 Oct 2025
3am โ +5
4am โ +2
5am โ +1
6am โ +3
7am โ +3
8am โโ +9
9am โ +4
10am โ +6
11am โ +3
12pm โ +3#python
#deeplearning, #gpt4, #json, #llama3, #llm, #machinelearning, #mistral, #openaiapi, #parser, #repair
โค1
serkodev/markdown-exit
Fast, customizable Markdown parser and renderer with full CommonMark support. TypeScript rewrite of markdown-it with enhancements.
Language:TypeScript
Total stars: 150
Stars trend:
#typescript
#commonmark, #javascript, #markdown, #parser, #renderer, #typescript
Fast, customizable Markdown parser and renderer with full CommonMark support. TypeScript rewrite of markdown-it with enhancements.
Language:TypeScript
Total stars: 150
Stars trend:
31 Oct 2025
3pm โโ +15
4pm โโ +11
5pm โโ +10
6pm โ +4
7pm โ +5#typescript
#commonmark, #javascript, #markdown, #parser, #renderer, #typescript
boa-dev/boa
Boa is an embeddable Javascript engine written in Rust.
Language:Rust
Total stars: 6214
Stars trend:
#rust
#ecmascript, #hacktoberfest, #interpreter, #javascript, #javascriptengine, #javascriptinterpreter, #parser, #runtime, #rust, #rustcrate, #wasm, #webassembly
Boa is an embeddable Javascript engine written in Rust.
Language:Rust
Total stars: 6214
Stars trend:
15 Nov 2025
8am โ +1
9am +0
10am +0
11am +0
12pm +0
1pm +0
2pm +0
3pm +0
4pm +0
5pm โ +1
6pm โโ +10
7pm โ +8#rust
#ecmascript, #hacktoberfest, #interpreter, #javascript, #javascriptengine, #javascriptinterpreter, #parser, #runtime, #rust, #rustcrate, #wasm, #webassembly
MadAppGang/dingo
A meta-language for Go that adds Result types, error propagation (?), and pattern matching while maintaining 100% Go ecosystem compatibility
Language:Go
Total stars: 89
Stars trend:
#go
#ast, #compiler, #developertools, #go, #golang, #gopls, #languageserver, #lsp, #parser
A meta-language for Go that adds Result types, error propagation (?), and pattern matching while maintaining 100% Go ecosystem compatibility
Language:Go
Total stars: 89
Stars trend:
23 Nov 2025
2am โ +4
3am โ +4
4am โ +2
5am โ +7
6am โ +6
7am โโ +15#go
#ast, #compiler, #developertools, #go, #golang, #gopls, #languageserver, #lsp, #parser
EmilStenstrom/justhtml
A pure Python HTML5 parser that just works. No C extensions to compile. No system dependencies to install. No complex API to learn.
Language:Python
Total stars: 182
Stars trend:
#python
#html5, #parser, #python
A pure Python HTML5 parser that just works. No C extensions to compile. No system dependencies to install. No complex API to learn.
Language:Python
Total stars: 182
Stars trend:
14 Dec 2025
4pm โ +5
5pm โ +6
6pm โ +2
7pm โ +6
8pm โโ +10
9pm โโ +11
10pm โโ +12
11pm โ +8
15 Dec 2025
12am โ +3#python
#html5, #parser, #python
LibPDF-js/core
A modern PDF library for TypeScript. Parse, modify, and generate PDFs with a clean, intuitive API.
Language:TypeScript
Total stars: 373
Stars trend:
#typescript
#digitalsignature, #document, #esign, #generator, #parser, #pdf, #pdfdocument, #pdfgenerator, #pkcs7, #signing, #typescript
A modern PDF library for TypeScript. Parse, modify, and generate PDFs with a clean, intuitive API.
Language:TypeScript
Total stars: 373
Stars trend:
24 Jan 2026
3pm โโ +12
4pm โโ +12
5pm โโ +11
6pm โโ +11
7pm โโ +11
8pm โโ +11
9pm โ +6
10pm โ +6
11pm โ +5
25 Jan 2026
12am โ +4
1am โ +4
2am โ +7#typescript
#digitalsignature, #document, #esign, #generator, #parser, #pdf, #pdfdocument, #pdfgenerator, #pkcs7, #signing, #typescript
โค1๐1
cheeriojs/cheerio
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
Language:TypeScript
Total stars: 30172
Stars trend:
#typescript
#cheerio, #dom, #hacktoberfest, #html, #htmlparser, #htmlparser2, #jquery, #parser, #scraper, #selector
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
Language:TypeScript
Total stars: 30172
Stars trend:
8 Mar 2026
5pm โ +2
6pm โ +1
7pm โ +7
8pm โ +1
9pm โ +8
10pm โ +5
11pm โ +5
9 Mar 2026
12am โ +6
1am โ +2
2am โ +6
3am โ +1
4am โโโ +17#typescript
#cheerio, #dom, #hacktoberfest, #html, #htmlparser, #htmlparser2, #jquery, #parser, #scraper, #selector
productdevbook/hucre
Zero-dependency spreadsheet engine. Read & write XLSX, CSV, ODS. Pure TypeScript, works everywhere.
Language:TypeScript
Total stars: 146
Stars trend:
#typescript
#csv, #csvparser, #esm, #excel, #ods, #odsparser, #parser, #spreadsheet, #streaming, #treeshakeable, #typescript, #xlsx, #xlsxparser, #xlsxwriter, #zerodependency
Zero-dependency spreadsheet engine. Read & write XLSX, CSV, ODS. Pure TypeScript, works everywhere.
Language:TypeScript
Total stars: 146
Stars trend:
29 Mar 2026
6pm โ +2
7pm โ +4
8pm โ +4
9pm โ +6
10pm โ +6
11pm โ +6
30 Mar 2026
12am โ +3#typescript
#csv, #csvparser, #esm, #excel, #ods, #odsparser, #parser, #spreadsheet, #streaming, #treeshakeable, #typescript, #xlsx, #xlsxparser, #xlsxwriter, #zerodependency