Dicklesworthstone/llm_aided_ocr
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
Language:Python
Total stars: 1576
Stars trend:
#python
#aiassist, #llama2, #llm, #ocr, #ocrcorrection, #tesseract
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
Language:Python
Total stars: 1576
Stars trend:
9 Aug 2024
1pm ▏ +1
2pm +0
3pm +0
4pm ▍ +3
5pm ██████▊ +54
6pm ███████▏ +57
7pm ██████ +48
8pm ████▍ +35
9pm ████▊ +38
#python
#aiassist, #llama2, #llm, #ocr, #ocrcorrection, #tesseract
❤1
xushengfeng/eSearch
截屏 离线OCR 搜索翻译 以图搜图 贴图 录屏 万向滚动截屏 屏幕翻译 Screenshot Offline OCR Search Translate Search for picture Paste the picture on the screen Screen recorder Omnidirectional scrolling screenshot Screen translator
Language:TypeScript
Total stars: 4089
Stars trend:
#typescript
#clipboard, #colorpicker, #crossplatform, #electron, #imageediting, #imageeditor, #livetext, #ocr, #paddleocr, #screencapture, #screenrecorder, #screenshot, #search, #searchphotos
截屏 离线OCR 搜索翻译 以图搜图 贴图 录屏 万向滚动截屏 屏幕翻译 Screenshot Offline OCR Search Translate Search for picture Paste the picture on the screen Screen recorder Omnidirectional scrolling screenshot Screen translator
Language:TypeScript
Total stars: 4089
Stars trend:
13 Oct 2024
2pm █▌ +12
3pm █▎ +10
4pm █ +8
5pm ▌ +4
6pm ▎ +2
7pm ▍ +3
8pm ▏ +1
9pm ▌ +4
10pm ▍ +3
11pm ▊ +6
14 Oct 2024
12am █▊ +14
1am ██▉ +23
#typescript
#clipboard, #colorpicker, #crossplatform, #electron, #imageediting, #imageeditor, #livetext, #ocr, #paddleocr, #screencapture, #screenrecorder, #screenshot, #search, #searchphotos
siyuan-note/siyuan
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
Language:TypeScript
Total stars: 19204
Stars trend:
#typescript
#anki, #chatgpt, #electron, #evernote, #knowledgebase, #localfirst, #markdown, #notetaking, #notebook, #notesapp, #notion, #obsidian, #ocr, #openai, #pdf, #pkm, #s3, #selfhosted, #webdav
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
Language:TypeScript
Total stars: 19204
Stars trend:
14 Oct 2024
2am ▎ +2
3am ▏ +1
4am +0
5am ██▋ +21
6am ██ +16
7am █▏ +9
8am █▍ +11
9am ▌ +4
10am ▋ +5
11am ▋ +5
12pm ▋ +5
#typescript
#anki, #chatgpt, #electron, #evernote, #knowledgebase, #localfirst, #markdown, #notetaking, #notebook, #notesapp, #notion, #obsidian, #ocr, #openai, #pdf, #pkm, #s3, #selfhosted, #webdav
VikParuchuri/tabled
Detect and extract tables to markdown and csv
Language:Python
Total stars: 91
Stars trend:
#python
#deeplearning, #ocr, #tables
Detect and extract tables to markdown and csv
Language:Python
Total stars: 91
Stars trend:
15 Oct 2024
11am ▊ +6
12pm █▋ +13
1pm █▋ +13
2pm █▎ +10
3pm ▏ +1
4pm █▏ +9
5pm ▍ +3
6pm ▊ +6
7pm ▎ +2
8pm ▊ +6
9pm ▌ +4
10pm ▎ +2
#python
#deeplearning, #ocr, #tables
❤2
CatchTheTornado/pdf-extract-api
Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
Language:Python
Total stars: 250
Stars trend:
#python
#anonymization, #api, #extract, #json, #llm, #ocr, #ocrpython, #pdf, #pii
Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
Language:Python
Total stars: 250
Stars trend:
3 Nov 2024
2pm ▏ +1
3pm █▊ +14
4pm █▉ +15
5pm ▋ +5
6pm ▍ +3
7pm ▌ +4
8pm ▍ +3
9pm ▌ +4
10pm ▍ +3
11pm ▉ +7
4 Nov 2024
12am ▊ +6
1am █▉ +15
#python
#anonymization, #api, #extract, #json, #llm, #ocr, #ocrpython, #pdf, #pii
👍1
getomni-ai/zerox
PDF to Markdown with vision models
Language:Python
Total stars: 8139
Stars trend:
#python
#ocr, #pdf
PDF to Markdown with vision models
Language:Python
Total stars: 8139
Stars trend:
16 Jan 2025
3am ▍ +3
4am ▊ +6
5am +0
6am ▌ +4
7am ▉ +7
8am ▌ +4
9am ▌ +4
10am ▌ +4
11am █▍ +11
12pm █▏ +9
1pm █▎ +10
2pm ██▉ +23
#python
#ocr, #pdf
siyuan-note/siyuan
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
Language:TypeScript
Total stars: 26661
Stars trend:
#typescript
#anki, #chatgpt, #electron, #evernote, #knowledgebase, #localfirst, #markdown, #notetaking, #notebook, #notesapp, #notion, #obsidian, #ocr, #openai, #pdf, #pkm, #s3, #selfhosted, #webdav
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
Language:TypeScript
Total stars: 26661
Stars trend:
17 Jan 2025
7pm ▏ +1
8pm +0
9pm +0
10pm +0
11pm ▏ +1
18 Jan 2025
12am ███ +24
1am ███▍ +27
2am ███▏ +25
3am ███▍ +27
4am ██▉ +23
#typescript
#anki, #chatgpt, #electron, #evernote, #knowledgebase, #localfirst, #markdown, #notetaking, #notebook, #notesapp, #notion, #obsidian, #ocr, #openai, #pdf, #pkm, #s3, #selfhosted, #webdav
codexu/note-gen
一款专注于记录和写作的跨端 AI 笔记
Language:TypeScript
Total stars: 265
Stars trend:
#typescript
#ai, #app, #chatgpt, #markdown, #nextjs, #notes, #ocr, #openai, #rust, #shadcnui, #tailwindcss, #tauri
一款专注于记录和写作的跨端 AI 笔记
Language:TypeScript
Total stars: 265
Stars trend:
19 Jan 2025
9am ▎ +2
10am █▌ +12
11am ▎ +2
12pm █ +8
1pm █▉ +15
2pm █▊ +14
3pm █▋ +13
4pm ▋ +5
5pm ▍ +3
6pm ▏ +1
#typescript
#ai, #app, #chatgpt, #markdown, #nextjs, #notes, #ocr, #openai, #rust, #shadcnui, #tailwindcss, #tauri
yobix-ai/extractous
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Language:Rust
Total stars: 777
Stars trend:
#rust
#datapipelines, #docx, #etl, #etlpipelines, #extraction, #llm, #machinelearning, #naturallanguageprocessing, #nlp, #ocr, #pdf, #pdfparser, #rag, #rust, #tika, #unstructured, #unstructureddata
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Language:Rust
Total stars: 777
Stars trend:
29 Jan 2025
10pm █▏ +9
11pm ▌ +4
30 Jan 2025
12am █▎ +10
1am ▋ +5
2am █▏ +9
3am ▊ +6
4am ▉ +7
5am █ +8
6am ▉ +7
7am █ +8
8am ▋ +5
9am █ +8
#rust
#datapipelines, #docx, #etl, #etlpipelines, #extraction, #llm, #machinelearning, #naturallanguageprocessing, #nlp, #ocr, #pdf, #pdfparser, #rag, #rust, #tika, #unstructured, #unstructureddata