ashvardanian/StringZilla
Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging SWAR and SIMD on Arm Neon and x86 AVX2 & AVX-512-capable chips to accelerate search, sort, edit distances, alignment scores, etc π¦
Language:C++
Total stars: 1230
Stars trend:
#cplusplus
#beautifulsoup, #commoncrawl, #csv, #dataset, #html, #informationretrieval, #json, #laion, #ndjson, #parser, #patternrecognition, #simd, #sortingalgorithms, #string, #stringmanipulation, #stringmatching, #stringparsing, #stringsearch, #substring
Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging SWAR and SIMD on Arm Neon and x86 AVX2 & AVX-512-capable chips to accelerate search, sort, edit distances, alignment scores, etc π¦
Language:C++
Total stars: 1230
Stars trend:
24 Feb 2024
8am β +6
9am β +3
10am β +5
11am ββ +9
12pm β +6
1pm β +7
2pm β +8
3pm ββ +9
4pm ββ +9
5pm β +8
6pm β +5
7pm β +3
#cplusplus
#beautifulsoup, #commoncrawl, #csv, #dataset, #html, #informationretrieval, #json, #laion, #ndjson, #parser, #patternrecognition, #simd, #sortingalgorithms, #string, #stringmanipulation, #stringmatching, #stringparsing, #stringsearch, #substring
apify/crawlee-python
CrawleeβA web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
Language:Python
Total stars: 208
Stars trend:
#python
#apify, #automation, #beautifulsoup, #crawler, #crawling, #headless, #headlesschrome, #pip, #playwright, #python, #scraper, #scraping, #webcrawler, #webcrawling, #webscraping
CrawleeβA web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
Language:Python
Total stars: 208
Stars trend:
9 Jul 2024
7am βββ +17
8am ββ +11
9am ββ +10
10am ββββ +25
11am βββ +22
12pm βββ +24
1pm ββββ +27
#python
#apify, #automation, #beautifulsoup, #crawler, #crawling, #headless, #headlesschrome, #pip, #playwright, #python, #scraper, #scraping, #webcrawler, #webcrawling, #webscraping