o-oconnell/minixfromscratch
Development and compilation setup for the book version of MINIX (3.1.0) on QEMU
Language: C
#bash #compilers #computer_architecture #data_structures_and_algorithms #filesystem #kernel #networking #operating_systems #programming #system_administration #system_programming
Stars: 921 Issues: 3 Forks: 41
https://github.com/o-oconnell/minixfromscratch
  
  Development and compilation setup for the book version of MINIX (3.1.0) on QEMU
Language: C
#bash #compilers #computer_architecture #data_structures_and_algorithms #filesystem #kernel #networking #operating_systems #programming #system_administration #system_programming
Stars: 921 Issues: 3 Forks: 41
https://github.com/o-oconnell/minixfromscratch
GitHub
  
  GitHub - o-oconnell/minixfromscratch: Development and compilation setup for the book versions of MINIX (2.0.0 and 3.1.0) on QEMU
  Development and compilation setup for the book versions of MINIX (2.0.0 and 3.1.0) on QEMU - o-oconnell/minixfromscratch
๐5
  ArroyoSystems/arroyo
Arroyo is a distributed stream processing engine written in Rust
Language: Rust
#data #dev_tools #infrastructure #kafka #rust #sql #stream_processing
Stars: 294 Issues: 1 Forks: 5
https://github.com/ArroyoSystems/arroyo
  
  Arroyo is a distributed stream processing engine written in Rust
Language: Rust
#data #dev_tools #infrastructure #kafka #rust #sql #stream_processing
Stars: 294 Issues: 1 Forks: 5
https://github.com/ArroyoSystems/arroyo
GitHub
  
  GitHub - ArroyoSystems/arroyo: Distributed stream processing engine in Rust
  Distributed stream processing engine in Rust. Contribute to ArroyoSystems/arroyo development by creating an account on GitHub.
๐3๐คฎ2
  nicolas-hbt/pygraft
Configurable Generation of Schemas and Knowledge Graphs at Your Fingertips
Language: Python
#data_generator #graph_generator #knowledge_base #knowledge_graph #ontology #ontology_generation #python #schema #semantic_web #synthetic_data #synthetic_dataset_generation
Stars: 188 Issues: 0 Forks: 15
https://github.com/nicolas-hbt/pygraft
  
  Configurable Generation of Schemas and Knowledge Graphs at Your Fingertips
Language: Python
#data_generator #graph_generator #knowledge_base #knowledge_graph #ontology #ontology_generation #python #schema #semantic_web #synthetic_data #synthetic_dataset_generation
Stars: 188 Issues: 0 Forks: 15
https://github.com/nicolas-hbt/pygraft
GitHub
  
  GitHub - nicolas-hbt/pygraft: Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips
  Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips - nicolas-hbt/pygraft
๐1
  DataEngineer-io/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
#apachespark #awesome #bigdata #data #dataengineering #sql
Stars: 1054 Issues: 9 Forks: 126
https://github.com/DataEngineer-io/data-engineer-handbook
  
  This is a repo with links to everything you'd ever want to learn about data engineering
#apachespark #awesome #bigdata #data #dataengineering #sql
Stars: 1054 Issues: 9 Forks: 126
https://github.com/DataEngineer-io/data-engineer-handbook
GitHub
  
  GitHub - DataExpert-io/data-engineer-handbook: This is a repo with links to everything you'd ever want to learn about data engineering
  This is a repo with links to everything you'd ever want to learn about data engineering - DataExpert-io/data-engineer-handbook
๐4
  sdg-1/consulting-handbook
A guide for technical professionals looking to start consulting
#career #consulting #data #software
Stars: 272 Issues: 0 Forks: 28
https://github.com/sdg-1/consulting-handbook
  
  A guide for technical professionals looking to start consulting
#career #consulting #data #software
Stars: 272 Issues: 0 Forks: 28
https://github.com/sdg-1/consulting-handbook
GitHub
  
  GitHub - sdg-1/consulting-handbook: A guide for technical professionals looking to start consulting
  A guide for technical professionals looking to start consulting - sdg-1/consulting-handbook
๐6
  mendableai/firecrawl
๐ฅ Turn entire websites into LLM-ready markdown
Language: TypeScript
#ai #crawler #data #html_to_markdown #llm #markdown #rag #scraper #scraping #web_crawler
Stars: 469 Issues: 8 Forks: 34
https://github.com/mendableai/firecrawl
  
  ๐ฅ Turn entire websites into LLM-ready markdown
Language: TypeScript
#ai #crawler #data #html_to_markdown #llm #markdown #rag #scraper #scraping #web_crawler
Stars: 469 Issues: 8 Forks: 34
https://github.com/mendableai/firecrawl
GitHub
  
  GitHub - firecrawl/firecrawl: ๐ฅ The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
  ๐ฅ The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data - firecrawl/firecrawl
๐1๐ฅ1
  jofpin/synthBTC
A tool that uses advanced Monte Carlo simulations and Turbit parallel processing to create possible Bitcoin prediction scenarios.
Language: JavaScript
#bitcoin #data_processing #monte_carlo_simulation #nodejs #prediction #synthetic_data #turbit
Stars: 1565 Issues: 1 Forks: 1013
https://github.com/jofpin/synthBTC
  
  A tool that uses advanced Monte Carlo simulations and Turbit parallel processing to create possible Bitcoin prediction scenarios.
Language: JavaScript
#bitcoin #data_processing #monte_carlo_simulation #nodejs #prediction #synthetic_data #turbit
Stars: 1565 Issues: 1 Forks: 1013
https://github.com/jofpin/synthBTC
GitHub
  
  GitHub - jofpin/synthBTC: A tool that uses advanced Monte Carlo simulations and Turbit parallel processing to create possible Bitcoinโฆ
  A tool that uses advanced Monte Carlo simulations and Turbit parallel processing to create possible Bitcoin prediction scenarios. - jofpin/synthBTC
๐1
  timelinize/timelinize
Store your data from all your accounts and devices in a single cohesive timeline on your own computer
Language: Go
#archival #data_archiving #data_import #timeline
Stars: 340 Issues: 17 Forks: 5
https://github.com/timelinize/timelinize
  
  Store your data from all your accounts and devices in a single cohesive timeline on your own computer
Language: Go
#archival #data_archiving #data_import #timeline
Stars: 340 Issues: 17 Forks: 5
https://github.com/timelinize/timelinize
GitHub
  
  GitHub - timelinize/timelinize: Store your data from all your accounts and devices in a single cohesive timeline on your own computer
  Store your data from all your accounts and devices in a single cohesive timeline on your own computer - timelinize/timelinize
๐1
  JUSTSUJAY/nlp-zero-to-hero
NLP Zero to Hero in just 10 Kernels
Language: Jupyter Notebook
#ai #andrej_karpathy #data_science #machine_learning #nlp #zero_to_hero
Stars: 244 Issues: 0 Forks: 16
https://github.com/JUSTSUJAY/nlp-zero-to-hero
  
  NLP Zero to Hero in just 10 Kernels
Language: Jupyter Notebook
#ai #andrej_karpathy #data_science #machine_learning #nlp #zero_to_hero
Stars: 244 Issues: 0 Forks: 16
https://github.com/JUSTSUJAY/nlp-zero-to-hero
GitHub
  
  GitHub - JUSTSUJAY/nlp-zero-to-hero: NLP Zero to Hero in just 10 Kernels
  NLP Zero to Hero in just 10 Kernels. Contribute to JUSTSUJAY/nlp-zero-to-hero development by creating an account on GitHub.
  briefercloud/briefer
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
Language: TypeScript
#analytics #bi #bigquery #briefer #business_intelligence #businessintelligence #dashboard #data_analysis #data_visualization #jupyter #notebook #postgres #postgresql #reporting #visualization
Stars: 671 Issues: 4 Forks: 24
https://github.com/briefercloud/briefer
  
  Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
Language: TypeScript
#analytics #bi #bigquery #briefer #business_intelligence #businessintelligence #dashboard #data_analysis #data_visualization #jupyter #notebook #postgres #postgresql #reporting #visualization
Stars: 671 Issues: 4 Forks: 24
https://github.com/briefercloud/briefer
GitHub
  
  GitHub - briefercloud/briefer: Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code,โฆ
  Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team. - briefercloud/briefer
๐1
  hugohadfield/kalmangrad
Automated, smooth, N'th order derivatives of non-uniformly sampled time series data
Language: Python
#data_science #derivatives #kalman_filter #signal_processing #smoothing
Stars: 168 Issues: 1 Forks: 3
https://github.com/hugohadfield/kalmangrad
  
  Automated, smooth, N'th order derivatives of non-uniformly sampled time series data
Language: Python
#data_science #derivatives #kalman_filter #signal_processing #smoothing
Stars: 168 Issues: 1 Forks: 3
https://github.com/hugohadfield/kalmangrad
GitHub
  
  GitHub - hugohadfield/kalmangrad: Automated, smooth, N'th order derivatives of non-uniformly sampled time series data
  Automated, smooth, N'th order derivatives of non-uniformly sampled time series data - hugohadfield/kalmangrad
  Wainberg/ryp
R inside Python
Language: Python
#bioinformatics #data_science #python #python_to_r #r #r_to_python #rstats #statistics
Stars: 92 Issues: 0 Forks: 1
https://github.com/Wainberg/ryp
  
  R inside Python
Language: Python
#bioinformatics #data_science #python #python_to_r #r #r_to_python #rstats #statistics
Stars: 92 Issues: 0 Forks: 1
https://github.com/Wainberg/ryp
GitHub
  
  GitHub - Wainberg/ryp: R inside Python
  R inside Python. Contribute to Wainberg/ryp development by creating an account on GitHub.
๐1
  BemiHQ/BemiDB
Postgres read replica optimized for analytics
Language: Go
#analytics #data_lakehouse #data_movement #data_warehouse #duckdb #iceberg #olap #parquet #postgresql #replication #zero_etl
Stars: 427 Issues: 2 Forks: 5
https://github.com/BemiHQ/BemiDB
  
  Postgres read replica optimized for analytics
Language: Go
#analytics #data_lakehouse #data_movement #data_warehouse #duckdb #iceberg #olap #parquet #postgresql #replication #zero_etl
Stars: 427 Issues: 2 Forks: 5
https://github.com/BemiHQ/BemiDB
GitHub
  
  GitHub - BemiHQ/BemiDB: Open-source Snowflake and Fivetran alternative bundled together
  Open-source Snowflake and Fivetran alternative bundled together - BemiHQ/BemiDB
โค1
  deepseek-ai/smallpond
A lightweight data processing framework built on DuckDB and 3FS.
Language: Python
#data_processing #duckdb
Stars: 724 Issues: 2 Forks: 51
https://github.com/deepseek-ai/smallpond
  
  A lightweight data processing framework built on DuckDB and 3FS.
Language: Python
#data_processing #duckdb
Stars: 724 Issues: 2 Forks: 51
https://github.com/deepseek-ai/smallpond
GitHub
  
  GitHub - deepseek-ai/smallpond: A lightweight data processing framework built on DuckDB and 3FS.
  A lightweight data processing framework built on DuckDB and 3FS. - deepseek-ai/smallpond
๐2๐ฅ1
  gszfwsb/NCFM
Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function" (NCFM) in CVPR 2025.
Language: Python
#computer_vision #data_centric_ai #dataset_distillation #synthetic_data
Stars: 268 Issues: 2 Forks: 15
https://github.com/gszfwsb/NCFM
  
  Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function" (NCFM) in CVPR 2025.
Language: Python
#computer_vision #data_centric_ai #dataset_distillation #synthetic_data
Stars: 268 Issues: 2 Forks: 15
https://github.com/gszfwsb/NCFM
GitHub
  
  GitHub - gszfwsb/NCFM: Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function:โฆ
  Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function: A Minmax Perspective" (NCFM) in CVPR 2025 (Highlight). - gszfwsb/NCFM
  subnetmarco/pgmcp
An MCP server to query any Postgres database in natural language.
Language: Go
#agent #agentic_ai #ai #analytics #artificial_intelligence #data_analysis #database #kong #mcp #mcp_server #postgres #postgresql
Stars: 318 Issues: 1 Forks: 29
https://github.com/subnetmarco/pgmcp
  
  An MCP server to query any Postgres database in natural language.
Language: Go
#agent #agentic_ai #ai #analytics #artificial_intelligence #data_analysis #database #kong #mcp #mcp_server #postgres #postgresql
Stars: 318 Issues: 1 Forks: 29
https://github.com/subnetmarco/pgmcp
GitHub
  
  GitHub - subnetmarco/pgmcp: An MCP server to query any Postgres database in natural language.
  An MCP server to query any Postgres database in natural language. - subnetmarco/pgmcp
  