πβοΈ CSV Trimming: a one-line to clean up (most) messy CSVs! βοΈπ
Hi r/Python!
Last week, I shared my **ugly-csv-generator** tool with this community, and the response blew me away! π Thank you so much for the support!
As I promised during the last post, I composed a decent set of heuristics that can often address those hideous CSV monstrosities. So Iβm back with a Python package that does just that: **CSV Trimming**.
# π§ What My Project Does
[**CSV Trimming**](https://github.com/LucaCappelletti94/csv_trimming) is a Python package designed to take messy CSVs β the kind you get from scraping websites, legacy systems, or poorly managed data β and transform them into clean, well-formatted CSVs with just **one line of code**. No need for complex setups or large language models. Itβs simple, straightforward, and generally gets the job done.
# π οΈ Target Audience
This package is made by a data wrangler for data wranglers. It is not made for people who make terrible CSVs, it is made for those who have to deal with them.
Whether you're dealing with:
* Duplicated schema headers
* Corrupted NaN-like data entries (hello, `#RIF!`, I'm looking at you)
* Or even padding and partial rows...
**CSV Trimming** can handle it all. It's like Marie Kondo for your CSVs β if it doesnβt spark joy, it gets trimmed! β¨
# π¦ Installation
As
/r/Python
https://redd.it/1f2bwx5
Hi r/Python!
Last week, I shared my **ugly-csv-generator** tool with this community, and the response blew me away! π Thank you so much for the support!
As I promised during the last post, I composed a decent set of heuristics that can often address those hideous CSV monstrosities. So Iβm back with a Python package that does just that: **CSV Trimming**.
# π§ What My Project Does
[**CSV Trimming**](https://github.com/LucaCappelletti94/csv_trimming) is a Python package designed to take messy CSVs β the kind you get from scraping websites, legacy systems, or poorly managed data β and transform them into clean, well-formatted CSVs with just **one line of code**. No need for complex setups or large language models. Itβs simple, straightforward, and generally gets the job done.
# π οΈ Target Audience
This package is made by a data wrangler for data wranglers. It is not made for people who make terrible CSVs, it is made for those who have to deal with them.
Whether you're dealing with:
* Duplicated schema headers
* Corrupted NaN-like data entries (hello, `#RIF!`, I'm looking at you)
* Or even padding and partial rows...
**CSV Trimming** can handle it all. It's like Marie Kondo for your CSVs β if it doesnβt spark joy, it gets trimmed! β¨
# π¦ Installation
As
/r/Python
https://redd.it/1f2bwx5
GitHub
GitHub - LucaCappelletti94/csv_trimming: Package python to remove common ugliness from a csv-like file
Package python to remove common ugliness from a csv-like file - LucaCappelletti94/csv_trimming