Data Science by ODS.ai 🦜
45.1K subscribers
754 photos
84 videos
7 files
1.83K links
First Telegram Data Science channel. Covering all technical and popular staff about anything related to Data Science: AI, Big Data, Machine Learning, Statistics, general Math and the applications of former. To reach editors contact: @malev
Download Telegram
MetroTwitter - What Twitter reveals about the differences between cities and the monoculture of the Bay Area

Researcher collected 96K bios + 180M tweets from Twitters users in 13 major cities and visualized the differences between these cities:

- How people describe themselves
- What they talk about
- Popular emojis
- Most unique city

Code and data are open-sourced.

Website: https://huyenchip.com/2019/05/28/metrotwitter.html
GitHub: https://github.com/chiphuyen/MetroTwitter

#openresearch
Benford’s Law, DS and the 2020 Election

This law can be used for the very basic check on wether the data was artificially generated or not. It assumes that lower digits have higher probability of occuring.

And there can be nothing better for #reproducibleresearch concept promotion, than #openresearch on poll data, because it shows that those can and should be transparent and open.

With the help of the repo below anyone can check compliance of poll data results with the #BenfordsLaw on unofficial (or official if you are able to get that data).

KDnuggets tutorial: https://www.kdnuggets.com/2020/09/diy-election-fraud-analysis-benfords-law.html
Github repo with examples on unofficial US election data: https://github.com/cjph8914/2020_benfords

#statistics
Entropy and complexity unveil the landscape of memes evolution

Sunday research about how memes evolved from 2011 to present.
TLDR: memes are getting more complex and require more contextual knowledge to understand.

Link: https://www.nature.com/articles/s41598-021-99468-6
Data: https://github.com/cdcslab/MemesEvolution

#memes #openresearch