uv ill like uv - My first blog post
https://zaloog.github.io/2025/01/19/uv.html
This year starting a blog and writing about more tech related topics besides just developing hobby projects is on my bucket list.
Since I still see a lot of people not knowing about uv, and I adopted it pretty much asap, I decided to write a bit about it and hope other feel it helpful and might also join the journey.
Have fun :)
/r/Python
https://redd.it/1i50f7i
https://zaloog.github.io/2025/01/19/uv.html
This year starting a blog and writing about more tech related topics besides just developing hobby projects is on my bucket list.
Since I still see a lot of people not knowing about uv, and I adopted it pretty much asap, I decided to write a bit about it and hope other feel it helpful and might also join the journey.
Have fun :)
/r/Python
https://redd.it/1i50f7i
Lars Grams
uv ill like uv
What is uv If you are following the recent developments in the python ecosystem, theres one new tool that stands out by far and might change the way you work with python. I am talking about uv, a super fast all-in-one python project management tool to replace…
Video Editing with Python for Subtitles
Hello everyone, I was looking for an open source video subtitle generator. something like this one:
https://www.captions.media/
I've tried moviepy but in my experiences it's not very useful. any other open source tool recommendations or python just not useful for this kind of tasks. Thanks!
/r/Python
https://redd.it/1i51o3d
Hello everyone, I was looking for an open source video subtitle generator. something like this one:
https://www.captions.media/
I've tried moviepy but in my experiences it's not very useful. any other open source tool recommendations or python just not useful for this kind of tasks. Thanks!
/r/Python
https://redd.it/1i51o3d
www.captions.media
Video captions done right!
OSS billing library based in SQL and Python
I just released the 0.1.0 of the project I’ve been working on called bframe. It’s a billing library that is built on top of duckdb and has a client written in python. The project is in active development and I’m looking for feedback!
Repository
Documentation
## What My Project Does
bframe connects to your database (or just files) and generates invoices, line items and the like based off of source data. The library doesn’t require a server and can be fully executed in process which makes it easy to run and test. bframe offers git like branching, time travel and extensive documentation making it a good fit for operators who want better control over their tools.
## Target Audience
bframe is a production application designed for developers and operators. The library supports subscriptions, usage based billing and more; meaning most software based businesses are a good fit.
## Comparison
The largest difference between bframe and alternatives is how we treat persistence. This library connects to an existing store (bring your own infra), while other solutions use their own storage. The approach bframe employs has all of the advantages of running your own infrastructure along with a few others:
1. No server or internet connection is required which makes it
/r/Python
https://redd.it/1i53m5k
I just released the 0.1.0 of the project I’ve been working on called bframe. It’s a billing library that is built on top of duckdb and has a client written in python. The project is in active development and I’m looking for feedback!
Repository
Documentation
## What My Project Does
bframe connects to your database (or just files) and generates invoices, line items and the like based off of source data. The library doesn’t require a server and can be fully executed in process which makes it easy to run and test. bframe offers git like branching, time travel and extensive documentation making it a good fit for operators who want better control over their tools.
## Target Audience
bframe is a production application designed for developers and operators. The library supports subscriptions, usage based billing and more; meaning most software based businesses are a good fit.
## Comparison
The largest difference between bframe and alternatives is how we treat persistence. This library connects to an existing store (bring your own infra), while other solutions use their own storage. The approach bframe employs has all of the advantages of running your own infrastructure along with a few others:
1. No server or internet connection is required which makes it
/r/Python
https://redd.it/1i53m5k
GitHub
GitHub - bframe-work/bframelib: Open source billing library based in SQL. Generate, view and diff invoices locally or in the cloud.
Open source billing library based in SQL. Generate, view and diff invoices locally or in the cloud. - bframe-work/bframelib
Adding Metadata-Driven User Defined Fields. Will This Work?
In my Loan Origination System project, I have various models with predefined default fields. When a given institution integrates its data, I'm cognizant that each different institution is likely to have its own user-defined fields that won't have a (key-value) match to the default fields.
I need to be able to allow system admins to make use of their user-defined fields on the front-end. Additionally, allowing the ability to create new user defined fields within the system, for data they may want stored in the application but not necessarily on their core. Ideally, I'd accomplish this without substantially changing the structure of each model and changes to the schemas.
I realize I could just add a single JSON field to each model. However, wouldn't I then be required to handle validation and field-types at the application level?
Instead, will something like this work? Are there better approaches?
from django.db import models
from django.contrib.contenttypes.fields import GenericForeignKey
from django.contrib.contenttypes.models import ContentType
FIELDSOURCECHOICES = (
('integration', 'Uploaded/Integrated'),
('internal', 'Admin/Customer-Created'),
)
/r/django
https://redd.it/1i52ktd
In my Loan Origination System project, I have various models with predefined default fields. When a given institution integrates its data, I'm cognizant that each different institution is likely to have its own user-defined fields that won't have a (key-value) match to the default fields.
I need to be able to allow system admins to make use of their user-defined fields on the front-end. Additionally, allowing the ability to create new user defined fields within the system, for data they may want stored in the application but not necessarily on their core. Ideally, I'd accomplish this without substantially changing the structure of each model and changes to the schemas.
I realize I could just add a single JSON field to each model. However, wouldn't I then be required to handle validation and field-types at the application level?
Instead, will something like this work? Are there better approaches?
from django.db import models
from django.contrib.contenttypes.fields import GenericForeignKey
from django.contrib.contenttypes.models import ContentType
FIELDSOURCECHOICES = (
('integration', 'Uploaded/Integrated'),
('internal', 'Admin/Customer-Created'),
)
/r/django
https://redd.it/1i52ktd
Reddit
From the django community on Reddit
Explore this post and more from the django community
Send sign up confirmation emails via zoho
Hi everybody,
I am working on a webapplication and creating the sign up form.
I want to send an automated email to the user to confirm his email adress.
We are using Zoho as our email provider.
I cant find documentation from zoho on how to connect to the email. Can you provide some assistance?
Best
/r/django
https://redd.it/1i5e689
Hi everybody,
I am working on a webapplication and creating the sign up form.
I want to send an automated email to the user to confirm his email adress.
We are using Zoho as our email provider.
I cant find documentation from zoho on how to connect to the email. Can you provide some assistance?
Best
/r/django
https://redd.it/1i5e689
Reddit
From the django community on Reddit
Explore this post and more from the django community
Monday Daily Thread: Project ideas!
# Weekly Thread: Project Ideas 💡
Welcome to our weekly Project Ideas thread! Whether you're a newbie looking for a first project or an expert seeking a new challenge, this is the place for you.
## How it Works:
1. **Suggest a Project**: Comment your project idea—be it beginner-friendly or advanced.
2. **Build & Share**: If you complete a project, reply to the original comment, share your experience, and attach your source code.
3. **Explore**: Looking for ideas? Check out Al Sweigart's ["The Big Book of Small Python Projects"](https://www.amazon.com/Big-Book-Small-Python-Programming/dp/1718501242) for inspiration.
## Guidelines:
* Clearly state the difficulty level.
* Provide a brief description and, if possible, outline the tech stack.
* Feel free to link to tutorials or resources that might help.
# Example Submissions:
## Project Idea: Chatbot
**Difficulty**: Intermediate
**Tech Stack**: Python, NLP, Flask/FastAPI/Litestar
**Description**: Create a chatbot that can answer FAQs for a website.
**Resources**: [Building a Chatbot with Python](https://www.youtube.com/watch?v=a37BL0stIuM)
# Project Idea: Weather Dashboard
**Difficulty**: Beginner
**Tech Stack**: HTML, CSS, JavaScript, API
**Description**: Build a dashboard that displays real-time weather information using a weather API.
**Resources**: [Weather API Tutorial](https://www.youtube.com/watch?v=9P5MY_2i7K8)
## Project Idea: File Organizer
**Difficulty**: Beginner
**Tech Stack**: Python, File I/O
**Description**: Create a script that organizes files in a directory into sub-folders based on file type.
**Resources**: [Automate the Boring Stuff: Organizing Files](https://automatetheboringstuff.com/2e/chapter9/)
Let's help each other grow. Happy
/r/Python
https://redd.it/1i5d1ag
# Weekly Thread: Project Ideas 💡
Welcome to our weekly Project Ideas thread! Whether you're a newbie looking for a first project or an expert seeking a new challenge, this is the place for you.
## How it Works:
1. **Suggest a Project**: Comment your project idea—be it beginner-friendly or advanced.
2. **Build & Share**: If you complete a project, reply to the original comment, share your experience, and attach your source code.
3. **Explore**: Looking for ideas? Check out Al Sweigart's ["The Big Book of Small Python Projects"](https://www.amazon.com/Big-Book-Small-Python-Programming/dp/1718501242) for inspiration.
## Guidelines:
* Clearly state the difficulty level.
* Provide a brief description and, if possible, outline the tech stack.
* Feel free to link to tutorials or resources that might help.
# Example Submissions:
## Project Idea: Chatbot
**Difficulty**: Intermediate
**Tech Stack**: Python, NLP, Flask/FastAPI/Litestar
**Description**: Create a chatbot that can answer FAQs for a website.
**Resources**: [Building a Chatbot with Python](https://www.youtube.com/watch?v=a37BL0stIuM)
# Project Idea: Weather Dashboard
**Difficulty**: Beginner
**Tech Stack**: HTML, CSS, JavaScript, API
**Description**: Build a dashboard that displays real-time weather information using a weather API.
**Resources**: [Weather API Tutorial](https://www.youtube.com/watch?v=9P5MY_2i7K8)
## Project Idea: File Organizer
**Difficulty**: Beginner
**Tech Stack**: Python, File I/O
**Description**: Create a script that organizes files in a directory into sub-folders based on file type.
**Resources**: [Automate the Boring Stuff: Organizing Files](https://automatetheboringstuff.com/2e/chapter9/)
Let's help each other grow. Happy
/r/Python
https://redd.it/1i5d1ag
YouTube
Build & Integrate your own custom chatbot to a website (Python & JavaScript)
In this fun project you learn how to build a custom chatbot in Python and then integrate this to a website using Flask and JavaScript.
Starter Files: https://github.com/patrickloeber/chatbot-deployment
Get my Free NumPy Handbook: https://www.python-engi…
Starter Files: https://github.com/patrickloeber/chatbot-deployment
Get my Free NumPy Handbook: https://www.python-engi…
I created a simple mailing program; pymailer
# What My Project Does
pymailer is a simple program the uses SMTP and IMAP to connect to your gmail account to send and fetch emails.
# Target Audience
No one in particular, I made this mainly because I need it for my other projects. I just thought it would be cool to share.
# Comparison
I'm sure there are many mailing programs written in python and other languages but I think mine is pretty simple and easy to understand.
https://github.com/cipherodev/pymailer
/r/Python
https://redd.it/1i5l8j5
# What My Project Does
pymailer is a simple program the uses SMTP and IMAP to connect to your gmail account to send and fetch emails.
# Target Audience
No one in particular, I made this mainly because I need it for my other projects. I just thought it would be cool to share.
# Comparison
I'm sure there are many mailing programs written in python and other languages but I think mine is pretty simple and easy to understand.
https://github.com/cipherodev/pymailer
/r/Python
https://redd.it/1i5l8j5
GitHub
GitHub - cipherodev/pymailer: Lightweight mail sender and fetcher
Lightweight mail sender and fetcher. Contribute to cipherodev/pymailer development by creating an account on GitHub.
Boosts AI LLMs For Multilingual And Multimodal Tasks
Distributed computing is revolutionizing AI by speeding up the training and deployment of language models, enhancing their ability to perform multilingual and multimodal tasks. This technology enables AI systems to process vast datasets, improving their accuracy and efficiency.
Learn more about how distributed computing boosts AI LLMs For Multilingual.
/r/Python
https://redd.it/1i5m9jd
Distributed computing is revolutionizing AI by speeding up the training and deployment of language models, enhancing their ability to perform multilingual and multimodal tasks. This technology enables AI systems to process vast datasets, improving their accuracy and efficiency.
Learn more about how distributed computing boosts AI LLMs For Multilingual.
/r/Python
https://redd.it/1i5m9jd
Yourquorum
How Distributed Computing Boosts AI LLMs for Multilingual and Multimodal Tasks
GPT, BERT, T5 and similar models generally referred to as large language models are the foundations in natural language processing to enable machines to understand and
My first steps with Playwright
In my previous company, I developed a batch job that tracked metrics across social media, such as Twitter, LinkedIn, Mastodon, Bluesky, Reddit, etc. Then I realized I could duplicate it for my own "persona". The problem is that some media don’t provide an HTTP API for the metrics I want.
I searched for a long time but found no API access for the metrics above. I scraped the metrics manually every morning for a long time and finally decided to automate this tedious task. Here’s what I learned.
https://blog.frankel.ch/first-steps-playwright/
/r/Python
https://redd.it/1i55qc9
In my previous company, I developed a batch job that tracked metrics across social media, such as Twitter, LinkedIn, Mastodon, Bluesky, Reddit, etc. Then I realized I could duplicate it for my own "persona". The problem is that some media don’t provide an HTTP API for the metrics I want.
I searched for a long time but found no API access for the metrics above. I scraped the metrics manually every morning for a long time and finally decided to automate this tedious task. Here’s what I learned.
https://blog.frankel.ch/first-steps-playwright/
/r/Python
https://redd.it/1i55qc9
A Java geek
My first steps with Playwright
In my previous company, I developed a batch job that tracked metrics across social media, such as Twitter, LinkedIn, Mastodon, Bluesky, Reddit, etc. Then I realized I could duplicate it for my own 'persona'. The problem is that some media don’t provide an…
MStock: The Tool I Built to Track Macy’s Restocks 🛍️
Hey everyone 👋
I wanted to share a personal project I made: MStock. I kept running into the same problem—I wanted multiple items from Macy’s that were out of stock, and I was tired of constantly checking for updates. So, I built this tool to notify me the moment something comes back in stock!
What My Project Does
MStock is a Python tool that:
• Monitors Macy’s Product Pages: Tracks multiple items at once.
• Sends Notifications: Alerts me via email or SMS (through iMessage on macOS).
• Provides Product Details: Like price, ratings, and reviews, so I don’t miss out on key info.
• Handles Failures Gracefully: Uses smart caching to keep product info even if a check fails.
Target Audience
If you frequently shop at Macy’s and hate missing out on restocks, MStock is perfect for you. It’s especially useful for anyone tracking multiple products or looking for a hands-off way to monitor stock status.
Why I Built It
There were a few items I really wanted, but they were sold out for weeks. I didn’t want to miss them when they came back, so I made MStock to handle the tracking for me. Now, I get a notification as soon as something is available again, and
/r/Python
https://redd.it/1i5b6su
Hey everyone 👋
I wanted to share a personal project I made: MStock. I kept running into the same problem—I wanted multiple items from Macy’s that were out of stock, and I was tired of constantly checking for updates. So, I built this tool to notify me the moment something comes back in stock!
What My Project Does
MStock is a Python tool that:
• Monitors Macy’s Product Pages: Tracks multiple items at once.
• Sends Notifications: Alerts me via email or SMS (through iMessage on macOS).
• Provides Product Details: Like price, ratings, and reviews, so I don’t miss out on key info.
• Handles Failures Gracefully: Uses smart caching to keep product info even if a check fails.
Target Audience
If you frequently shop at Macy’s and hate missing out on restocks, MStock is perfect for you. It’s especially useful for anyone tracking multiple products or looking for a hands-off way to monitor stock status.
Why I Built It
There were a few items I really wanted, but they were sold out for weeks. I didn’t want to miss them when they came back, so I made MStock to handle the tracking for me. Now, I get a notification as soon as something is available again, and
/r/Python
https://redd.it/1i5b6su
Reddit
From the Python community on Reddit
Explore this post and more from the Python community
MathSpell v0.1.0: Expanded Features and Test cases!
Hello everyone!
A couple of weeks ago I shared my first ever python package MathSpell \- a context-aware number-to-word conversion library built on the `spaCy` and `num2words` python library. After receiving valuable feedback I started working on some improvements. I really thought I could update it in a day, but days turned into weeks. Well, I've released an update with v0.1.0.
# What’s New in v0.1.0?
Expanded Features:
More Contexts Handled:
Quantities: "5 m/s" is now "five meter per second". Used the `unit_parse` library (which also uses `pint` under the hood) to achieve this.
Currencies: "$3.25" converts to "three dollars and twenty five cents".
Exponential Notation: "3e8" becomes "three times ten to the power of eight".
Fractions: "1/2" converts to "one over two". Also preprocessed to avoid datetime format (dd/mm/yyyy) confusion with the mathematical division sign.
Test Cases:
Added 26 new test cases with edge cases\~
I also added docstrings for better understanding!
# Target Audience:
Mainly me. But I was really happy to see that it was positively received last time! The main use case of this library is for data preprocessing tasks for applications such as
/r/Python
https://redd.it/1i5apa9
Hello everyone!
A couple of weeks ago I shared my first ever python package MathSpell \- a context-aware number-to-word conversion library built on the `spaCy` and `num2words` python library. After receiving valuable feedback I started working on some improvements. I really thought I could update it in a day, but days turned into weeks. Well, I've released an update with v0.1.0.
# What’s New in v0.1.0?
Expanded Features:
More Contexts Handled:
Quantities: "5 m/s" is now "five meter per second". Used the `unit_parse` library (which also uses `pint` under the hood) to achieve this.
Currencies: "$3.25" converts to "three dollars and twenty five cents".
Exponential Notation: "3e8" becomes "three times ten to the power of eight".
Fractions: "1/2" converts to "one over two". Also preprocessed to avoid datetime format (dd/mm/yyyy) confusion with the mathematical division sign.
Test Cases:
Added 26 new test cases with edge cases\~
I also added docstrings for better understanding!
# Target Audience:
Mainly me. But I was really happy to see that it was positively received last time! The main use case of this library is for data preprocessing tasks for applications such as
/r/Python
https://redd.it/1i5apa9
Reddit
From the Python community on Reddit: My first python package - MathSpell. Convert numbers to words contextually.
Explore this post and more from the Python community
Problem Understanding Django-allauth's headless social login ( Skill Issue)
I'm trying to implement
/r/django
https://redd.it/1i5ugnb
I'm trying to implement
django-allauth and learn about its headless URLs for social login and implement it django-ninja. However, when following the social login section of the documentation, I keep getting a 409 response. Can someone guide me in the right direction?/r/django
https://redd.it/1i5ugnb
Reddit
From the django community on Reddit
Explore this post and more from the django community
Where can I learn complete django from the basics ??
Hey , I am new to programming with a basic syntax knowledge of python, what should I do now to learn django. I need sort of a roadmap on the topics and the concepts.It would be great if you guys suggest me few.
/r/django
https://redd.it/1i5m139
Hey , I am new to programming with a basic syntax knowledge of python, what should I do now to learn django. I need sort of a roadmap on the topics and the concepts.It would be great if you guys suggest me few.
/r/django
https://redd.it/1i5m139
Reddit
From the django community on Reddit
Explore this post and more from the django community
Flask - Hosting - Requests
Hey, I am currently using a simple Flask app with a basic database connection to store various inputs (spread across 5 tables). The app also includes an admin login with user authentication and database queries for logging in.
The app is hosted on a VPS with 2 vCores and 2GB of RAM using Docker, Nginx, and Gunicorn.
This project originated during my studies and is now being used for the first time. Approximately 200 requests (in the worst case, simultaneously) are expected.
I would like to test how many requests the server can handle and determine whether 2 vCores and 2GB of RAM are sufficient for handling \~200 requests. I’ve noticed there are various tools for load testing, but since the VPS is hosted by a third-party provider, I would need to request permission before conducting such tests (even if the load is minimal).
Perhaps I am overthinking this, as 200 requests might not actually be a significant load at all ? If you need any additional information, feel free to ask, I didn’t want to go into every tiny detail here.
Thanks for taking the time to read this!
/r/flask
https://redd.it/1i5pcgv
Hey, I am currently using a simple Flask app with a basic database connection to store various inputs (spread across 5 tables). The app also includes an admin login with user authentication and database queries for logging in.
The app is hosted on a VPS with 2 vCores and 2GB of RAM using Docker, Nginx, and Gunicorn.
This project originated during my studies and is now being used for the first time. Approximately 200 requests (in the worst case, simultaneously) are expected.
I would like to test how many requests the server can handle and determine whether 2 vCores and 2GB of RAM are sufficient for handling \~200 requests. I’ve noticed there are various tools for load testing, but since the VPS is hosted by a third-party provider, I would need to request permission before conducting such tests (even if the load is minimal).
Perhaps I am overthinking this, as 200 requests might not actually be a significant load at all ? If you need any additional information, feel free to ask, I didn’t want to go into every tiny detail here.
Thanks for taking the time to read this!
/r/flask
https://redd.it/1i5pcgv
Reddit
From the flask community on Reddit
Explore this post and more from the flask community
XSS in django-allauth <0.63.6 when using Facebook provider with js_sdk method
https://stsewd.dev/posts/xss-in-django-allauth-fb-provider/
/r/django
https://redd.it/1i5v8bv
https://stsewd.dev/posts/xss-in-django-allauth-fb-provider/
/r/django
https://redd.it/1i5v8bv
Santos Gallegos
XSS in django-allauth <0.63.6
Details about a cross-site scripting vulnerability that I reported to django-allauth.
django course
would you guys suggest me the best free course for learning django for someone who has worked with laravel before
/r/django
https://redd.it/1i640y9
would you guys suggest me the best free course for learning django for someone who has worked with laravel before
/r/django
https://redd.it/1i640y9
Reddit
From the django community on Reddit
Explore this post and more from the django community
Tuesday Daily Thread: Advanced questions
# Weekly Wednesday Thread: Advanced Questions 🐍
Dive deep into Python with our Advanced Questions thread! This space is reserved for questions about more advanced Python topics, frameworks, and best practices.
## How it Works:
1. **Ask Away**: Post your advanced Python questions here.
2. **Expert Insights**: Get answers from experienced developers.
3. **Resource Pool**: Share or discover tutorials, articles, and tips.
## Guidelines:
* This thread is for **advanced questions only**. Beginner questions are welcome in our [Daily Beginner Thread](#daily-beginner-thread-link) every Thursday.
* Questions that are not advanced may be removed and redirected to the appropriate thread.
## Recommended Resources:
* If you don't receive a response, consider exploring r/LearnPython or join the [Python Discord Server](https://discord.gg/python) for quicker assistance.
## Example Questions:
1. **How can you implement a custom memory allocator in Python?**
2. **What are the best practices for optimizing Cython code for heavy numerical computations?**
3. **How do you set up a multi-threaded architecture using Python's Global Interpreter Lock (GIL)?**
4. **Can you explain the intricacies of metaclasses and how they influence object-oriented design in Python?**
5. **How would you go about implementing a distributed task queue using Celery and RabbitMQ?**
6. **What are some advanced use-cases for Python's decorators?**
7. **How can you achieve real-time data streaming in Python with WebSockets?**
8. **What are the
/r/Python
https://redd.it/1i656sb
# Weekly Wednesday Thread: Advanced Questions 🐍
Dive deep into Python with our Advanced Questions thread! This space is reserved for questions about more advanced Python topics, frameworks, and best practices.
## How it Works:
1. **Ask Away**: Post your advanced Python questions here.
2. **Expert Insights**: Get answers from experienced developers.
3. **Resource Pool**: Share or discover tutorials, articles, and tips.
## Guidelines:
* This thread is for **advanced questions only**. Beginner questions are welcome in our [Daily Beginner Thread](#daily-beginner-thread-link) every Thursday.
* Questions that are not advanced may be removed and redirected to the appropriate thread.
## Recommended Resources:
* If you don't receive a response, consider exploring r/LearnPython or join the [Python Discord Server](https://discord.gg/python) for quicker assistance.
## Example Questions:
1. **How can you implement a custom memory allocator in Python?**
2. **What are the best practices for optimizing Cython code for heavy numerical computations?**
3. **How do you set up a multi-threaded architecture using Python's Global Interpreter Lock (GIL)?**
4. **Can you explain the intricacies of metaclasses and how they influence object-oriented design in Python?**
5. **How would you go about implementing a distributed task queue using Celery and RabbitMQ?**
6. **What are some advanced use-cases for Python's decorators?**
7. **How can you achieve real-time data streaming in Python with WebSockets?**
8. **What are the
/r/Python
https://redd.it/1i656sb
Discord
Join the Python Discord Server!
We're a large community focused around the Python programming language. We believe that anyone can learn to code. | 412982 members
IP banning followup. My site is now being continuously scraped by robots.txt violating bots.
TL;DR: I need advice on:
How to implement a badbot honeypot.
How to implement an "are you human" check on account creation.
Any idea on why this is happening all of a sudden.
---
I posted a few days ago about banning a super racist IP, and implemented the changes. Since then there has been a wild amount of webscraping being done by a ton of IPs that are not displaying a proper user agent. I have no idea whether this is connected.
It may be that "Owler (ows.eu/owler)" is responsible, as it is the only thing that displays a proper useragent, and occationally checks Robots.txt, but the sheer numbers of bots hitting the site at the same time clearly violates the robots file, and I've since disallowed Owler's user agent, but it continues to check robots.txt.
These bots are almost all coming from "Hetzner Online GmbH" while the rest are all Tor exit nodes. I'm banning these IP ranges as fast as I can, but I think I need to automate it some how.
Does anyone have a good way to gather all the offending IP's without actually collecting normal user traffic? I'm tempted to just write a honeypot to collect robots.txt violating IP's, and just
/r/flask
https://redd.it/1i5d2gs
TL;DR: I need advice on:
How to implement a badbot honeypot.
How to implement an "are you human" check on account creation.
Any idea on why this is happening all of a sudden.
---
I posted a few days ago about banning a super racist IP, and implemented the changes. Since then there has been a wild amount of webscraping being done by a ton of IPs that are not displaying a proper user agent. I have no idea whether this is connected.
It may be that "Owler (ows.eu/owler)" is responsible, as it is the only thing that displays a proper useragent, and occationally checks Robots.txt, but the sheer numbers of bots hitting the site at the same time clearly violates the robots file, and I've since disallowed Owler's user agent, but it continues to check robots.txt.
These bots are almost all coming from "Hetzner Online GmbH" while the rest are all Tor exit nodes. I'm banning these IP ranges as fast as I can, but I think I need to automate it some how.
Does anyone have a good way to gather all the offending IP's without actually collecting normal user traffic? I'm tempted to just write a honeypot to collect robots.txt violating IP's, and just
/r/flask
https://redd.it/1i5d2gs
Reddit
From the flask community on Reddit
Explore this post and more from the flask community