Anthropic CEO says at the beginning of 2024, models scored ~3% at SWE-bench. Ten months later, we were at 50%. He thinks in another year we’ll probably be at 90% N
"One of the reasons I'm optimistic about the rapid progress of powerful AI is that, if you extrapolate the next few points on the curve, we’re quickly approaching human-level ability.
Some of the new models we've developed, as well as reasoning models from other companies, are starting to reach what I’d consider PhD or professional level. For example, our latest model, Sonnet 3.5, gets about 50% on SWE-bench, which is a benchmark for professional real-world software engineering tasks. At the start of the year, the state of the art was only around 3 or 4%. In just 10 months, we've gone from 3% to 50% on this task. I believe in another year, we could reach 90%.
We've seen similar advancements in graduate-level math, physics, and biology, with models like OpenAI’s GPT-3. If we continue to extrapolate this progress, in a few years, these models could surpass the highest professional human levels in skill.
Now, will that progress continue? There are various reasons why it might not, but if the current trajectory holds, that's where we're headed."
\- Dario Amodei. See the full interview here.
/r/MachineLearning
https://redd.it/1i8wkth
"One of the reasons I'm optimistic about the rapid progress of powerful AI is that, if you extrapolate the next few points on the curve, we’re quickly approaching human-level ability.
Some of the new models we've developed, as well as reasoning models from other companies, are starting to reach what I’d consider PhD or professional level. For example, our latest model, Sonnet 3.5, gets about 50% on SWE-bench, which is a benchmark for professional real-world software engineering tasks. At the start of the year, the state of the art was only around 3 or 4%. In just 10 months, we've gone from 3% to 50% on this task. I believe in another year, we could reach 90%.
We've seen similar advancements in graduate-level math, physics, and biology, with models like OpenAI’s GPT-3. If we continue to extrapolate this progress, in a few years, these models could surpass the highest professional human levels in skill.
Now, will that progress continue? There are various reasons why it might not, but if the current trajectory holds, that's where we're headed."
\- Dario Amodei. See the full interview here.
/r/MachineLearning
https://redd.it/1i8wkth
YouTube
Dario Amodei: Anthropic CEO on Claude, AGI & the Future of AI & Humanity | Lex Fridman Podcast #452
Dario Amodei is the CEO of Anthropic, the company that created Claude. Amanda Askell is an AI researcher working on Claude's character and personality. Chris Olah is an AI researcher working on mechanistic interpretability.
Thank you for listening ❤ Check…
Thank you for listening ❤ Check…
Bagels v0.3 update! Expense tracker that lives in your terminal.
Hi r/Python! I'm excited to share about the launch of Bagels 0.3 - a terminal (UI) expense tracker built with the textual TUI library! Check out the git repo for screenshots!
This new major version adds a whole new
The plotting is implemented with plotext!
# Target audience
Pain point: I find it annoying that my mobile budget tracker often gets out of sync with my actual balance when a record is missing, and I have no clue when that was. Also, it was frustrating that the most feature-rich budget trackers require you to pay to export your data.
Bagels is designed for you to conveniently enter your records at the end of each day, and store them in sqlite for easy export and processing if needed!
Comparison: Unlike traditional expense trackers that are accessed by web or mobile, Bagels lives in your terminal. Intended for you to check in and add records for the day, instead of doing so on the go with a mobile app.
#
/r/Python
https://redd.it/1i8xjry
Hi r/Python! I'm excited to share about the launch of Bagels 0.3 - a terminal (UI) expense tracker built with the textual TUI library! Check out the git repo for screenshots!
This new major version adds a whole new
manager page, equipped with a display of 3 new plots (spending per day, cummulative spending trajectory and balance over time). The new budget section is designed to assist with saving part of your income and limit unnecessary spending!The plotting is implemented with plotext!
# Target audience
Pain point: I find it annoying that my mobile budget tracker often gets out of sync with my actual balance when a record is missing, and I have no clue when that was. Also, it was frustrating that the most feature-rich budget trackers require you to pay to export your data.
Bagels is designed for you to conveniently enter your records at the end of each day, and store them in sqlite for easy export and processing if needed!
Comparison: Unlike traditional expense trackers that are accessed by web or mobile, Bagels lives in your terminal. Intended for you to check in and add records for the day, instead of doing so on the go with a mobile app.
#
/r/Python
https://redd.it/1i8xjry
GitHub
GitHub - EnhancedJax/Bagels: Powerful expense tracker that lives in your terminal.
Powerful expense tracker that lives in your terminal. - EnhancedJax/Bagels
Django + HTMX
I am playing around for a while now with this combo with mixed feelings.
It’s really hard to remain a clear structure when rendering only with partials, snippets and including it then in content…
Do you struggle with that too? Is there any helpful resource how not to lose track with the overall structure?
/r/django
https://redd.it/1i8zs5l
I am playing around for a while now with this combo with mixed feelings.
It’s really hard to remain a clear structure when rendering only with partials, snippets and including it then in content…
Do you struggle with that too? Is there any helpful resource how not to lose track with the overall structure?
/r/django
https://redd.it/1i8zs5l
Reddit
From the django community on Reddit
Explore this post and more from the django community
Example large open source rest API?
I started reading Indico, but looks like it is mostly a server rendered app rather than an API and SPA. Are there are other large examples that I can read?
/r/flask
https://redd.it/1i9cewi
I started reading Indico, but looks like it is mostly a server rendered app rather than an API and SPA. Are there are other large examples that I can read?
/r/flask
https://redd.it/1i9cewi
Reddit
From the flask community on Reddit
Explore this post and more from the flask community
Need Help Improving My Resume – Seeking Ideas and Feedback!
I’m currently in the process of updating my resume and could really use some fresh ideas or suggestions to make it stand out. I’m applying for a Python Django Developer/Full Stack Developer role, and I want my resume to effectively highlight my skills and experience.
Here’s a bit about me:
Experience: I have 2 years of experience as a Python Django Trainer, where I’ve taught aspiring developers and built a strong foundation in Python, Django, and web development. I’m now transitioning into Full Stack Development roles to apply my technical and teaching expertise in real-world projects.
Skills: Python, Django, JavaScript, REST APIs, SQL, Git, and front-end tools like Tailwind CSS and Bootstrap. I also have experience with creating and deploying projects using Django, collaborating with teams, and building RESTful applications.
Career Goal: I aim to grow as a Full Stack Developer by contributing to innovative projects, building scalable solutions, and exploring modern web development practices.
If you have any suggestions for improving the structure, format, or content of my resume, I’d really appreciate your input.
Are there specific sections or layouts that work best for tech resumes?
What’s the best way to showcase teaching experience in a way that resonates with recruiters?
/r/djangolearning
https://redd.it/1i8122h
I’m currently in the process of updating my resume and could really use some fresh ideas or suggestions to make it stand out. I’m applying for a Python Django Developer/Full Stack Developer role, and I want my resume to effectively highlight my skills and experience.
Here’s a bit about me:
Experience: I have 2 years of experience as a Python Django Trainer, where I’ve taught aspiring developers and built a strong foundation in Python, Django, and web development. I’m now transitioning into Full Stack Development roles to apply my technical and teaching expertise in real-world projects.
Skills: Python, Django, JavaScript, REST APIs, SQL, Git, and front-end tools like Tailwind CSS and Bootstrap. I also have experience with creating and deploying projects using Django, collaborating with teams, and building RESTful applications.
Career Goal: I aim to grow as a Full Stack Developer by contributing to innovative projects, building scalable solutions, and exploring modern web development practices.
If you have any suggestions for improving the structure, format, or content of my resume, I’d really appreciate your input.
Are there specific sections or layouts that work best for tech resumes?
What’s the best way to showcase teaching experience in a way that resonates with recruiters?
/r/djangolearning
https://redd.it/1i8122h
Reddit
From the djangolearning community on Reddit
Explore this post and more from the djangolearning community
Need Help deploying React + Flask
Hi, flask newbie here
i spent quite a while developing a react and flask application thats pretty simple,
react fronend, sends post requests and get requests to flask backend, uses routing aswell.
i had it up working fine with localhost:5000 and localhost:3000 and now I'm trying to deploy it,
tried to use Dockerfile with render and deploy them both at the same time, ended up being really frustrating so i switched to using a digitalocean droplet, and following a tutorial that got me setup with nginx, but because my flask backed doesnt really display anything i am having trouble debugging - i set up a system service on the droplet (just a linux vm) so a gunicorn process is alway running, but i cant visit it properly. also i have a domain but it doenst seemt o work with nginx.
any advice on deployment?
Thanks
/r/flask
https://redd.it/1i95cz0
Hi, flask newbie here
i spent quite a while developing a react and flask application thats pretty simple,
react fronend, sends post requests and get requests to flask backend, uses routing aswell.
i had it up working fine with localhost:5000 and localhost:3000 and now I'm trying to deploy it,
tried to use Dockerfile with render and deploy them both at the same time, ended up being really frustrating so i switched to using a digitalocean droplet, and following a tutorial that got me setup with nginx, but because my flask backed doesnt really display anything i am having trouble debugging - i set up a system service on the droplet (just a linux vm) so a gunicorn process is alway running, but i cant visit it properly. also i have a domain but it doenst seemt o work with nginx.
any advice on deployment?
Thanks
/r/flask
https://redd.it/1i95cz0
Reddit
From the flask community on Reddit
Explore this post and more from the flask community
Any reason to NOT use Pyright?
Based on this comparison (by Microsoft): https://htmlpreview.github.io/?https://github.com/python/typing/blob/main/conformance/results/results.html
It seems Pyright more or less implements nearly every specification in the Python Type System, while it's competitors are still lagging behind. Is there even any reason to not use Pyright (other than it relying on Node.js, but I don't think it's that big of a deal)? I know MyPy is the so-called 'Reference Implementation' but for a Reference Implementation it sure is lagging behind a lot.
EDIT: I context is which Type Checker is best to use as a Language Server, rather than CI/CD.
/r/Python
https://redd.it/1i8pm2f
Based on this comparison (by Microsoft): https://htmlpreview.github.io/?https://github.com/python/typing/blob/main/conformance/results/results.html
It seems Pyright more or less implements nearly every specification in the Python Type System, while it's competitors are still lagging behind. Is there even any reason to not use Pyright (other than it relying on Node.js, but I don't think it's that big of a deal)? I know MyPy is the so-called 'Reference Implementation' but for a Reference Implementation it sure is lagging behind a lot.
EDIT: I context is which Type Checker is best to use as a Language Server, rather than CI/CD.
/r/Python
https://redd.it/1i8pm2f
Reddit
From the Python community on Reddit
Explore this post and more from the Python community
blackjack from 100 days of python code.
Wow. This was rough on me. This is the 3rd version after I got lost in the sauce of my own spaghetti code. So nested in statements I gave my code the bird.
Things I learned:
write your pseudo code. if you don't know **how** you'll do your pseudo code, research on the front end.
always! debug before writing a block of something
if you don't understand what you wrote when you wrote it, you wont understand it later. Breakdown functions into something logical, then test them step by step.
good times. Any pointers would be much appreciated. Thanks everyone :)
from random import randint
import art
def checkscore(playerlist, dealerlist): #get win draw bust lose continue
if len(playerlist) == 5 and sum(playerlist) <= 21:
return "win"
elif sum(playerlist) >= 22:
return "bust"
elif sum(playerlist) == 21 and not sum(dealerlist)
/r/Python
https://redd.it/1i8xqld
Wow. This was rough on me. This is the 3rd version after I got lost in the sauce of my own spaghetti code. So nested in statements I gave my code the bird.
Things I learned:
write your pseudo code. if you don't know **how** you'll do your pseudo code, research on the front end.
always! debug before writing a block of something
if you don't understand what you wrote when you wrote it, you wont understand it later. Breakdown functions into something logical, then test them step by step.
good times. Any pointers would be much appreciated. Thanks everyone :)
from random import randint
import art
def checkscore(playerlist, dealerlist): #get win draw bust lose continue
if len(playerlist) == 5 and sum(playerlist) <= 21:
return "win"
elif sum(playerlist) >= 22:
return "bust"
elif sum(playerlist) == 21 and not sum(dealerlist)
/r/Python
https://redd.it/1i8xqld
Reddit
From the Python community on Reddit
Explore this post and more from the Python community
Django Ninja vs DRF for Async ? Seeking Advice
Hey everyone,
We’re building an API for our app, and while we really want to use Django, we need robust async capabilities since we’ll be making external API requests (e.g., OpenAI, LLMs).
We’re torn between Django REST Framework (DRF) and Django Ninja. While Django Ninja supports async out of the box and looks very developer-friendly, we’re concerned about its long-term support and community size compared to DRF. Future-proofing our app is important since this API will be a core part of our project.
For those who have used Django Ninja in production, would you recommend it? What are your experiences with its stability and scalability?
If not Django Ninja, how would you approach incorporating async functionality into a DRF-based API? Any advice or recommendations would be greatly appreciated! We will definitely use Celery for some tasks but rn we're looking for a solution for immediate responses.
Thanks in advance! 🙏
P.S. We also need robust permission handling since we need to be HIPAA compliant.
/r/django
https://redd.it/1i9f9bb
Hey everyone,
We’re building an API for our app, and while we really want to use Django, we need robust async capabilities since we’ll be making external API requests (e.g., OpenAI, LLMs).
We’re torn between Django REST Framework (DRF) and Django Ninja. While Django Ninja supports async out of the box and looks very developer-friendly, we’re concerned about its long-term support and community size compared to DRF. Future-proofing our app is important since this API will be a core part of our project.
For those who have used Django Ninja in production, would you recommend it? What are your experiences with its stability and scalability?
If not Django Ninja, how would you approach incorporating async functionality into a DRF-based API? Any advice or recommendations would be greatly appreciated! We will definitely use Celery for some tasks but rn we're looking for a solution for immediate responses.
Thanks in advance! 🙏
P.S. We also need robust permission handling since we need to be HIPAA compliant.
/r/django
https://redd.it/1i9f9bb
Reddit
From the django community on Reddit
Explore this post and more from the django community
For my Textual users! Neovim plugin to support Textual CSS Syntax Highlighting
Here's the GitHub repo
Added support for all the same rules as seen in the VSCode Extension for Textual CSS. Hope it's of use to anyone!
/r/Python
https://redd.it/1i9stwk
Here's the GitHub repo
Added support for all the same rules as seen in the VSCode Extension for Textual CSS. Hope it's of use to anyone!
/r/Python
https://redd.it/1i9stwk
GitHub
GitHub - cachebag/nvim-tcss: A Neovim plugin for Textual CSS (.tcss) syntax highlighting as seen on www.github.com/Textualize/transcendent…
A Neovim plugin for Textual CSS (.tcss) syntax highlighting as seen on www.github.com/Textualize/transcendent-textual - cachebag/nvim-tcss
Understanding .get vs get_object_or_404 in Django
https://www.youtube.com/watch?v=J7r5K5kLGWY&t=281s
/r/django
https://redd.it/1i9xvlm
https://www.youtube.com/watch?v=J7r5K5kLGWY&t=281s
/r/django
https://redd.it/1i9xvlm
YouTube
Understanding .get vs get_object_or_404 in Django | How To Use get_object_or_404 In Django
In this video, I will break down the difference between .get() and get_object_or_404() in Django. Learn when to use each, how they handle missing objects, and why get_object_or_404 is a lifesaver for clean and error-proof code! Perfect for Django beginners…
Help Needed: Unable to Update Field Values in Web App (304 Not Modified Issue)
Hi All,
Hi everyone,
I'm working on a small project involving web application development. While I can successfully create records for users, I'm running into trouble updating field values. Every time I try to update, I encounter a
I suspect there's something wrong in my code or configuration, but I can't pinpoint the exact issue.
Here’s what I’d like help with:
Understanding why I might be receiving a `304 Not Modified` status.
Identifying the part of the code I should focus on (frontend or backend).
Below is a brief overview of the technologies I’m using and relevant details:
Frontend: \[HTML, CSS, JavaSCript\]
Backend: [Python\]
Database: \[SQLAlchemy, MySQL\]
HTTP Method for Update: POST, GET
Error Details:
127.0.0.1 \- - [25/Jan/2025 12:03:07\] "GET /static/css/style.css HTTP/1.1" 304 -
[127.0.0.1](http://127.0.0.1) \- - \[25/Jan/2025 12:03:07\] "GET /static/js/profile\_details.js HTTP/1.1" 304 -
127.0.0.1 \- - [25/Jan/2025 12:03:07\] "GET /static/images/default_placeholder.png HTTP/1.1" 304 -
[127.0.0.1](http://127.0.0.1) \- - \[25/Jan/2025 12:03:07\] "GET /static/js/calendar\_availability.js HTTP/1.1" 304 -
127.0.0.1 \- - [25/Jan/2025 12:03:23\] "GET /static/css/style.css HTTP/1.1" 304 -
I’d appreciate any guidance or suggestions. If needed, I can share snippets of the relevant code. Thank you in
/r/flask
https://redd.it/1i9uqpw
Hi All,
Hi everyone,
I'm working on a small project involving web application development. While I can successfully create records for users, I'm running into trouble updating field values. Every time I try to update, I encounter a
304 Not Modified status response.I suspect there's something wrong in my code or configuration, but I can't pinpoint the exact issue.
Here’s what I’d like help with:
Understanding why I might be receiving a `304 Not Modified` status.
Identifying the part of the code I should focus on (frontend or backend).
Below is a brief overview of the technologies I’m using and relevant details:
Frontend: \[HTML, CSS, JavaSCript\]
Backend: [Python\]
Database: \[SQLAlchemy, MySQL\]
HTTP Method for Update: POST, GET
Error Details:
127.0.0.1 \- - [25/Jan/2025 12:03:07\] "GET /static/css/style.css HTTP/1.1" 304 -
[127.0.0.1](http://127.0.0.1) \- - \[25/Jan/2025 12:03:07\] "GET /static/js/profile\_details.js HTTP/1.1" 304 -
127.0.0.1 \- - [25/Jan/2025 12:03:07\] "GET /static/images/default_placeholder.png HTTP/1.1" 304 -
[127.0.0.1](http://127.0.0.1) \- - \[25/Jan/2025 12:03:07\] "GET /static/js/calendar\_availability.js HTTP/1.1" 304 -
127.0.0.1 \- - [25/Jan/2025 12:03:23\] "GET /static/css/style.css HTTP/1.1" 304 -
I’d appreciate any guidance or suggestions. If needed, I can share snippets of the relevant code. Thank you in
/r/flask
https://redd.it/1i9uqpw
Reddit
From the flask community on Reddit
Explore this post and more from the flask community
Possibility of Django being helpful to land a job
Hey guys I am an aspiring software engineer who has made and launched a live website using Django. I really like Django but I don’t see that many career opportunities that Django can give me, so I am thinking if I should focus on data science/analytics. What are your honest thoughts about this?
/r/django
https://redd.it/1ia5gbk
Hey guys I am an aspiring software engineer who has made and launched a live website using Django. I really like Django but I don’t see that many career opportunities that Django can give me, so I am thinking if I should focus on data science/analytics. What are your honest thoughts about this?
/r/django
https://redd.it/1ia5gbk
Reddit
From the django community on Reddit
Explore this post and more from the django community
Sunday Daily Thread: What's everyone working on this week?
# Weekly Thread: What's Everyone Working On This Week? 🛠️
Hello /r/Python! It's time to share what you've been working on! Whether it's a work-in-progress, a completed masterpiece, or just a rough idea, let us know what you're up to!
## How it Works:
1. Show & Tell: Share your current projects, completed works, or future ideas.
2. Discuss: Get feedback, find collaborators, or just chat about your project.
3. Inspire: Your project might inspire someone else, just as you might get inspired here.
## Guidelines:
Feel free to include as many details as you'd like. Code snippets, screenshots, and links are all welcome.
Whether it's your job, your hobby, or your passion project, all Python-related work is welcome here.
## Example Shares:
1. Machine Learning Model: Working on a ML model to predict stock prices. Just cracked a 90% accuracy rate!
2. Web Scraping: Built a script to scrape and analyze news articles. It's helped me understand media bias better.
3. Automation: Automated my home lighting with Python and Raspberry Pi. My life has never been easier!
Let's build and grow together! Share your journey and learn from others. Happy coding! 🌟
/r/Python
https://redd.it/1ia0tm7
# Weekly Thread: What's Everyone Working On This Week? 🛠️
Hello /r/Python! It's time to share what you've been working on! Whether it's a work-in-progress, a completed masterpiece, or just a rough idea, let us know what you're up to!
## How it Works:
1. Show & Tell: Share your current projects, completed works, or future ideas.
2. Discuss: Get feedback, find collaborators, or just chat about your project.
3. Inspire: Your project might inspire someone else, just as you might get inspired here.
## Guidelines:
Feel free to include as many details as you'd like. Code snippets, screenshots, and links are all welcome.
Whether it's your job, your hobby, or your passion project, all Python-related work is welcome here.
## Example Shares:
1. Machine Learning Model: Working on a ML model to predict stock prices. Just cracked a 90% accuracy rate!
2. Web Scraping: Built a script to scrape and analyze news articles. It's helped me understand media bias better.
3. Automation: Automated my home lighting with Python and Raspberry Pi. My life has never been easier!
Let's build and grow together! Share your journey and learn from others. Happy coding! 🌟
/r/Python
https://redd.it/1ia0tm7
Reddit
From the Python community on Reddit
Explore this post and more from the Python community