AI & ML Papers
32.8K subscribers
7.07K photos
523 videos
24 files
7.72K links
Advancing research in Machine Learning โ€“ practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
๐Ÿ”น Title: AMO-Bench: Large Language Models Still Struggle in High School Math Competitions

๐Ÿ”น Publication Date: Published on Oct 30

๐Ÿ”น Paper Links:
โ€ข arXiv Page: https://arxiv.org/pdf/2510.26768
โ€ข PDF: https://arxiv.org/pdf/2510.26768
โ€ข Project Page: https://amo-bench.github.io/
โ€ข Github: https://amo-bench.github.io/

๐Ÿ”น Datasets citing this paper:
โ€ข https://huggingface.co/datasets/meituan-longcat/AMO-Bench

๐Ÿ”น Spaces citing this paper:
No spaces found
==================================

For more data science resources:
โœ“ https://xn--r1a.website/DataScienceT
๐Ÿ”น Title: EHR-R1: A Reasoning-Enhanced Foundational Language Model for Electronic Health Record Analysis

๐Ÿ”น Publication Date: Published on Oct 29

๐Ÿ”น Paper Links:
โ€ข arXiv Page: https://arxiv.org/abs/2510.25628
โ€ข PDF: https://arxiv.org/pdf/2510.25628
โ€ข Github: https://github.com/MAGIC-AI4Med/EHR-R1

๐Ÿ”น Datasets citing this paper:
No datasets found

๐Ÿ”น Spaces citing this paper:
No spaces found
==================================

For more data science resources:
โœ“ https://xn--r1a.website/DataScienceT
A title

I guess this is a start
https://xn--r1a.website/frommeforworld



Sponsored By WaybienAds
๐Ÿ”น Title: The Era of Agentic Organization: Learning to Organize with Language Models

๐Ÿ”น Publication Date: Published on Oct 30

๐Ÿ”น Paper Links:
โ€ข arXiv Page: https://arxiv.org/abs/2510.26658
โ€ข PDF: https://arxiv.org/pdf/2510.26658

๐Ÿ”น Datasets citing this paper:
No datasets found

๐Ÿ”น Spaces citing this paper:
No spaces found
==================================

For more data science resources:
โœ“ https://xn--r1a.website/DataScienceT
๐Ÿ”น Title: OmniLayout: Enabling Coarse-to-Fine Learning with LLMs for Universal Document Layout Generation

๐Ÿ”น Publication Date: Published on Oct 30

๐Ÿ”น Paper Links:
โ€ข arXiv Page: https://arxiv.org/abs/2510.26213
โ€ข PDF: https://arxiv.org/pdf/2510.26213

๐Ÿ”น Datasets citing this paper:
No datasets found

๐Ÿ”น Spaces citing this paper:
No spaces found
==================================

For more data science resources:
โœ“ https://xn--r1a.website/DataScienceT
โค1
๐Ÿ”น Title: Exploring Conditions for Diffusion models in Robotic Control

๐Ÿ”น Publication Date: Published on Oct 17

๐Ÿ”น Paper Links:
โ€ข arXiv Page: https://arxiv.org/abs/2510.15510
โ€ข PDF: https://arxiv.org/pdf/2510.15510
โ€ข Project Page: https://orca-rc.github.io/
โ€ข Github: https://orca-rc.github.io/

๐Ÿ”น Datasets citing this paper:
No datasets found

๐Ÿ”น Spaces citing this paper:
No spaces found
==================================

For more data science resources:
โœ“ https://xn--r1a.website/DataScienceT
๐Ÿ”น Title: ChartAB: A Benchmark for Chart Grounding & Dense Alignment

๐Ÿ”น Publication Date: Published on Oct 30

๐Ÿ”น Paper Links:
โ€ข arXiv Page: https://arxiv.org/abs/2510.26781
โ€ข PDF: https://arxiv.org/pdf/2510.26781
โ€ข Project Page: https://huggingface.co/datasets/umd-zhou-lab/ChartAlignBench
โ€ข Github: https://github.com/tianyi-lab/ChartAlignBench

๐Ÿ”น Datasets citing this paper:
No datasets found

๐Ÿ”น Spaces citing this paper:
No spaces found
==================================

For more data science resources:
โœ“ https://xn--r1a.website/DataScienceT
๐Ÿ”น Title: MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency

๐Ÿ”น Publication Date: Published on Oct 29

๐Ÿ”น Paper Links:
โ€ข arXiv Page: https://arxiv.org/abs/2510.25897
โ€ข PDF: https://arxiv.org/pdf/2510.25897
โ€ข Project Page: https://nicolas-dufour.github.io/miro/
โ€ข Github: https://nicolas-dufour.github.io/miro/

๐Ÿ”น Datasets citing this paper:
No datasets found

๐Ÿ”น Spaces citing this paper:
No spaces found
==================================

For more data science resources:
โœ“ https://xn--r1a.website/DataScienceT
Forwarded from Kaggle Data Hub
Is Your Crypto Transfer Secure?

Score Your Transfer analyzes wallet activity, flags risky transactions in real time, and generates downloadable compliance reportsโ€”no technical skills needed. Protect funds & stay compliant.



Sponsored By WaybienAds
๐Ÿ”น Title: Surfer 2: The Next Generation of Cross-Platform Computer Use Agents

๐Ÿ”น Publication Date: Published on Oct 22

๐Ÿ”น Paper Links:
โ€ข arXiv Page: https://arxiv.org/abs/2510.19949
โ€ข PDF: https://arxiv.org/pdf/2510.19949

๐Ÿ”น Datasets citing this paper:
No datasets found

๐Ÿ”น Spaces citing this paper:
No spaces found
==================================

For more data science resources:
โœ“ https://xn--r1a.website/DataScienceT
๐Ÿ”น Title: CLASS-IT: Conversational and Lecture-Aligned Small-Scale Instruction Tuning for BabyLMs

๐Ÿ”น Publication Date: Published on Oct 29

๐Ÿ”น Paper Links:
โ€ข arXiv Page: https://arxiv.org/abs/2510.25364
โ€ข PDF: https://arxiv.org/pdf/2510.25364

๐Ÿ”น Datasets citing this paper:
โ€ข https://huggingface.co/datasets/colinglab/CLASS_IT

๐Ÿ”น Spaces citing this paper:
No spaces found
==================================

For more data science resources:
โœ“ https://xn--r1a.website/DataScienceT
๐Ÿ”น Title: The End of Manual Decoding: Towards Truly End-to-End Language Models

๐Ÿ”น Publication Date: Published on Oct 30

๐Ÿ”น Paper Links:
โ€ข arXiv Page: https://arxiv.org/abs/2510.26697
โ€ข PDF: https://arxiv.org/pdf/2510.26697
โ€ข Github: https://github.com/Zacks917/AutoDeco

๐Ÿ”น Datasets citing this paper:
No datasets found

๐Ÿ”น Spaces citing this paper:
No spaces found
==================================

For more data science resources:
โœ“ https://xn--r1a.website/DataScienceT
๐Ÿ”น Title: MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs

๐Ÿ”น Publication Date: Published on Oct 29

๐Ÿ”น Paper Links:
โ€ข arXiv Page: https://arxiv.org/abs/2510.25867
โ€ข PDF: https://arxiv.org/pdf/2510.25867
โ€ข Project Page: https://ucsc-vlaa.github.io/MedVLSynther/
โ€ข Github: https://ucsc-vlaa.github.io/MedVLSynther/

๐Ÿ”น Datasets citing this paper:
No datasets found

๐Ÿ”น Spaces citing this paper:
No spaces found
==================================

For more data science resources:
โœ“ https://xn--r1a.website/DataScienceT
๐Ÿ”น Title: CityRiSE: Reasoning Urban Socio-Economic Status in Vision-Language Models via Reinforcement Learning

๐Ÿ”น Publication Date: Published on Oct 25

๐Ÿ”น Paper Links:
โ€ข arXiv Page: https://arxiv.org/abs/2510.22282
โ€ข PDF: https://arxiv.org/pdf/2510.22282
โ€ข Github: https://github.com/tsinghua-fib-lab/CityRiSE

๐Ÿ”น Datasets citing this paper:
No datasets found

๐Ÿ”น Spaces citing this paper:
No spaces found
==================================

For more data science resources:
โœ“ https://xn--r1a.website/DataScienceT
๐Ÿ”น Title: PORTool: Tool-Use LLM Training with Rewarded Tree

๐Ÿ”น Publication Date: Published on Oct 29

๐Ÿ”น Paper Links:
โ€ข arXiv Page: https://arxiv.org/abs/2510.26020
โ€ข PDF: https://arxiv.org/pdf/2510.26020

๐Ÿ”น Datasets citing this paper:
No datasets found

๐Ÿ”น Spaces citing this paper:
No spaces found
==================================

For more data science resources:
โœ“ https://xn--r1a.website/DataScienceT
๐Ÿ”น Title: L^2M^3OF: A Large Language Multimodal Model for Metal-Organic Frameworks

๐Ÿ”น Publication Date: Published on Oct 23

๐Ÿ”น Paper Links:
โ€ข arXiv Page: https://arxiv.org/abs/2510.20976
โ€ข PDF: https://arxiv.org/pdf/2510.20976

๐Ÿ”น Datasets citing this paper:
No datasets found

๐Ÿ”น Spaces citing this paper:
No spaces found
==================================

For more data science resources:
โœ“ https://xn--r1a.website/DataScienceT
๐Ÿ”น Title: Performance Trade-offs of Optimizing Small Language Models for E-Commerce

๐Ÿ”น Publication Date: Published on Oct 24

๐Ÿ”น Paper Links:
โ€ข arXiv Page: https://arxiv.org/abs/2510.21970
โ€ข PDF: https://arxiv.org/pdf/2510.21970

๐Ÿ”น Datasets citing this paper:
No datasets found

๐Ÿ”น Spaces citing this paper:
No spaces found
==================================

For more data science resources:
โœ“ https://xn--r1a.website/DataScienceT
โค1
๐Ÿ”น Title: POWSM: A Phonetic Open Whisper-Style Speech Foundation Model

๐Ÿ”น Publication Date: Published on Oct 28

๐Ÿ”น Paper Links:
โ€ข arXiv Page: https://arxiv.org/abs/2510.24992
โ€ข PDF: https://arxiv.org/pdf/2510.24992

๐Ÿ”น Datasets citing this paper:
No datasets found

๐Ÿ”น Spaces citing this paper:
No spaces found
==================================

For more data science resources:
โœ“ https://xn--r1a.website/DataScienceT
โค2
nature papers: 2000$

Q1 and  Q2 papers    1000$

Q3 and Q4 papers   500$

Doctoral thesis (complete)    700$

M.S thesis         300$

paper simulation   200$

Contact me @husseinsheikho
โค3
AI & ML Papers pinned ยซnature papers: 2000$ Q1 and  Q2 papers    1000$ Q3 and Q4 papers   500$ Doctoral thesis (complete)    700$ M.S thesis         300$ paper simulation   200$ Contact me @husseinsheikhoยป
Top 100 Data Analyst Interview Questions & Answers

#DataAnalysis #InterviewQuestions #SQL #Python #Statistics #CaseStudy #DataScience

Part 1: SQL Questions (Q1-30)

#1. What is the difference between DELETE, TRUNCATE, and DROP?
A:
โ€ข DELETE is a DML command that removes rows from a table based on a WHERE clause. It is slower as it logs each row deletion and can be rolled back.
โ€ข TRUNCATE is a DDL command that quickly removes all rows from a table. It is faster, cannot be rolled back, and resets table identity.
โ€ข DROP is a DDL command that removes the entire table, including its structure, data, and indexes.

#2. Select all unique departments from the employees table.
A: Use the DISTINCT keyword.

SELECT DISTINCT department
FROM employees;


#3. Find the top 5 highest-paid employees.
A: Use ORDER BY and LIMIT.

SELECT name, salary
FROM employees
ORDER BY salary DESC
LIMIT 5;


#4. What is the difference between WHERE and HAVING?
A:
โ€ข WHERE is used to filter records before any groupings are made (i.e., it operates on individual rows).
โ€ข HAVING is used to filter groups after aggregations (GROUP BY) have been performed.

-- Find departments with more than 10 employees
SELECT department, COUNT(employee_id)
FROM employees
GROUP BY department
HAVING COUNT(employee_id) > 10;


#5. What are the different types of SQL joins?
A:
โ€ข (INNER) JOIN: Returns records that have matching values in both tables.
โ€ข LEFT (OUTER) JOIN: Returns all records from the left table, and the matched records from the right table.
โ€ข RIGHT (OUTER) JOIN: Returns all records from the right table, and the matched records from the left table.
โ€ข FULL (OUTER) JOIN: Returns all records when there is a match in either the left or right table.
โ€ข SELF JOIN: A regular join, but the table is joined with itself.

#6. Write a query to find the second-highest salary.
A: Use OFFSET or a subquery.

-- Method 1: Using OFFSET
SELECT salary
FROM employees
ORDER BY salary DESC
LIMIT 1 OFFSET 1;

-- Method 2: Using a Subquery
SELECT MAX(salary)
FROM employees
WHERE salary < (SELECT MAX(salary) FROM employees);


#7. Find duplicate emails in a customers table.
A: Group by the email column and use HAVING to find groups with a count greater than 1.

SELECT email, COUNT(email)
FROM customers
GROUP BY email
HAVING COUNT(email) > 1;


#8. What is a primary key vs. a foreign key?
A:
โ€ข A Primary Key is a constraint that uniquely identifies each record in a table. It must contain unique values and cannot contain NULL values.
โ€ข A Foreign Key is a key used to link two tables together. It is a field (or collection of fields) in one table that refers to the Primary Key in another table.

#9. Explain Window Functions. Give an example.
A: Window functions perform a calculation across a set of table rows that are somehow related to the current row. Unlike aggregate functions, they do not collapse rows.

-- Rank employees by salary within each department
SELECT
name,
department,
salary,
RANK() OVER (PARTITION BY department ORDER BY salary DESC) as dept_rank
FROM employees;


#10. What is a CTE (Common Table Expression)?
A: A CTE is a temporary, named result set that you can reference within a SELECT, INSERT, UPDATE, or DELETE statement. It helps improve readability and break down complex queries.
โค2