Just links – Telegram

Just links

6.59K subscribers

362 photos

43 videos

10 files

7.8K links

That's just link aggregator of everything I consider interesting, especially DL and topological condensed matter physics. @EvgeniyZh

Download Telegram

About

Blog

Apps

Platform

6.59K subscribers

Refer-Agent: A Collaborative Multi-Agent System with Reasoning and Reflection for Referring Video Object Segmentation https://arxiv.org/abs/2602.03595

Refer-Agent: A Collaborative Multi-Agent System with Reasoning and...

Referring Video Object Segmentation (RVOS) aims to segment objects in videos based on textual queries. Current methods mainly rely on large-scale supervised fine-tuning (SFT) of Multi-modal Large...

1.81K views08:35

Learning to Repair Lean Proofs from Compiler Feedback https://arxiv.org/abs/2602.02990

Learning to Repair Lean Proofs from Compiler Feedback

As neural theorem provers become increasingly agentic, the ability to interpret and act on compiler feedback is critical. However, existing Lean datasets consist almost exclusively of correct...

❤1

1.99K views08:48

First Proof https://arxiv.org/abs/2602.05192

To assess the ability of current AI systems to correctly answer research-level mathematics questions, we share a set of ten math questions which have arisen naturally in the research process of...

🤔3

1.92K views14:24

Universal Topological Gates from Braiding and Fusing Anyons on Quantum Hardware https://arxiv.org/abs/2601.20956

Universal Topological Gates from Braiding and Fusing Anyons on...

Topological quantum computation encodes quantum information in the internal fusion space of non-Abelian anyonic quasiparticles, whose braiding implements logical gates. This goes beyond Abelian...

1.67K views16:08

Expanding the Capabilities of Reinforcement Learning via Text Feedback https://arxiv.org/abs/2602.02482

Expanding the Capabilities of Reinforcement Learning via Text Feedback

The success of RL for LLM post-training stems from an unreasonably uninformative source: a single bit of information per rollout as binary reward or preference label. At the other extreme,...

1.88K views16:09

https://fixupx.com/i/status/2021239388173213737

🧵 Thread • FixupX

Charlie (Zixi) Chen (@charllechen)

Why is nanochat's optimal tokens per param 8, much smaller than 20 from Chinchilla? We had similar findings in our NeurIPS work https://arxiv.org/abs/2512.05620. We hypothesize two key factors: (1) improved optimization and (2) higher-quality data. 1/n🧵
…

🔥2

9.78K views22:16

Goal-Guided Efficient Exploration via Large Language Model in Reinforcement Learning https://arxiv.org/abs/2509.22008

Goal-Guided Efficient Exploration via Large Language Model in...

Real-world decision-making tasks typically occur in complex and open environments, posing significant challenges to reinforcement learning (RL) agents' exploration efficiency and long-horizon...

1.64K views23:42

Self-dual Higgs transitions: Toric code and beyond https://arxiv.org/abs/2601.20945

Self-dual Higgs transitions: Toric code and beyond

The toric code, when deformed in a way that preserves the self-duality $\mathbb{Z}_2$ symmetry exchanging the electric and magnetic excitations, admits a transition to a topologically trivial...

🤯2👾1

1.53K views13:13

We hid backdoors in binaries — Opus 4.6 found 49% of them https://quesma.com/blog/introducing-binaryaudit/

We hid backdoors in binaries — Opus 4.6 found 49% of them - Quesma Blog

BinaryAudit benchmarks AI agents using Ghidra to find backdoors in compiled binaries of real open-source servers, proxies, and network infrastructure.

👍6

2.11K viewsedited 15:03

https://lichess.org/@/Lichess/blog/op1-partial-8-piece-tablebase-available/1ptPBDpC

Op1 - Partial 8-piece tablebase available

63 TiB of chess knowledge sent across the Atlantic and now available on the Lichess analysis board

1.55K views09:03

An X-ray-emitting protocluster at z ≈ 5.7 reveals rapid structure growth https://www.nature.com/articles/s41586-025-09973-1

An X-ray-emitting protocluster at z ≈ 5.7 reveals rapid structure growth

Nature - Discovery of a protocluster at z = 5.68, merely one billion years after the Big Bang, suggests that large-scale structure must have formed more rapidly in some regions of the...

1.53K views11:18

https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-deep-think/

Gemini 3 Deep Think: Advancing science, research and engineering

We’re releasing a major upgrade to Gemini 3 Deep Think, our specialized reasoning mode.

🤷‍♂1

1.67K views15:16

https://github.com/hacker-fab/gitbook

GitHub - hacker-fab/gitbook

Contribute to hacker-fab/gitbook development by creating an account on GitHub.

🤯2❤1

1.6K views07:12

Soft Contamination Means Benchmarks Test Shallow Generalization https://arxiv.org/abs/2602.12413

Soft Contamination Means Benchmarks Test Shallow Generalization

If LLM training data is polluted with benchmark test data, then benchmark performance gives biased estimates of out-of-distribution (OOD) generalization. Typical decontamination filters use n-gram...

1.17K views14:46

Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning https://arxiv.org/abs/2602.11149

Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning

Supervised fine-tuning (SFT) on chain-of-thought data is an essential post-training step for reasoning language models. Standard machine learning intuition suggests that training with more unique...

1.04K views06:53

rePIRL: Learn PRM with Inverse RL for LLM Reasoning https://arxiv.org/abs/2602.07832

rePIRL: Learn PRM with Inverse RL for LLM Reasoning

Process rewards have been widely used in deep reinforcement learning to improve training efficiency, reduce variance, and prevent reward hacking. In LLM reasoning, existing works also explore...

❤3

1.07K views07:14

Tensor Decomposition for Non-Clifford Gate Minimization https://arxiv.org/abs/2602.15285

Tensor Decomposition for Non-Clifford Gate Minimization

Fault-tolerant quantum computation requires minimizing non-Clifford gates, whose implementation via magic state distillation dominates the resource costs. While $T$-count minimization is...

👍3😈1

944 views05:36

Forwarded from Love. Death. Transformers.

>We throw away gradient updates randomly
>Outperforms Muon with RMSProp

paper

🔥8👍2

660 views16:30

BRIDGE: Predicting Human Task Completion Time From Model Performance https://arxiv.org/abs/2602.07267

BRIDGE: Predicting Human Task Completion Time From Model Performance

Evaluating the real-world capabilities of AI systems requires grounding benchmark performance in human-interpretable measures of task difficulty. Existing approaches that rely on direct human task...

😁1

867 views20:02

Forwarded from Love. Death. Transformers.

Если вы готовитесь к собесу в норм место вам будет полезно почитать

https://djdumpling.github.io/2026/01/31/frontier_training.html

Alex Wa’s Blog

frontier model training methodologies

How do labs train a frontier, multi-billion parameter model? We look towards seven open-weight frontier models: Hugging Face’s SmolLM3, Prime Intellect’s Intellect 3, Nous Research’s Hermes 4, OpenAI’s gpt-oss-120b, Moonshot’s Kimi K2, DeepSeek’s DeepSeek…

👍3🔥3

391 views12:01