All about AI, Web 3.0, BCI
3.3K subscribers
729 photos
26 videos
161 files
3.14K links
This channel about AI, Web 3.0 and brain computer interface(BCI)

owner @Aniaslanyan
Download Telegram
Anthropic published a new constitution for Claude.

The new constitution discusses Claude in terms previously reserved for humans—incorporating concepts like virtue, psychological security, and ethical maturity.
4🔥3👏2
Amazon is rolling out Health AI for One Medical members where an AI assistant, built on Amazon Bedrock, uses your medical records, labs & meds.

It can answer health questions, manage prescriptions & book appointments pushing Amazon deeper into this space now too.
2
China has launched its first open-source, vertical LLM dedicated to the general agricultural sector, marking a significant breakthrough in foundational AI model research and its applications for agriculture in the country.

The model, Sinong, which is named after the ancient Chinese officials overseeing agriculture and finance, integrates content from nearly 9,000 books, over 240,000 academic papers, approximately 20,000 policy documents and standards, and extensive web-based knowledge.
Sinong is now fully open-sourced on platforms like ModelScope and GitHub.
👏6🔥3🥰2
This paper from Google DeepMind, Meta, Amazon, and Yale University quietly explains why most AI agents feel smart in demos and dumb in real work.

The authors formalize agentic reasoning as a loop, not a prompt:

observe → plan → act → reflect → update state → repeat.

Instead of one long chain-of-thought, the model maintains an internal task state. It decides what to think about next, not just how to finish the sentence.

This is why classic tricks like longer CoT plateau. You get more words, not better decisions.

One of the most important insights: reasoning quality collapses when control and reasoning are mixed. When the same prompt tries to plan, execute, critique, and finalize, errors compound silently. Agentic setups separate these roles.

Planning is explicit. Execution is scoped. Reflection is delayed and structured.

The paper shows that even strong frontier models improve dramatically when given:

• explicit intermediate goals
• checkpoints for self-evaluation
• the ability to abandon bad paths
• memory of past attempts

The takeaway is brutal for the industry: scaling tokens and parameters won’t give us reliable agents. Architecture will. Agentic reasoning isn’t a feature it’s the missing operating system for LLMs.
🔥6👍4👏3
Google DeepMind looking to hire a Senior Economist to lead a small team investigating post-AGI economics.
🔥5👏2🤩2💯2
How to get AI to make discoveries on open scientific problems?

Most methods just improve the prompt with more attempts. But the AI itself doesn't improve.

With test-time training, AI can continue to learn on the problem it’s trying to solve.

Meet TTT-Discover, which enables open models to beat the prior art from both humans and AI based on closed frontier models:

1. Mathematics: new bounds on Erdős' minimum overlap problem and an autocorrelation inequality

2. Kernel Engineering: 2× faster than top humans in GPUMode

3. Algorithms: top scores on past AtCoder contests

4. Biology: SOTA for single-cell RNA-seq denoising.

All of code is public + results are reproducible here.

Everyone can now discover new SOTA in science with a few hundred $.

Test-Time Training + open model > prompt engineering + closed frontier model (Gemini, GPT-5), for discovery problems in Mathematics, Kernel Engineering, Algorithms and Biology.
4👍4🔥4
LLM in sandbox elicits general agentic intelligence

Giving LLMs access to a code sandbox unlocks emergent capabilities for non-code tasks.
Emergent capabilities for non-code tasks.

Contributions:
1. LLMs spontaneously exploit sandbox capabilities (external access, file I/O, code execution) without training
2. RL with non-agentic data enables agentic generalization
3. Efficient deployment: up to 8× token savings

HuggingFace
GitHub
3🔥3👏3
A new work from Yoshua Bengio’s lab: Recursive Self-Aggregation > Gemini DeepThink.

it really is the best test-time scaling algorithm. Just crushed ARC-AGI 2 public evals with Gemini 3 Flash and RSA.
5🔥4🥰4
Nvidia introduced 3 new open source models in the NV Earth-2 family, enabling weather forecasting with tools for data assimilation, forecasting, nowcasting, and downscaling.

Developers can also build climate simulations using PhysicsNeMo and create inference pipelines with the open source Earth2Studio framework.
👍4🔥43
DeepSeek just released #DeepSeek-OCR 2

Now, AI could "see" an image in the same logical order as a human!

Its new method, the DeepEncoder V2, teaches the AI to dynamically reorder the pieces of an image based on its meaning, instead of just scanning it rigidly from left to right. This mimics how humans follow the logical flow of a scene.

The result is a model that outperforms conventional vision-language models, especially on images with complex layouts like documents or diagrams, by enabling more intelligent, causally-informed visual understanding.
🔥43👍2
The “One Person Company” (OPC) model is booming, especially in innovation hubs like Shenzhen, where AI-powered entrepreneurship is reshaping the business landscape.

These OPCs, often led by a single founder supported by AI and minimal staff, offer fast decision-making, low costs, and high flexibility. Shenzhen is building dedicated OPC hubs, attracting creators nationwide.
🔥7💯4👏3🤡1
Moonshot AI released Kimi K2.5, Open-Source Visual Agentic Intelligence

Global SOTA on Agentic Benchmarks: HLE full set (50.2%), BrowseComp (74.9%)

Open-source SOTA on Vision and Coding: MMMU Pro (78.5%), VideoMMMU (86.6%), SWE-bench Verified (76.8%)

Code with Taste: turn chats, images & videos into aesthetic websites with expressive motion.

Agent Swarm (Beta): self-directed agents working in parallel, at scale. Up to 100 sub-agents, 1,500 tool calls, 4.5× faster compared with single-agent setup.

K2.5 is now live on kimi.com in chat mode and agent mode.

K2.5 Agent Swarm in beta for high-tier users.

For production-grade coding, you can pair K2.5 with Kimi Code

Weights & code.
❤‍🔥7🔥4👏3
Qwen released Qwen3-Max-Thinking, its flagship reasoning model and DeepPlanning

It says demonstrates performance comparable to models such as GPT-5.2 Thinking and Opus 4.5 (Qwen).

Key innovations:
1. Adaptive tool-use: intelligently leverages Search, Memory & Code Interpreter without manual selection
2. Test-time scaling: multi-round self-reflection beats Gemini 3 Pro on reasoning
3. From complex math (98.0 on HMMT Feb) to agentic search (49.8 on HLE)—it just thinks better.

DeepPlanning is a new benchmark for long-horizon agent planning in real-world scenarios.

HF
ModelScope.
5🔥5👍3
OpenAI introduced Prism a free, AI-native workspace for scientists to write and collaborate on research, powered by GPT-5.2.

Accelerating science requires progress on two fronts:

1. Frontier AI models that use scientific tools and can tackle the hardest problems
2. Integrating that AI into the products scientists use every day

Prism is free to anyone with a ChatGPT account, with unlimited projects and collaborators.
6🔥2👏2
Google introduced ATLAS: new scaling laws for massively multilingual language models.

Practical, data-driven guidance to balance data mix and model size, helping global developers better serve billions of non-English speakers.
2🔥2🆒2👏1
Big news in clinical AI: Aidoc secured FDA clearance for healthcare’s first comprehensive AI triage solution for body CT, powered by their CARE foundation model.
👏3🔥2💯2
Fidelity to launch dollar-backed stablecoin FIDD on Ethereum in coming weeks

The firm first said it was testing a stablecoin in early 2025, but had not committed to a launch at the time.

The token will be issued by Fidelity Digital Assets’ national trust bank and is expected to roll out to both retail and institutional customers.

Fidelity said it will oversee issuance and management of reserves for the stablecoin, leaning on its asset management arm, Fidelity Management & Research Company LLC, to handle reserve assets.

Customers will be able to purchase or redeem FIDD for $1 through Fidelity Digital Assets, Fidelity Crypto and Fidelity Crypto for Wealth Managers, with the stablecoin also transferable to any Ethereum mainnet address and available on major crypto exchanges where it is listed.
3🔥2👏2
In the last month, 1X, Skild, and Physical Intelligence all signaled a shift to human data.

Robotics is caught in a tug-of-war between quality and scale, where reality is the referee.

This essay explains why the robot models that best navigate the “Data Pareto Frontier” will win in 2026.
🔥3🥰2👏2