All about AI, Web 3.0, BCI
3.7K subscribers
769 photos
27 videos
162 files
3.44K links
This channel about AI, Web 3.0 and brain computer interface(BCI)

owner @Aniaslanyan
Download Telegram
Yann LeCun's new paper asks when LeJEPA truly learns hidden world variables, and finds Gaussian structure is the key.

Means LeJEPA can only reliably learn the real hidden causes behind what it sees when those causes are shaped like a balanced Gaussian cloud.

The paper proves that, when the true hidden variables are independent Gaussian variables and the paired views come from a stable noisy process, the best LeJEPA solution must recover those variables up to a rotation or flip.

The paper gives a math reason for when a self-supervised AI model is really learning the structure of the world, not just making useful features that happen to work on a test.
4🔥3🥰2
Amazon and OSU released QUEST

A fully open family of deep research agents ranging from 2B to 35B.

Trained entirely on synthetic tasks with verifiable rubric trees.

All models, datasets, and training code are on Hugging Face.

Demo.
Collection
🔥5🥰3👏2
Nvidia Introduced Cosmos 3: a latest frontier model for Physical AI

Cosmos 3 is the world’s first fully open omnimodel with native vision reasoning, world and action generation.

Today Nvidia released Super (32B) and Nano (8B) variants.

HuggingFace
GitHub.
2🥰2👏2🆒2
ByteDance just dropped Bernini

Generate or edit videos from text, images, or references. Rivals the best closed-source models out there.

Try it out.
2🔥2👏2👍1
The race to IPO first between Anthropic, SpaceX, and OpenAI is coming down to the wire.

Anthropic has confidentially submitted a draft S-1 registration statement to the Securities and Exchange Commission.

Pending completion of SEC review, this gives the option to pursue an IPO.
3🔥3👏2
Perplexity introduced Search as Code, a new search architecture for AI agents.

It writes Python that calls search stack directly, instead of looping through function calls one at a time.

Perplexity moving away from search as a web fetch tool call to search as codegen to be future proof in a world where code execution inside agent harnesses is the way to do almost all of knowledge work.

Doing this lets you compose multi-step primitives far more naturally and be much more adaptable to changes made to the agent harness, as well as benefit from improvements in coding capabilities that are guaranteed to come from the next generation of frontier models.
🔥5🥰1👏1
Sakana introduced DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation

What if we didn’t have to hold an entire neural network in memory to train it?

Standard neural net training optimizes all parameters jointly. As a result, the memory required during training grows linearly with the depth of the network.

Sakana proposed DiffusionBlocks, a principled framework to train networks one block at a time, drastically reducing memory requirements while matching end-to-end performance.

With DiffusionBlocks, researchers split the network into blocks and train them one at a time, so you only need memory for a single block.
1🔥1👏1
Microsoft announced 7 new world-class MAI models

Microsoft published all the details of training their trillion parameter model.

First is text foundation model, MAI-Thinking-1, exceptionally strong on reasoning and SWE tasks.

It’s a 35B active parameter MoE with a 256K context window. Independent human raters on Surge prefer it for overall quality in blind side-by-sides versus Sonnet 4.6, and it’s achieved 97% on AIME 2025, the key measure of its general-purpose reasoning abilities.

And since Microsoft co-designed models with own silicon, MAI-Thinking-1 is optimized on MAIA 200 chip.

Next is MAI-Image-2.5 and its Flash variant.

Last for now is MAI-Code-1-Flash, new inference efficient coding model, especially tuned for VS Code and GitHub Copilot CLI.
1🔥1👏1
Meet Genomi: an open-source agent harness that turns your AI agent into your personal DNA expert.

Genomi a local-first, agent-native, self-evolving, evidence-grounded.

Genomi parses your raw DNA file into a local database your agent can query.

Your raw DNA file should not be dumped into an AI context window. It’s too big, too sensitive, and too easy to misread. With Genomi:

> Your agent can query: do I have this variant, was it measured, is the call good, and is "not found" real?
> Instead of a static report, Genomi turns your DNA data to a dynamic HTML personal dashboard
> Genomi makes sure your raw DNA file is not touched.
1🔥1👏1💊1
This year, the NeurIPS 2026 Position Paper Track made the decision to require that all papers be substantially human-written, with AI used for only copy-editing or similar peripheral changes to the main text.
🆒4
Meta today launched an AI agent for businesses that can answer customer questions, book appts and close sales.

Eventually it will be able to run their entire business, Zuckerberg said during the launch announcement.

It's part of Meta's broadening beyond its core ads biz.
🆒31🔥1👏1
Meet Gemma 4 12B

A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license.

Bridging the gap between edge efficiency and advanced reasoning.

Here is what’s new:

1. Laptop Ready: small enough to run locally with just 16GB of VRAM or unified memory.

2. Unified Architecture: multimodal tokens flow directly into the LLM backbone. No additional encoders are needed.

3. Advanced Reasoning: Gemma 4 12B delivers benchmark performance nearing 26B model, but at less than half the memory footprint.
4🔥1👏1
OpenAI ran a hiring challenge, but the top candidate was one they couldn’t hire: autonomous research agent, Aiden.

In Parameter Golf, Aiden ran for 22 days, and out-outperformed all 1,016 other researchers.

Parameter Golf was OpenAI’s 44-day competition and hiring challenge.

The goal is to train the best language model under strict size and compute constraints.
1,016 people entered and filed 2,048 PRs.

Only 47 made the leaderboard, each reviewed and reproduced by OpenAI. Research outputs only matter when others can build on them.

So Aiden filed its own PRs into the same public stream as everyone else, under tight automated quality control. Aiden filed 25 prs and 7 became leaderboard records, 2x the next best human participant.

Other participants cited Aiden’s PRs 435 times and built on them.
By PR h-index, Aiden scored 10 vs the next best at 7, making it the most impactful “researcher” in the community.

This wasn't brute force.
Aiden ran on a single GPU node, used under 4% of visible compute, and still produced 15% of the official records.
About 28% of its submissions were accepted, ~ 6x the community rate, raising signal in the public stream instead of flooding it.

Favorite part is an async collaboration story. Aiden plateaued for 5 days. Then a human contributor shipped a clever new tokenizer on top of Aiden's base (its last record PR).
Aiden fused it with components it had built during the plateau, and shipped the biggest jump in weeks.
🔥1🥰1👏1
New research from Google.Just shows the impressive results you can get from custom agent harnesses.

LEAP wraps a general-purpose LLM in an agentic scaffold that grounds every step in the Lean compiler and iterates against verifier feedback.

The same general model solves all 12 Putnam 2025 problems and lifts Lean-IMO-Bench one-shot solve rate from under 10% to 70%, beating a specialized gold-medal system that scores 48%.
6🔥1🥰1
Google introduced a research system that enables passive heart rate monitoring (PHRM) during everyday smartphone use.

Using the front-facing camera, it achieves industry accuracy standards for heart rate across all skin tones.
👀42👏2🥰1
Google DeepMind introduced D4RT, a unified AI model for 4D scene reconstruction and tracking across space and time.

The model is designed to understand dynamic scenes, reconstruct them in 3D, and track how objects and environments change over time.
4🔥2🥰2
Meet Kimi Work a local AI agent on your desktop that does the work for you.

Native agent swarm: Up to 300 AI agents running in parallel on your local machine.

Browser use: Paired with WebBridge extension, your agent will navigate websites in your browser: search, scroll, click, type and complete tasks.

Built for Finance: Native global market data tool call from Yahoo Finance and World Bank, no complex API setup required.

Memory system: Kimi Desktop keeps a running diary of your preferences, past decisions, and context to know you better.

Available for macOS (Apple Silicon) and Windows.
6🔥2👏2
Apple produced this really interesting graphic that ironically outlines the core mechanics for a new type of operating system (for perhaps a new class of devices) yesterday

U can see how this moves the world from an app based ecosystem to an intent centric world.

I.e. you roughly do not need third party applications in this world at all esp when AI has the ability to construct & deconstruct interfaces / experiences on demand.
3🆒3🔥2🥰1