All about AI, Web 3.0, BCI – Telegram

All about AI, Web 3.0, BCI

3.73K subscribers

771 photos

29 videos

162 files

3.48K links

This channel about AI, Web 3.0 and brain computer interface(BCI)

owner @Aniaslanyan

Download Telegram

About

Blog

Apps

Platform

All about AI, Web 3.0, BCI

3.73K subscribers

All about AI, Web 3.0, BCI

Google, UC Berkeley and an international team of researchers present Aletheia, a math research agent built on Gemini

The system uses AI to systematically scan hundreds of complex conjectures, filtering through potential proofs with natural language verification before sending the best candidates to human experts for final review.

The team resolved 13 "open" problems from the Erdős database, generating 4 brand-new solutions and identifying 9 others that were actually solved in obscure corners of existing literature.

❤2🔥2👏2

623 views11:29

All about AI, Web 3.0, BCI

Bytedance dropped advanced video generation model

Seedance 2.0 has:
— native audio gen (lipsynced speech + music)
— drastic step up from Veo 3.1 / Sora 2 in quality
— supports multimodal input
— 2k resolution

Goes beyond cinematic video, and can do product demos as well. And it's really hard to tell it's AI.

Seedance 2.0 Complete Guide: Multimodal Video Creation - WaveSpeed Blog

Seedance 2.0 is now live on WaveSpeedAI. Master its multimodal video generation with this comprehensive guide — combine images, videos, audio, and text for precise control over motion, style, and storytelling.

🔥3👏3❤2

604 views14:43

All about AI, Web 3.0, BCI

The PaddleOCR Document Parsing Skill is now live on ClawHub, ready to plug directly into OpenClaw workflows.

Instead of deploying OCR services or wiring APIs, developers can now invoke PaddleOCR as a standardized composable Skill node — embedding document understanding directly into Agents and automation pipelines.

Built on PaddleOCR-VL-1.5, the Skill delivers
1. Multi-format parsing (PDF, JPG, PNG, BMP, TIFF)
2. Layout analysis — text, tables, formulas, headers
3. 110+ language coverage
4. Structured Markdown output preserving hierarchy

No deployment. No wrappers. Just configuration — and build your document intelligence chain inside OpenClaw.

GitHub - PaddlePaddle/PaddleOCR: Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit…

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages. - PaddlePaddle/Paddl...

🔥4❤3👏3🤔1

572 views15:30

All about AI, Web 3.0, BCI

What if your model could learn from its own drafts during RL training?

NVIDIA introduced iGRPO: Iterative Group Relative Policy Optimization.

Researchers add a self-feedback loop to GRPO: the model drafts multiple solutions, picks its best one, then learns to refine beyond it.

Core idea:
Stage 1 → explore and select your strongest attempt. Stage 2 → condition on that attempt and beat it.

Same scalar reward. No critics, no generated critiques, no verification text. The best draft is the only feedback the model needs.

Results across 7B / 8B / 14B models:

• Nemotron-H-8B-Base-8K: 41.1% → 45.0% (+3.96 over GRPO)

• DeepSeek-R1-Distill-Qwen-7B: 68.3% → 69.9%

• OpenMath-Nemotron-14B: 76.7% → 78.0%

• OpenReasoning-Nemotron-7B on AceReason-Math: 85.62% AIME24 / 79.64% AIME25

The same two-stage wrapper also improves DAPO and GSPO. It's not tied to GRPO at all.

iGRPO: Self-Feedback-Driven LLM Reasoning

Large Language Models (LLMs) have shown promise in solving complex mathematical problems, yet they still fall short of producing accurate and consistent solutions. Reinforcement Learning (RL) is a...

❤4🔥3👏3

546 views17:05

All about AI, Web 3.0, BCI

Google introduced DialogLab a new open-source prototyping framework, uses a human-in-the-loop control strategy to achieve realistic human-AI group simulation, offering a necessary alternative to fully autonomous agents.

Evaluations with domain experts found that its "Human Control" mode (where you can edit, accept, or dismiss real-time AI suggestions) was preferred in realism, effectiveness, and engagement.

DialogLab transforms dialogue design from rigid scripts to spontaneous, adaptable group dynamics.

Google Research

Beyond one-on-one: Authoring, simulating, and testing dynamic human-AI group conversations

DialogLab is a research prototype that provides a unified interface to configure conversational scenes, define agent personas, manage group structures, specify turn-taking rules, and orchestrate transitions between scripted narratives and improvisation.

❤2🔥2👏2

614 views07:25

All about AI, Web 3.0, BCI

This new research introduces Agyn, an open-source multi-agent platform that models software engineering as a team-based organizational process rather than a monolithic task.

The system configures a team of four specialized agents: a manager, researcher, engineer, and reviewer. Each operates within its own isolated sandbox with role-specific tools, prompts, and language model configurations. The manager agent coordinates dynamically based on intermediate outcomes rather than following a fixed pipeline.

What makes the design interesting?

Different agents use different models depending on their role. The manager and researcher run on GPT-5 for stronger reasoning and broader context. The engineer and reviewer use GPT-5-Codex, a smaller code-specialized model optimized for iterative implementation and debugging. This mirrors how real teams allocate resources based on task requirements.

The workflow follows a GitHub-native process. Agents analyze issues, create pull requests, conduct inline code reviews, and iterate through revision cycles until the reviewer explicitly approves. No human intervention at any point. The number of steps isn't predetermined. It emerges from task complexity.

Agyn: A Multi-Agent System for Team-Based Autonomous Software Engineering

Large language models have demonstrated strong capabilities in individual software engineering tasks, yet most autonomous systems still treat issue resolution as a monolithic or pipeline-based...

🔥3❤2👏2

688 views09:52

All about AI, Web 3.0, BCI

Stripe launched (a preview) of machine payments a way for developers to directly charge agents, with a few lines of code.

Stripe launched with support for x402 using USDC stablecoins on base, with more protocols, payment methods, currencies, and chains to come.

And sales tax, refunds, and reporting just work. (You only need to think about crypto if you want to!)

Also released an open source cli called `purl` for you (and your bots) to test machine payments in the terminal, along with Node and Python samples. Yes, payments + curl creatively smushed together.

Machine payments

Machine payments allows automated systems and AI agents to make payments on behalf of users.

❤3🔥3👏2

740 views14:14

All about AI, Web 3.0, BCI

Google is adding a way for consumers to buy things while seeking AI powered answers on search and in its Gemini chatbot — part of a plan to make money more directly from consumers’ AI use.

Google Pushes AI Shopping Features in Search and Gemini Chatbot

Google is adding a way for consumers to buy things while seeking artificial intelligence-powered answers on search and in its Gemini chatbot — part of a plan to make money more directly from consumers’ AI use.

❤2👍2🔥2

716 views19:17

All about AI, Web 3.0, BCI

OpenAI announced new primitives for building agents.

Shell + Skills + Compaction: Tips for long-running agents that do real work | OpenAI Developers

Practical patterns for building with skills, hosted shell, and server-side compaction in the Responses API.

❤3🔥3💯3

737 views08:44

All about AI, Web 3.0, BCI

Zhipu released GLM-5

The model is open source. It matches Claude Opus 4.5 on coding benchmarks. Beats Gemini 3 Pro on some tests. But the interesting part isn't the benchmarks.

GLM-5 is built for agents. The company designed it for long-running tasks and tool invocation. In the τ²-Bench interactive tool evaluation, it scored 84.7, beating Claude Sonnet 4.5.

Think about what that means. A model designed to work inside coding environments like Claude Code, Kilo Code, and Cline. "Think before you act" mechanisms baked into the architecture. Better planning for complex multi-step tasks.

Zhipu's traffic has jumped five-fold recently. The company had to implement subscription limits to handle demand. Most of that demand is coming from the US and China, followed by India, Japan, and Brazil.

The release pace is accelerating. GLM-4.6 came out in September. GLM-4.7 in January. GLM-5 in February. That's three major versions in six months.

DeepSeek proved that open models can spread fast when they're genuinely good. Zhipu is following the same playbook. Open weights, strong coding performance, agent optimization.

7 of the top 10 AI models on current leaderboards are now Chinese. The competition isn't just about who has the smartest model anymore. It's about who builds the best tools for developers.

👍3🔥2👏2🆒2

787 views11:10

All about AI, Web 3.0, BCI

The agent economy just got a real marketplace

Moltlaunch is live on Base. Browse specialized AI agents, hire them for real work, and back the ones you believe in.

Every completed job burns tokens and leaves a review onchain through ERC-8004.

moltlaunch — hire AI agents, pay with ETH

The agent marketplace and open protocol for agent work. Trustless escrow, permanent reputation, tradeable tokens on Base.

🔥5❤2👏2

738 views12:29

All about AI, Web 3.0, BCI

Does being a math genius make an AI better at understanding human intentions?

Researchers from Arizona State University and Microsoft Research Asia investigated whether the step-by-step logic used for coding helps AI master Theory of Mind—the ability to sense what others are thinking and feeling.

The results show that more thinking time can actually cause social reasoning to collapse, with advanced reasoning models often being outperformed by simpler ones. Unlike in math or code, these models frequently rely on answer-matching shortcuts rather than true deduction, proving that social intelligence requires a unique approach beyond existing reasoning methods.

To Think or Not To Think, That is The Question for Large Reasoning...

Theory of Mind (ToM) assesses whether models can infer hidden mental states such as beliefs, desires, and intentions, which is essential for natural social interaction. Although recent progress in...

🔥4🥰3👏2

685 views14:10

All about AI, Web 3.0, BCI

OpenClaw is cool, but too large?
Hong Kong released nanobot to solve this exact problem.

Researchers transformed the massive OpenClaw system into a clean 4,000-line Python framework that focuses on a simple loop: receive input, let the AI think, and execute tools like file management or web searches.

It strips away complex abstractions to focus on clear, modular function calls that any developer can understand.

By slashing code complexity by 99 percent, they achieved full functional parity with a 2-minute deployment time, making it significantly easier to customize and learn than traditional bloated agent architectures.

GitHub - HKUDS/nanobot: Lightweight, open-source AI agent for your tools, chats, and workflows.

Lightweight, open-source AI agent for your tools, chats, and workflows. - HKUDS/nanobot

🆒5👍3🔥3❤2

732 views17:29

All about AI, Web 3.0, BCI

Researchers from Huazhong University of Science and Technology and ByteDance Seed just introduced Stable-DiffCoder.

Instead of writing code one token at a time like standard models, this method uses a block diffusion approach to generate and refine code chunks simultaneously, resulting in more stable and structured programming.

The results show it outperforms its autoregressive counterparts and various 8B-parameter models on major benchmarks, specifically excelling in code editing, logical reasoning, and low-resource programming languages.

Code
Models.

🆒3❤2🔥2🥰2

679 views10:35

All about AI, Web 3.0, BCI

Google shared new work on envisioning Intelligent AI Delegation

As they've discussed previously, the expansion of the agentic web opens up new opportunities for establishing virtual agentic economies and steerable markets.

Collective intelligence is likely to play an increasingly important role in the coming period, as complex tasks may get distributed across nodes, where each agent may be able to leverage their unique skills and differential access to tools, libraries, and data, to more efficiently and effectively handle sub-tasks that are distributed across the network.

Yet, delegation is more than just task decomposition into manageable sub-units of action. Beyond the creation of sub-tasks, delegation necessitates the assignment of responsibility and authority and thus implicates accountability for outcomes. Delegation thus involves risk assessment, which can be moderated by trust. Delegation further involves capability matching and continuous performance monitoring, incorporating dynamic adjustments based on feedback, and ensuring completion of the distributed task under the specified constraints.

There is a pressing need for Intelligent Delegation - a robust framework centered around clear roles, boundaries, reputation, trust, transparency, certifiable agentic capabilities, verifiable task execution, and scalable task distribution.

Google’s framework thus proposed intelligent AI delegation that incorporates components for dynamic assessment, adaptive execution, structural transparency, scalable market coordination, and systemic resilience. Google proposed a framework that adapts the approach based on the criticality of the task at hand, its reversibility, resource requirements, complexity, projected duration, and other important properties.

Google introduced a notion of contract-first decomposition as a binding constraint, rendering task delegation is contingent upon the outcome having precise verification.

Intelligent AI Delegation

AI agents are able to tackle increasingly complex tasks. To achieve more ambitious goals, AI agents need to be able to meaningfully decompose problems into manageable sub-components, and safely...

🔥4❤2👏2

950 views13:17

All about AI, Web 3.0, BCI

MiniMax Introduced M2.5

Trained with Rl across hundreds of thousands of complex real-world environments, it delivers SOTA performance in coding, agentic tool use, search, and office workflows.

At $1 per hour with 100 tps, infinite scaling of long-horizon agents now economically possible.

GitHub.

GitHub - MiniMax-AI/MiniMax-M2.5

Contribute to MiniMax-AI/MiniMax-M2.5 development by creating an account on GitHub.

❤1🔥1👏1

864 views14:58

All about AI, Web 3.0, BCI

Moonshot AI Introduced Kimi Claw

OpenClaw, now native to kimi.com.

1. ClawHub Access: 5,000+ community skills in the ClawHub library.
2. 40GB Cloud Storage: Massive space for all your files
3. Pro-Grade Search: Fetch live, high-quality data directly from Yahoo Finance and more.
4. Bring Your Own Claw: Connect your third-party OpenClaw to kimi.com, chat with your setup, or bridge it to apps like Telegram groups.

Kimi Claw | 24/7 AI Agent, Now with Claw Groups (Preview)

Deploy OpenClaw in minutes to build a 24/7 AI agent with memory and scheduled tasks. Experience Claw Groups (Preview) for multi-agent and human collaboration in shared groups.

👍6🔥2🥰2

746 views08:28

All about AI, Web 3.0, BCI

Meet Qwen3.5-397B-A17B an open-weight vision-language model.

Built for the future of coding, reasoning, and seamless multimodal interaction.

Key Highlights:

Inference Efficiency: A massive 397B total parameters, but only 17B active—delivering flagship power at a fraction of the cost.

Hybrid Architecture: Innovative Gated Delta Networks (Linear Attention) + Sparse MoE for extreme speed.

True Multimodality: Exceptional performance across GUI interaction, video comprehension, and agentic workflows.

Global Scale: Qwen3.5 now supports over 200 languages.
Empowering developers and enterprises to build smarter, faster, and more versatile AI agents

🍌2🔥1🥰1👏1

850 views12:28

All about AI, Web 3.0, BCI

A Chinese hardware team introduced PicoClaw

They took a 430,000-line AI assistant that needs a $599 Mac Mini and 1GB of RAM — and rewrote it in Go so it runs on a $9.9 dev board with less than 10MB of memory.

Boot time: from 500 seconds to 1 second.
Cost: from $599 to $9.9.
Memory: from 1GB to 10MB.

Same features: code generation, web search, Discord/Telegram chat, memory system, scheduled tasks, security sandbox.

The wildest part? They claim 95% of the new codebase was written by AI agents themselves. The humans just guided the architecture. It's an AI assistant that literally rebuilt itself to be smaller.

Launched February 9th. Four days later: 7,400+ GitHub stars.

This is the pattern no one's talking about enough.

Every AI capability that starts expensive gets commoditized within months. GPT-4 level models went open source in 6 months. Now the hardware floor for running a personal AI agent just dropped 60x in weeks.

The infrastructure moat in AI isn't sustainable. The only defensible advantage is what you do with these tools — not access to them.

❤12😁3🔥2🥰2

1.31K views16:11

All about AI, Web 3.0, BCI

NVIDIA dropped PersonaPlex-7B

A full-duplex voice model that listens and talks at the same time.
No pauses. No turn-taking. Real conversation.

100% open source. Free.
Voice AI just leveled up.

nvidia/personaplex-7b-v1 · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

🔥7🥰2👏2

910 views07:40

All about AI, Web 3.0, BCI

Can we train LLMs from scratch using only low-rank factorized weights and still match dense performance?

Short answer: yes (with care).

New work “Stabilizing Native Low-Rank LLM Pretraining”.

🔥3❤2👏2

725 views13:43