ReSearch: Teaching LLMs to Make Better Decisions Through Search
Baichuan AI has unveiled an exciting open-source project called ReSearch.
This innovative system teaches Large Language Models to improve their reasoning capabilities by actively searching for information when needed.
How ReSearch Works:
ReSearch combines Reinforcement Learning (RL) with Retrieval-Augmented Generation (RAG) to empower LLMs with a crucial skill: determining when to search for external information.
Similar to how humans look up facts when uncertain, these enhanced models learn to:
Identify knowledge gaps requiring external information
Formulate effective search queries
Execute multi-step, multi-hop searches for complex problems
Integrate search results into their reasoning process.
What makes this approach particularly impressive is that the model learns these search patterns without direct supervision.
Baichuan AI has unveiled an exciting open-source project called ReSearch.
This innovative system teaches Large Language Models to improve their reasoning capabilities by actively searching for information when needed.
How ReSearch Works:
ReSearch combines Reinforcement Learning (RL) with Retrieval-Augmented Generation (RAG) to empower LLMs with a crucial skill: determining when to search for external information.
Similar to how humans look up facts when uncertain, these enhanced models learn to:
Identify knowledge gaps requiring external information
Formulate effective search queries
Execute multi-step, multi-hop searches for complex problems
Integrate search results into their reasoning process.
What makes this approach particularly impressive is that the model learns these search patterns without direct supervision.
GitHub
GitHub - Agent-RL/ReCall: ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason…
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning - Agent-RL/ReCall
🔥4❤3👍3
Sophgo has introduced 1st RISC-V servers that support #DeepSeek R1 models (1.5B to 70B)
Can do 11.8tps for 70B
SRA3-40 computing server use Sophgo's latest SG2044 64-core server CPU.
It also released SRB3-40 storage server & SRM3-40 convergence server on SG2044.
Can do 11.8tps for 70B
SRA3-40 computing server use Sophgo's latest SG2044 64-core server CPU.
It also released SRB3-40 storage server & SRM3-40 convergence server on SG2044.
Ithome
算能推出 SRA3-40:全球首款支持 DeepSeek 的 RISC-V 众核服务器 - IT之家
SRA3-40 属于计算服务器范畴,基于算能旗下算丰团队开发的新一代服务器级 64 核心 RISC-V 处理器 SG2044。
👍3🔥3❤2
Huge VLM release from Cohere for AI is just in
Aya-Vision is a new VLM family based on SigLIP and Aya, and it outperforms many larger models.
> 8B and 32B models covering 23 languages and two new benchmark dataset
> supported by HF transformers from get-go
Aya-Vision is a new VLM family based on SigLIP and Aya, and it outperforms many larger models.
> 8B and 32B models covering 23 languages and two new benchmark dataset
> supported by HF transformers from get-go
huggingface.co
Cohere Labs Aya Vision - a CohereLabs Collection
Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages.
👍3👏3🔥2
Forwarded from @kurilo (Dmitri)
Looking for exceptionally strong engineers.
iOS (native, swift, obj-c)
Backend (GCP, Node.js, NestJS, Nx, Kubernetes)
Location: Europe. Remote is OK.
DM for more info @masterrr
Products I'm hiring for:
https://bereal.com/
https://carrotcare.health/
iOS (native, swift, obj-c)
Backend (GCP, Node.js, NestJS, Nx, Kubernetes)
Location: Europe. Remote is OK.
DM for more info @masterrr
Products I'm hiring for:
https://bereal.com/
https://carrotcare.health/
carrotcare.health
Organise your blood test data | Carrot Care
❤3👍3🔥3🤡1
Cohere released Aya Vision on Hugging Face
Aya Vision outperforms the leading open-weight models in multilingual text generation and image understanding.
In its parameter class, Aya Vision 8B achieves the best performance in combined multilingual multimodal tasks, outperforming Qwen2.5-VL 7B, Gemini Flash 1.5 8B, Llama-3.2 11B Vision, and Pangea 7B by up to 70% win rates on AyaVisionBench and 79% on m-WildVision.
Aya Vision 32B sets a new frontier in multilingual vision open-weights models, outperforming Llama-3.2 90B Vision, Molmo 72B and Qwen2-VL 72B by up to 64% win rates on AyaVisionBench and 72% win rates on m-WildVision.
Aya Vision outperforms the leading open-weight models in multilingual text generation and image understanding.
In its parameter class, Aya Vision 8B achieves the best performance in combined multilingual multimodal tasks, outperforming Qwen2.5-VL 7B, Gemini Flash 1.5 8B, Llama-3.2 11B Vision, and Pangea 7B by up to 70% win rates on AyaVisionBench and 79% on m-WildVision.
Aya Vision 32B sets a new frontier in multilingual vision open-weights models, outperforming Llama-3.2 90B Vision, Molmo 72B and Qwen2-VL 72B by up to 64% win rates on AyaVisionBench and 72% win rates on m-WildVision.
huggingface.co
Cohere Labs Aya Vision - a CohereLabs Collection
Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages.
🔥4❤3👏2
Commerce Secretary confirms the US Bitcoin Strategic Reserve is likely on the cards:
“A Bitcoin strategic reserve is something the President’s interested in and I think you’re going to see it executed on Friday.”
Trump will unveil the Bitcoin reserve strategy at the White House Crypto Summit. "So Bitcoin is one thing, and then the other currencies, the other crypto tokens, I think, will be treated differently—positively, but differently."
“A Bitcoin strategic reserve is something the President’s interested in and I think you’re going to see it executed on Friday.”
Trump will unveil the Bitcoin reserve strategy at the White House Crypto Summit. "So Bitcoin is one thing, and then the other currencies, the other crypto tokens, I think, will be treated differently—positively, but differently."
The Pavlovic Today
Howard Lutnick Reveals: Trump to Unveil Bitcoin Reserve Strategy at White House Crypto Summit - The Pavlovic Today
Commerce Secretary Howard Lutnick tells The Pavlovic Today that President Trump will unveil a Bitcoin reserve strategy at the White House Crypto Summit, marking a major shift in U.S. crypto policy.
🔥4❤2👏2
The 2024 Turing Award, the Nobel for Computer Science, goes to the inventors of reinforcement learning.
Andrew Barto and former PhD student Rich Sutton’s (famous for his essay The Bitter Lesson) work is foundational for ChatGPT post-training, AlphaGo, robotics and more.
Andrew Barto and former PhD student Rich Sutton’s (famous for his essay The Bitter Lesson) work is foundational for ChatGPT post-training, AlphaGo, robotics and more.
awards.acm.org
Andrew Barto and Richard Sutton are the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic…
❤3🔥3👏2🦄2
All about AI, Web 3.0, BCI
Commerce Secretary confirms the US Bitcoin Strategic Reserve is likely on the cards: “A Bitcoin strategic reserve is something the President’s interested in and I think you’re going to see it executed on Friday.” Trump will unveil the Bitcoin reserve strategy…
Full list of confirmed attendees for the White House Crypto Summit this Friday with Trump
❤5🔥5👏3
Alibaba released QwQ-32B a new reasoning model with 32B parameters that rivals cutting-edge reasoning model, e.g., DeepSeek-R1.
HF
ModelScope
Demo
Chat.
HF
ModelScope
Demo
Chat.
Qwen
QwQ-32B: Embracing the Power of Reinforcement Learning
QWEN CHAT Hugging Face ModelScope DEMO DISCORD
Scaling Reinforcement Learning (RL) has the potential to enhance model performance beyond conventional pretraining and post-training methods. Recent studies have demonstrated that RL can significantly improve…
Scaling Reinforcement Learning (RL) has the potential to enhance model performance beyond conventional pretraining and post-training methods. Recent studies have demonstrated that RL can significantly improve…
👍3🔥3❤2
Emirates NBD, a wholly owned bank of the Dubai government, launched the Liv X app on March 6, offering cryptocurrency buying and selling services.
The service is based on the infrastructure of Aquanow and is hosted by Zodia, which is supported by Standard Chartered.
The service is based on the infrastructure of Aquanow and is hosted by Zodia, which is supported by Standard Chartered.
Coindesk
Dubai Government-Owned Bank Emirates NBD Offers Crypto Trading Through Liv X App
Liv is offering its crypto service using infrastructure operated by Aquanow, a digital asset platform licensed by Dubai's VARA.
❤3👍2🔥2🆒2
Today Anthropic submitted their recommendations to the OSTP for the U.S. AI Action Plan
Anthropic predicts powerful AI systems will appear by late 2026 or early 2027, with intellectual abilities matching Nobel Prize winners, able to autonomously handle digital tasks (text, audio, video, internet browsing), reason independently over hours or weeks, and control physical equipment digitally
They recommend stronger national security actions, including government testing of AI models for security risks, stricter export controls on key chips like the H20, and secure communication channels between AI labs and intelligence agencies
They suggest the government build 50 gigawatts of additional power capacity dedicated to AI by 2027, speed up AI adoption across federal agencies, and improve economic data collection to prepare for AI’s impact on jobs and society
Anthropic predicts powerful AI systems will appear by late 2026 or early 2027, with intellectual abilities matching Nobel Prize winners, able to autonomously handle digital tasks (text, audio, video, internet browsing), reason independently over hours or weeks, and control physical equipment digitally
They recommend stronger national security actions, including government testing of AI models for security risks, stricter export controls on key chips like the H20, and secure communication channels between AI labs and intelligence agencies
They suggest the government build 50 gigawatts of additional power capacity dedicated to AI by 2027, speed up AI adoption across federal agencies, and improve economic data collection to prepare for AI’s impact on jobs and society
Anthropic
Anthropic’s Recommendations to OSTP for the U.S. AI Action Plan
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
👍5🔥4👏2
Ai21 launched Jamba 1.6, the best open model for private enterprise deployment
AI21’s Jamba outperforms Cohere, Mistral and Llama on key benchmarks, including Arena Hard, and rivals leading closed models while maintaining unmatched speed and quality.
AI21’s Jamba outperforms Cohere, Mistral and Llama on key benchmarks, including Arena Hard, and rivals leading closed models while maintaining unmatched speed and quality.
huggingface.co
ai21labs/AI21-Jamba-Large-1.6 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
👍5❤4👏2
PDF parsing is solved (again). Mistral's new OCR API
— parses 1000-2000 pages for $1
— achieves state of the art results on tables, multilingual
— supports structure: images, bounding boxes, scans, equations
90% of the world's organizational data is in PDFs.
— parses 1000-2000 pages for $1
— achieves state of the art results on tables, multilingual
— supports structure: images, bounding boxes, scans, equations
90% of the world's organizational data is in PDFs.
mistral.ai
Mistral OCR | Mistral AI
Introducing the world’s best document understanding API.
❤7👍3🥰2
Trump just now signs an Executive Order establishing the Strategic Bitcoin Reserve and U.S. Digital Asset Stockpile
The Reserve will be capitalized with Bitcoin owned by the federal government that was forfeited as part of criminal or civil asset forfeiture proceedings. This means it will not cost taxpayers a dime.
It is estimated that the U.S. government owns about 200,000 bitcoin; however, there has never been a complete audit. The E.O. directs a full accounting of the federal government’s digital asset holdings.
The U.S. will not sell any bitcoin deposited into the Reserve. It will be kept as a store of value. The Reserve is like a digital Fort Knox for the cryptocurrency often called “digital gold.”
Premature sales of bitcoin have already cost U.S. taxpayers over $17 billion in lost value. Now the federal government will have a strategy to maximize the value of its holdings.
The Secretaries of Treasury and Commerce are authorized to develop budget-neutral strategies for acquiring additional bitcoin, provided that those strategies have no incremental costs on American taxpayers.
IN ADDITION, the Executive Order establishes a U.S. Digital Asset Stockpile, consisting of digital assets other than bitcoin forfeited in criminal or civil proceedings.
The government will not acquire additional assets for the Stockpile beyond those obtained through forfeiture proceedings.
The purpose of the Stockpile is responsible stewardship of the government’s digital assets under the Treasury Department.
The Reserve will be capitalized with Bitcoin owned by the federal government that was forfeited as part of criminal or civil asset forfeiture proceedings. This means it will not cost taxpayers a dime.
It is estimated that the U.S. government owns about 200,000 bitcoin; however, there has never been a complete audit. The E.O. directs a full accounting of the federal government’s digital asset holdings.
The U.S. will not sell any bitcoin deposited into the Reserve. It will be kept as a store of value. The Reserve is like a digital Fort Knox for the cryptocurrency often called “digital gold.”
Premature sales of bitcoin have already cost U.S. taxpayers over $17 billion in lost value. Now the federal government will have a strategy to maximize the value of its holdings.
The Secretaries of Treasury and Commerce are authorized to develop budget-neutral strategies for acquiring additional bitcoin, provided that those strategies have no incremental costs on American taxpayers.
IN ADDITION, the Executive Order establishes a U.S. Digital Asset Stockpile, consisting of digital assets other than bitcoin forfeited in criminal or civil proceedings.
The government will not acquire additional assets for the Stockpile beyond those obtained through forfeiture proceedings.
The purpose of the Stockpile is responsible stewardship of the government’s digital assets under the Treasury Department.
X (formerly Twitter)
David Sacks (@davidsacks47) on X
Just a few minutes ago, President Trump signed an Executive Order to establish a Strategic Bitcoin Reserve.
The Reserve will be capitalized with Bitcoin owned by the federal government that was forfeited as part of criminal or civil asset forfeiture proceedings.…
The Reserve will be capitalized with Bitcoin owned by the federal government that was forfeited as part of criminal or civil asset forfeiture proceedings.…
🔥5❤2👍2👎1
Google co-founder Larry Page is building a new company called Dynatomics that’s focused on applying AI to product manufacturing
Page is reportedly working with a small group of engineers on AI that can create “highly optimized” designs for objects and then have a factory build them.
Chris Anderson, previously the CTO of Page-backed electric airplane startup Kittyhawk, is running the stealth effort.
Page is reportedly working with a small group of engineers on AI that can create “highly optimized” designs for objects and then have a factory build them.
Chris Anderson, previously the CTO of Page-backed electric airplane startup Kittyhawk, is running the stealth effort.
The Information
Larry Page Has a New AI Startup
Google co-founder Larry Page has formed a new company, Dynatomics, to upend manufacturing with artificial intelligence. Page and a small group of engineers are working on ways to use large language models to design flying cars and other types of planes—and…
🆒3
a16z introduced the Top 100 Gen AI Consumer Apps
In just 6 months, the consumer AI landscape has shifted—some AI apps surged, others stalled, and a few unexpected players vaulted over the competition.
A few key insights:
• DeepSeek is outpacing competing general assistant LLMs — in growth and engagement
• The AI apps people are willing to pay for diverge from the most popular
• After a year-long plateau, ChatGPT's growth has come roaring back.
• AI video has finally broken through, with some high-quality new players
• So-called “vibecoding” tools are reshaping who can create with AI, not just who can use it
In just 6 months, the consumer AI landscape has shifted—some AI apps surged, others stalled, and a few unexpected players vaulted over the competition.
A few key insights:
• DeepSeek is outpacing competing general assistant LLMs — in growth and engagement
• The AI apps people are willing to pay for diverge from the most popular
• After a year-long plateau, ChatGPT's growth has come roaring back.
• AI video has finally broken through, with some high-quality new players
• So-called “vibecoding” tools are reshaping who can create with AI, not just who can use it
👍5❤4🔥2
Cortical Labs announced the world's first biocomputer
CL1 merges real living neurons with a chip to solve complex problems and redefine research.
CL1 merges real living neurons with a chip to solve complex problems and redefine research.
Tom's Hardware
World's first 'body in a box' biological computer uses human brain cells with silicon-based computing
Cortical Labs said the CL1 will be available from June, priced at around $35,000.
❤4👍3🔥2
FoundationStereo: A Revolutionary Breakthrough in Stereo Vision
NVIDIA researchers have just introduced FoundationStereo, a groundbreaking foundation model for stereo depth estimation that works right out of the box without any fine-tuning.
Stereo vision enables computer systems to perceive depth and create 3D representations of the world using images from two or more cameras - similar to how humans perceive depth through two eyes.
Code.
This capability is crucial for:
- Robotics and automation
- Autonomous vehicles
- AR/VR applications
- Industrial inspection
- Medical imaging
The Breakthrough
Until now, high-quality depth estimation from low-cost cameras has been a persistent challenge in industrial applications, often requiring investments of $10,000+ in specialized structured-light scanners.
FoundationStereo changes the game by:
Zero-shot generalization: The model works across diverse scenarios without domain-specific fine-tuning
Handling challenging cases: Exceptional performance on transparent objects, reflective surfaces, and complex lighting conditions
Leveraging monocular priors: Adapting rich geometric priors from vision foundation models
Massive training dataset: Built on a 1-million stereo pair synthetic dataset with unprecedented diversity and photorealism
Technical Innovation
The researchers developed several novel components:
Side-Tuning Adapter that leverages internet-scale knowledge from monocular depth estimation models
Attentive Hybrid Cost Filtering with 3D Axial-Planar Convolution
Disparity Transformer for long-range context reasoning
NVIDIA researchers have just introduced FoundationStereo, a groundbreaking foundation model for stereo depth estimation that works right out of the box without any fine-tuning.
Stereo vision enables computer systems to perceive depth and create 3D representations of the world using images from two or more cameras - similar to how humans perceive depth through two eyes.
Code.
This capability is crucial for:
- Robotics and automation
- Autonomous vehicles
- AR/VR applications
- Industrial inspection
- Medical imaging
The Breakthrough
Until now, high-quality depth estimation from low-cost cameras has been a persistent challenge in industrial applications, often requiring investments of $10,000+ in specialized structured-light scanners.
FoundationStereo changes the game by:
Zero-shot generalization: The model works across diverse scenarios without domain-specific fine-tuning
Handling challenging cases: Exceptional performance on transparent objects, reflective surfaces, and complex lighting conditions
Leveraging monocular priors: Adapting rich geometric priors from vision foundation models
Massive training dataset: Built on a 1-million stereo pair synthetic dataset with unprecedented diversity and photorealism
Technical Innovation
The researchers developed several novel components:
Side-Tuning Adapter that leverages internet-scale knowledge from monocular depth estimation models
Attentive Hybrid Cost Filtering with 3D Axial-Planar Convolution
Disparity Transformer for long-range context reasoning
arXiv.org
FoundationStereo: Zero-Shot Stereo Matching
Tremendous progress has been made in deep stereo matching to excel on benchmark datasets through per-domain fine-tuning. However, achieving strong zero-shot generalization - a hallmark of...
🔥4❤2👏1