Alibaba Introduced Qwen3
Open-weight Qwen3, latest large language models, including 2 MoE models and 6 dense models, ranging from 0.6B to 235B.
Qwen3-235B-A22B, achieves competitive results in benchmark evaluations of coding, math, general capabilities, etc., when compared to other top-tier models such as DeepSeek-R1, o1, o3-mini, Grok-3, and Gemini-2.5-Pro.
Additionally, the small MoE model, Qwen3-30B-A3B, outcompetes QwQ-32B with 10 times of activated parameters, and even a tiny model like Qwen3-4B can rival the performance of Qwen2.5-72B-Instruct.
Trained on 36T tokens, covering 119 languages! Data extracted from PDFs, synthetic data, etc.
Thinking and non-thinking modes
Improved agentic, coding capabilities, support for MCP
Training pipeline similar to DeepSeek R1
Small distilled models, such as Qwen3-4B that can rival the performance of Qwen2.5-72B-Instruct, even a Qwen3-0.6B model
GitHub
HuggingFace
Modelscope
Open-weight Qwen3, latest large language models, including 2 MoE models and 6 dense models, ranging from 0.6B to 235B.
Qwen3-235B-A22B, achieves competitive results in benchmark evaluations of coding, math, general capabilities, etc., when compared to other top-tier models such as DeepSeek-R1, o1, o3-mini, Grok-3, and Gemini-2.5-Pro.
Additionally, the small MoE model, Qwen3-30B-A3B, outcompetes QwQ-32B with 10 times of activated parameters, and even a tiny model like Qwen3-4B can rival the performance of Qwen2.5-72B-Instruct.
Trained on 36T tokens, covering 119 languages! Data extracted from PDFs, synthetic data, etc.
Thinking and non-thinking modes
Improved agentic, coding capabilities, support for MCP
Training pipeline similar to DeepSeek R1
Small distilled models, such as Qwen3-4B that can rival the performance of Qwen2.5-72B-Instruct, even a Qwen3-0.6B model
GitHub
HuggingFace
Modelscope
Qwen
Qwen3: Think Deeper, Act Faster
QWEN CHAT GitHub Hugging Face ModelScope Kaggle DEMO DISCORD
Introduction Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models. Our flagship model, Qwen3-235B-A22B, achieves competitive resultsโฆ
Introduction Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models. Our flagship model, Qwen3-235B-A22B, achieves competitive resultsโฆ
๐4โค3๐2
New work on automated prompt engineering for personalized text-to-image generation:
PRISM: Automated Black-box Prompt Engineering for Personalized Text-to-Image Generation
Paper + Code
Prompt engineering for personalized image generation is labor-intensive or requires model-specific tuning, limiting generalization.
Key Idea: PRISM uses VLMs and iterative in-context learning to automatically generate effective, human-readable prompts using only black-box access to image generation models.
This approach shows strong generalization and versatility in generating accurate prompts for objects, styles and images across multiple T2I models, including Stable Diffusion, DALL-E, and Midjourney. It also enables easy editing and multi-concept prompt generation.
PRISM: Automated Black-box Prompt Engineering for Personalized Text-to-Image Generation
Paper + Code
Prompt engineering for personalized image generation is labor-intensive or requires model-specific tuning, limiting generalization.
Key Idea: PRISM uses VLMs and iterative in-context learning to automatically generate effective, human-readable prompts using only black-box access to image generation models.
This approach shows strong generalization and versatility in generating accurate prompts for objects, styles and images across multiple T2I models, including Stable Diffusion, DALL-E, and Midjourney. It also enables easy editing and multi-concept prompt generation.
kellyyutonghe.github.io
PRISM: Automated Black-box Prompt Engineering for Personalized Text-to-Image Generation
We propose an algorithm that automatically identifies human-interpretable and transferable prompts that can effectively generate desired concepts given only black-box access to T2I models.
BCG_AI_Agents_MCP_1745919815.pdf
22.8 MB
BCG ๐ฑ๐ฟ๐ผ๐ฝ๐ฝ๐ฒ๐ฑ ๐๐ต๐ฒ๐ถ๐ฟ ๐น๐ฎ๐๐ฒ๐๐ ๐ฃ๐ข๐ฉ ๐ผ๐ป ๐๐ ๐๐ด๐ฒ๐ป๐๐ ๐ฎ๐ป๐ฑ ๐๐ต๐ฒ ๐ ๐ผ๐ฑ๐ฒ๐น ๐๐ผ๐ป๐๐ฒ๐
๐ ๐ฃ๐ฟ๐ผ๐๐ผ๐ฐ๐ผ๐น (๐ ๐๐ฃ)
๐๐ฒ๐ฟ๐ฒ ๐ฎ๐ฟ๐ฒ ๐ธ๐ฒ๐ ๐๐ฎ๐ธ๐ฒ๐ฎ๐๐ฎ๐๐:
1. ๐๐๐๐ผ๐ป๐ผ๐บ๐ผ๐๐ ๐๐ด๐ฒ๐ป๐๐ ๐๐ฟ๐ฒ ๐ ๐ผ๐๐ถ๐ป๐ด ๐๐ฟ๐ผ๐บ ๐๐ผ๐ป๐ฐ๐ฒ๐ฝ๐ ๐๐ผ ๐ฅ๐ฒ๐ฎ๐น๐ถ๐๐:
โ Early deployments are already delivering 30โ90% improvements in speed, productivity, and cost across coding, compliance, and supply chain domains.
2. ๐ ๐๐ฃ ๐๐ ๐๐ฒ๐ฐ๐ผ๐บ๐ถ๐ป๐ด ๐๐ต๐ฒ ๐๐ฎ๐ฐ๐ธ๐ฏ๐ผ๐ป๐ฒ ๐ผ๐ณ ๐ฆ๐ฐ๐ฎ๐น๐ฎ๐ฏ๐น๐ฒ ๐๐ด๐ฒ๐ป๐๐:
โ The Model Context Protocol (MCP) is the new open standard adopted by Anthropic, OpenAI, Microsoft, Google, and Amazon to expose tools, prompts, and resources reliably.
3. ๐๐ด๐ฒ๐ป๐ ๐๐ป๐๐ฒ๐น๐น๐ถ๐ด๐ฒ๐ป๐ฐ๐ฒ ๐๐ ๐ฃ๐ฟ๐ผ๐ด๐ฟ๐ฒ๐๐๐ถ๐ป๐ด ๐ฅ๐ฎ๐ฝ๐ถ๐ฑ๐น๐:
โ Agents today can automate tasks up to one hour long โ and this limit is doubling every seven months, pushing toward multi-day autonomous workflows by the end of the decade.
4. ๐๐ด๐ฒ๐ป๐ ๐๐ฟ๐ฐ๐ต๐ถ๐๐ฒ๐ฐ๐๐๐ฟ๐ฒ๐ ๐ ๐๐๐ ๐๐ฒ ๐ฆ๐ฒ๐ฐ๐๐ฟ๐ถ๐๐-๐๐ถ๐ฟ๐๐:
โ Security challenges grow as agents gain system access. OAuth, RBAC, permission isolation, eval-driven development, and real-time monitoring are mandatory to deploy agents safely.
5. ๐ง๐ต๐ฒ ๐ฅ๐ถ๐๐ฒ ๐ผ๐ณ ๐๐ด๐ฒ๐ป๐-๐ข๐ฟ๐ฐ๐ต๐ฒ๐๐๐ฟ๐ฎ๐๐ถ๐ผ๐ป ๐ฃ๐น๐ฎ๐๐ณ๐ผ๐ฟ๐บ๐:
โ Platforms like Azure Foundry, Vertex AI, Bedrock Agents, and Lindy are positioning themselves as the orchestration layer to create, manage, and scale enterprise agent ecosystems.
6. ๐๐ฟ๐ผ๐บ ๐ช๐ผ๐ฟ๐ธ๐ณ๐น๐ผ๐๐ ๐๐ผ ๐๐๐น๐น๐ ๐๐๐๐ผ๐ป๐ผ๐บ๐ผ๐๐ ๐๐ด๐ฒ๐ป๐๐:
โ Enterprises are shifting from prompt chaining (rigid workflows) to fully autonomous agents capable of observing, reasoning, and acting dynamically based on real-world feedback.
7. ๐ ๐๐ฃ ๐ฎ๐ป๐ฑ ๐2๐ ๐ช๐ถ๐น๐น ๐๐ฒ๐ณ๐ถ๐ป๐ฒ ๐๐ต๐ฒ ๐๐ด๐ฒ๐ป๐ ๐๐ฐ๐ผ๐ป๐ผ๐บ๐:
โ MCP connects agents to tools and data. A2A (Agent-to-Agent communication) will enable agents to negotiate, collaborate, and coordinate across systems โ forming true multi-agent networks.
๐๐ฒ๐ฟ๐ฒ ๐ฎ๐ฟ๐ฒ ๐ธ๐ฒ๐ ๐๐ฎ๐ธ๐ฒ๐ฎ๐๐ฎ๐๐:
1. ๐๐๐๐ผ๐ป๐ผ๐บ๐ผ๐๐ ๐๐ด๐ฒ๐ป๐๐ ๐๐ฟ๐ฒ ๐ ๐ผ๐๐ถ๐ป๐ด ๐๐ฟ๐ผ๐บ ๐๐ผ๐ป๐ฐ๐ฒ๐ฝ๐ ๐๐ผ ๐ฅ๐ฒ๐ฎ๐น๐ถ๐๐:
โ Early deployments are already delivering 30โ90% improvements in speed, productivity, and cost across coding, compliance, and supply chain domains.
2. ๐ ๐๐ฃ ๐๐ ๐๐ฒ๐ฐ๐ผ๐บ๐ถ๐ป๐ด ๐๐ต๐ฒ ๐๐ฎ๐ฐ๐ธ๐ฏ๐ผ๐ป๐ฒ ๐ผ๐ณ ๐ฆ๐ฐ๐ฎ๐น๐ฎ๐ฏ๐น๐ฒ ๐๐ด๐ฒ๐ป๐๐:
โ The Model Context Protocol (MCP) is the new open standard adopted by Anthropic, OpenAI, Microsoft, Google, and Amazon to expose tools, prompts, and resources reliably.
3. ๐๐ด๐ฒ๐ป๐ ๐๐ป๐๐ฒ๐น๐น๐ถ๐ด๐ฒ๐ป๐ฐ๐ฒ ๐๐ ๐ฃ๐ฟ๐ผ๐ด๐ฟ๐ฒ๐๐๐ถ๐ป๐ด ๐ฅ๐ฎ๐ฝ๐ถ๐ฑ๐น๐:
โ Agents today can automate tasks up to one hour long โ and this limit is doubling every seven months, pushing toward multi-day autonomous workflows by the end of the decade.
4. ๐๐ด๐ฒ๐ป๐ ๐๐ฟ๐ฐ๐ต๐ถ๐๐ฒ๐ฐ๐๐๐ฟ๐ฒ๐ ๐ ๐๐๐ ๐๐ฒ ๐ฆ๐ฒ๐ฐ๐๐ฟ๐ถ๐๐-๐๐ถ๐ฟ๐๐:
โ Security challenges grow as agents gain system access. OAuth, RBAC, permission isolation, eval-driven development, and real-time monitoring are mandatory to deploy agents safely.
5. ๐ง๐ต๐ฒ ๐ฅ๐ถ๐๐ฒ ๐ผ๐ณ ๐๐ด๐ฒ๐ป๐-๐ข๐ฟ๐ฐ๐ต๐ฒ๐๐๐ฟ๐ฎ๐๐ถ๐ผ๐ป ๐ฃ๐น๐ฎ๐๐ณ๐ผ๐ฟ๐บ๐:
โ Platforms like Azure Foundry, Vertex AI, Bedrock Agents, and Lindy are positioning themselves as the orchestration layer to create, manage, and scale enterprise agent ecosystems.
6. ๐๐ฟ๐ผ๐บ ๐ช๐ผ๐ฟ๐ธ๐ณ๐น๐ผ๐๐ ๐๐ผ ๐๐๐น๐น๐ ๐๐๐๐ผ๐ป๐ผ๐บ๐ผ๐๐ ๐๐ด๐ฒ๐ป๐๐:
โ Enterprises are shifting from prompt chaining (rigid workflows) to fully autonomous agents capable of observing, reasoning, and acting dynamically based on real-world feedback.
7. ๐ ๐๐ฃ ๐ฎ๐ป๐ฑ ๐2๐ ๐ช๐ถ๐น๐น ๐๐ฒ๐ณ๐ถ๐ป๐ฒ ๐๐ต๐ฒ ๐๐ด๐ฒ๐ป๐ ๐๐ฐ๐ผ๐ป๐ผ๐บ๐:
โ MCP connects agents to tools and data. A2A (Agent-to-Agent communication) will enable agents to negotiate, collaborate, and coordinate across systems โ forming true multi-agent networks.
๐3๐ฅ2
U.S. Secretary of Commerce Howard Lutnick said the U.S. will accelerate bitcoin mining, support the construction of its own power infrastructure, and reduce reliance on the public power grid.
The U.S. will allow miners to build power plants and data centers near natural gas fields and consider incorporating bitcoin into the national economic account.
The U.S. will allow miners to build power plants and data centers near natural gas fields and consider incorporating bitcoin into the national economic account.
Bitcoin Magazine
U.S. Secretary Of Commerce Howard Lutnick Has A Bitcoin Vision For America
Secretary Lutnick encourages Bitcoin businesses to set up shop in the United States, as he claims that the Trump administration is doing everything in its power to welcome such companies to the U.S. in the wake of the Biden administrationโs hostile treatmentโฆ
The U.K. government released consultation papers on crypto legislation.
It sees the creation of new regulated activities, such as operating a crypto exchange and stablecoin issuance.
It sees the creation of new regulated activities, such as operating a crypto exchange and stablecoin issuance.
GOV.UK
Regulatory regime for cryptoassets (regulated activities) โ Draft SI and Policy Note
A draft of statutory provisions to create new regulated activities for cryptoassets, and an explainer document detailing the intended policy outcomes of these provisions. The government laid the final legislation in Parliament on 15 December 2025.
โค3๐ฅ3๐2
Meta is releasing a standalone mobile app for its ChatGPT competitor, Meta AI.
There's a Discover feed that shows interactions that others (including your IG/FB friends) are having with the assistant. Meta tells the idea is to demystify AI and show โpeople what they can do with it."
OpenAI is working on a similar feed for ChatGPT.
There's a Discover feed that shows interactions that others (including your IG/FB friends) are having with the assistant. Meta tells the idea is to demystify AI and show โpeople what they can do with it."
OpenAI is working on a similar feed for ChatGPT.
The Verge
Metaโs ChatGPT competitor shows how your friends use AI
What if Instagram only showed people talking with AI?
Stanford and Google DeepMind released SWiRL: A synthetic data generation and multi-step RL approach for reasoning and tool use!
With SWiRL, the modelโs capability generalizes to new tasks and tools. For example, a model trained to use a retrieval tool to solve multi-hop knowledge-intensive question answering tasks becomes significantly better at using Python to solve math problems (and vice versa).
As they scale the synthetic data size, the generalization gains continue to improve.
This suggests new possibilities for self improvement, where researchers use the model to synthetically generate data on multi-step tasks in more accessible (or affordable) domains and improve it on other domains.
With SWiRL, the modelโs capability generalizes to new tasks and tools. For example, a model trained to use a retrieval tool to solve multi-hop knowledge-intensive question answering tasks becomes significantly better at using Python to solve math problems (and vice versa).
As they scale the synthetic data size, the generalization gains continue to improve.
This suggests new possibilities for self improvement, where researchers use the model to synthetically generate data on multi-step tasks in more accessible (or affordable) domains and improve it on other domains.
๐ฅ2
Amazon introduces an architecture to migrate from various models to Amazon Nova models using DSPy and its MIPROv2 algorithm.
Amazon
Improve Amazon Nova migration performance with data-aware prompt optimization | Amazon Web Services
In this post, we present an LLM migration paradigm and architecture, including a continuous process of model evaluation, prompt generation using Amazon Bedrock, and data-aware optimization. The solution evaluates the model performance before migration andโฆ
โค3๐3๐2
Xiaomi MiMo-7B: a 7B reasoning model series trained from scratch, outperforms 32B+ baselines on math and code via dense RL
- pretrained on 25T tokens w/ multi-token prediction
- RL rewards from rule-verifiable math/code tasks
- cold-start RL model (MiMo-7B-RL-Zero) hits 93.6% MATH-500, 49.1% LCB v5
- SFTโRL variant matches OpenAI o1-mini
- also open: base + SFT checkpoints
- seamless rollout engine: 2.29ร faster RL training
- vLLM + MTP inference ready
- strong AIME 2025 (55.4%) and LCB v6 (49.3%) results
- pretrained on 25T tokens w/ multi-token prediction
- RL rewards from rule-verifiable math/code tasks
- cold-start RL model (MiMo-7B-RL-Zero) hits 93.6% MATH-500, 49.1% LCB v5
- SFTโRL variant matches OpenAI o1-mini
- also open: base + SFT checkpoints
- seamless rollout engine: 2.29ร faster RL training
- vLLM + MTP inference ready
- strong AIME 2025 (55.4%) and LCB v6 (49.3%) results
huggingface.co
XiaomiMiMo (Xiaomi MiMo)
Org profile for Xiaomi MiMo on Hugging Face, the AI community building the future.
๐4โค3๐2๐ฅ1๐1
Google DeepMind introduced the SAS prompt: LLM as Numerical Optimizers for Robot Self-Improvement
Large language models like Gemini have an inherent ability to problem solve, without needing to retrain for specific jobs.
Robots can use these models to improve how they operate over time, by interacting with the world, and learning from those interactions.
With the SAS prompt, you can now use language models like Gemini to learn from a robot's history.
This allows the model to analyze parameter effects, and suggest ways to improve - similar to a real-life table tennis coach.
Also Google released a dataset of table tennis ball throws, and a simple MuJoCo simulation environment able to replicate trajectories from the real world, with data on specific serves and rallies.
Paper.
Large language models like Gemini have an inherent ability to problem solve, without needing to retrain for specific jobs.
Robots can use these models to improve how they operate over time, by interacting with the world, and learning from those interactions.
With the SAS prompt, you can now use language models like Gemini to learn from a robot's history.
This allows the model to analyze parameter effects, and suggest ways to improve - similar to a real-life table tennis coach.
Also Google released a dataset of table tennis ball throws, and a simple MuJoCo simulation environment able to replicate trajectories from the real world, with data on specific serves and rallies.
Paper.
Google
SAS-Prompt
SAS-Prompt: Large Language Models
as Numerical Optimizers
for Robot Self-Improvement
as Numerical Optimizers
for Robot Self-Improvement
โค3๐ฏ3๐2๐ฅ1
Sam Altman's World Brings Biometric Verification and Digital Payments to US Market
World (formerly Worldcoin) has chosen six key innovation hubs for its American debut: Atlanta, Austin, Los Angeles, Miami, Nashville, and San Francisco. Americans in these cities can now:
1. Verify their unique World ID using the company's advanced biometric technology
2. Access the complete World App experience
3. Claim the Worldcoin (WLD) token airdrop.
The company's signature NVIDIA-powered Orbs โ the biometric verification devices that distinguish humans from AI โ will be available across the USA via standalone World Spaces and partner locations including Razer stores.
Alongside its identity verification system, World has announced the World Card a financial product that connects directly to users' World App wallets, enabling them to spend digital assets anywhere Visa is accepted.
Key features include:
1. Seamless integration with verified human identities through World ID
2. Ability to spend digital assets at over 150 million Visa-accepting locations globally
3. Merchants receive fiat currency without needing to understand crypto.
4. A rewards program specifically optimized for the AI economy, with enhanced rewards on AI subscriptions and services
5. Rewards paid directly in WLD tokens to connected wallets.
World emphasizes that its architecture places Americans in complete control of their digital identity:
- Personal data remains exclusively on users' devices through "Personal Custody"
- Advanced cryptographic systems, including Anonymized Multi-Party Computation and zero-knowledge proofs, ensure data privacy
- Verification of humanity without compromising personal information
World (formerly Worldcoin) has chosen six key innovation hubs for its American debut: Atlanta, Austin, Los Angeles, Miami, Nashville, and San Francisco. Americans in these cities can now:
1. Verify their unique World ID using the company's advanced biometric technology
2. Access the complete World App experience
3. Claim the Worldcoin (WLD) token airdrop.
The company's signature NVIDIA-powered Orbs โ the biometric verification devices that distinguish humans from AI โ will be available across the USA via standalone World Spaces and partner locations including Razer stores.
Alongside its identity verification system, World has announced the World Card a financial product that connects directly to users' World App wallets, enabling them to spend digital assets anywhere Visa is accepted.
Key features include:
1. Seamless integration with verified human identities through World ID
2. Ability to spend digital assets at over 150 million Visa-accepting locations globally
3. Merchants receive fiat currency without needing to understand crypto.
4. A rewards program specifically optimized for the AI economy, with enhanced rewards on AI subscriptions and services
5. Rewards paid directly in WLD tokens to connected wallets.
World emphasizes that its architecture places Americans in complete control of their digital identity:
- Personal data remains exclusively on users' devices through "Personal Custody"
- Advanced cryptographic systems, including Anonymized Multi-Party Computation and zero-knowledge proofs, ensure data privacy
- Verification of humanity without compromising personal information
world.org
World Card: Your Digital Assets, Accepted Anywhere Visa Is
As AI advances, itโs increasingly important to distinguish between humans and bots online.
โค3๐ฅฐ3๐2
Morgan Stanley plans to offer crypto trading to E-Trade clients.
Morgan Stanley is working on a plan to add cryptocurrency trading to its E-Trade platform, in what would be the most significant move by a major US bank to help everyday customers buy into the asset class since the Trump administration began removing regulatory barriers.
The project is nascent and executives envision launching the service sometime next year, according to people familiar with the matter. The firm is considering partnering with one or multiple established crypto firms as it sets up the mechanics for the brokerageโs clients to buy and sell popular tokens including Bitcoin and Ether.
Morgan Stanley is working on a plan to add cryptocurrency trading to its E-Trade platform, in what would be the most significant move by a major US bank to help everyday customers buy into the asset class since the Trump administration began removing regulatory barriers.
The project is nascent and executives envision launching the service sometime next year, according to people familiar with the matter. The firm is considering partnering with one or multiple established crypto firms as it sets up the mechanics for the brokerageโs clients to buy and sell popular tokens including Bitcoin and Ether.
Bloomberg.com
Morgan Stanley Plans to Offer Crypto Trading to E*Trade Clients
Morgan Stanley is working on a plan to add cryptocurrency trading to its E*Trade platform, in what would be the most significant move by a major US bank to help everyday customers buy into the asset class since the Trump administration began removing regulatoryโฆ
โค3๐ฅ3๐2
Microsoft Introduced Phi-4-reasoning, adding reasoning models to the Phi family of SLMs.
The model is trained with both supervised finetuning (using a carefully curated dataset of reasoning demonstration) and Reinforcement Learning.
- Competitive results on reasoning benchmarks with much larger top-tier models up to DeepSeek R1.
- Strong performance on new tests released after data collection (AIME 2025, HMMT).
- Reasoning transfers/generalizes well to new domains even with only SFT (e.g. k-SAT, Mae Solving, Calendar Planning, etc.)
- Retains and often significantly improves general-purpose capabilities (e.g. instruction following).
HuggingFace Phi-4-reasoning
HF Phi-4-reasoning-plus
Hf Phi-4-mini-reasoning
The model is trained with both supervised finetuning (using a carefully curated dataset of reasoning demonstration) and Reinforcement Learning.
- Competitive results on reasoning benchmarks with much larger top-tier models up to DeepSeek R1.
- Strong performance on new tests released after data collection (AIME 2025, HMMT).
- Reasoning transfers/generalizes well to new domains even with only SFT (e.g. k-SAT, Mae Solving, Calendar Planning, etc.)
- Retains and often significantly improves general-purpose capabilities (e.g. instruction following).
HuggingFace Phi-4-reasoning
HF Phi-4-reasoning-plus
Hf Phi-4-mini-reasoning
โค3๐3๐2๐1
Microsoft is getting ready to host Elon Muskโs Grok AI model. Microsoft has been in discussions with xAI to make Grok AI available on Azure's AI Foundry service.
In recent weeks Microsoft has been in discussions with xAI to host the Grok AI model and make it available to customers and Microsoftโs own product teams through the Azure cloud service.
The move could prove controversial internally and further inflame tensions with Microsoftโs partner OpenAI.
In recent weeks Microsoft has been in discussions with xAI to host the Grok AI model and make it available to customers and Microsoftโs own product teams through the Azure cloud service.
The move could prove controversial internally and further inflame tensions with Microsoftโs partner OpenAI.
The Verge
Microsoft is getting ready to host Elon Muskโs Grok AI model
Grok AI might appear on Azure AI Foundry soon
๐4๐3โค2๐ค2
Huawei is building a 7nm fab in Shenzhen for its smartphone and Ascend chips, its first effort to manufacture its own high-end chips.
The Guanlan site is part of a sprawling network of new chip manufacturing sites all working on various elements of Huawei's push to become a semiconductor champion, from equipment to fabrication.
Huawei wasn't considered a serious player in chip manufacturing before it was sanctioned in 2019. The move kickstarted massive investment to localise chip technology, aided by state funds and led by the tech giant. The Guanlan network is part of this effort.
The Guanlan site is part of a sprawling network of new chip manufacturing sites all working on various elements of Huawei's push to become a semiconductor champion, from equipment to fabrication.
Huawei wasn't considered a serious player in chip manufacturing before it was sanctioned in 2019. The move kickstarted massive investment to localise chip technology, aided by state funds and led by the tech giant. The Guanlan network is part of this effort.
Ft
Satellite images reveal Huaweiโs advanced chip production line in China
Rapid expansion of Shenzhen facilities designed to break dependence on foreign technologies
๐5โค3๐1
Cisco's Foundation AI released Foundation-Sec-8B
Built on Llama 3.1, the LLM matches Llama 3.1-70B & GPT-4o-mini on multiple security tasks
It will help with use cases like threat detection, vulnerability assessment, security automation, and more.
Built on Llama 3.1, the LLM matches Llama 3.1-70B & GPT-4o-mini on multiple security tasks
It will help with use cases like threat detection, vulnerability assessment, security automation, and more.
huggingface.co
fdtn-ai/Foundation-Sec-8B ยท Hugging Face
Weโre on a journey to advance and democratize artificial intelligence through open source and open science.
๐4โค3๐2
Carnegie Mellon University started company with only AI employees
They got OpenAI, Gemini, Anthropic, etc models and gave them job roles. They were retarded and costed a fuckton. Claude 3.5 was the best employee and only did 24% of its tasks.
Paper.
They got OpenAI, Gemini, Anthropic, etc models and gave them job roles. They were retarded and costed a fuckton. Claude 3.5 was the best employee and only did 24% of its tasks.
Paper.
Futurism
Professors Staffed a Fake Company Entirely With AI Agents, and You'll Never Guess What Happened
An experiment by researchers at Carnegie Melon University staffed a fake software company with AI Agents, and the results were dismal.
๐3โค2๐ฆ2๐1๐1
Google DeepMind presented Evaluating Frontier Models for Stealth and Situational Awareness:
- 5 evals of ability to reason about and circumvent oversight
- 11 evals for measuring a modelโs ability to instrumentally reason about itself, its environment and its deployment
No SotA model currently shows concerning levels of either capabillity.
- 5 evals of ability to reason about and circumvent oversight
- 11 evals for measuring a modelโs ability to instrumentally reason about itself, its environment and its deployment
No SotA model currently shows concerning levels of either capabillity.
๐ฅ6โค3๐ฅฐ2
Anthropic launched a new "AI for Science" program
Under the initiative, the company will provide up to $20,000 in free API credits (for 6 months) to researchers in โhigh-impactโ scientific fields like drug discovery, genomics, and agriculture
Under the initiative, the company will provide up to $20,000 in free API credits (for 6 months) to researchers in โhigh-impactโ scientific fields like drug discovery, genomics, and agriculture
Anthropic
Introducing Anthropic's AI for Science Program
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
๐ฅฐ3โค2๐2
The world's first $500 Full Body MRI: Ezra has been acquired by Function.
Together, they're introducing the world's first $500 Full Body MRI.
Together, they're introducing the world's first $500 Full Body MRI.
โค3๐ฅ3๐ฆ3