Below you see the currently best open small language models. While not getting according visibility, StableLM2 1.6B reigns here (and note Gemma 2B is 56% larger!).
Microsoft Copilot for Finance is out
The Official Microsoft Blog
Introducing Microsoft Copilot for Finance – the newest Copilot offering in Microsoft 365 designed to transform modern finance
Today we’re announcing the public preview of Microsoft Copilot for Finance, the newest Copilot offering designed for business functions that extends Microsoft Copilot for Microsoft 365 and revolutionizes how finance teams approach their daily work. Copilot…
Elon Musk has filed a lawsuit against OpenAI for breach of contract, breach of fiduciary duty and unfair business practices, and is asking for OpenAI to revert back to open source, and to share all it's research for the benefit of humanity.
He's arguing they have already achieved AGI, and are thus outside the scope of the agreement with Microsoft, which only applies to pre-AGI tech.
He's arguing they have already achieved AGI, and are thus outside the scope of the agreement with Microsoft, which only applies to pre-AGI tech.
👍4
What can LLMs do in and for games? NPCs, narratives, game mastering, playing, level generation, new game mechanics... player modeling?
Researchers presented "Large Language Models and Games: A Survey and Roadmap".
Researchers presented "Large Language Models and Games: A Survey and Roadmap".
arXiv.org
Large Language Models and Games: A Survey and Roadmap
Recent years have seen an explosive increase in research on large language models (LLMs), and accompanying public engagement on the topic. While starting as a niche area within natural language...
New Resource: Foundation Model Development Cheatsheet for best practices
250+ resources & tools for:
1. sourcing data
2. documenting & audits
3. environmental impact
4. risks & harms eval
5. release & monitoring
250+ resources & tools for:
1. sourcing data
2. documenting & audits
3. environmental impact
4. risks & harms eval
5. release & monitoring
⚡4
Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Presents:
-ArXivCap, a million-scale figure-caption dataset from arxiv papers
- ArXivQA, a QA dataset generated by prompting GPT-4V based on arxiv figures.
Paper here.
Presents:
-ArXivCap, a million-scale figure-caption dataset from arxiv papers
- ArXivQA, a QA dataset generated by prompting GPT-4V based on arxiv figures.
Paper here.
mm-arxiv.github.io
Multimodal ArXiv
Vision-Language Feedback
⚡5
Intel's NPU Acceleration Library goes open source — Meteor Lake CPUs can now run TinyLlama and other lightweight LLMs.
Tom's Hardware
Intel's NPU Acceleration Library goes open source — Meteor Lake CPUs can now run TinyLlama and other lightweight LLMs
LLMs on the go.
⚡4
Anthropic announced the Claude 3 model family
The family includes three state-of-the-art models in ascending order of capability:
Claude 3 Haiku, Claude 3 Sonnet, and Claude 3 Opus.
Each successive model offers increasingly powerful performance, allowing users to select the optimal balance of intelligence, speed, and cost for their specific application.
The family includes three state-of-the-art models in ascending order of capability:
Claude 3 Haiku, Claude 3 Sonnet, and Claude 3 Opus.
Each successive model offers increasingly powerful performance, allowing users to select the optimal balance of intelligence, speed, and cost for their specific application.
Anthropic
Introducing the next generation of Claude
Today, we're announcing the Claude 3 model family, which sets new industry benchmarks across a wide range of cognitive tasks. The family includes three state-of-the-art models in ascending order of capability: Claude 3 Haiku, Claude 3 Sonnet, and Claude 3…
⚡5
Bytedance introduced Diffusion Protein Language Models (DPLM), a new suite of discrete diffusion-based protein language models
With versatility in both generative and predictive tasks, DPLM is poised to set the new SOTA in protein language models, excelling across a spectrum of benchmark tasks.
With versatility in both generative and predictive tasks, DPLM is poised to set the new SOTA in protein language models, excelling across a spectrum of benchmark tasks.
arXiv.org
Diffusion Language Models Are Versatile Protein Learners
This paper introduces diffusion protein language model (DPLM), a versatile protein language model that demonstrates strong generative and predictive capabilities for protein sequences. We first...
⚡4
A new article "Creative Flow as Optimized Processing: Evidence from Brain Oscillations During Jazz Improvisations by Expert and Non-Expert Musicians."
This is the first neuroimaging study to isolate the neural correlates of the flow experience during a creative production task, in this case, jazz improvisation. Flow is not hyperfocus. It results from an expert brain network plus release of executive control.
This is the first neuroimaging study to isolate the neural correlates of the flow experience during a creative production task, in this case, jazz improvisation. Flow is not hyperfocus. It results from an expert brain network plus release of executive control.
Drexel News
Your Brain in the Zone: A New Neuroimaging Study Reveals How the Brain Achieves a Creative Flow State
A new neuroimaging study from Drexel University’s Creativity Research Lab is the first to reveal how the brain gets to the creative flow state.
⚡8👍3
Very cool data analysis from Paradigm showing the breakdown of Ethereum's state.
ERC20s make up 27% of total state, while ERC721s make up 21.6%. Accounts total 14.1%.
XEN makes up 3.5% of Ethereum's state, which is more than any other single protocol.
ERC20s make up 27% of total state, while ERC721s make up 21.6%. Accounts total 14.1%.
XEN makes up 3.5% of Ethereum's state, which is more than any other single protocol.
⚡5🆒2
Andrew Ng: we will control and steer superhuman AI, so if we want humanity to survive and thrive we should develop AI "as fast as possible"
❤3⚡3
It’s a big! First-of-its-kind supplement clinically proven to slow effects of aging in dogs available at LeapYears.com
⚡3❤2
The world’s 4 biggest cloud firms, Amazon AWS, Microsoft, Google and Meta will spend a record high US$200 billion on capex in 2025, citing the Wells Fargo Investment Institute, up from $140 billion in spending last year.
MorningStar
AI boom in data centers has top tech companies spending more than major oil companies on capex
By Joy Wiltermuth
⚡3
MindSpeaker BCI has built its “MindSpeaker+MindClick”
The integrated product concept enables improving communication for patients and elderly suffering from speech disorders via in-ear EEG sensing.
MindSpeaker builds Alternative and Augmentative Communication products. This product will address patients with speech paralysis (dysarthria).
The integrated product concept enables improving communication for patients and elderly suffering from speech disorders via in-ear EEG sensing.
MindSpeaker builds Alternative and Augmentative Communication products. This product will address patients with speech paralysis (dysarthria).
⚡5
What if you and your friends could see through each other’s eyes all at once?
Researchers revealed that elephantnose fish might really do this kind of group sensing with their electro-location sensory system.
Researchers revealed that elephantnose fish might really do this kind of group sensing with their electro-location sensory system.
⚡3🍌1
A 7B-parameter DNA language model trained on 2.7M prokaryotic genomes can perform generation and prediction at the DNA, RNA, and protein levels.
bioRxiv
Sequence modeling and design from molecular to genome scale with Evo
The genome is a sequence that completely encodes the DNA, RNA, and proteins that orchestrate the function of a whole organism. Advances in machine learning combined with massive datasets of whole genomes could enable a biological foundation model that accelerates…
❤3🍌1