For Developers – Telegram

For Developers

210 subscribers

65 photos

3 videos

1.01K files

998 links

YAC

Download Telegram

About

Blog

Apps

Platform

210 subscribers

#anthropic #vs #openai
https://www.anthropic.com/news/claude-3-5-sonnet

Introducing Claude 3.5 Sonnet

Introducing Claude 3.5 Sonnet—our most intelligent model yet. Sonnet now outperforms competitor models and Claude 3 Opus on key evaluations, at twice the speed.

164 viewsedited 18:46

#GAN #QPU #QuGAN
https://www.mdpi.com/2079-9292/12/4/856

A Survey of Recent Advances in Quantum Generative Adversarial Networks

Quantum mechanics studies nature and its behavior at the scale of atoms and subatomic particles. By applying quantum mechanics, a lot of problems can be solved in a more convenient way thanks to its special quantum properties, such as superposition and entanglement.…

160 views09:02

#gpt #vs #iq_test
https://trackingai.org/IQ

Tracking AI is a cutting-edge application that unveils the political biases embedded in artificial intelligence systems. Explore and analyze the political leanings of AIs with our intuitive platform, designed to foster transparency in the world of artificial…

177 views19:53

#llm #gpt #cost #best_practice #RAG

ROUTELLM: LEARNING TO ROUTE LLMS WITH
PREFERENCE DATA
https://arxiv.org/pdf/2406.18665

Searching for Best Practices in Retrieval-Augmented
Generation
https://arxiv.org/pdf/2407.01219

221 viewsedited 20:33

A Survey on Efficient Inference for Large
Language Models
https://arxiv.org/pdf/2404.14294
#vLLM #vs #deepspeed #overview #survey #inference #optimization

232 viewsedited 11:20

#fingpt #rag #llm #gpt
https://arxiv.org/abs/2310.04027v1

#structured_output #vs #outlines #vs #mirascope #vs #instructor #langhchain #guidance
https://simmering.dev/blog/structured_output/
https://simmering.dev/blog/openai_structured_output/

#aws #team #sagemaker #genai #inference #better #autoscale #subminute #metrics #cloudwatch
https://aws.amazon.com/about-aws/whats-new/2024/07/amazon-sagemaker-faster-auto-scaling-generative-ai-models/
https://aws.amazon.com/blogs/machine-learning/amazon-sagemaker-inference-launches-faster-auto-scaling-for-generative-ai-models/

Enhancing Financial Sentiment Analysis via Retrieval Augmented...

Financial sentiment analysis is critical for valuation and investment decision-making. Traditional NLP models, however, are limited by their parameter size and the scope of their training...

167 viewsedited 15:34

https://arxiv.org/abs/2408.12648
#qaoq #qpu

A Monte Carlo Tree Search approach to QAOA: finding a needle in...

The search for quantum algorithms to tackle classical combinatorial optimization problems has long been one of the most attractive yet challenging research topics in quantum computing. In this...

151 views06:28

#cancer #bacteria_programming #bacteria

https://www.cuimc.columbia.edu/news/hacking-bacteria-attack-cancer

#aws #sagemaker #autoscale #watchcloud
https://www.youtube.com/watch?v=1B2cRMoPpSk

Enhancing Financial Sentiment Analysis via Retrieval Augmented...

Financial sentiment analysis is critical for valuation and investment decision-making. Traditional NLP models, however, are limited by their parameter size and the scope of their training...

118 viewsedited 12:44

https://darioamodei.com/machines-of-loving-grace

#future #anthropic #team #futurology #agi #business #bioengineering #economics #alphafold #alphaproteo #crispr #next_decate

Dario Amodei — Machines of Loving Grace

How AI Could Transform the World for the Better

128 viewsedited 06:03

https://galileo.ai/blog/mastering-agents-langgraph-vs-autogen-vs-crew#:~:text=Autogen%3A%20Autogen%20supports%20human%2Din,flag%20in%20the%20task%20definition.

#crewai #vs #autogen #vs #langgraph ; #ai_agents

Mastering Agents: LangGraph Vs Autogen Vs Crew AI

Select the best framework for building intelligent AI Agents

118 viewsedited 11:47

React: Synergizing reasoning and acting in language models
https://scholar.google.com/scholar?cites=15164492138064021676&as_sdt=2005&sciodt=0,5&hl=en

Self-refine: Iterative refinement with self-feedback
https://scholar.google.com/scholar?cites=8414000456339217032&as_sdt=2005&sciodt=0,5&hl=en

Communicative agents for software development
https://scholar.google.com/scholar?cites=168100539275365535&as_sdt=2005&sciodt=0,5&hl=en

Code generation with alphacodium: From prompt engineering to flow engineering
https://scholar.google.com/scholar?cites=4650119543966656826&as_sdt=2005&sciodt=0,5&hl=en

#ai_agents

151 viewsedited 10:22

#azure #openai #vs #aws #bedrock #vs #google #vertexai #vertex_ai
https://www.ankursnewsletter.com/p/aws-bedrock-vs-google-vertex-ai-vs

#nvidia #team #qpu
https://techcrunch.com/2024/11/02/quantum-machines-and-nvidia-use-machine-learning-to-get-closer-to-an-error-corrected-quantum-computer/

Quantum Machines and Nvidia use machine learning to get closer to an error-corrected quantum computer | TechCrunch

About a year and a half ago, quantum control startup Quantum Machines and Nvidia announced a deep partnership that would bring together Nvidia's DGX

170 viewsedited 06:45

#qpu
https://www.mdpi.com/1099-4300/25/4/548

Quantum Circuit Components for Cognitive Decision-Making

This paper demonstrates that some non-classical models of human decision-making can be run successfully as circuits on quantum computers. Since the 1960s, many observed cognitive behaviors have been shown to violate rules based on classical probability and…

184 views09:05

https://arxiv.org/html/2410.04520v1
#stacking #ensemble #ml #dl

137 views17:45

#agents #gen_ai #llm #antropic #team #langgraph
https://www.anthropic.com/research/building-effective-agents

#llm #code #benchmarks
https://livebench.ai/
https://simple-bench.com/
https://livecodebench.github.io/leaderboard.html

109 viewsedited 12:53

#llm

Transformer2 : Self-adaptive LLMs
https://arxiv.org/abs/2501.06252

Transformer-Squared: Self-adaptive LLMs

Self-adaptive large language models (LLMs) aim to solve the challenges posed by traditional fine-tuning methods, which are often computationally intensive and static in their ability to handle...

87 views20:02

#llm #open_ai #o1 #vs #deepseek #kimi
https://github.com/deepseek-ai/DeepSeek-R1/blob/main/DeepSeek_R1.pdf

https://www.youtube.com/watch?v=LYxQbgAUzsQ

https://x.com/deepseek_ai/status/1881318130334814301

https://x.com/DrJimFan/status/1881382618627019050

https://pandaily.com/kimi-k1-5-the-first-non-openai-model-to-match-full-powered-o1-performance/

https://github.com/MoonshotAI/Kimi-k1.5/blob/main/Kimi_k1.5.pdf

DeepSeek-R1/DeepSeek_R1.pdf at main · deepseek-ai/DeepSeek-R1

Contribute to deepseek-ai/DeepSeek-R1 development by creating an account on GitHub.

92 viewsedited 04:52

#llm #openai #stem_cells

https://www.technologyreview.com/2025/01/17/1110086/openai-has-created-an-ai-model-for-longevity-science/

https://www.technologyreview.com/2023/03/08/1069523/sam-altman-investment-180-million-retro-biosciences-longevity-death/

https://www.youtube.com/watch?v=D43-YFauw58

MIT Technology Review

OpenAI has created an AI model for longevity science

The company is making a foray into scientific discovery with an AI built to help manufacture stem cells.

79 viewsedited 13:42

#slm
https://www.technologyreview.com/2025/01/03/1108800/small-language-models-ai-breakthrough-technologies-2025/

MIT Technology Review

Small language models: 10 Breakthrough Technologies 2025

Large language models unleashed the power of AI. Now it’s time for more efficient AIs to take over.

79 views16:09

Forwarded from HN Best Comments

Re: The Era of 1-bit LLMs: ternary parameters for cost...

Fun to see ternary weights making a comeback. This was hot back in 2016 with BinaryConnect and TrueNorth chip from IBM research (disclosure, I was one of the lead chip architects there).

Authors seemed to have missed the history. They should at least cite Binary Connect or Straight Through Estimators (not my work).

Helpful hint to authors: you can get down to 0.68 bits / weight using a similar technique, good chance this will work for LLMs too.

https://arxiv.org/abs/1606.01981

This was a passion project of mine in my last few months at IBM research :).

I am convinced there is a deep connection to understanding why backprop is unreasonably effective, and the result that you can train low precision DNNs; for those note familiar, the technique is to compute the loss wrt to the low precision parameters (eg project to ternary) but apply the gradient to high precision copy of parameters (known as the straight through estimator). This is a biased estimator and there is no theoretical underpinning for why this should work, but in practice it works well.

My best guess is that it is encouraging the network to choose good underlying subnetworks to solve the problem, similar to Lottery Ticket Hypothesis. With ternary weights it is just about who connects to who (ie a graph), and not about the individual weight values anymore.

paul_mk1, 9 hours ago

Deep neural networks are robust to weight binarization and other...

Recent results show that deep neural networks achieve excellent performance even when, during training, weights are quantized and projected to a binary representation. Here, we show that this is...

75 views06:57