Medium / Medium.com – Telegram

Medium / Medium.com

1.29K subscribers

106K links

Just main page of medium.com fresh from the oven

Download Telegram

About

Blog

Apps

Platform

Medium / Medium.com

1.29K subscribers

Medium / Medium.com

What Are the Benchmark Results of GPT-4-Turbo, GPT4, and GPT-3.5-Turbo?

#llms #gptbenchmarkresults #bigbenchmistake #directtracelevelprompting #cotsteplevelprompting #directsteplevelprompting #llmoutputcorrection #usingllmstofindmistakes

https://hackernoon.com/what-are-the-benchmark-results-of-gpt-4-turbo-gpt4-and-gpt-35-turbo

What Are the Benchmark Results of GPT-4-Turbo, GPT4, and GPT-3.5-Turbo? | HackerNoon

All models are given the same 3-shot prompts. We use three different prompting methods. Direct trace-level prompting involves using the whole trace as input

9 views11:30

Medium / Medium.com

LLMs Can Correct Reasoning Errors! But Not Without Limitations

#llms #bigbenchmistake #cotstyletraces #usingllmstocorrecterrors #rewardmodels #usingllmstofindmistakes #humanannotation #llmbacktracking

https://hackernoon.com/llms-can-correct-reasoning-errors-but-not-without-limitations

LLMs Can Correct Reasoning Errors! But Not Without Limitations | HackerNoon

In this paper, we describe and release our dataset BIG-Bench Mistake for mistake-finding and propose a backtracking method to correct logical errors.

12 views22:45