Medium / Medium.com – Telegram

Medium / Medium.com

1.26K subscribers

106K links

Just main page of medium.com fresh from the oven

Download Telegram

About

Blog

Apps

Platform

Medium / Medium.com

1.26K subscribers

Medium / Medium.com

LLMs Cannot Find Reasoning Errors, but They Can Correct Them!

#llms #llmmistakefinding #llmoutputcorrection #bigbenchmistake #chainofthought #nlp #selfconsistency #zeroshotprompting

https://hackernoon.com/llms-cannot-find-reasoning-errors-but-they-can-correct-them

LLMs Cannot Find Reasoning Errors, but They Can Correct Them! | HackerNoon

In this paper, we break down the self-correction process into two core components: mistake finding and output correction.

10 views23:00

Medium / Medium.com

Backtracking: Why We Replaced External Feedback With a Lightweight Classifier

#llms #lightweightclassifier #externalfeedback #cottrace #llmbacktracking #bigbenchmistake #rewardmodeling #generatormodel

https://hackernoon.com/backtracking-why-we-replaced-external-feedback-with-a-lightweight-classifier

Backtracking: Why We Replaced External Feedback With a Lightweight Classifier | HackerNoon

We propose a simple backtracking method to improve model outputs based on the location of logical errors. Backtracking reduces the computational cost

5 views11:15

Medium / Medium.com

What Are the Benchmark Results of GPT-4-Turbo, GPT4, and GPT-3.5-Turbo?

#llms #gptbenchmarkresults #bigbenchmistake #directtracelevelprompting #cotsteplevelprompting #directsteplevelprompting #llmoutputcorrection #usingllmstofindmistakes

https://hackernoon.com/what-are-the-benchmark-results-of-gpt-4-turbo-gpt4-and-gpt-35-turbo

What Are the Benchmark Results of GPT-4-Turbo, GPT4, and GPT-3.5-Turbo? | HackerNoon

All models are given the same 3-shot prompts. We use three different prompting methods. Direct trace-level prompting involves using the whole trace as input

9 views11:30

Medium / Medium.com

BIG-Bench Mistake: What Is It?

#llms #bigbenchmistake #costyletraces #automatedannotation #dycklanguages #bigbenchdatasets #humanannotation #llmmistakefinding

https://hackernoon.com/big-bench-mistake-what-is-it

BIG-Bench Mistake: What Is It? | HackerNoon

BIG-Bench Mistake consists of 2186 sets of CoTstyle traces. Each trace was generated by PaLM 2-L-Unicorn

14 views11:45

Medium / Medium.com

Our Annotations Guide for BIG-Bench Mistake

#llms #bigbenchmistake #multisteparithmetic #cotsteplevelprompting #bigbenchdatasets #whatisbigbenchmistake #usingllmstocorrecterrors #canllmsfindmistakes

https://hackernoon.com/our-annotations-guide-for-big-bench-mistake

Our Annotations Guide for BIG-Bench Mistake | HackerNoon

Annotators can click on words to highlight the same word across the trace and the question text. Buttons on the right automatically become inactive

14 views22:15

Medium / Medium.com

BIG-Bench Mistake: Implementational Details That Are Important

#llms #bigbenchmistake #cotprompting #bigbenchdatasets #cotstyletraces #palm2 #3shotprompting #aiprompts

https://hackernoon.com/big-bench-mistake-implementational-details-that-are-important

BIG-Bench Mistake: Implementational Details That Are Important | HackerNoon

We use PaLM 2 L (Unicorn) to generate the traces used in BIG-Bench Mistake. All traces are generated at temperature = 0. We algorithmically append “Thought N:”

9 views22:30

Medium / Medium.com

LLMs Can Correct Reasoning Errors! But Not Without Limitations

#llms #bigbenchmistake #cotstyletraces #usingllmstocorrecterrors #rewardmodels #usingllmstofindmistakes #humanannotation #llmbacktracking

https://hackernoon.com/llms-can-correct-reasoning-errors-but-not-without-limitations

LLMs Can Correct Reasoning Errors! But Not Without Limitations | HackerNoon

In this paper, we describe and release our dataset BIG-Bench Mistake for mistake-finding and propose a backtracking method to correct logical errors.

12 views22:45

Medium / Medium.com

Using LLMs to Correct Reasoning Mistakes: Related Works That You Should Know About

#llms #bigbenchmistake #bigbenchdatasets #apibasedllms #usingllmstocorrecterrors #reasoncapacitiesofllms #posthoccorrection #selfcorrectionmethofs

https://hackernoon.com/using-llms-to-correct-reasoning-mistakes-related-works-that-you-should-know-about

Using LLMs to Correct Reasoning Mistakes: Related Works That You Should Know About | HackerNoon

This paper explores few-shot in-context learning methods, which is typically used in realworld applications with API-based LLMs

13 views23:15