OpenAI unveils CriticGPT, a model that is built to critique GPT4

OpenAI’s newest model will assist it discover and repair the errors made by GPT4.

CriticGPT CriticGPT is based mostly on GPT4 (Picture credit score: OpenAI)

OpenAI not too long ago introduced a new model known as CriticGPT based mostly on GPT4. In contrast to the opposite fashions from the corporate, that are consumer-facing, CriticGPT is designed to “write critiques of ChatGPT responses to assist human trainers spot errors throughout reinforcement studying from human suggestions (RLHF).”

CriticGPT is based mostly on GPT-4 and can assist the human trainers at OpenAI to “catch errors in ChatGPT’s code output.” In accordance to OpenAI, code reviewed by CriticGPT can outperform unreviewed code by 60 per cent. The corporate is at present integrating CriticGPT-like fashions into the RLHF labelling pipeline to help AI trainers in evaluating outputs from superior AI programs.

=metering.exceededMeter.max AND metering.userProperties.premium=’false’ )” amp-access-hide>

OpenAI says fashions like CriticGPT will help make ChatGPT extra correct with refined errors and can even spot errors that people would possibly miss, as fashions change into extra educated.

The method of coaching CriticGPT included modifying ChatGPT-generated code manually and introducing new errors into the code together with pattern suggestions to prepare the model to simply determine widespread and not-so-common errors.

Once more, similar to human ideas, CriticGPT’s ideas will not be at all times right, nevertheless, the mix of the Human+CriticGPT staff is mentioned to outperform unassisted human trainers, and likewise helps trainers write “complete critiques” whereas producing fewer hallucinations.

OpenAI additionally says CriticGPT would possibly unfold real-world errors throughout many components of the reply, and it can’t consider an especially advanced process or response. This new AI model, in accordance to the corporate, will assist human trainers to “produce higher RLHF information for GPT-4,” and OpenAI is additionally planning to scale this work additional.

First uploaded on: 01-07-2024 at 10:27 IST

Source link

OpenAI’s newest model will assist it discover and repair the errors made by GPT4.

Leave a Reply Cancel reply