Edit Content

-
-
OpenAI announced CriticGPT: a new model for improving GPT-4 accuracy

OpenAI announced CriticGPT: a new model for improving GPT-4 accuracy

OpenAI announced CriticGPT_ a new model to improve the accuracy of GPT-4

OpenAI worked out CriticGPT, based on GPT-4, to help human trainers validate the program code generated by ChatGPT. The model analyzes the code and points out potential bugs, making it easier to detect flaws that might have gone unnoticed.

Benefits of CriticGPT

In studies conducted by OpenAI, annotators preferred CriticGPT remarks to human remarks in 63% cases with natural language model errors. This preference is explained by the fact that CriticGPT generated fewer false positives and useless remarks. Working humans and CriticGPT together also resulted in more complete error reports than when humans alone were used. In addition, the use of CriticGPT helped to reduce the level of hallucinations that occurred when the model was running alone.

CriticGPT Training

A large dataset of deliberately introduced errors was used to develop CriticGPT. Experts modified the ChatGPT code by introducing errors and providing examples of feedback, allowing the model to learn how to identify and criticize different types of errors.

New techniques and opportunities

Researchers have developed a new Force Sampling Beam Search (FSBS) technique that helps CriticGPT write more detailed bug reports. This technique allows for adjusting the thoroughness of the problem search and the frequency of non-existent error generation, which can be customized depending on specific tasks.

Interestingly, CriticGPT demonstrated its capabilities not only in code analysis. In experiments, the model identified errors in 24% cases in ChatGPT training data, which were previously considered flawless in human evaluation. These errors were later confirmed by experts.

Limitations and perspectives

Despite its successes, CriticGPT has limitations. The model is trained on relatively short ChatGPT responses, which may not be sufficient to evaluate longer and more complex tasks. Although CriticGPT reduces false positives, false positives cannot be completely eliminated, and human trainers may err on the side of labeling based on false positives. The model is more effective at detecting errors localized to a specific point in the code, whereas errors may be distributed across multiple parts of the response, which presents a challenge for future versions of the model.

Future plans

OpenAI plans to use models like CriticGPT to help trainers evaluate the output of language models, which will improve assessment tools and increase efficiency. However, even with AI, complex tasks can be a challenge for humans.

More in the category

OpenAI GPT-4.5 System Card
Translation of the full GPT-4.5 system report into Russian and its conclusions. The development of language models does not stand still:...
sam altman
OpenAI, a leader in artificial intelligence, is once again surprising with innovative plans. In this article, we will cover the latest roadmap update,...
o3 mini
OpenAI officially launches the new o3-mini artificial intelligence model, which will be available today.
Goodbye 3.5! OpenAI introduces GPT-4o mini model
OpenAI has unveiled its latest artificial intelligence model, the GPT-4o mini, which will be the replacement for the GPT-3.5. This model promises to significantly improve the quality of...
openai_prezentovala_novuyu_model_gpt_5_na_konferenczii_microsoft
At the recent Microsoft conference, OpenAI CEO, Sam Altman, unveiled the long-awaited GPT-5. This event was an important milestone in the...
OpenAI's newest free model is GPT-4o
OpenAI is releasing a new flagship generative AI model called GPT-4o, which will be "iteratively" deployed in the company's products for developers and...
The musical debut of "Sora"_ her music video has become a major topic of discussion online
Fantastic images, animated art and the magic of sound - all this is embodied in a new video clip created with the help of OpenAI neural network for...
OpenAI_announced_the_release_of_GPT_4_Turbo,_an artificial_intelligence model
An improved version of the GPT-4 language model, called GPT-4 Turbo, was introduced at the first OpenAI developer conference. The new model has more...
OpenAI_has_hosted_its_first_conference_for_developers
OpenAI, the developer of ChatGPT, held its first developer conference, in which it unveiled some big news. One of the most interesting announcements was...
The new_model_of_text_sound_from_OpenAI_can_be_tried_for_free
OpenAI has introduced several APIs for developers to use the new speech synthesis model in their projects. One of them is.
ChatGPT_can_now_create_images_OpenAI_announced_new
OpenAI happily announced the release of an update to its generative artificial intelligence system, ChatGPT. On its official blog, the company shared the news of...
Open AI announced - ChatGPT4 Vision
Open AI have made significant changes to their platform, specifically announcing ChatGPT4 Vision. This update will bring new multimodal features that...