OpenAI announced CriticGPT: a new model to improve the accuracy of GPT-4 | Library of neural networks and promts in Russian language

OpenAI announced CriticGPT: a new model for improving GPT-4 accuracy

01.07.24

OpenAI announced CriticGPT_ a new model to improve the accuracy of GPT-4

OpenAI worked out CriticGPT, based on GPT-4, to help human trainers validate the program code generated by ChatGPT. The model analyzes the code and points out potential bugs, making it easier to detect flaws that might have gone unnoticed.

Benefits of CriticGPT

In studies conducted by OpenAI, annotators preferred CriticGPT remarks to human remarks in 63% cases with natural language model errors. This preference is explained by the fact that CriticGPT generated fewer false positives and useless remarks. Working humans and CriticGPT together also resulted in more complete error reports than when humans alone were used. In addition, the use of CriticGPT helped to reduce the level of hallucinations that occurred when the model was running alone.

CriticGPT Training

A large dataset of deliberately introduced errors was used to develop CriticGPT. Experts modified the ChatGPT code by introducing errors and providing examples of feedback, allowing the model to learn how to identify and criticize different types of errors.

New techniques and opportunities

Researchers have developed a new Force Sampling Beam Search (FSBS) technique that helps CriticGPT write more detailed bug reports. This technique allows for adjusting the thoroughness of the problem search and the frequency of non-existent error generation, which can be customized depending on specific tasks.

Interestingly, CriticGPT demonstrated its capabilities not only in code analysis. In experiments, the model identified errors in 24% cases in ChatGPT training data, which were previously considered flawless in human evaluation. These errors were later confirmed by experts.

Limitations and perspectives

Despite its successes, CriticGPT has limitations. The model is trained on relatively short ChatGPT responses, which may not be sufficient to evaluate longer and more complex tasks. Although CriticGPT reduces false positives, false positives cannot be completely eliminated, and human trainers may err on the side of labeling based on false positives. The model is more effective at detecting errors localized to a specific point in the code, whereas errors may be distributed across multiple parts of the response, which presents a challenge for future versions of the model.

Future plans

OpenAI plans to use models like CriticGPT to help trainers evaluate the output of language models, which will improve assessment tools and increase efficiency. However, even with AI, complex tasks can be a challenge for humans.

More in the category OpenAI

OpenAI ChatGPT

28.02.25

OpenAI GPT-4.5 system card

Translation of the full GPT-4.5 system report into Russian and its conclusions. The development of language models does not stand still:...

OpenAI ChatGPT

13.02.25

OpenAI updates the roadmap: GPT-4.5 (Orion) and GPT-5 await us

OpenAI, a leader in artificial intelligence, is once again surprising with innovative plans. In this article, we will cover the latest roadmap update,...

OpenAI ChatGPT

31.01.25

⚡ OpenAI releases o3-mini - a powerful neural network with free access

OpenAI officially launches the new o3-mini artificial intelligence model, which will be available today.

Goodbye 3.5! OpenAI introduces GPT-4o mini model

OpenAI ChatGPT

19.07.24

Goodbye 3.5! OpenAI introduces GPT-4o mini model

OpenAI has unveiled its latest artificial intelligence model, the GPT-4o mini, which will be the replacement for the GPT-3.5. This model promises to significantly improve the quality of...

OpenAI

27.05.24

OpenAI presented the new GPT-5 model at the Microsoft conference

At the recent Microsoft conference, OpenAI CEO, Sam Altman, unveiled the long-awaited GPT-5. This event was an important milestone in the...

OpenAI ChatGPT

13.05.24

OpenAI's newest free model is GPT-4o

OpenAI is releasing a new flagship generative AI model called GPT-4o, which will be "iteratively" deployed in the company's products for developers and...

The musical debut of "Sora"_ her music video has become a major topic of discussion online

Sora OpenAI

06.04.24

Sora's musical debut: her music video has become a major topic of discussion online

Fantastic images, animated art and the magic of sound - all this is embodied in a new video clip created with the help of OpenAI neural network for...

OpenAI

08.11.23

OpenAI announced the release of GPT-4 Turbo, an artificial intelligence model with support for 128 thousand tokens and three times more affordable than GPT-4

An improved version of the GPT-4 language model, called GPT-4 Turbo, was introduced at the first OpenAI developer conference. The new model has more...

OpenAI

08.11.23

OpenAI held its first developer conference

OpenAI, the developer of ChatGPT, held its first developer conference, in which it unveiled some big news. One of the most interesting announcements was...

OpenAI

08.11.23

OpenAI's new text voicing model can be tried for free

OpenAI has introduced several APIs for developers to use the new speech synthesis model in their projects. One of them is.

OpenAI

20.10.23

ChatGPT can now create images. OpenAI announced a new feature

OpenAI happily announced the release of an update to its generative artificial intelligence system, ChatGPT. On its official blog, the company shared the news of...

OpenAI

26.09.23

Open AI announced - ChatGPT4 Vision

Open AI have made significant changes to their platform, specifically announcing ChatGPT4 Vision. This update will bring new multimodal features that...