Edit Content

-
-
Minecraft bot Voyager programs itself with GPT-4

Minecraft bot Voyager programs itself with GPT-4

Minecraft_bot_Voyager_programs_its_self_with_GPT_4

Voyager uses GPT-4 to control a learning Minecraft agent in a pixel world. Instead of reinforcement learning, Voyager relies on code generation.

Researchers from Nvidia, Caltech, UT Austin, Stanford, and ASU present Voyager, the first lifelong learning agent to play Minecraft. Unlike other Minecraft agents that use classic reinforcement learning methods, for example, Voyager uses GPT-4 to continuously improve himself. It does this by writing, improving, and passing code stored in an external skill library.

The result is small programs that help you navigate, open doors, mine resources, make a pickaxe, or fight zombies. "GPT-4 opens up a new paradigm," says researcher Jim Phan of Nvidia, who advised the project. "In this paradigm, 'training' is code execution, and the 'trained model' is the skill base that Voyager iteratively builds."

The Voyager consists of three main components:

  1. An iterative hinting mechanism that includes feedback from the game, execution errors, and self-checking for program improvement.
  2. Skills library with code to store and retrieve complex behaviors.
  3. Automated curriculum for maximum learning.

Agent Voyager Minecraft learns in context

Minecraft Agent learns in an iterative way: Voyager writes a program using GPT-4 to achieve the goal, and uses feedback from the game environment and possible Javascript errors to refine the program using GPT-4. In this way, Voyager gradually builds a library of skills and stores successful programs in a vector database. Complex skills are formed from simpler ones.

To explore Minecraft's diverse world, the team uses an automated learning program that suggests appropriate exploration tasks based on the agent's current skills and the current state of the world. For example, an agent learns to collect sand and cacti in the desert before digging for iron.

Minecraft bot Voyager programs itself with GPT-4

All of this together creates an agent who is constantly learning and can perform various tasks. The team conducts all experiments in the MineDojo environment.

Currently, Voyager can only build homes with human feedback.

The team compares Voyager to other language model-based agents such as ReAct, Reflection, or Auto-GPT in Minecraft. Voyager found 63 different objects in 160 iterations of cues - 3.3 times more than the next best approach, the team says.

Minecraft bot Voyager programs itself with GPT-4

Auto-GPT makes Voyager travel a lot: In general, the Minecraft agent travels more than twice the distance and visits more biomes. Auto-GPT and other methods, on the other hand, often get stuck in their local area.

Minecraft bot Voyager programs itself with GPT-4

The skill library created by Voyager is also compatible with Auto-GPT: the AI agent in Minecraft achieves significantly better results with it, but still lags behind Voyager.

Voyager currently only works with text and cannot see what is happening in the block world. Therefore, it can't build houses. However, in an early experiment, the team used humans to give the agent visual feedback - so Voyager can learn to build houses and portals to Nether, for example.

More information and examples are available on the project page Voyager.

The code is available at GhitHub.

More in the category

and gpt
ChatGPT is a powerful artificial intelligence-based tool that can be an indispensable assistant for programmers. Below are 25 ways,...
OpenAI GPT-4.5 System Card
Translation of the full GPT-4.5 system report into Russian and its conclusions. The development of language models does not stand still:...
sam altman
OpenAI, a leader in artificial intelligence, is once again surprising with innovative plans. In this article, we will cover the latest roadmap update,...
laywer
What is ChatGPT and how does it work? ChatGPT is an artificial intelligence based program. It is able to answer questions...
o3 mini
OpenAI officially launches the new o3-mini artificial intelligence model, which will be available today.
Stable Diffusion 3.5 update
Stability AI recently introduced three new ControlNet models for Stable Diffusion 3.5 Large: Blur, Canny, and Depth. These models, available for...
The new_model_of_text_sound_from_OpenAI_can_be_tried_for_free
Unlike the GPT-4o, the new model is able to build logical chains, analyze tasks sequentially and draw conclusions. This has significantly improved the accuracy of...
Goodbye 3.5! OpenAI introduces GPT-4o mini model
OpenAI has unveiled its latest artificial intelligence model, the GPT-4o mini, which will be the replacement for the GPT-3.5. This model promises to significantly improve the quality of...
gpt4o_i_gpt_store_stali_dostupny_dlya_besplatnyh_polzovatelej
Now everyone can try the coolest OpenAI model and custom GPT bots! Catch the list of the coolest bots created by enthusiasts for all occasions...
OpenAI's newest free model is GPT-4o
OpenAI is releasing a new flagship generative AI model called GPT-4o, which will be "iteratively" deployed in the company's products for developers and...
LLaMa_3_absolutely_free_through_Perplexity_Labs
Meet LLaMa 3 - a text-based neural network whose skills are as good as GPT and even superior in some aspects....
8_ways_to_improve_the_prompt_for_ChatGPT
Ask the ChatGPT to introduce themselves as a specific professional, character, or member of a profession. Example: Take on the role of a lawyer (poet, psychologist, critic,.....