We trained a neural network to play Minecraft by Video PreTraining (VPT) on a massive unlabeled video dataset of human Minecraft play, while using only a small amount of labeled contractor data. With fine-tuning, our model can learn to craft diamond tools, a task that usually takes proficient humans over 20 minutes (24,000 actions). Our model uses the native human interface of keypresses and mouse movements, making it quite general, and represents a step towards general computer-using agents.
Back to articles
AIOpenAI News
Learning to play Minecraft with Video PreTraining
We trained a neural network to play Minecraft by Video PreTraining (VPT) on a massive unlabeled video dataset of human Minecraft play, while using only a small amount of labeled contractor data. With fine-tuning, our mo...
This source only provides an excerpt in its RSS feed. FlowMarket displays all content available from the feed and keeps the original publication link for attribution.
Need an n8n workflow or help installing it?
After the briefing, move to execution: find an n8n template or a creator who can adapt it to your tools.