NeurIPS 2026 · Competition Track

Foundation
Agent
Challenge

One agent. A thousand games. Train a single model and harness to play across text games, web games, and emulated games. The competition runs on Kaggle; this site is the landing page for rules, resources, and updates.

Kaggle Details Contact Organizers

100. public-crawl games

1,000. private-crawl games

1. model

The Challenge

Can one agent play many games?

Participants submit one self-contained agent: a model up to 40B parameters plus a harness. The public crawl gives teams a broad training set; the held-out private crawl determines the primary ranking.

Scores are normalized per game, averaged within each runtime category, then averaged across categories with equal weight.

Text Games

Interactive fiction, structured text observations, menu actions, and token-level decisions.

Web Games

Browser games rendered through JS and HTML5, controlled with mouse and keyboard actions.

Illustration representing emulated console games

Emulated Games

Console and handheld titles with video-frame observations and emulator button inputs.

Kaggle Hosted

Kaggle is where the competition happens.

Registration, containers, submissions, and live competition operations will run through Kaggle. Official competition links and materials will be posted here as they become available.

Preview

Built for generalist game-playing agents.

The crawl spans action, platformer, puzzle, RPG, simulation, strategy, adventure, racing, and survival games across multiple runtimes.

Timeline

2026 competition schedule

July 1Beta launch with public crawl, starter kit, baseline harness, and live leaderboard.
Late AugustHybrid hackathon and official kickoff; rules and crawl freeze.
Sept 1-15Text games sprint.
Sept 15-Oct 1Web games sprint.
Oct 1-15Emulated games sprint.
Nov 15Final submissions due before private-crawl evaluation and report review.