VideoGaming

Open-World Navigation

Long-horizon traversal of large, freely navigable game worlds, with every keypress, mouse movement, and controller input timestamped to microseconds and synchronized to video frames. Sessions cover full traversal arcs, including the open-ended decision-making that drives where the player goes next.

Hours61.2K+

Games100+

Genres4+

Training Use Cases

✓Action-conditioned video generation and learned world models

✓Embodied agent training and behavioral cloning

✓Vision-language-action models bridging visual observation and discrete action

✓Navigation policy learning in 3D environments

Key Highlights

✓100+ game titles and engines covering open-world action, RPG, survival, and exploration

✓Per-frame player input captured alongside the video

✓Both keyboard-and-mouse and controller input streams captured

✓Long, uninterrupted traversal sequences alongside short combat and interaction clips

Metadata Fields

durationLength of clip in HH:MM:SS

resolutionPixel dimensions (e.g., 1920x1080, 3840x2160)

frame_rateFrames per second (e.g., 24, 30, 60, 120)

contains_audioWhether the clip carries an audio track (boolean)

primary_categoryDominant content category assigned to a video

stylephotorealistic | stylized | low_poly | painterly

subgenreaction | rpg | survival | exploration | sandbox

game_titleGame name and version (e.g., Elden Ring 1.10, GTA V Build 3258)

input_devicekeyboard_mouse | controller

traversal_typewalking | vehicle | flight | swimming (one or more values)