Datasets
VideoGaming
First-Person Gameplay
Ego-perspective gameplay where the camera is the player character's view, paired frame-by-frame with the keypress, mouse, and controller inputs that produced it. The first-person framing matches the viewpoint that embodied AI agents and robotics perception systems deploy in.
Hours34.6K+
Games50+
Genres5+
Training Use Cases
✓Action-conditioned video generation and learned world models
✓Embodied agent training and behavioral cloning
✓First-person navigation and spatial reasoning
✓Vision-language-action models bridging visual observation and discrete action
Key Highlights
✓50+ game titles and engines spanning shooter, survival, simulation, immersive sim, and horror
✓Per-frame player input captured alongside the video
✓Both keyboard-and-mouse and controller input streams captured
✓Aim, look, weapon-handle, and environment-interaction events all isolatable in the input stream
Metadata Fields
durationLength of clip in HH:MM:SS
resolutionPixel dimensions (e.g., 1920x1080, 3840x2160)
frame_rateFrames per second (e.g., 24, 30, 60, 120)
contains_audioWhether the clip carries an audio track (boolean)
primary_categoryDominant content category assigned to a video
stylephotorealistic | stylized | retro_pixel | painterly
game_genreshooter | survival | simulation | immersive_sim | horror | other
game_titleGame name and version (e.g., Counter-Strike 2 Build 14189)
input_devicekeyboard_mouse | controller
weapon_or_tool_visibleWhether held weapon/tool is in frame (boolean)