Datasets
VideoRobotics
Device Usage POV
First-person footage of people picking up, navigating, and operating the devices that fill modern daily life, captured close to the user's natural viewpoint. The set covers phone use, tablet handling, laptop work, smart-home interaction, and the moment-to-moment switching between physical objects and on-screen tasks.
Hours2.2K+
Devices12+
Settings15+
Training Use Cases
✓Embodied AI training for AR and wearable systems
✓Egocentric action recognition for device interactions
✓Hand-object interaction modeling with consumer devices
✓UI and screen-context understanding from first-person view
Key Highlights
✓12+ device categories covered including smartphones, tablets, laptops, smart speakers, wearables, and home appliances
✓15+ everyday contexts represented including commute, kitchen, desk, couch, and outdoor settings
✓First-person framing throughout, captured close to the user's working viewpoint
✓Real-world device use rather than scripted demonstrations or product shots
Metadata Fields
durationLength of clip in HH:MM:SS
resolutionPixel dimensions (e.g., 1920x1080, 3840x2160)
frame_rateFrames per second (e.g., 24, 30, 60, 120)
contains_audioWhether the clip carries an audio track (boolean)
primary_categoryDominant content category assigned to a video
device_typesmartphone | tablet | laptop | smart_speaker | wearable | home_appliance
settingcommute | kitchen | desk | couch | bed | outdoor
camera_mounthead_mounted | chest_mounted | glasses
ui_visibleWhether the device's screen is in frame (boolean)