Datasets
VideoRobotics

Device Usage POV

First-person footage of people picking up, navigating, and operating the devices that fill modern daily life, captured close to the user's natural viewpoint. The set covers phone use, tablet handling, laptop work, smart-home interaction, and the moment-to-moment switching between physical objects and on-screen tasks.

Hours2.2K+
Devices12+
Settings15+

Training Use Cases

Embodied AI training for AR and wearable systems
Egocentric action recognition for device interactions
Hand-object interaction modeling with consumer devices
UI and screen-context understanding from first-person view
Key Highlights
12+ device categories covered including smartphones, tablets, laptops, smart speakers, wearables, and home appliances
15+ everyday contexts represented including commute, kitchen, desk, couch, and outdoor settings
First-person framing throughout, captured close to the user's working viewpoint
Real-world device use rather than scripted demonstrations or product shots

Metadata Fields

durationLength of clip in HH:MM:SS
resolutionPixel dimensions (e.g., 1920x1080, 3840x2160)
frame_rateFrames per second (e.g., 24, 30, 60, 120)
contains_audioWhether the clip carries an audio track (boolean)
primary_categoryDominant content category assigned to a video
device_typesmartphone | tablet | laptop | smart_speaker | wearable | home_appliance
settingcommute | kitchen | desk | couch | bed | outdoor
camera_mounthead_mounted | chest_mounted | glasses
ui_visibleWhether the device's screen is in frame (boolean)