Publications
publications in reversed chronological order.
2026
- CVPR
- CVPRFVGen: Scaling 3D Scene Datasets with Certainty-Aware Free-View Generation from Scene Geometry ReconstructionIn CVPR, 2026
- CVPRSemanticVLA: Towards Semantic Reasoning over Action Memorization via Synergistic Explicit Trace and Latent Action PlanningIn CVPR, 2026
- CVPRDo You See What I Am Pointing At? Gesture-Based Egocentric Video Question AnsweringIn CVPR, 2026
- CVPR
- CVPRVideoFocus-R1: Agentic RL with Dynamic Spatio-Temporal Focus for Long Video UnderstandingIn CVPR, 2026
- CVPR
- CVPRColor When It Counts: Grayscale-Guided Online Triggering for Always-On Streaming Video SensingIn CVPR, 2026