Latest Posts by the Author
- Uncertain Object Representation for Image-Based 3D Object Perception (TPAMI 2025)
- Bootstrap Masked Visual Modeling via Hard Patch Mining (TPAMI 2025)
- Reconstructive Visual Instruction Tuning.International Conference on Learning Representations (ICLR 2025)
- Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness (ICCV 2025)
- CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-scale Scenes (ICLR 2025)
- MCOP: Multi-UAV Collaborative Occupancy Prediction (ICCV 2025)
- ENHANCING END-TO-END AUTONOMOUS DRIVING WITH LATENT WORLD MODEL (ICLR 2025)
- FreeSim:Toward Free-viewpoint Camera Simulation in Driving Scenes (CVPR2025)
- FlexDrive: Toward Trajectory Flexibility in Driving Scene Reconstruction and Rendering (cvpr2025)
- UIPro: Unleashing Superior Interaction Capability For GUI Agents (ICCV 2025)
- Continual Forgetting for Pre-trained Vision Models (CVPR 2024)
- Large-Scale Object Detection in the Wild with Imbalanced Data Distribution, and Multi-Labels (TPAMI 2024)
- RCL: Reliable Continual Learning for Unified Failure Detection (CVPR 2024)
- Fully Sparse Fusion for 3D Object Detection (TPAMI 2024)
- Fully Data-Driven Pseudo Label Estimation for Pointly-Supervised Panoptic Segmentation (AAAI 2024)
- Learnable Graph Matching: A Practical Paradigm for Data Association (TPAMI 2024)
- MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection, International Conference on Learning Representations (ICLR 2024)
- HardMo: A Large-Scale Hardcase Dataset for Motion Capture (CVPR 2024)
- Monocular Occupancy Prediction for Scalable Indoor Scenes (ECCV 2024)
- OneTrack: Demystifying the Conflict Between Detection and Tracking in End-to-End 3D Trackers (ECCV 2024)
