Research

Embracing Single Stride 3D Object Detector with Sparse Transformer (CVPR2022)

2022-03-11

代码已开源在： https://github.com/TuSimple/SST 研究简介：在自动驾驶场景中，相比于整个场景的尺度，单个物体的尺度通常很小。下图展示了COCO数据集和Waymo数据集上物体相对尺度的分布情况： COCO和Waymo上物体相对尺度分布这一特性往往被基于Pillar或者体素的检测器所忽略，它们通常借用了成熟的2D多尺度检测器的结构。基于这一考量，本文探索了单步长（无降采样）的检测器结构。如果简单地将卷积网络提升为单步长网络，会取得一定的性能提升，但是会带来感受野不足的问题以及巨大的计算量。为了得到一个高效高性能的单步长检测器，我们借用了当前流行的swin transformer的结构，舍弃了其多尺度的结构并且针对点云数据的特点将其稀疏化，我们将其命名为单步长稀疏Transformer（Single-stride Sparse Transformer, SST）。我们在当前最大的3D检测数据集Waymo Open Dataset上做了详尽的实验，从各个方面探讨了SST的特性，并取得了SoTA的性能，特别是在小物体上比之前的方法有了显著的提升（达到了83.8的Level 1 AP）。 Sparse Attention结构设计

Instance Guided Proposal Network for Person Search (CVPR2020, oral)

Zhaoxiang Zhang

2020-06-02

Research

Comments

Journal / Conference Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR, 2020) [PDF link: link] [Code link: link] Keywords Person Search, Siamese RPN, Relation Block Abstract […]

Bi-directional Interaction Network for Person Search (CVPR2020)

Zhaoxiang Zhang

2020-06-02

Research

Comments

Journal / Conference Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR, 2020) [PDF link: link] [Code link: link] Keywords Person Search, Bi-directional Interaction Abstract Existing works […]

Learning Integral Objects with Intra-Class Discriminator for Weakly-Supervised Semantic Segmentation (CVPR2020)

Zhaoxiang Zhang

2020-04-21

Research

Comments

Journal / Conference Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR, 2020) [PDF link: PENDING] [Code link: here] Keywords Weakly-Supervised Learning, Semantic Segmentation Abstract Image-level weakly-supervised […]

CIAN: Cross-Image Affinity Net for Weakly Supervised Semantic Segmentation (AAAI2019)

Zhaoxiang Zhang

2020-01-21

Research

Comments

Journal / Conference Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI, 2020) [PDF link: here] [Code link: here] Keywords Weakly-Supervised Learning, Semantic Segmentation Abstract Weakly supervised semantic segmentation with only image-level […]

Efficient Neural Architecture Transformation Search in Channel-Level for Object Detection (NeurIPS2019)

Zhaoxiang Zhang

2019-10-01

Research

Comments

Journal / Conference Thirty-third Conference on Neural Information Processing Systems(NIPS, 2019) [PDF link: here] [Code link: PENDING] Keywords Neural Architecture Transformation Search(NATS), Object Detection Abstract Recently, Neural Architecture Search has […]

POD: Practical Object Detection with Scale-Sensitive Network (ICCV2019)

Zhaoxiang Zhang

2019-08-23

Research

Comments

Journal / Conference The IEEE International Conference on Computer Vision (ICCV, 2019) [PDF link: here] [Code link: Pending] Keywords Scale-sensitive object detection,Global Scale Learning module Abstract Scale-sensitive object detection remains […]

Sequence Level Semantics Aggregation for Video Object Detection (ICCV2019)

Zhaoxiang Zhang

2019-08-23

Research

Comments

Journal / Conference The IEEE International Conference on Computer Vision (ICCV, 2019) [PDF link:here] [Code link: here] Keywords Video Object Detection, SEquence Level Semantics Aggregation (SELSA) Abstract Video objection detection […]

Scale-Aware Trident Networks for Object Detection (ICCV2019)

Zhaoxiang Zhang

2019-08-18

Research

Comments

Journal / Conference The IEEE International Conference on Computer Vision (ICCV, 2019) [PDF link: here] [Code link: here] Keywords Object Detection, TridentNet Abstract Scale variation is one of the key […]

Spectral Feature Transformation for Person Re-Identification (ICCV2019)

Zhaoxiang Zhang

2019-08-06

Research

Comments

Journal / Conference IEEE International Conference on Computer Vision (ICCV2019) [PDF link: here] [Code link: here] Keywords Person Re-ID, Spectral Feature Transformation, Post-processing Abstract With the surge of deep learning […]

Zhaoxiang Zhang (张兆翔)

Embracing Single Stride 3D Object Detector with Sparse Transformer (CVPR2022)

Instance Guided Proposal Network for Person Search (CVPR2020, oral)

Bi-directional Interaction Network for Person Search (CVPR2020)

Learning Integral Objects with Intra-Class Discriminator for Weakly-Supervised Semantic Segmentation (CVPR2020)

CIAN: Cross-Image Affinity Net for Weakly Supervised Semantic Segmentation (AAAI2019)

Efficient Neural Architecture Transformation Search in Channel-Level for Object Detection (NeurIPS2019)

POD: Practical Object Detection with Scale-Sensitive Network (ICCV2019)

Sequence Level Semantics Aggregation for Video Object Detection (ICCV2019)

Scale-Aware Trident Networks for Object Detection (ICCV2019)

Spectral Feature Transformation for Person Re-Identification (ICCV2019)

What is new

Opening Positions