Shuyang (Kevin) Sun

A soon-to-graduate DPhil (PhD) student at University of Oxford

photo.jpg

I am a DPhil (PhD) student of the Torr Vision Group, University of Oxford, supervised by Professor Philip Torr and Professor Victor Prisacariu. I received my M.Phil. degree from the University of Sydney, where I was with the SIGMA Lab, School of Electrical & Information Engineering, supervised by Professor Wanli Ouyang. I am fortunate to collaborate closely with Weijun Wang, and Liang-Chieh Chen at Google Research, Vladlen Koltun, Philipp Krähenbühl and René Ranftl at Intel ISL. I received a B.Eng. degree in 2016 in Software Engineering from Wuhan University, China. My current focus is on building a comprehensive visual system with a unified perception.

Personal email: kevin.sysun@gmail.com

Work email: kevinsun@robots.ox.ac.uk

news

Jul 11, 2022 Internship started at Google Research.

selected publications

For the full publication list, please refer to my Google Scholar.
  1. clip_as_rnn.jpg
    CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
    Shuyang Sun*, Runjia Li*, Philip Torr, Xiuye Gu, and Siyang Li
    In CVPR, 2024
  2. remax.jpg
    ReMaX: Relaxing for better training on efficient panoptic segmentation
    Shuyang Sun, Weijun Wang, Qihang Yu, Andrew Howard, Philip Torr, and Liang-Chieh Chen
    NeurIPS, 2023
  3. realfake.jpg
    Real-Fake: Effective Training Data Synthesis Through Distribution Matching
    Jianhao Yuan, Jie Zhang, Shuyang Sun, Philip Torr, and Bo Zhao
    ICLR, 2024
  4. synthesis.png
    Is synthetic data from generative models ready for image recognition?
    Ruifei He, Shuyang Sun, Xin Yu, Chuhui Xue, Wenqing Zhang, Philip Torr, Song Bai, and Xiaojuan Qi
    ICLR, spotlight, 2023
  5. transmix.jpg
    TransMix: Attend to Mix for Vision Transformers
    Shuyang Sun*, Jie-Neng Chen*, Ju He, Philip Torr, Alan Yuille, and Song Bai
    CVPR, 2021
  6. vip.png
    Visual Parser: Representing Part-whole Hierarchies with Transformers
    Shuyang Sun, Xiaoyu Yue, Song Bai, and Philip Torr
    arXiv preprint arXiv:2107.05790, 2021
  7. psvit.png
    Vision transformer with progressive sampling
    Xiaoyu Yue*, Shuyang Sun*, Zhanghui Kuang, Meng Wei, Philip Torr, Wayne Zhang, and Dahua Lin
    In ICCV, 2021
  8. fish.png
    Fishnet: A versatile backbone for image, region, and pixel level prediction
    Shuyang Sun, Jiangmiao Pang, Jianping Shi, Shuai Yi, and Wanli Ouyang
    NeurIPS, 2018
  9. off.jpg
    Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition
    Shuyang Sun, Zhanghui Kuang, Lu Sheng, Wanli Ouyang, and Wei Zhang
    In CVPR, 2018