publications | Xinyu Gong

2026

MV-S2V: Multi-View Subject-Consistent Video Generation

Ziyang Song, Xinyu Gong, Bangya Liu, and 1 more author

arXiv preprint arXiv:2601.17756, 2026

ByteLoom: Weaving Geometry-Consistent Human-Object Interactions through Progressive Curriculum Learning

Bangya Liu, Xinyu Gong, Zelin Zhao, and 6 more authors

arXiv preprint arXiv:2512.22854, 2025
CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization

Zelin Zhao, Xinyu Gong, Bangya Liu, and 5 more authors

arXiv preprint arXiv:2512.19020, 2025

Grounded-instruct-pix2pix: Improving instruction based image editing with automatic target grounding

Artur Shagidanov, Hayk Poghosyan, Xinyu Gong, and 3 more authors

In ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024

MMG-Ego4D: Multi-Modal Generalization in Egocentric Action Recognition

Xinyu Gong, Sreyas Mohan, Naina Dhingra, and 4 more authors

In The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023, 2023
WS-iFSD: Weakly Supervised Incremental Few-shot Object Detection Without Forgetting

Xinyu Gong, Li Yin, Juan-Manuel Perez-Rua, and 2 more authors

In Conference on Parsimony and Learning (Proceedings Track), 2023

Auto-X3D: Ultra-efficient video understanding via finer-grained neural architecture search

Yifan Jiang, Xinyu Gong, Junru Wu, and 3 more authors

In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022
Sandwich batch normalization: A drop-in replacement for feature distribution heterogeneity

Xinyu Gong, Wuyang Chen, Tianlong Chen, and 1 more author

In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022
Deep Architecture Connectivity Matters for Its Convergence: A Fine-Grained Analysis

Wuyang Chen, Wei Huang, Xinyu Gong, and 2 more authors

In NeurIPS 2022, 2022
Unified Implicit Neural Stylization

Zhiwen Fan, Yifan Jiang, Peihao Wang, and 3 more authors

In ECCV 2022, 2022
NeRF-SOS: Any-View Self-supervised Object Segmentation on Complex Scenes

Zhiwen Fan, Peihao Wang, Yifan Jiang, and 3 more authors

In ICLR 2023, 2022

Neural architecture search on imagenet in four gpu hours: A theoretically inspired perspective

Wuyang Chen, Xinyu Gong, and Zhangyang Wang

In ICLR 2021, 2021
Understanding and Accelerating Neural Architecture Search with Training-Free and Theory-Grounded Metrics

Wuyang Chen, Xinyu Gong, Junru Wu, and 5 more authors

TPAMI, 2021
Searching for Two-Stream Models in Multivariate Space for Video Recognition

Xinyu Gong, Heng Wang, Zheng Shou, and 3 more authors

In International Conference on Computer Vision (ICCV) 2021, 2021
SAFIN: arbitrary style transfer with self-attentive factorized instance normalization

Aaditya Singh, Shreeshail Hingane, Xinyu Gong, and 1 more author

arXiv preprint arXiv:2105.06129, 2021

AutoSpeech: Neural Architecture Search for Speaker Recognition

Shaojin Ding, Tianlong Chen, Xinyu Gong, and 2 more authors

InterSpeech 2020, 2020
NADS: Neural Architecture Distribution Search for Uncertainty Awareness

Randy Ardywibowo, Shahin Boluki, Xinyu Gong, and 2 more authors

In International Conference on Machine Learning (ICML) 2020, 2020
Autopose: Searching multi-scale branch aggregation for pose estimation

Xinyu Gong, Wuyang Chen, Yifan Jiang, and 5 more authors

arXiv preprint arXiv:2008.07018, 2020

Conditional adversarial generative flow for controllable image synthesis

Rui Liu, Yu Liu, Xinyu Gong, and 2 more authors

In Computer Vision and Pattern Recognition (CVPR), 2019
Enlightengan: Deep light enhancement without paired supervision

Yifan Jiang, Xinyu Gong, Ding Liu, and 6 more authors

TIP, 2019
AutoGAN: Neural Architecture Search for Generative Adversarial Networks

Xinyu Gong, Shiyu Chang, Yifan Jiang, and 1 more author

In IEEE International Conference on Computer Vision (ICCV) 2019, 2019
FasterSeg: Searching for Faster Real-time Semantic Segmentation

Wuyang Chen, Xinyu Gong, Xianming Liu, and 3 more authors

In International Conference on Learning Representations (ICLR) 2020, 2019
EnlightenGAN: Deep light enhancement without paired supervision. arXiv 2019

Yifan Jiang, Xinyu Gong, Ding Liu, and 6 more authors

arXiv preprint arXiv:1906.06972, 2019

Neural Stereoscopic Image Style Transfer

Xinyu Gong, Haozhi Huang, Lin Ma, and 3 more authors

In The European Conference on Computer Vision (ECCV), 2018, pp. 54-69, 2018
Multitarget AOA estimation using wideband LFMCW signal and two receiver antennas

Dongheng Zhang, Ying He, Xinyu Gong, and 3 more authors

IEEE Transactions on Vehicular Technology (TVT), 2018