Publications
-
Publishing House of Electronics Industry 2024(Chinese Book) Efficient Deep Learning: Model Compression and Design. 《高效深度学习:模型压缩与设计》 (京东有售)
Authors: Yu Wang, Xuefei Ning -
ArXiv 2024E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Authors: Zhihang Yuan*, Yuzhang Shang*, Hanling Zhang, Tongcheng Fang, Rui Xie, Bingxin Xu, Yan Yan, Shengen Yan, Guohao Dai, Yu Wang+ Paper -
ArXiv 2024Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
Authors: Yao Teng, Han Shi, Xian Liu, Xuefei Ning, Guohao Dai, Yu Wang, Zhenguo Li, Xihui Liu+ Paper -
AAAI 2025Training-Free and Hardware-Friendly Acceleration for Diffusion Models via Similarity-based Token Pruning
Authors: Evelyn Zhang, Jiayi Tang, Xuefei Ning, Linfeng Zhang Paper -
ArXiv 2024A Survey on Efficient Inference for Large Language Models
Authors: Zixuan Zhou*, Xuefei Ning*+, Ke Hong*, Tianyu Fu, Jiaming Xu, Shiyao Li, Yuming Lou, Luning Wang, Zhihang Yuan, Xiuhong Li, Shengen Yan, Guohao Dai+, Xiao-Ping Zhang, Yuhan Dong, Yu Wang+ Paper -
ICCAD 2024Towards Floating Point-Based Attention-Free LLM: Hybrid PIM with Non-Uniform Data Format and Reduced Multiplications
Authors: Lidong Guo*, Zhenhua Zhu*+, Tengxuan Liu, Xuefei Ning, Shiyao Li, Guohao Dai, Huazhong Yang, Wangyang Fu and Yu Wang+ Paper -
FPGA 2024FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs
Authors: Shulin Zeng*, Jun Liu*, Guohao Dai+, Xinhao Yang, Tianyu Fu, Hongyi Wang, Wenheng Ma, Hanbo Sun, Shiyao Li, Zixiao Huang, Yadong Dai, Jintao Li, Zehao Wang, Ruoyu Zhang, Kairui Wen, Xuefei Ning, Yu Wang+ Paper -
DATE 2024DyPIM: Dynamic-inference-enabled Processing-In-Memory Accelerator
Authors: Tongxin Xie, Tianchen Zhao, Zhenhua Zhu, Xuefei Ning, Bing Li, Guohao Dai, Huazhong Yang, Yu Wang Paper -
WACV 2024TCP: Triplet Contrastive-relationship Preserving for Class-Incremental Learning
Authors: Shiyao Li, Xuefei Ning+, Shanghang Zhang, Lidong Guo, Tianchen Zhao, Huazhong Yang, Yu Wang+ Paper -
NeurIPS Workshop 2023LLM-MQ: Mixed-precision Quantization for Efficient LLM Deployment
Authors: Shiyao Li, Xuefei Ning+, Ke Hong, Tengxuan Liu, Luning Wang, Xiuhong Li, Kai Zhong, Guohao Dai, Huazhong Yang, Yu Wang+ Paper -
AAAI 2023Memory-Oriented Structural Pruning for Efficient Image Restoration
Authors: Xiangsheng Shi*, Xuefei Ning*+, Lidong Guo*, Tianchen Zhao, Enshu Liu, Yi Cai, Yuhan Dong, Huazhong Yang, Yu Wang+ Paper -
AAAI 2023Ensemble-in-One: Ensemble Learning within Random Gated Networks for Enhanced Adversarial Robustness
Authors: Yi Cai, Xuefei Ning, Huazhong Yang, Yu Wang Paper -
DATE 2022 & TCAD 2023Gibbon: Efficient Co-Exploration of NN Model and Processing-In-Memory Architecture
Authors: Hanbo Sun*, Chenyu Wang*, Zhenhua Zhu, Xuefei Ning+, Guohao Dai, Huazhong Yang, Yu Wang+ Paper -
TCAD 2022Exploring the Potential of Low-bit Training of Convolutional Neural Networks
Authors: Kai Zhong, Xuefei Ning, Guohao Dai, Zhenhua Zhu, Tianchen Zhao, Shulin Zeng, Yu Wang+, Huazhong Yang Paper -
CVPR 2022CodedVTR: Codebook-based Sparse Voxel Transformer with Geometric Guidance
Authors: Tianchen Zhao, Niansong Zhang, Xuefei Ning, He Wang, Li Yi, Yu Wang Paper -
CVPR 2022FedCor: Correlation-Based Active Client Selection Strategy for Heterogeneous Federated Learning
Authors: Minxue Tang, Xuefei Ning, Yitu Wang, Jingwei Sun, Yu Wang, Hai Li, Yiran Chen+ Paper -
ECCV 2022CLOSE: Curriculum Learning On the Sharing Extent Towards Better One-shot NAS
Authors: Zixuan Zhou*, Xuefei Ning*+, Yi Cai, Jiashu Han, Yiping Deng, Yuhan Dong, Huazhong Yang, Yu Wang+ Paper -
NeurIPS 2022 (Spotlight)TA-GATES: An Encoding Scheme for Neural Network Architectures
Authors: Xuefei Ning*+, Zixuan Zhou*, Junbo Zhao, Tianchen Zhao, Yiping Deng, Changcheng Tang, Shuang Liang, Huazhong Yang, Yu Wang+ Paper -
Low-Power CV 2022Hardware Design and Software Practices for Efficient Neural Network Inference
Authors: Yu Wang, Xuefei Ning, Shulin Zeng, Yi Cai, Kaiyuan Guo, Hanbo Sun, Changcheng Tang, Tianyi Lu, Shuang Liang, Tianchen Zhao Paper -
ECCV 2020 (Spotlight)DSA: More Efficient Budgeted Pruning via Differentiable Sparsity Allocation
Authors: Xuefei Ning*, Tianchen Zhao*, Wenshuo Li, Peng Lei, Yu Wang, Huazhong Yang Paper