About me

I am currently a Research Assistant Professor at the Department of Computing, The Hong Kong Polytechnic University. I am also a member of Internet and Mobile Computing Laboratory (IMCL).

Email: zhaorui.zhang@polyu.edu.hk

Address: PQ748, Mong Man Wai Building, PolyU

Research Interests:

I am broadly interested in the building and optimization of AI systems (MLSys) from both sides of the system and machine learning algorithms based on a wide range of computing platforms (e.g., distributed, cloud, HPC, IoT, AIoT, and even quantum and photonic platforms) for emerging big data and AI applications, including distributed communication reduction, gradient error constraints, and deep learning compilation optimization, etc.

I am also interested in high-performance computing (HPC), distributed systems, Data Reduction, cloud computing, and FPGA.

Opening:

Currently, We are looking for student interns who would like to work with us in the area of Machine Learning Systems (MLSys). Students who want to spend several months in HK for your Final Year Project (“毕业设计”) are also welcome to reach out with your CV.

Publications:

Haotian Xu, Zhaorui Zhang, Sheng Di, Benben Liu, Alharthi Khalid, Jiannong Cao, FedFa: A Fully Asynchronous Training Paradigm for Federated Learning, [IJCAI] 2024

Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Zhaorui Zhang, Jinyang Liu, Ken Raffenetti, Hui Zhou, Kai Zhao, Zizhong Chen, Franck Cappello, Yanfei Guo, Rajeev Thakur, An Optimized Error-controlled MPI Collective Framework Integrated with Lossy Compression, [IPDPS] 2024

Zhuoran Ji, Zhaorui Zhang, Jiming Xu, Lei Ju, Accelerating High-Precision Integer Multiplication used in Cryptosystems with GPUs, [PPoPP] 2024

Zhaorui Zhang, Efficient Parameter Update Strategy for Distributed Deep Learning Systems, HKU Theses Online (HKUTO), 2021.

Zhaorui Zhang, Cho-Li Wang, MIPD: An Adaptive Gradient Sparsification Framework for Distributed DNNs Training, IEEE Transactions on Parallel and Distributed Systems, Special Section on Parallel and Distributed Computing Techniques for AI, ML, and DL, 2022. [TPDS]

Zhaorui Zhang, Cho-Li Wang, SaPus: Self-Adaptive Parameter Update Strategy for DNN Training on Multi-GPU Clusters, IEEE Transactions on Parallel and Distributed Systems, 2021, directly accepted by the first round of review. [TPDS]

Zhaorui Zhang, Zhuoran Ji, Cho-Li Wang, Momentum-Driven Adaptive Synchronization Model for Distributed DNN Training on HPC Clusters, Journal of Parallel and Distributed Computing, 2021. [JPDC]

Xuebin Chi, Liping Liu, Yangang Wang, Zhaorui Zhang, etc., Development Report on National High-Performance Computing Environment, Book, published by Science Press, 2018.

Zhaorui Zhang, Xin Y, Liu B, Li WXY, Lee K.H., Ng C.F., Stoyanov D, Cheung RCC, Kwok KW, FPGA-based High-Performance Collision Detection: An Enabling Technique for Image-Guided Robotic Surgery, Frontiers in Robotics and AI, August 2016

Yao Xin, Will X. Y. Li, Zhaorui Zhang, Ray C. C. Cheung, Dong Song, Theodore W. Berger, An Application Specific Instruction Set Processor (ASIP) for Adaptive Filters in Neural Prosthetics, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2015. [TCBB]

Professional Services:

Conference Program Committee:

IEEE Cluster’24: IEEE International Conference on Cluster Computing

HiPC’24: IEEE International Conference on High-Performance Computing, Data, and Analytics

SSDBM’24: International Conference on Scientific and Statistical Database Management

Reviewer:

IEEE Transactions on Parallel and Distributed Systems (TPDS)

IEEE Transactions on Networking (TON)

Neural Computing and Applications (Journal-JCR:Q1)

Bio:

I received my Ph.D. degree from the System Research Group at the Department of Computer Science, The University of Hong Kong. I am very fortunate to work with my supervisor Professor Cho-Li Wang. Before joining HKU, I worked as a research assistant in the Department of Electrical Engineering, at City University of Hong Kong. I received my Bachelor’s degree in Computer Science from Xi’an Jiaotong University.

Academic Employment Experiences:

The Hong Kong Polytechnic University

Research Assistant Professor in the Department of Computing

The University of Hong Kong

Postdoctoral Researcher in the Department of Computer Science

1) Optimization for distributed deep learning system.

2) Communication reduction, gradient compression, and the gradient error constraint for distributed DNN training.

City University of Hong Kong

Research Assistant in the Department of Electrical Engineering

1) Algorithm optimization and acceleration based on the FPGA.

2) Acceleration for collision detection in image-guided robotic surgery based on the FPGA.

Talks:

  1. June. 2022: Invited Talk at Shenzhen University — Efficient Parameter Update Strategy for Distributed Deep Learning Systems
  2. Dec. 2021: Invited Talk at Future Network Theory Lab of Huawei Technologies(HK), talk about the gradient error constraints for distributed deep learning systems

Professional Collaboration:

  1. HKUST: X-GPU cluster, computing resources.
  2. China National Grid: helps to maintain the high-performance computing platform and publish a book as a co-author, collaborates with the Computer Network Information Center of the Chinese Academy of Science.
  3. AWS (public cloud): computing resources, help to invite the AWS’s technique staff to give a talk at HKU.

Teaching Experiences:

COMP4442: Service and Cloud Computing, Department of Computing, The Hong Kong Polytechnic University, 2023 Spring Semester, 2024 Spring Semester

COMP7104: The Introduction of the Linux Operating System, Lecturer, The University of Hong Kong

COMP7305: Cloud and Cluster Computing, Teaching Assistant, in the Department of Computer Science, The University of Hong Kong

COMP8301: Advanced Topics in Computer Systems, Teaching Assistant, in Department of Computer Science, The University of Hong Kong

COMP9301: Systems Design and Implementation, Teaching Assistant, in Department of Computer Science, The University of Hong Kong

Map

Updated by Oct. 2022