Fold3D: Rethinking and Parallelizing Computational and Communicational Tasks in the Training of Large DNN Models
F. Li, S. Zhao* (corresponding author), Y. Qing, X. Chen, X. Guan, S. Wang, G. Zhang, and H. Cui
IEEE Transactions on Parallel and Distributed Systems 2021 (TPDS '23, accepted)
Publications
ROG: A High Performance and Robust Distributed Training System for Robotic IoT
X. Guan, Z. Sun, S. Deng, X. Chen, S. Zhao* (corresponding author), Z. Zhang, T. Duan, Y. Wang, C. Wu, Y. Cui, L. Zhang, Y. Wu, R. Wang, H. Cui
Proceedings of the 55th ACM/IEEE International Symposium on Microarchitecture (MICRO '22; ACM Reproducible Badge)
NASPipe: High Performance and Reproducible Pipeline Parallel Supernet Training via Causal Synchronous Parallel
S. Zhao, F. Li, X. Chen, T. Shen, L. Chen, S. Wang, G. Zhang, C. Li, H. Cui
Proceedings of the 22nd Architectural Support for Programming Languages and Operating Systems (ASPLOS '22; ACM Reproducible Badge)
A Geography-Based P2P Overlay Network for Fast and Robust Blockchain Systems
H. Qiu, T. Ji, S. Zhao* (corresponding author), X. Chen*, J. Qi, H. Cui, S. Wang
IEEE Transactions on Services Computing 2022 (TSC '22)
BIDL: A High-throughput, Low-latency Permissioned Blockchain Framework for Datacenter Networks
J. Qi, X. Chen, Y. Jiang, J. Jiang, T. Shen, S. Zhao, S. Wang, G. Zhang, L. Chen, M. Au, H. Cui
Proceedings of the 28th ACM Symposium on Operating Systems Principles (SOSP '21)
Efficient and DoS-resistant Consensus for Permissioned Blockchains
X. Chen, S. Zhao, J. Qi, J. Jiang, H. Song, C. Wang, T. O. Li, H. Chan, F. Zhang, X. Luo, S. Wang, G. Zhang, H. Cui
Proceedings of the 39th International Symposium on Computer Performance, Modeling, Measurements and Evaluation 2021 (Performance '21)
vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training
S. Zhao, F. Li, X. Chen, X. Guan, J. Jiang, D. Huang, Y. Qing, S. Wang, P. Wang, G. Zhang, C. Li, P. Luo, H. Cui
IEEE Transactions on Parallel and Distributed Systems 2021 (TPDS '21)
DAENet: Making Strong Anonymity Scale in a Fully Decentralized Network
T. Shen, J. Jiang, Y. Jiang, X. Chen, J. Qi, S. Zhao, F. Zhang, X. Luo, H. Cui
IEEE Transactions on Dependable and Secure Computing 2021 (TDSC '21)
HAMS: High Availability for Distributed Machine Learning Service Graphs
S. Zhao, X. Chen, C. Wang, F. Li, J. Qi, H. Cui, C. Li, S. Wang
Proceedings of the 50th IEEE/IFIP International Conference on Dependable Systems and Networks (DSN '20)
Uranus: Simple, Efficient SGX Programming and Its Applications
J. Jiang, X. Chen, T.O. Li, C. Wang, T. Shen, S. Zhao, H. Cui, C.L. Wang, F. Zhang
Proceedings of the 15th ACM ASIA Conference on Computer and Communications Security (ASIA CCS '20 , accepted)
NFVactor: A Resilient NFV System using the Distributed Actor Model
J. Duan, X. Yi, S. Zhao, C. Wu, H. Cui and F. Le
IEEE Journal on Selected Areas in Communications, Feb 2019. PP(99):1-1 (JSAC '19)
OWL: Understanding and Detecting Concurrency Attacks
S. Zhao, R. Gu, H. Qiu, Y. Wang, H. Cui and J. Yang
Proceedings of The 48th IEEE/IFIP International Conference on Dependable Systems and Networks 2018 (DSN '18)
PLOVER: Fast, Multi-core Scalable Virtual Machine Fault-tolerance
C. Wang, X. Chen, W. Jia, H. Qiu, B. Li, S. Zhao and H. Cui
Proceedings of The 15th USENIX Symposium on Networked Systems Design and Implementation 2018 (NSDI '18)
Kakute: A Precise, Unified Information Flow Analysis System for Big-data Security
J. Jiang, S. Zhao, D. Alsayed, Y. Wang, H. Cui, F. Liang and Z. Gu
Proceedings of the Annual Computer Security Applications Conference 2017 (ACSAC '17). Best paper award.