近期论文
查看导师新发文章
(温馨提示:请注意重名现象,建议点开原文通过作者单位确认)
Linxiao Nie, Jiuding Sun, Yanlin Wang, Lun Du, Shi Han, Dongmei Zhang, Lei Hou, Juanzi Li, Jidong Zhai (2023). Unveiling the Black Box of PLMs with Semantic Anchors: Towards Interpretable Neural Semantic Parsing. Thirty-Seventh AAAI Conference on Artificial Intelligence, AAAI 2023, Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence, IAAI 2023, Thirteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2023, Washington, DC, USA, February 7-14, 2023.
Zixuan Ma, Yuyang Jin, Shizhi Tang, Haojie Wang, Wei-Cheng Xue, Jidong Zhai, Wei-Min Zheng (2023). Unified Programming Models for Heterogeneous High-Performance Computers. J. Comput. Sci. Technol..
Sunita Chandrasekaran, Min Si, Jidong Zhai, Lena Oden (2023). Special issue on new trends in high-performance computing: Software systems and applications. Softw. Pract. Exp..
Mingshu Zhai, Jiaao He, Zixuan Ma, Zan Zong, Runqing Zhang, Jidong Zhai (2023). SmartMoE: Efficiently Training Sparsely-Activated Models through Combining Offline and Online Parallelization. 2023 USENIX Annual Technical Conference, USENIX ATC 2023, Boston, MA, USA, July 10-12, 2023.
Kezhao Huang, Haitian Jiang, Minjie Wang, Guangxuan Xiao, David Wipf, Xiang Song, Quan Gan, Zengfeng Huang, Jidong Zhai, Zheng Zhang (2023). ReFresh: Reducing Memory Access from Exploiting Stable Historical Embeddings for Graph Neural Network Training. CoRR.
Zixuan Ma, Haojie Wang, Jingze Xing, Liyan Zheng, Chen Zhang, Huanqi Cao, Kezhao Huang, Shizhi Tang, Penghan Wang, Jidong Zhai (2023). PowerFusion: A Tensor Compiler with Explicit Data Movement Description and Instruction-level Graph IR. CoRR.
Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, Wenguang Chen, Zhiyuan Liu, Peng Zhang, Yuxiao Dong, Jie Tang (2023). GLM-130B: An Open Bilingual Pre-trained Model. The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023.
Liyan Zheng, Haojie Wang, Jidong Zhai, Muyan Hu, Zixuan Ma, Tuowei Wang, Shuhong Huang, Xupeng Miao, Shizhi Tang, Kezhao Huang, Zhihao Jia (2023). EINNET: Optimizing Tensor Programs with Derivation-Based Transformations. 17th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2023, Boston, MA, USA, July 10-12, 2023.
Juncheng Cao, Kaiyuan Rong, Mingshu Zhai, Zeyu Song, Yanyu Ren, Yuxi Zhu, Wentao Han, Jidong Zhai (2023). Critique of \"A Parallel Framework for Constraint-Based Bayesian Network Learning via Markov Blanket Discovery\" by SCC Team From Tsinghua University. IEEE Trans. Parallel Distributed Syst..
Zheng Chen, Feng Zhang, Jiawei Guan, Jidong Zhai, Xipeng Shen, Huanchen Zhang, Wentong Shu, Xiaoyong Du (2023). CompressGraph: Efficient Parallel Graph Analytics with Rule-Based Compression. Proc. ACM Manag. Data.
Chen Zhang, Lingxiao Ma, Jilong Xue, Yining Shi, Ziming Miao, Fan Yang, Jidong Zhai, Zhi Yang, Mao Yang (2023). Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning. 17th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2023, Boston, MA, USA, July 10-12, 2023.
Wei Liu, Jiangming Jin, Hao Wu, Yifan Gong, Ziyue Jiang, Jidong Zhai (2022). Zoro: A robotic middleware combining high performance and high reliability. J. Parallel Distributed Comput..
Liyan Zheng, Jidong Zhai, Xiongchao Tang, Haojie Wang, Teng Yu, Yuyang Jin, Shuaiwen Leon Song, Wenguang Chen (2022). Vapro: performance variance detection and diagnosis for production-run parallel applications. PPoPP ‘22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2 - 6, 2022.
Chen Zhang, Haojie Wang, Zixuan Ma, Lei Xie, Zeyu Song, Jidong Zhai (2022). UniQ: A Unified Programming Model for Efficient Quantum Circuit Simulation. SC22: International Conference for High Performance Computing, Networking, Storage and Analysis, Dallas, TX, USA, November 13-18, 2022.
Lei Xie, Jidong Zhai, Zhenxing Zhang, Jonathan Allcock, Shengyu Zhang, Yicong Zheng (2022). Suppressing ZZ crosstalk of Quantum computers through pulse and scheduling co-optimization. ASPLOS ‘22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022 - 4 March 2022.
Feng Zhang, Jidong Zhai, Xipeng Shen, Onur Mutlu, Xiaoyong Du (2022). POCLib: A High-Performance Framework for Enabling Near Orthogonal Processing on Compression. IEEE Trans. Parallel Distributed Syst..
Feng Zhang, Yani Liu, Ningxuan Feng, Cheng Yang, Jidong Zhai, Shuhao Zhang, Bingsheng He, Jiazao Lin, Xiao Zhang, Xiaoyong Du (2022). Periodic Weather-Aware LSTM With Event Mechanism for Parking Behavior Prediction. IEEE Trans. Knowl. Data Eng..
Yuyang Jin, Haojie Wang, Runxin Zhong, Chen Zhang, Jidong Zhai (2022). PerFlow: a domain specific framework for automatic performance analysis of parallel applications. PPoPP ‘22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2 - 6, 2022.
Qingyu Xu, Feng Zhang, Mingde Zhang, Jidong Zhai, Bingsheng He, Cheng Yang, Shuhao Zhang, Jiazao Lin, Haidi Liu, Xiaoyong Du (2022). Payment behavior prediction on shared parking lots with TR-GCN. VLDB J..
Liyan Zheng, Haojie Wang, Jidong Zhai, Muyan Hu, Zixuan Ma, Tuowei Wang, Shizhi Tang, Lei Xie, Kezhao Huang, Zhihao Jia (2022). OLLIE: Derivation-based Tensor Program Optimizer. CoRR.
Yunquan Zhang, Jidong Zhai, Rajiv Ranjan (2022). Message from the High Performance Computing and Communications 2022 Program Chairs. 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, Cloud & Big Data Systems & Application, HPCC/DSS/SmartCity/DependSys 2022, Hainan, China, December 18-20, 2022.
Jidong Zhai, Liyan Zheng, Jinghan Sun, Feng Zhang, Xiongchao Tang, Xuehai Qian, Bingsheng He, Wei Xue, Wenguang Chen, Weimin Zheng (2022). Leveraging Code Snippets to Detect Variations in the Performance of HPC Systems. IEEE Trans. Parallel Distributed Syst..
Linxiao Nie, Jiuding Sun, Yanlin Wang, Lun Du, Shi Han, Dongmei Zhang, Lei Hou, Juanzi Li, Jidong Zhai (2022). Guiding the PLMs with Semantic Anchors as Intermediate Supervision: Towards Interpretable Semantic Parsing. CoRR.
Jidong Zhai, Min Si, Antonio J. Peña (2022). Guest Editorial. IEEE Trans. Parallel Distributed Syst..
Linxiao Nie, Shulin Cao, Jiaxin Shi, Jiuding Sun, Qi Tian, Lei Hou, Juanzi Li, Jidong Zhai (2022). GraphQ IR: Unifying the Semantic Parsing of Graph Query Languages with One Intermediate Representation. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022, Abu Dhabi, United Arab Emirates, December 7-11, 2022.
Linxiao Nie, Shulin Cao, Jiaxin Shi, Qi Tian, Lei Hou, Juanzi Li, Jidong Zhai (2022). GraphQ IR: Unifying Semantic Parsing of Graph Query Language with Intermediate Representation. CoRR.
Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, Wenguang Chen, Peng Zhang, Yuxiao Dong, Jie Tang (2022). GLM-130B: An Open Bilingual Pre-trained Model. CoRR.
Shizhi Tang, Jidong Zhai, Haojie Wang, Lin Jiang, Liyan Zheng, Zhenhao Yuan, Chen Zhang (2022). FreeTensor: a free-form DSL with holistic optimizations for irregular tensor programs. PLDI ‘22: 43rd ACM SIGPLAN International Conference on Programming Language Design and Implementation, San Diego, CA, USA, June 13 - 17, 2022.
Jiaao He, Jidong Zhai, Tiago Antunes, Haojie Wang, Fuwen Luo, Shangfeng Shi, Qin Li (2022). FasterMoE: modeling and optimizing training of large-scale dynamic pre-trained models. PPoPP ‘22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2 - 6, 2022.
Jiesong Liu, Feng Zhang, Hourun Li, Dalin Wang, Weitao Wan, Xiaokun Fang, Jidong Zhai, Xiaoyong Du (2022). Exploring Query Processing on CPU-GPU Integrated Edge Device. IEEE Trans. Parallel Distributed Syst..
Zaifeng Pan, Feng Zhang, Yanliang Zhou, Jidong Zhai, Xipeng Shen, Onur Mutlu, Xiaoyong Du (2022). Exploring Data Analytics Without Decompression on Embedded GPU Systems. IEEE Trans. Parallel Distributed Syst..
Zixuan Ma, Haojie Wang, Guanyu Feng, Chen Zhang, Lei Xie, Jiaao He, Shengqi Chen, Jidong Zhai (2022). Efficiently emulating high-bitwidth computation with low-bitwidth hardware. ICS ‘22: 2022 International Conference on Supercomputing, Virtual Event, June 28 - 30, 2022.
Jidong Zhai, Liyan Zheng, Feng Zhang, Xiongchao Tang, Haojie Wang, Teng Yu, Yuyang Jin, Shuaiwen Leon Song, Wenguang Chen (2022). Detecting Performance Variance for Parallel Applications Without Source Code. IEEE Trans. Parallel Distributed Syst..
Runxin Zhong, Jiajie Chen, Chen Zhang, Mingshu Zhai, Zeyu Song, Yutian Wang, Wentao Han, Lin Gan, Jidong Zhai (2022). Critique of \"MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization\" by SCC Team From Tsinghua University. IEEE Trans. Parallel Distributed Syst..
Feng Zhang, Weitao Wan, Chenyang Zhang, Jidong Zhai, Yunpeng Chai, Haixiang Li, Xiaoyong Du (2022). CompressDB: Enabling Efficient Compressed Data Direct Processing for Various Databases. SIGMOD ‘22: International Conference on Management of Data, Philadelphia, PA, USA, June 12 - 17, 2022.
Zixuan Ma, Jiaao He, Jiezhong Qiu, Huanqi Cao, Yuanwei Wang, Zhenbo Sun, Liyan Zheng, Haojie Wang, Shizhi Tang, Tianyu Zheng, Junyang Lin, Guanyu Feng, Zeqiang Huang, Jie Gao, Aohan Zeng, Jianwei Zhang, Runxin Zhong, Tianhui Shi, Sha Liu, Weimin Zheng, Jie Tang, Hongxia Yang, Xin Liu, Jidong Zhai, Wenguang Chen (2022). BaGuaLu: targeting brain scale pretrained models with over 37 million cores. PPoPP ‘22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2 - 6, 2022.
Zhen Zheng, Xuanda Yang, Pengzhan Zhao, Guoping Long, Kai Zhu, Feiwen Zhu, Wenyi Zhao, Xiaoyong Liu, Jun Yang, Jidong Zhai, Shuaiwen Leon Song, Wei Lin (2022). AStitch: enabling a new multi-dimensional optimization space for memory-intensive ML training and inference on modern SIMT architectures. ASPLOS ‘22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022 - 4 March 2022.
Kezhao Huang, Jidong Zhai, Zhen Zheng, Youngmin Yi, Xipeng Shen (2021). Understanding and bridging the gaps in current GNN performance optimizations. PPoPP ‘21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Virtual Event, Republic of Korea, February 27- March 3, 2021.
Feng Zhang, Jidong Zhai, Xipeng Shen, Dalin Wang, Zheng Chen, Onur Mutlu, Wenguang Chen, Xiaoyong Du (2021). TADOC: Text analytics directly on compression. VLDB J..
Xian-He Sun, Dong Li, Wen-Guang Chen, Tao Li, Jiwu Shu, Bo Wu, Jin Xiong, Jinging Xue, Feng Zhang, Jidong Zhai, Zhiia Zhao (2021). Preface. J. Comput. Sci. Technol..
Haojie Wang, Jidong Zhai, Mingyu Gao, Zixuan Ma, Shizhi Tang, Liyan Zheng, Yuanzhi Li, Kaiyuan Rong, Yuanyong Chen, Zhihao Jia (2021). PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections. 15th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2021, July 14-16, 2021.
Lei Xie, Jidong Zhai, Weimin Zheng (2021). Mitigating Crosstalk in Quantum Computers through Commutativity-Based Instruction Reordering. 58th ACM/IEEE Design Automation Conference, DAC 2021, San Francisco, CA, USA, December 5-9, 2021.
Chen Zhang, Zeyu Song, Haojie Wang, Kaiyuan Rong, Jidong Zhai (2021). HyQuas: hybrid partitioner based quantum circuit simulation system on GPU. ICS ‘21: 2021 International Conference on Supercomputing, Virtual Event, USA, June 14-17, 2021.
Pavan Balaji, Jidong Zhai, Min Si (2021). Guest Editorial. IEEE Trans. Parallel Distributed Syst..
Feng Zhang, Zaifeng Pan, Yanliang Zhou, Jidong Zhai, Xipeng Shen, Onur Mutlu, Xiaoyong Du (2021). G-TADOC: Enabling Efficient GPU-Based Text Analytics without Decompression. CoRR.
Feng Zhang, Zaifeng Pan, Yanliang Zhou, Jidong Zhai, Xipeng Shen, Onur Mutlu, Xiaoyong Du (2021). G-TADOC: Enabling Efficient GPU-Based Text Analytics without Decompression. 37th IEEE International Conference on Data Engineering, ICDE 2021, Chania, Greece, April 19-22, 2021.
Jiaao He, Jiezhong Qiu, Aohan Zeng, Zhilin Yang, Jidong Zhai, Jie Tang (2021). FastMoE: A Fast Mixture-of-Expert Training System. CoRR.
Chen Zhang, Chenggang Zhao, Jiaao He, Shengqi Chen, Liyan Zheng, Kezhao Huang, Wentao Han, Jidong Zhai (2021). Critique of \"Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility\" by SCC Team From Tsinghua University. IEEE Trans. Parallel Distributed Syst..
Teng Yu, Runxin Zhong, Vladimir Janjic, Pavlos Petoumenos, Jidong Zhai, Hugh Leather, John Thomson (2021). Collaborative Heterogeneity-Aware OS Scheduler for Asymmetric Multicore Processors. IEEE Trans. Parallel Distributed Syst..
Feng Zhang, Jidong Zhai, Bo Wu, Bingsheng He, Wenguang Chen, Xiaoyong Du (2021). Automatic Irregularity-Aware Fine-Grained Workload Partitioning on Integrated Architectures. IEEE Trans. Knowl. Data Eng..
Feng Zhang, Zheng Chen, Chenyang Zhang, Amelie Chi Zhou, Jidong Zhai, Xiaoyong Du (2021). An Efficient Parallel Secure Machine Learning Framework on GPUs. IEEE Trans. Parallel Distributed Syst..
Zhixiang Ren, Yongheng Liu, Tianhui Shi, Lei Xie, Yue Zhou, Jidong Zhai, Youhui Zhang, Yunquan Zhang, Wenguang Chen (2021). AIPerf: Automated machine learning as an AI-HPC benchmark. Big Data Min. Anal..
Hao Wu, Jiangming Jin, Jidong Zhai, Yifan Gong, Wei Liu (2021). Accelerating GPU Message Communication for Autonomous Navigation Systems. IEEE International Conference on Cluster Computing, CLUSTER 2021, Portland, OR, USA, September 7-10, 2021.
Xiongchao Tang, Chen Zhang, Jidong Zhai, Xuehai Qian, Wenguang Chen, Yong Jiang (2021). A Fast Lock for Explicit Message Passing Architectures. IEEE Trans. Computers.
Feng Zhang, Jidong Zhai, Xipeng Shen, Dalin Wang, Zheng Chen, Onur Mutlu, Wenguang Chen, Xiaoyong Du (2020). TADOC: Text Analytics Directly on Compression. CoRR.
Yuyang Jin, Haojie Wang, Teng Yu, Xiongchao Tang, Torsten Hoefler, Xu Liu, Jidong Zhai (2020). ScalAna: Automating Scaling Loss Detection with Graph Analysis. CoRR.
Yuyang Jin, Haojie Wang, Teng Yu, Xiongchao Tang, Torsten Hoefler, Xu Liu, Jidong Zhai (2020). ScalAna: automating scaling loss detection with graph analysis. Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2020, Virtual Event / Atlanta, Georgia, USA, November 9-19, 2020.
Feng Zhang, Ningxuan Feng, Yani Liu, Cheng Yang, Jidong Zhai, Shuhao Zhang, Bingsheng He, Jiazao Lin, Xiaoyong Du (2020). PewLSTM: Periodic LSTM with Weather-Aware Gating Mechanism for Parking Behavior Prediction. Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI 2020.
Qingyu Xu, Feng Zhang, Mingde Zhang, Jidong Zhai, Jiazao Lin, Haidi Liu, Xiaoyong Du (2020). Payment Behavior Prediction and Statistical Analysis for Shared Parking Lots. Network and Parallel Computing - 17th IFIP WG 10.3 International Conference, NPC 2020, Zhengzhou, China, September 28-30, 2020, Revised Selected Papers.
Zheng Chen, Feng Zhang, Amelie Chi Zhou, Jidong Zhai, Chenyang Zhang, Xiaoyong Du (2020). ParSecureML: An Efficient Parallel Secure Machine Learning Framework on GPUs. ICPP 2020: 49th International Conference on Parallel Processing, Edmonton, AB, Canada, August 17-20, 2020.
Ziyue Jiang, Yifan Gong, Jidong Zhai, Yu-Ping Wang, Wei Liu, Hao Wu, Jiangming Jin (2020). Message Passing Optimization in Robot Operating System. Int. J. Parallel Program..
Wei Liu, Yifan Gong, Hao Wu, Jidong Zhai, Jiangming Jin (2020). Memory-Centric Communication Mechanism for Real-time Autonomous Navigation Applications. ICPP 2020: 49th International Conference on Parallel Processing, Edmonton, AB, Canada, August 17-20, 2020.
Yuyang Jin, Haojie Wang, Xiongchao Tang, Torsten Hoefler, Xu Liu, Jidong Zhai (2020). Identifying scalability bottlenecks for large-scale parallel programs with graph analysis. PPoPP ‘20: 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, San Diego, California, USA, February 22-26, 2020.
Tianhui Shi, Mingshu Zhai, Yi Xu, Jidong Zhai (2020). GraphPi: High Performance Graph Pattern Matching through Effective Redundancy Elimination. CoRR.
Tianhui Shi, Mingshu Zhai, Yi Xu, Jidong Zhai (2020). GraphPi: high performance graph pattern matching through effective redundancy elimination. Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2020, Virtual Event / Atlanta, Georgia, USA, November 9-19, 2020.
Chanyoung Oh, Zhen Zheng, Xipeng Shen, Jidong Zhai, Youngmin Yi (2020). GOPipe: A Granularity-Oblivious Programming Framework for Pipelined Stencil Executions on GPU. PACT ‘20: International Conference on Parallel Architectures and Compilation Techniques, Virtual Event, GA, USA, October 3-7, 2020.
Feng Zhang, Jidong Zhai, Xipeng Shen, Onur Mutlu, Xiaoyong Du (2020). Enabling Efficient Random Access to Hierarchically-Compressed Data. 36th IEEE International Conference on Data Engineering, ICDE 2020, Dallas, TX, USA, April 20-24, 2020.
Lei Xie, Jidong Zhai, Baodong Wu, Yuanbo Wang, Xingcheng Zhang, Peng Sun, Shengen Yan (2020). Elan: Towards Generic and Efficient Elastic Training for Deep Learning. 40th IEEE International Conference on Distributed Computing Systems, ICDCS 2020, Singapore, November 29 - December 1, 2020.
Xiaoyang Wang, Zhe Zhou, Ping Han, Tong Meng, Guangyu Sun, Jidong Zhai (2020). Edge-Stream: a Stream Processing Approach for Distributed Applications on a Hierarchical Edge-computing System. 5th IEEE/ACM Symposium on Edge Computing, SEC 2020, San Jose, CA, USA, November 12-14, 2020.
Zhixiang Ren, Yongheng Liu, Tianhui Shi, Lei Xie, Yue Zhou, Jidong Zhai, Youhui Zhang, Yunquan Zhang, Wenguang Chen (2020). AIPerf: Automated machine learning as an AI-HPC benchmark. CoRR.
Jiaao He, Chenggang Zhao, Jiping Yu, Xinjian Yu, Liyan Zheng, Chenyao Lou, Shizhi Tang, Wentao Han, Jidong Zhai (2019). Student Cluster Competition 2018, Team Tsinghua University: Reproducing performance of multi-physics simulations of the Tsunamigenic 2004 Sumatra megathrust earthquake on the Intel Skylake Architecture. Parallel Comput..
Ningxuan Feng, Feng Zhang, Jiazao Lin, Jidong Zhai, Xiaoyong Du (2019). Statistical Analysis and Prediction of Parking Behavior. Network and Parallel Computing - 16th IFIP WG 10.3 International Conference, NPC 2019, Hohhot, China, August 23-24, 2019, Proceedings.
Xiongchao Tang, Haojie Wang, Xiaosong Ma, Nosayba El-Sayed, Jidong Zhai, Wenguang Chen, Ashraf Aboulnaga (2019). Spread-n-share: improving application performance and cluster throughput with resource-aware job placement. Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2019, Denver, Colorado, USA, November 17-19, 2019.
Amelie Chi Zhou, Yao Xiao, Yifan Gong, Bingsheng He, Jidong Zhai, Rui Mao (2019). Privacy Regulation Aware Process Mapping in Geo-Distributed Cloud Data Centers. IEEE Trans. Parallel Distributed Syst..
Xiongchao Tang, Jidong Zhai, Xuehai Qian, Wenguang Chen (2019). pLock: A Fast Lock for Architectures with Explicit Inter-core Message Passing. Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS 2019, Providence, RI, USA, April 13-17, 2019.
Feng Zhang, Weifeng Liu, Ningxuan Feng, Jidong Zhai, Xiaoyong Du (2019). Performance evaluation and analysis of sparse matrix and graph kernels on heterogeneous processors. CCF Trans. High Perform. Comput..
Zhen Zheng, Chanyoung Oh, Jidong Zhai, Xipeng Shen, Youngmin Yi, Wenguang Chen (2019). HiWayLib: A Software Framework for Enabling High Performance Communications for Heterogeneous Pipeline Computations. Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS 2019, Providence, RI, USA, April 13-17, 2019.
Feng Zhang, Jidong Zhai, Marc Snir, Hai Jin, Hironori Kasahara, Mateo Valero (2019). Guest Editorial: Special Issue on Network and Parallel Computing for Emerging Architectures and Applications. Int. J. Parallel Program..
Chanyoung Oh, Zhen Zheng, Xipeng Shen, Jidong Zhai, Youngmin Yi (2019). GOPipe: a granularity-oblivious programming framework for pipelined stencil executions on GPU. Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2019, Washington, DC, USA, February 16-20, 2019.
Bin Yang, Xu Ji, Xiaosong Ma, Xiyang Wang, Tianyu Zhang, Xiupeng Zhu, Nosayba El-Sayed, Haidong Lan, Yibo Yang, Jidong Zhai, Weiguo Liu, Wei Xue (2019). End-to-end I/O Monitoring on a Leading Supercomputer. 16th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2019, Boston, MA, February 26-28, 2019.
Xu Ji, Bin Yang, Tianyu Zhang, Xiaosong Ma, Xiupeng Zhu, Xiyang Wang, Nosayba El-Sayed, Jidong Zhai, Weiguo Liu, Wei Xue (2019). Automatic, Application-Aware I/O Forwarding Resource Allocation. 17th USENIX Conference on File and Storage Technologies, FAST 2019, Boston, MA, February 25-28, 2019.
Feng Zhang, Jidong Zhai, Xipeng Shen, Onur Mutlu, Wenguang Chen (2018). Zwift: A Programming Framework for High Performance Text Analytics on Compressed Data. Proceedings of the 32nd International Conference on Supercomputing, ICS 2018, Beijing, China, June 12-15, 2018.
Xiongchao Tang, Jidong Zhai, Xuehai Qian, Bingsheng He, Wei Xue, Wenguang Chen (2018). vSensor: leveraging fixed-workload snippets of programs for performance variance detection. Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2018, Vienna, Austria, February 24-28, 2018.
Ka Cheong Jason Lau, Yuxuan Li, Lei Xie, Qian Xie, Beichen Li, Yu Chen, Guanyu Feng, Jiping Yu, Xinjian Yu, Miao Wang, Wentao Han, Jidong Zhai (2018). Student cluster competition 2017, team Tsinghua University: Reproducing vectorization of the tersoff multi-body potential on the Intel Skylake and NVIDIA Volta architectures. Parallel Comput..
Haojie Wang, Jidong Zhai, Xiongchao Tang, Bowen Yu, Xiaosong Ma, Wenguang Chen (2018). Spindle: Informed Memory Access Monitoring. 2018 USENIX Annual Technical Conference, USENIX ATC 2018, Boston, MA, USA, July 11-13, 2018.
Feng Zhang, Jidong Zhai, Marc Snir, Hai Jin, Hironori Kasahara, Mateo Valero (2018). Network and Parallel Computing - 15th IFIP WG 10.3 International Conference, NPC 2018, Muroran, Japan, November 29 - December 1, 2018, Proceedings. Springer.
Feng Zhang, Jidong Zhai, Xipeng Shen, Onur Mutlu, Wenguang Chen (2018). Efficient Document Analytics on Compressed Data: Method, Challenges, Algorithms, Insights. Proc. VLDB Endow..
Youwei Zhuo, Jinglei Cheng, Qinyi Luo, Jidong Zhai, Yanzhi Wang, Zhongzhi Luan, Xuehai Qian (2018). CSE: Parallel Finite State Machines with Convergence Set Enumeration. 51st Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2018, Fukuoka, Japan, October 20-24, 2018.
Yuwei Hu, Jidong Zhai, Dinghua Li, Yifan Gong, Yuhao Zhu, Wei Liu, Lei Su, Jiangming Jin (2018). BitFlow: Exploiting Vector Parallelism for Binary Neural Networks on CPU. 2018 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2018, Vancouver, BC, Canada, May 21-25, 2018.
Xiongchao Tang, Jidong Zhai, Bowen Yu, Wenguang Chen, Weimin Zheng, Keqin Li (2018). An Efficient In-Memory Checkpoint Method and its Practice on Fault-Tolerant HPL. IEEE Trans. Parallel Distributed Syst..
Feng Zhang, Heng Lin, Jidong Zhai, Jie Cheng, Dingyi Xiang, Jizhong Li, Yunpeng Chai, Xiaoyong Du (2018). An adaptive breadth-first search algorithm on integrated architectures. J. Supercomput..
Jidong Zhai, Wen-Guang Chen (2018). A vision of post-exascale programming. Frontiers Inf. Technol. Electron. Eng..
Zhen Zheng, Chanyoung Oh, Jidong Zhai, Xipeng Shen, Youngmin Yi, Wenguang Chen (2017). Versapipe: a versatile programming framework for pipelined computing on GPU. Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2017, Cambridge, MA, USA, October 14-18, 2017.
Feng Zhang, Jidong Zhai, Bingsheng He, Shuhao Zhang, Wenguang Chen (2017). Understanding Co-Running Behaviors on Integrated CPU/GPU Architectures. IEEE Trans. Parallel Distributed Syst..
Xiongchao Tang, Jidong Zhai, Bowen Yu, Wenguang Chen, Weimin Zheng (2017). Self-Checkpoint: An In-Memory Checkpoint Method Using Less Space and Its Practice on Fault-Tolerant HPL. Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Austin, TX, USA, February 4-8, 2017.
Heng Lin, Xiongchao Tang, Bowen Yu, Youwei Zhuo, Wenguang Chen, Jidong Zhai, Wanwang Yin, Weimin Zheng (2017). Scalable Graph Traversal on Sunway TaihuLight with Ten Million Cores. 2017 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2017, Orlando, FL, USA, May 29 - June 2, 2017.
Feng Zhang, Bo Wu, Jidong Zhai, Bingsheng He, Wenguang Chen (2017). FinePar: irregularity-aware fine-grained workload partitioning on integrated architectures. Proceedings of the 2017 International Symposium on Code Generation and Optimization, CGO 2017, Austin, TX, USA, February 4-8, 2017.
Amelie Chi Zhou, Yifan Gong, Bingsheng He, Jidong Zhai (2017). Efficient process mapping in geo-distributed cloud data centers. Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2017, Denver, CO, USA, November 12 - 17, 2017.
Shuo Yang, Kai Wu, Yifan Qiao, Dong Li, Jidong Zhai (2017). Algorithm-Directed Crash Consistence in Non-Volatile Memory for HPC. CoRR.
Shuo Yang, Kai Wu, Yifan Qiao, Dong Li, Jidong Zhai (2017). Algorithm-Directed Crash Consistence in Non-volatile Memory for HPC. 2017 IEEE International Conference on Cluster Computing, CLUSTER 2017, Honolulu, HI, USA, September 5-8, 2017.
Jidong Zhai, Wenguang Chen, Weimin Zheng, Keqin Li (2016). Performance Prediction for Large-Scale Parallel Applications Using Representative Replay. IEEE Trans. Computers.
Jidong Zhai, Feng Zhang, Qingwen Li, Wenguang Chen, Weimin Zheng (2016). Characterizing and optimizing TPC-C workloads on large-scale systems using SSD arrays. Sci. China Inf. Sci..
Shuangcheng Niu, Jidong Zhai, Xiaosong Ma, Xiongchao Tang, Wenguang Chen, Weimin Zheng (2016). Building Semi-Elastic Virtual Clusters for Cost-Effective HPC Cloud Resource Provisioning. IEEE Trans. Parallel Distributed Syst..
Haibao Chen, Song Wu, Hai Jin, Wenguang Chen, Jidong Zhai, Yingwei Luo, Xiaolin Wang (2016). A survey of cloud resource management for complex engineering applications. Frontiers Comput. Sci..
Xinliang Wang, Wei Xue, Jidong Zhai, Yangtong Xu, Weimin Zheng, Hai-Xiang Lin (2016). A Fast Tridiagonal Solver for Intel MIC Architecture. 2016 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2016, Chicago, IL, USA, May 23-27, 2016.
Feng Zhang, Jidong Zhai, Wenguang Chen, Bingsheng He, Shuhao Zhang (2015). To Co-run, or Not to Co-run: A Performance Study on Integrated Architectures. 23rd IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, MASCOTS 2015, Atlanta, GA, USA, October 5-7, 2015.
Ikjoon Kim, Jidong Zhai, Yan Li, Wenguang Chen (2015). Optimizing seam carving on multi-GPU systems for real-time content-aware image resizing. J. Supercomput..
Jidong Zhai, Mingliang Liu, Ye Jin, Xiaosong Ma, Wenguang Chen (2015). Automatic Cloud I/O Configurator for I/O Intensive Parallel Applications. IEEE Trans. Parallel Distributed Syst..
Yunyun Jiang, Tian Xiao, Jidong Zhai, Ying Zhao, Wenguang Chen (2015). A Power-Conserving Online Scheduling Scheme for Video Streaming Services. Algorithms and Architectures for Parallel Processing - 15th International Conference, ICA3PP 2015, Zhangjiajie, China, November 18-20, 2015, Proceedings, Part I.
Ikjoon Kim, Jidong Zhai, Yan Li, Wenguang Chen (2014). Optimizing Seam Carving on multi-GPU systems for real-time image resizing. 20th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2014, Hsinchu, Taiwan, December 16-19, 2014.
Jidong Zhai, Jianfei Hu, Xiongchao Tang, Xiaosong Ma, Wenguang Chen (2014). CYPRESS: Combining Static and Dynamic Analysis for Top-Down Communication Trace Compression. International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2014, New Orleans, LA, USA, November 16-21, 2014.
Shuangcheng Niu, Jidong Zhai, Xiaosong Ma, Xiongchao Tang, Wenguang Chen (2013). Cost-effective cloud HPC resource provisioning by building semi-elastic virtual clusters. International Conference for High Performance Computing, Networking, Storage and Analysis, SC'13, Denver, CO, USA - November 17 - 21, 2013.
Mingliang Liu, Ye Jin, Jidong Zhai, Yan Zhai, Qianqian Shi, Xiaosong Ma, Wenguang Chen (2013). ACIC: automatic cloud I/O configurator for parallel applications. The 22nd International Symposium on High-Performance Parallel and Distributed Computing, HPDC'13, New York, NY, USA - June 17 - 21, 2013.
Mingliang Liu, Ye Jin, Jidong Zhai, Yan Zhai, Qianqian Shi, Xiaosong Ma, Wenguang Chen (2013). ACIC: automatic cloud I/O configurator for HPC applications. International Conference for High Performance Computing, Networking, Storage and Analysis, SC'13, Denver, CO, USA - November 17 - 21, 2013.
Shuangcheng Niu, Jidong Zhai, Xiaosong Ma, Mingliang Liu, Yan Zhai, Wenguang Chen, Weimin Zheng (2012). Employing Checkpoint to Improve Job Scheduling in Large-Scale Systems. Job Scheduling Strategies for Parallel Processing, 16th International Workshop, JSSPP 2012, Shanghai, China, May 25, 2012. Revised Selected Papers.
Mingliang Liu, Jidong Zhai, Yan Zhai, Xiaosong Ma, Wenguang Chen (2011). One optimized I/O configuration per HPC application: leveraging the configurability of cloud. APSys ‘11 Asia Pacific Workshop on Systems, Shanghai, China, July 11-12, 2011.
Jidong Zhai, Tianwei Sheng, Jiangzhou He, Wenguang Chen, Weimin Zheng (2011). Efficiently Acquiring Communication Traces for Large-Scale Parallel Applications. IEEE Trans. Parallel Distributed Syst..
Yan Zhai, Mingliang Liu, Jidong Zhai, Xiaosong Ma, Wenguang Chen (2011). Cloud versus in-house cluster: evaluating Amazon cluster compute instances for running MPI applications. Conference on High Performance Computing Networking, Storage and Analysis - State of the Practice Reports, SC 2011, Seattle, Washington, USA, November 12-18, 2011.
Jidong Zhai, Wenguang Chen, Weimin Zheng (2010). PHANTOM: predicting performance of parallel applications on large-scale parallel machines using a single node. Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2010, Bangalore, India, January 9-14, 2010.
Jin Zhang, Jidong Zhai, Wenguang Chen, Weimin Zheng (2009). Process Mapping for MPI Collective Communications. Euro-Par 2009 Parallel Processing, 15th International Euro-Par Conference, Delft, The Netherlands, August 25-28, 2009. Proceedings.
Wenguang Chen, Jidong Zhai, Jin Zhang, Weimin Zheng (2009). LogGPO: An accurate communication model for performance prediction of MPI programs. Sci. China Ser. F Inf. Sci..
Jidong Zhai, Tianwei Sheng, Jiangzhou He, Wenguang Chen, Weimin Zheng (2009). FACT: fast communication trace collection for parallel applications through program slicing. Proceedings of the ACM/IEEE Conference on High Performance Computing, SC 2009, November 14-20, 2009, Portland, Oregon, USA.