Follow
Panruo Wu
Title
Cited by
Cited by
Year
Fast discrete distribution clustering using Wasserstein barycenter with sparse support
J Ye, P Wu, JZ Wang, J Li
IEEE Transactions on Signal Processing 65 (9), 2317-2332, 2017
1542017
Wukong: A scalable and locality-enhanced framework for serverless parallel computing
B Carver, J Zhang, A Wang, A Anwar, P Wu, Y Cheng
Proceedings of the 11th ACM symposium on cloud computing, 1-15, 2020
1352020
Investigating half precision arithmetic to accelerate dense linear system solvers
A Haidar, P Wu, S Tomov, J Dongarra
Proceedings of the 8th workshop on latest advances in scalable algorithms …, 2017
842017
PLASMA: Parallel linear algebra software for multicore using OpenMP
J Dongarra, M Gates, A Haidar, J Kurzak, P Luszczek, P Wu, I Yamazaki, ...
ACM Transactions on Mathematical Software (TOMS) 45 (2), 1-35, 2019
712019
FT-ScaLAPACK: Correcting soft errors on-line for ScaLAPACK Cholesky, QR, and LU factorization routines
P Wu, Z Chen
Proceedings of the 23rd international symposium on High-performance parallel …, 2014
652014
Rethinking algorithm-based fault tolerance with a cooperative software-hardware approach
D Li, Z Chen, P Wu, JS Vetter
Proceedings of the International Conference on High Performance Computing …, 2013
582013
New-sum: A novel online abft scheme for general iterative methods
D Tao, SL Song, S Krishnamoorthy, P Wu, X Liang, EZ Zhang, ...
Proceedings of the 25th ACM International Symposium on High-Performance …, 2016
562016
The design of fast and energy-efficient linear solvers: On the potential of half-precision arithmetic and iterative refinement techniques
A Haidar, A Abdelfattah, M Zounon, P Wu, S Pranesh, S Tomov, ...
International conference on computational science, 586-600, 2018
552018
Investigating the interplay between energy efficiency and resilience in high performance computing
L Tan, SL Song, P Wu, Z Chen, R Ge, DJ Kerbyson
2015 IEEE International Parallel and Distributed Processing Symposium, 786-796, 2015
552015
Towards practical algorithm based fault tolerance in dense linear algebra
P Wu, Q Guan, N DeBardeleben, S Blanchard, D Tao, X Liang, J Chen, ...
Proceedings of the 25th ACM International Symposium on High-Performance …, 2016
542016
Fail-stop failure algorithm-based fault tolerance for cholesky decomposition
D Hakkarinen, P Wu, Z Chen
IEEE Transactions on Parallel and Distributed Systems 26 (5), 1323-1335, 2014
542014
Algorithm-directed data placement in explicitly managed non-volatile memory
P Wu, D Li, Z Chen, JS Vetter, S Mittal
Proceedings of the 25th ACM International Symposium on High-Performance …, 2016
512016
Fault tolerant matrix-matrix multiplication: correcting soft errors on-line
P Wu, C Ding, L Chen, F Gao, T Davies, C Karlsson, Z Chen
Proceedings of the second workshop on Scalable algorithms for large-scale …, 2011
482011
Correcting soft errors online in fast fourier transform
X Liang, J Chen, D Tao, S Li, P Wu, H Li, K Ouyang, Y Liu, F Song, ...
Proceedings of the International Conference for High Performance Computing …, 2017
442017
Silent data corruption resilient two-sided matrix factorizations
P Wu, N DeBardeleben, Q Guan, S Blanchard, J Chen, D Tao, X Liang, ...
Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of …, 2017
322017
On-line soft error correction in matrix–matrix multiplication
P Wu, C Ding, L Chen, T Davies, C Karlsson, Z Chen
Journal of Computational Science 4 (6), 465-472, 2013
272013
Fault tolerant one-sided matrix decompositions on heterogeneous systems with gpus
J Chen, H Li, S Li, X Liang, P Wu, D Tao, K Ouyang, Y Liu, K Zhao, ...
SC18: International Conference for High Performance Computing, Networking …, 2018
262018
Design, use and evaluation of p-fsefi: A parallel soft error fault injection framework for emulating soft errors in parallel applications
Q Guan, N BeBardeleben, P Wu, S Eidenbenz, S Blanchard, L Monroe, ...
Proceedings of the 9th EAI International Conference on Simulation Tools and …, 2016
252016
Accelerated discrete distribution clustering under wasserstein distance
J Ye, J Li, JZ Wang
US Patent 10,013,477, 2018
242018
High accuracy matrix computations on neural engines: A study of QR factorization and its applications
S Zhang, E Baharlouei, P Wu
Proceedings of the 29th International Symposium on High-Performance Parallel …, 2020
172020
The system can't perform the operation now. Try again later.
Articles 1–20