High Performance Computing (HPC Group)

Selected Publications

[TPDS"14] Efficient GPU Spatial-Temporal Multitasking. Yun Liang, Huynh Phung Huynh (co-first author), Kyle Rupnow, Rick Siow Mong Goh, and Deming Chen. IEEE Transactions on Parallel and Distributed Systems, 2014.


[TPDS"14] A Family of Bit-Representation-Optimized Formats for Fast Sparse Matrix-Vector Multiplication on the GPU. Wai Teng Tang, Wen Jun Tan, Rick Siow Mong Goh, Stephen J. Turner, Weng-Fai Wong. IEEE Transactions on Parallel and Distributed Systems, 2014.


[TPDS"14] A Code Generation Framework for Targeting Optimized Library Calls for Multiple Platforms. Wen Jun Tan, Wai Teng Tang, Rick Siow Mong Goh, Stephen J. Turner, Weng-Fai Wong. IEEE Transactions on Parallel and Distributed Systems, 2014.


[TPDS"13] Mapping Streaming Applications onto GPU Systems. Huynh Phung Huynh, Andrei Hagiescu, Ong Zhong Liang, Weng-Fai Wong and Rick Siow Mong Goh. IEEE Transactions on Parallel and Distributed Systems, 2013.


[IEEE BigData"13] Optimizing the MapReduce Framework on Intel Xeon Phi Coprocessor. Mian Lu, Lei Zhang, Huynh Phung Huynh, Zhong Liang Ong, Yun Liang, Bingsheng He, Rick Siow Mong Goh, and Richard Huynh. IEEE International Conference on Big Data (IEEE BigData"13) 2013.


[EuroPar"13] Hierarchical Parallel Algorithm for Modularity-Based Community Detection using GPUs. Chun Yew Cheong, Huynh Phung Huynh, David Lo and Rick Siow Mong Goh. International European Conference on Parallel and Distributed Computing, 2013.


[SC"13] Accelerating Sparse Matrix-Vector Multiplication on GPUs using Bit-Representation-Optimized Schemes. Wai Teng Tang, Wen Jun Tan, Rajarshi Ray, Yi Wen Wong, Weiguang Chen, Shyh-hao Kuo, Rick Siow Mong Goh, Stephen J. Turner, Weng-Fai Wong. IEEE Internation Conference in High Performance Computing, Networking Storage and Analysis, Denver, USA, 2013.


[IPDPS"13] Optimizing and Auto-tuning Iterative Stencil Loops for GPUs with the In-plane Method. Wai Teng Tang, Wen Jun Tan, Ratna Krishnamoorthy, Yi Wen Wong, Shyh-hao Kuo, Rick Siow Mong Goh, Stephen J. Turner, Weng-Fai Wong. IEEE International Parallel and Distributed Processing Symposium, Boston, MA, USA, 2013.


[PPoPP"12] Scalable Framework for Mapping Streaming Applications onto Multi-GPU Systems. Huynh Phung Huynh, Andrei Hagiescu, Weng Fai Wong, Rick Siow Mong Goh. 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Feb 2012.


[EuroPar"12] Tulipse: A Visualization Framework for User-guided Parallelization. Yi Wen Wong, Tomasz Dubrownik, Wai Teng Tang, Wen Jun Tan, Rubing Duan, Rick Siow Mong Goh, Stephen J. Turner, Weng-Fai Wong. International European Conference on Parallel and Distributed Computing, 2013.


[ICPADS"12] GPGPU for Real-Time Data Analytics. Bingsheng He, Huynh Phung Huynh, Rick Siow Mong Goh. 18th IEEE International Conference on Parallel and Distributed Systems, 2012. (Invited Tutorial).


[SC"12 Companion] Mapping Streaming Applications onto GPU Systems. Huynh Phung Huynh, Andrei Hagiescu, Weng-Fai Wong, Rick Siow Mong Goh, Abhishek Ray. 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, Salt Lake City, UT, USA, November 10-16, 2012.


[IPDPS"11] Automated architecture-aware mapping of streaming applications onto GPUs. Andrei Hagiescu, Huynh Phung Huynh, Weng Fai Wong, Rick Siow Mong Goh. IEEE International Parallel and Distributed Processing Symposium 2011.


[GTC"11, Singapore] Mapping Framework for Streaming Applications on GPUs. Huynh Phung Huynh. GPU Technology Conference, 2011. (Invited talk)


[GTC"10] Migration of a Complete 3D Poisson Solver from Optimized Fortran77 to GPU. Huynh Phung Huynh, Shyh-hao Kuo, Rick Siow Mong Goh, Le Duc Vinh, Terence Hung Gih Guang. GPU Technology Conference, September 2010.