Publications

(2023). High Performance, Low Power Matrix Multiply Design on ACAP: from Architecture, Design Challenges and DSE Perspectives (🔥📣New Paper & Project🔥📣! ). To appear in Proceedings of the 60th ACM/IEEE Design Automation Conference, San Francisco, California, USA, (DAC ’23), July 9–13, 2023, San Francisco, CA, USA. Full Paper Accepted (acceptance ratio is 23 percent).

(2020). Algorithm-Hardware Co-design for BQSR Acceleration in Genome Analysis ToolKit. 2020 IEEE International Symposium on Field-Programmable Custom Computing Machines (FCCM 20).

PDF Cite

(2018). Caffeine: Towards Uniformed Representation and Acceleration for Deep Convolutional Neural Networks (🔥Best Paper). IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems ( Volume: 38, Issue: 11, Nov. 2019).

PDF Cite

(2018). SODA: Stencil with Optimized Dataflow Architecture (🔥Best Paper Nominee). 2018 International Conference On Computer Aided Design (ICCAD 18).

PDF Cite

(2018). Latte: Locality Aware Transformation for High-Level Synthesis. 2018 IEEE International Symposium on Field-Programmable Custom Computing Machines (FCCM 18), short paper acceptance ratio: 7/48 = 14.6%.

PDF Cite Slides SlidesWithAudio Poster

(2018). ST-Accel: A High-Level Programming Platform for Streaming Applications on FPGA. 2018 IEEE International Symposium on Field-Programmable Custom Computing Machines (FCCM 18), full paper acceptance ratio: 22/106 = 20.7%.

PDF Cite

(2017). Bandwidth Optimization Through On-Chip Memory Restructuring for HLS. 54th Annual Design Automation Conference (ACM DAC 17), acceptance rate: 161/676 = 24%.

PDF Cite

(2016). Energy Efficiency of Full Pipelining: A Case Study for Matrix Multiplication. 24th IEEE International Symposium on Field-Programmable Custom Computing Machines (IEEE FCCM 16), acceptance rate: 32/133 = 24%.

PDF Cite Slides SlidesWithAudio Poster

(2016). ARAPrototyper: Enabling Rapid Prototyping and Evaluation for Accelerator-Rich Architecture. 24th ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (ACM/SIGDA FPGA 16).

Cite PDF Poster

(2014). A Fully Pipelined and Dynamically Composable Architecture of CGRA. 22nd IEEE International Symposium on Field-Programmable Custom Computing Machines (IEEE FCCM 14).

PDF Cite Slides

You are the No. free counter vistor of my research homepage at Pitt-ECE.