FILCO: Flexible Composing Architecture with Real-Time Reconfigurability for DNN Acceleration(🔥📣New Paper & Project🔥📣! )

Xingzhen Chen, Jinming Zhuang, Zhuoping Yang, Shixin Ji, Sarah Schultz, Zheng Dong, Weisong Shi, Peipei Zhou

March 2026

Abstract

With the development of deep neural network (DNN) enabled applications, achieving high hardware resource efficiency on diverse workloads is non-trivial in heterogeneous computing platforms. Prior works discuss dedicated architectures to achieve maximal resource efficiency. However, a mismatch between hardware and workloads always exists in various diverse workloads. Other works discuss overlay architecture that can dynamically switch dataflow for different workloads. However, these works are still limited by flexibility granularity and induce much resource inefficiency.

To solve this problem, we propose a flexible composing architecture, FILCO, that can efficiently match diverse workloads to achieve the optimal storage and computation resource efficiency. FILCO can be reconfigured in real-time and flexibly composed into a unified or multiple independent accelerators. We also propose the FILCO framework, including an analytical model with a two-stage DSE that can achieve the optimal design point. We also evaluate the FILCO framework on the 7nm AMD Versal VCK190 board. Compared with prior works, our design can achieve 1.3x∼5x throughput and hardware efficiency on various diverse workloads.

Type

Conference paper

Publication

Proceedings of the ACM/IEEE Design Automation Conference, DAC ’26, July 26 - July 29, 2026, Long Beach, CA, USA. Full Paper Accepted!

DAC

FILCO: Flexible Composing Architecture with Real-Time Reconfigurability for DNN Acceleration(🔥📣New Paper & Project🔥📣! )

Abstract

Xingzhen Chen

PhD Graduate Student

Jinming Zhuang

PhD Graduate Student

Zhuoping Yang

PhD Graduate Student

Shixin Ji

PhD Graduate Student

Sarah Schultz

PhD Graduate Student

Peipei Zhou

Assistant Professor of Engineering

Related