The intricate process of designing FPGA-based accelerators for AI workloads is notoriously time-consuming, demanding extensive domain expertise to navigate a vast design space of architectural parameters, data flow, and memory hierarchies. While existing tools offer rapid co-design, identifying optimal configurations remains a bottleneck. This paper introduces a novel approach to streamline this challenge, integrating Large Language Models (LLMs) into the SECDA framework to automate and guide the design space exploration (DSE) process for FPGA accelerators, as detailed in their work on SECDA-DSE.
Automated Kernel-Specific Accelerator Generation
SECDA-DSE leverages an LLM Stack, employing retrieval-augmented generation and chain-of-thought prompting, to perform reasoning-guided exploration. This system generates candidate architectures, which are then iteratively refined through a feedback loop. The framework's efficacy is demonstrated by successfully generating three distinct accelerator designs, for element-wise vector multiplication, 2D convolution, and matrix transpose, that were synthesized and executed end-to-end on FPGA hardware. This signifies a significant step towards automating the creation of efficient, hardware-specific AI accelerators.