The Library

Exploring the capabilities of FPGA DSP blocks in neural network accelerators

Tools

Ioannou, Lenos (2021) Exploring the capabilities of FPGA DSP blocks in neural network accelerators. PhD thesis, University of Warwick.

Preview

PDF
WRAP_Theses_Ioannou_L_2021.pdf - Submitted Version - Requires a PDF viewer.
Download (3661Kb) | Preview

Official URL: http://webcat.warwick.ac.uk/record=b3782408~S15

Request Changes to record.

Abstract

Neural networks have contributed significantly in applications that had been difficult to implement with the traditional programming concepts (e.g. computer vision, natural language processing). In many occasions, they outperform their hand coded counterparts and are increasingly popular in end user applications. Neural networks, however, are compute and memory demanding, making their execution in resource constraint devices more difficult, especially for real time applications. Custom computing architectures on Field-Programmable Gate Arrays (FPGAs) have traditionally been used to accelerate such computations to meet specific requirements. Nonetheless, most approaches in the literature do not consider in detail the underlying FPGA architecture, resulting in less efficient implementations. They additionally have focused on complex designs optimised for high throughput in a datacenter setting with access to large datasets in memory. Meanwhile real edge applications are often processing streaming sensor data and require consideration of efficiency. Detailed FPGA implementations involve time consuming low level design effort, which in turn result in long turnaround time. FPGAs have evolved over the years to include hard macro blocks, for example Digital Signal Processing (DSP) blocks, that map more efficiently widely used operations. In addition, FPGAs are often tightly coupled with embedded microprocessors in a System-on-Chip (SoC) arrangement that offers a complete system solution. This thesis explores the capabilities of FPGA DSP blocks in neural network accelerators. Within this context, practices and tools that improve turnaround time have been explored, drawing conclusions on how to exploit DSP blocks in a way that maximises performance and efficiency. Finally, the work in this thesis shows that designing overlays in an architecture-centric manner can result in high operating frequency, while scaling to better utilise FPGA resources.

Item Type:

Thesis (PhD)

Subjects:

Q Science > QA Mathematics > QA76 Electronic computers. Computer science. Computer software
T Technology > TK Electrical engineering. Electronics Nuclear engineering

Library of Congress Subject Headings (LCSH):

Neural networks (Computer science), Field programmable gate arrays, Signal processing -- Digital techniques, Computer architecture

Official Date:

November 2021

Dates:

Date	Event
November 2021	UNSPECIFIED

Institution:

University of Warwick

Theses Department:

School of Engineering

Thesis Type:

PhD

Publication Status:

Unpublished

Supervisor(s)/Advisor:

Fahmy, Suhaib

Sponsors:

Engineering and Physical Sciences Research Council ; University of Warwick. School of Engineering

Extent:

xv, 126 leaves : illustrations, charts

Language:

eng

Request changes or add full text files to a record

Repository staff actions (login required)

View Item

University of Warwick
Publications service & WRAP

Highlight your research

The Library

Exploring the capabilities of FPGA DSP blocks in neural network accelerators

Abstract

Repository staff actions (login required)

University of WarwickPublications service & WRAP

Highlight your research

The Library

Exploring the capabilities of FPGA DSP blocks in neural network accelerators

Abstract

Repository staff actions (login required)

University of Warwick
Publications service & WRAP