HLS/IP Bram Usage

rockstiff · August 16, 2021, 2:40pm

Hello. Im trying a neural network with HLS. This is a mix of an issue for xilinx and here i think. Im using HLS4ML for the network compile/syntethize but this part of the issue is more xilinx/pynq related. The board is a PYNQ-Z2 and my NN has 512 data input size and 10 outputs (for classification)

When the projects is builded an HLS program is created. I can then open it with Vivado HLS (2019.2) and Run C Synthesis and Export the IP. Here come the issues.

If i complie with io_type=io_parallel, the following IP is generated
Screenshot from 2021-08-16 11-29-23720×713 52.9 KB

but then this is my utilization estimate:
Screenshot from 2021-08-16 11-30-17

If i compile with io_type=io_serial i get this massive IP block
Screenshot from 2021-08-16 11-31-29483×716 16.6 KB

with 512 inputs but the utilization estimates are

Screenshot from 2021-08-16 11-32-17
fits the board

As you can see, io_parallel has the best IP block form but the BRAM is way over the maximum of the board.

Here comes the questions:

MOST IMPORTANT: Its possible for the pynq to implement the smaller io_parallel and then to load the BRAM from SD? How can i reduce the BRAM usage? How to do it from python?
If the io_serial is used so the BRAM fits the board, how do i run the connections so then i can generate the overlay and use it from python?

Topic		Replies	Views
My synthesized HLS IP BRAM utilization exceeds PYNQ-Z1 available resources Support	1	425	October 20, 2021
BRAM access by HLS IP and PYNQ Support	3	748	April 17, 2023
Pynq to m_axi HLS IP Support	5	1799	October 15, 2022
Could I run 2 IPs of the same overlay in parallel with PYNQ? Support	5	774	May 12, 2022
Accessing BRAM from HLS IP Support	3	573	May 13, 2024

HLS/IP Bram Usage

Related topics