Can you elaborate on what would you like to accomplish?
Reading the code, it seems that you want to transfer element by element from the array. Having 32-bit elements in the array and the addressing being 64-bit it makes sense that you have aliment problems.
You typically want to use the DMA to transfer a large array to maximize performance. If you haven’t done it yet. I suggest you check out the DMA tutorial series here