Hi @Xingda_Chen,
Welcome to the PYNQ community.
I am not an expert in RFSoC, however, I can tell you that I see a few issues with the DMA. First, the tlast is set to a constant, which from the results, I assume may be one. This is not correct. Second, the tkeep is unconnected, which mean it is being tied to 0.
You will need to create extra logic to set tkeep to all ones, and only set tlast in the appropriate beat, this will be at 512 samples (16-bit) each.
I wrote an extensive DMA debug post.
Mario