gpu: nvgpu: contiguous memory for userd

For a given channel, userd_iova is computed as an offset from fifo->userd address. If nvlink is enabled we need fifo->userd buffer to be physically contiguous, as nvlink bypasses IOMMU. Otherwise, it may result in loading PBDMA from an invalid location in memory. This manifests most of the time with either channel timeout (GP_PUT loaded with 0, hence no progress) or GPPTR Invalid Error (GP_PUT loaded with out of range index). Use NVGPU_DMA_FORCE_CONTIGUOUS for fifo->userd buffer, when nvlink is enabled. Bug 2422486 Change-Id: I99d585ee196534025522a1cbd74fb4e4c03df98e Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1954802 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>
2025-12-23 01:50:07 +03:00 · 2018-11-20 13:27:45 -08:00
parent 38dee046b0
commit ae6e4d0572
1 changed files with 5 additions and 1 deletions
--- a/drivers/gpu/nvgpu/gk20a/fifo_gk20a.c
+++ b/drivers/gpu/nvgpu/gk20a/fifo_gk20a.c
@@ -967,7 +967,11 @@ int gk20a_init_fifo_setup_sw(struct gk20a *g)
 				   (size_t)f->num_channels,
 				   &f->userd);
 	} else {
-		err = nvgpu_dma_alloc_sys(g, (size_t)f->userd_entry_size *
+		u32 flags = nvgpu_is_enabled(g, NVGPU_MM_USE_PHYSICAL_SG) ?
 	                NVGPU_DMA_FORCE_CONTIGUOUS : 0U;
 		err = nvgpu_dma_alloc_flags_sys(g, flags,
 				(size_t)f->userd_entry_size *
 				(size_t)f->num_channels, &f->userd);
 	}
 	if (err != 0) {