linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-23 18:16:01 +03:00

Author	SHA1	Message	Date
Deepak Nibade	330cc7d0e5	gpu: nvgpu: add profiler apis for resource reservation Add two new functions to reserve/release PM resources : nvgpu_prof_ioctl_reserve_pm_resource() nvgpu_prof_ioctl_release_pm_resource() Add ctxsw field to struct nvgpu_profiler_object to store per-resource context switch enable flag. Force resource reservation release while unbinding the context from profiler object or while closing the profiler object. Add this code in nvgpu_profiler_unbind_context() since both above paths will call this function. Bug 2510974 Jira NVGPU-5360 Change-Id: If334148e8df86360fba4162d1611187f3f04d01b Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2389654 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Alex Waterman	7e99a68e34	gpu: nvgpu: Add basic recovery debugging messages Add basic recovery messages that describe what's happening during the recovery process. Hide this under a new recovery specific GPU debug log flag. The logs look like: [ 276.000733] nvgpu: 17000000.gv11b gv11b_fifo_recover:162 [DBG] REC \| Recovery starting [ 276.000737] nvgpu: 17000000.gv11b gv11b_fifo_recover:163 [DBG] REC \| ID = 0 [ 276.000741] nvgpu: 17000000.gv11b gv11b_fifo_recover:164 [DBG] REC \| id_type = TSG [ 276.000745] nvgpu: 17000000.gv11b gv11b_fifo_recover:165 [DBG] REC \| rc_type = MMU fault [ 276.000748] nvgpu: 17000000.gv11b gv11b_fifo_recover:166 [DBG] REC \| Engine bitmask: 0x0 [ 276.000753] nvgpu: 17000000.gv11b gv11b_fifo_recover:170 [DBG] REC \| Acquiring engines_reset_mutex [ 276.000756] nvgpu: 17000000.gv11b gv11b_fifo_recover:174 [DBG] REC \| Acquiring runlist_lock for active runlists [ 276.000764] nvgpu: 17000000.gv11b gv11b_fifo_recover:185 [DBG] REC \| Channels bound to this TSG: [ 276.000767] nvgpu: 17000000.gv11b gv11b_fifo_recover:190 [DBG] REC \| 0 \| chid 511 [ 276.001098] nvgpu: 17000000.gv11b gv11b_fifo_recover:222 [DBG] REC \| PBDMA Bitmask: 0x1 [ 276.001102] nvgpu: 17000000.gv11b gv11b_fifo_recover:228 [DBG] REC \| Runlist Bitmask: 0x1 [ 276.001106] nvgpu: 17000000.gv11b gv11b_fifo_recover:240 [DBG] REC \| Disabling RL scheduler now [ 276.001126] nvgpu: 17000000.gv11b gv11b_fifo_recover:246 [DBG] REC \| Disabling CG/PG now [ 276.189348] nvgpu: 17000000.gv11b gv11b_fifo_recover:259 [DBG] REC \| Clearing PBDMA_FAULTED, ENG_FAULTED in CCSR register [ 276.191972] nvgpu: 17000000.gv11b gv11b_fifo_recover:264 [DBG] REC \| Disabling TSG [ 276.191983] nvgpu: 17000000.gv11b gv11b_fifo_recover:279 [DBG] REC \| Preempting runlists for RC [ 276.192001] nvgpu: 17000000.gv11b gv11b_fifo_recover:288 [DBG] REC \| Polling for TSG to be off PBDMA [ 276.192012] nvgpu: 17000000.gv11b gv11b_fifo_recover:296 [DBG] REC \| Done! [ 276.192016] nvgpu: 17000000.gv11b gv11b_fifo_recover:306 [DBG] REC \| Resetting relevant engines [ 276.192020] nvgpu: 17000000.gv11b gv11b_fifo_recover:318 [DBG] REC \| Engine bitmask for RL 0: 0xd [ 276.192024] nvgpu: 17000000.gv11b gv11b_fifo_recover:323 [DBG] REC \| > Restting engine: ID=0 [ 276.209567] nvgpu: 17000000.gv11b gv11b_fifo_recover:347 [DBG] REC \| Done! [ 276.209572] nvgpu: 17000000.gv11b gv11b_fifo_recover:323 [DBG] REC \| > Restting engine: ID=2 [ 276.214290] nvgpu: 17000000.gv11b gv11b_fifo_recover:347 [DBG] REC \| Done! [ 276.214295] nvgpu: 17000000.gv11b gv11b_fifo_recover:323 [DBG] REC \| > Restting engine: ID=3 [ 276.224986] nvgpu: 17000000.gv11b gv11b_fifo_recover:347 [DBG] REC \| Done! [ 276.225013] nvgpu: 17000000.gv11b gv11b_fifo_recover:377 [DBG] REC \| Re-enabling runlists [ 276.225034] nvgpu: 17000000.gv11b gv11b_fifo_recover:383 [DBG] REC \| Re-enabling CG/PG [ 276.225134] nvgpu: 17000000.gv11b gv11b_fifo_recover:394 [DBG] REC \| Releasing engines reset mutex Note the "REC \|" which lets one easily do: $ dmesg \| grep "REC \|" To get a clear ubobstrructed view of the recovery progress in the dmesg log. JIRA NVGPU-5606 Change-Id: I183f2b5ac54edc60ee894a82111723e27aa5c46b Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2392991 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Konsta Hölttä	fcbd807842	gpu: nvgpu: remove lockless allocator The lockless allocator that spins in alloc and free ops using cmpxchg to mitigate race conditions has only ever been used for the post fences in preallocated job resources. Now each post fence has a clear owner (the job struct which already is allocated well) and lifetime, so this allocator has no longer a purpose. Delete it to avoid bitrot. (The design of the job queue has always been such that there's minimal contention in any case.) Jira NVGPU-5773 Change-Id: Ied98d977c2c75bacfd3d010ce60c80fe709231e0 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2392705 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	e6c0d84683	gpu: nvgpu: allocate fences in job structs As the submit job metadata has been simplified, the fence pool for job tracking fences is now just complex code for very simple purposes, so delete it. It's enough to hold the fence memory in the job struct itself instead of having separately allocated objects with different lifetimes. Each channel is using preallocated job arrays based on the prespecified inflight job count. The fences are used for tracking job completion, and a new job cannot be submitted before a previous wait has completed. This means that even with a ringbuffer with space for only one job, the previous job memory cannot get reclaimed by a new submit because the submits are ordered. Jira NVGPU-5773 Change-Id: I0c777df700aa7cfda6f971efa47aa72c5462b53a Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2392704 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Antony Clince Alex	d7a1e0d4c9	gpu: nvgpu: update fmodel ctx_var init to add missing pm lists Update netlist parsing logic for fmodel to include the following lists: - LIST_compressed_pm_ctx_reg_ROP - LIST_compressed_pm_ctx_reg_unicast_GPC - LIST_compressed_pm_fbpa_ctx_regs - LIST_compressed_pm_ctx_reg_CAU - LIST_nv_perf_fbp_control_ctx_regs - LIST_nv_perf_gpc_control_ctx_regs - LIST_nv_perf_pma_control_ctx_regs Jira NVGPU-4711 Change-Id: Ie62784941c86ad42e06228875dea3254d8714be9 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2391709 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
mkumbar	e6c9c30b32	gpu: nvgpu: Add PMU ucode version for nvgpu-next Adding PMU ucode version for nvgpu-next chips_a P4 CL#: 28820694 JIRA NVGPU-5183 Change-Id: Id70bec5ad1422cce5fc0b0081f4d5924a4a15e09 Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2378149 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
mkumbar	72e2f2e064	gpu: nvgpu: PMU NS ucode blob update Created PMU fw ops to support mutliple version of PMU NS boot blob creation as there is a pmu_ucode_desc interface change between legacy and new interface. Added pmu_ucode_desc_v1 interface to support igpu PMU on nvgpu-next JIRA NVGPU-5183 Change-Id: I9f132aa84681d78b05b03913c71a30dda08053f8 Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2377832 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
mkumbar	918fa1a658	gpu: nvgpu: PMU NS ucode boot update Removed gpmu_ucode.bin usage by fetching PMU ucode descriptor and image from respective files for NS boot. JIRA NVGPU-5183 Change-Id: I597c5dd17b4a58603f550b32980d7d0ca9624aed Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2376448 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
mkumbar	3f75e62c26	gpu: nvgpu: update super surface for igpu Add supper surface gpu_va details to super surface header member as needed by PMU ucode to process. This is required for iGPU PMU ucode on nvgpu-next to process command line args and ACK back with INIT message, without this PMU ucode ends up in hang due to DMA wait. Update super-surface details to cmd line args for PMU ucode to know the starting address of super-surface in SYSMEM. JIRA NVGPU-5186 Change-Id: I56d7d3e28527e46707663c97bc8e2a58000c7f5a Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2376364 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
mkumbar	e6a3540ec1	gpu: nvgpu: support nvgpu-next for PMU on iGPU Support lsfm, perfmon and PG for iGPU PMU on nvgpu-next JIRA NVGPU-5183 Change-Id: Idbe1125c2a8f347de3f59c4ec824df9600573e7a Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2376321 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Sagar Kamble	a8b06778bb	gpu: nvgpu: validate map request with nvmap dmabuf file mode fail RW map request with RO nvmap dmabuf. Bug 200630384 Change-Id: Ic18d348d30a638cd6cb098e9228f470809dbea42 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2376131 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Alex Waterman	7c1c533a4a	gpu: nvgpu: Don't disable coalesce for gv11b+ Stop enabling LG and SU coalesce on gv11b and tu104. This is no longer required. Bug 1951653 Bug 1801194 Change-Id: I412be2caae6b841d5387ae5a153d38e49d3d61bc Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2392901 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	6daa0636d1	gpu: nvgpu: rework regops execution API Rework regops execution API to accomodate below updates for new profiler design - gops.regops.exec_regops() should accept TSG pointer instead of channel pointer. - Remove individual boolean parameters and add one flag field. Below new flags are added to this API : NVGPU_REG_OP_FLAG_MODE_ALL_OR_NONE NVGPU_REG_OP_FLAG_MODE_CONTINUE_ON_ERROR NVGPU_REG_OP_FLAG_ALL_PASSED NVGPU_REG_OP_FLAG_DIRECT_OPS Update other APIs, e.g. gr_gk20a_exec_ctx_ops() and validate_reg_ops() as per new API changes. Add new API gk20a_is_tsg_ctx_resident() to check context residency from TSG pointer. Convert gr_gk20a_ctx_patch_smpc() to a HAL gops.gr.ctx_patch_smpc(). Set this HAL only for gm20b since it is not required for later chips. Also, remove subcontext code from this function since gm20b does not support subcontext. Remove stale comment about missing vGPU support in exec_regops_gk20a() Bug 2510974 Jira NVGPU-5360 Change-Id: I3c25c34277b5ca88484da1e20d459118f15da102 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2389733 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	a73b5d3c6f	gpu: nvgpu: use smpc global mode capability check In nvgpu_dbg_gpu_ioctl_smpc_ctxsw_mode(), check if SMPC global mode capability is supported instead of checking for the function pointer. Enable the capability only for Turing since pre-Turing GPUs don't support it. Bug 2510974 Jira NVGPU-5360 Change-Id: I352fb2a91b836cd8ef727966a53a28255d8ea834 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2389653 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	ccba2e850b	gpu: nvgpu: add mutex to serialize profiler ioctl calls Add new mutex prof->ioctl_lock to serialize all IOCTL calls on profiler object. Running concurrent IOCTL calls could lead to races and corrupted state. Bug 2510974 Jira NVGPU-5360 Change-Id: I66a8d9078c35475a13442ccd34b61aca5b9c1d2b Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2389652 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Seema Khowala	9ea21459b4	gpu: nvgpu: pascal+: trigger_suspend, wait_for/resume_from _pause set to NULL - NvRmGpuDeviceSetSmDebugMode uses regops interface. - NvRmGpuDeviceTriggerSuspend, NvRmGpuDeviceWaitForPause, and NvRmGpuDeviceResumeFromPause should return error on Pascal+. Use regops interface to suspend/resume. - On non-cilp devices(Maxwell), NvRmGpuDeviceTriggerSuspend, NvRmGpuDeviceWaitForPause, NvRmGpuDeviceResumeFromPause and NvRmGpuDeviceSetSmDebugMode are used when debugger(including coredump, memcheck) is attached or when CUDA application uses a syscall that requires traphandler(assert, cnp). Bug 2558022 Bug 2559631 Bug 2706068 JIRA NVGPU-5502 Change-Id: I9eb2ab0c8c75c50f53523d8bf39c75f98b34f3f0 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2376159 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Shashank Singh	2fecf71e45	gpu: nvgpu: add disable nvlink option in gk20a_platform Add option to disable nvlink in struct gk20a_platform so that chips that do not support nvlink can work with pcie without compiling out nvlink code. Jira NVGPU-5870 Change-Id: Idc60418b5cf322ac81b241a4e59d25f5d8e6b9ca Signed-off-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2332162 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Lakshmanan M	c99afa1766	gpu: nvgpu: add gr manager and mig infra This CL covers the code changes related to following support, - Added gr manager infra. - Added grmgr_gops infra. - Added mig infra. - Added log mask for MIG verbose support. JIRA NVGPU-5645 JIRA NVGPU-5646 Change-Id: Iec356e08e6cfee86ad9f59fdf6cfee9c38231359 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2385111 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	969b901999	gpu: nvgpu: create device/context profiler dev nodes Create new dev nodes for device and context profilers. Example of dev nodes on iGPU /dev/nvhost-prof-dev-gpu - device scope profiler /dev/nvhost-prof-ctx-gpu - context scope profiler Add below APIs to open/close above dev nodes : nvgpu_prof_dev_fops_open() nvgpu_prof_ctx_fops_open() nvgpu_prof_fops_release() Add common API nvgpu_prof_fops_ioctl() to handle IOCTL call on these dev nodes. Add IOCTL NVGPU_PROFILER_IOCTL_BIND_CONTEXT to bind the TSG to profiler objects. Add nvgpu_tsg_get_from_file() to retrieve TSG struct pointer from file descriptor. Also store profiler object pointer into TSG struct. Enable NVGPU_SUPPORT_PROFILER_V2_DEVICE capability on gv11b and tu104. Note that this is not yet enabled for vGPU. Keep NVGPU_SUPPORT_PROFILER_V2_CONTEXT capabiity disabled since this will take longer to support. Add new IOCTL NVGPU_PROFILER_IOCTL_UNBIND_CONTEXT so that userspace can explicitly unbind the context and release the resources before closing the profiler descriptor. Add context_init flag to profiler object for book keeping. Bug 2510974 Jira NVGPU-5360 Change-Id: Ie07e0cfd5a9da9d80008f79c955c7ef93b4bc60f Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2384354 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	fb95b7efa7	gpu: nvgpu: move nvgpu_func io functions to common - Move nvgpu_func_writel and nvgpu_func_readl to common io file. - Add func.get_full_phys_offset() hal to gk20a_gops structure. - Add tu104_func_get_full_phys_offset() for tu104. JIRA NVGPU-5363 Change-Id: I2aa13862a37f48321510882053256e16ef3f7377 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2383483 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Antony Clince Alex	dd82cdca97	gpu: nvgpu: introduce new ctxsw_addr_type LTS_MAIN The LTS_MAIN will be used by nvgpu-next chips. In addition, update gops_ltc.h to include nvgpu_next_gops_ltc.h and nvgpu_next_gops_ltc_intr.h Jira NVGPU-5352 Bug 200605474 Bug 200608785 Change-Id: Id77ddfc4c1aa2f93e98e05cfd8645f7ffb8f41c8 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2366350 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Alex Waterman	0f5818b89e	gpu: nvgpu: Condition debug dump on recovery profiling If recovery sequence profiling is enabled skip the debug dump that happens during an MMU fault. This prevents the debug dump from dominating the time spent by the recovery sequence. The debug dump is severly limited in speed by the (lack of) UART bandwidth. JIRA NVGPU-5606 Change-Id: Ifc7c326d33d9115d58b13c0fa42ec4bb7acb3075 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2382591 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Alex Waterman	d0714b40c1	gpu: nvgpu: Add engine reset profiling This is a key part of the fifo recovery sequence. JIRA NVGPU-5606 Change-Id: I8807884394834b912f25d7c535ee22f547988b2d Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2382590 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Alex Waterman	1bcdc306a0	gpu: nvgpu: Add gv11b recovery profiling Add some basic profiling to the gv11b recovery sequence. This captures the high level events. Subsequent patches start to dig into the subsections in more detail. JIRA NVGPU-5606 Change-Id: I488a448ca1cbf961651588e24685e2a5b4420c44 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2368302 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Alex Waterman	811ba85dc6	gpu: nvgpu: Add basic stats to profiler Add the ability to print some basic stats to the SW profiler. This doesn't replace a userspace application to do more sophisticated stats analysis if necessary, but it goves some quick basic info. The stats provided are: { Min, Max, Mean, Media, Sigma^2 } JIRA NVGPU-5606 Change-Id: Iadfa5cf1d57657182dcb63e66dd682b54a6fa0de Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2367421 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Tejal Kudav	ab2b0b5949	gpu: nvgpu: Set unserviceable flag early during RC During recovery, we set ch->unserviceable at the end after we preempt the TSG and reset the engines. It might be too late and user-space might submit more work to the broken channel which is not desirable. Move setting this unserviceable flag right at the start of recovery sequence. Another thread doing a submit can still read the unserviceable flag just before it is set here, leaving that submit stuck if recovery completes before the submit thread advances enough to set up a post fence visible for other threads. This could be fixed with a big lock or with a double check at the end of the submit code after the job data has been made visible. We still release the fences, semaphore and error notifier wait queues at the end; so user-space would not trigger channel unbind while channel is being recovered. Also, change the handle_mmu_fault APIs to return void as the debug_dump return value is not used in any of the caller APIs. JIRA NVGPU-5843 Change-Id: Ib42c2816dd1dca542e4f630805411cab75fad90e Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2385256 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
shashank singh	650ce63466	gpu: nvgpu: make iommu bit getting hal NULL for turing For dgpu iommu bit is causing smmu fault when sysmem is accessed via pcie. Since pcie is always having iommu enabled on linux that creates issue for linux. So don't set the iommu bit for dgpu in any case. Bug 200640033 Change-Id: I38556779db94289b0656cdb53d417e4ff83ed426 Signed-off-by: shashank singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2384653 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Sagar Kamble	e161c8d7fa	gpu: nvgpu: remove the root cap check in ctxsw device open The device node permission for the ctxsw should be set to "root:debug" instead. Bug 2823941 Change-Id: I523fdd298b70cac82c0a8d853f3e241a80a2ebf5 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2372943 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	e8f2e3d514	gpu: nvgpu: add nvgpu-next sim function prototypes Add nvgpu-next sim function prototypes. This resolves qnx and userspace build errors. JIRA NVGPU-5363 Change-Id: I7b20917ec73b2ca3a1514872620266bb7a54097c Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2369657 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Dinesh	d0087f3ad8	gpu: nvgpu: Support for runlist_max_supported nvgpu_next needs support for max_runlist_supported by litter value. So the function is changed to support. JIRA NVGPU-5534 Change-Id: I097f6343295049532c46904316314dc82092a46b Signed-off-by: Dinesh <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2382882 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Lakshmanan M	58ef68e162	gpu: nvgpu: add gr manager gops for nvgpu-next-1 1) Included gr manager gops for nvgpu-next chip 2) Added conf flag to enable/disable MIG JIRA NVGPU-5646 Change-Id: I37d3b64fb8a49f97d37c89374241d0fc9c75891e Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2382270 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Shashank Singh	71c8d998d4	gpu: nvgpu: return error if therm is uninitialized If therm is not initialized then return error for getting temperature API. Bug 200638833 Jira NVGPU-5832 Change-Id: Iebe44218d76d39d5bf765e8de6fd74c3b64c8b68 Signed-off-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2382905 Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Seeta Rama Raju	9dd17d29d7	gpu: nvgu: Build nvgpu-next files for internal build only Bug 200632047 JIRA NVGPU-5833 Change-Id: I17cb42f8b533925c9caba5e4b434c645adc2affb Signed-off-by: Seeta Rama Raju <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2375170 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Prateek Sethi <prsethi@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	14ad3b21af	gpu: nvgpu: add user interface for profiler v2 support Add user interface of new dev nodes and corresponding IOCTLs to support new profiler design. Bug 2510974 Jira NVGPU-5360 Change-Id: Ib257fa826a2282ffb463b1238f6092a1c9b9b2dc Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2340563 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Tejal Kudav	881a6f35be	gpu: nvgpu: Trigger quiesce on PBDMA preempt fail During recovery, we preempt the faulty TSG from PBDMA and engines. If the TSG preempt on PBDMA times out(timeout = 100ms), the PBDMA might be hung state. We do not reset the HOST during recovery, so stuck PBDMAs are unrecoverable. Abort the recovery and trigger GPU to quiesce as there is no way back. Triggering Quiesce from recovery sequence should be fine as the only redundant operation will be write to FIFO_RUNLIST_PREEMPT register. The error notifiers will eventually be set by Quiesce thread. Bug 2768005 JIRA NVGPU-4631 Change-Id: I914b9379aa8e48014e6ddace9abe47180a072863 Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2368187 Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Terje Bergstrom	f7a73f4ab8	gpu: nvgpu: Rename gpu_sys clock to pwr In upstream device trees pwr clock for gp10b and gv11b are called pwr. Rename the clock in downstream so that nvgpu can work with upstream kernel. Bug 3030537 Change-Id: I7738a5c73e893c6ee7ebbf859a347aa508bfcffa Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2376595 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Terje Bergstrom	ebd3b18d27	gpu: nvgpu: Support nvidia,gp10b In upstream T186 GPU has compatiblity string nvidia,gp10b. Add support for it to nvgpu. Bug 3030537 Change-Id: Ia1c7a2b5cd0fd0ce35f2860ee82638f767bf9845 Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2376143 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Seema Khowala	23f290a128	gpu: nvgpu: dbgr_control0 value can be different for different SM Do not assume dbgr_control0 register value uniformity as different SM can have different values. JIRA NVGPU-5502 Change-Id: Ib2e1f418f04f142b1948f5713b473df0f9b3ffc3 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2373946 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Seema Khowala	52271d7ab6	gpu: nvgpu: add check for SM debug mode Add check for SM debug mode in trigger_suspend, wait_for_pause and resume_from_pause hals. SMs cannot be suspended/resumed if all SMs are not in debug mode. JIRA NVGPU-5502 Change-Id: I790eb11405155a5e5d327ca048ebf21f9f8d2fab Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2373489 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	9d723a5f1f	gpu: nvgpu: add knob to control fecs_trace feature Currently, NVGPU_SUPPORT_FECS_CTXSW_TRACE enabled flag is set to true when fecs_trace s/w setup is executed successfully. Sometimes, fecs_trace is required to be disabled for debugging. This change will help disable/enable fecs_trace feature by modifying one of the enabled flags. Enable NVGPU_SUPPORT_FECS_CTXSW_TRACE during chip specific hal init. Control fec_trace init and ctxsw dev open depending on NVGPU_SUPPORT_FECS_CTXSW_TRACE flag status. JIRA NVGPU-5616 Change-Id: Id0754a5af7cd95a67a1f0ae5de36115d44e1111b Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2357501 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
mkumbar	8fbc4e5b56	gpu: nvgpu: update ACR sub-wpr support update ACR sub-wpr support by deleting FRTS_VBIOS_TABLES sub-wpr id support. FRTS_VBIOS_TABLES sub-wpr causing NEXT dGPU ACR AHESASC to hit ACR_ERROR_FLCN_ID_NOT_FOUND error and these tables are not supported by NVGPU. JIRA NVGPU-5462 Change-Id: I2de20b27a1a3ecbf4b3acb793eb22c637c4faba6 Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2368213 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	f34711d3de	gpu: nvgpu: split perfbuf initialization gk20a_perfbuf_map() allocates perfbuf VM, maps the user buffer into new VM, and then triggers gops.perfbuf.perfbuf_enable(). This HAL then does following : - Allocate perfbuf instance block - Initialize perfbuf instance block - Reset stream buffer - Program instance block address in PMA registers - Program user buffer address into PMA registers New profiler interface will have it's own API to setup PMA strem, and it requires above setup to be done in two phases of perfbuf initialization and then user buffer setup. Split above functionalities into below functions - nvgpu_perfbuf_init_vm() - Allocate perfbuf VM - Call gops.perfbuf.init_inst_block() to initialize perfbuf instance block - gops.perfbuf.init_inst_block() - Allocate perfbuf instance block - Initialize perfbuf instance block - Program instance block address in PMA registers using gops.perf.init_inst_block() - In case of vGPU, trigger TEGRA_VGPU_CMD_PERFBUF_INST_BLOCK_MGT command to gpu server - gops.perf.init_inst_block() - Reset stream buffer - Program user buffer address into PMA registers Also add corresponding cleanup functions as below : gops.perf.deinit_inst_block() gops.perfbuf.deinit_inst_block() nvgpu_perfbuf_deinit_vm() Bug 2510974 Jira NVGPU-5360 Change-Id: I486370f21012cbb7fea84fe46fb16db95bc16790 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2372984 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Alex Waterman	12e71f22f8	gpu: nvgpu: Use correct GPL for swprofile_debugfs.[ch] These two files were not using the correct GPL license format for the nvgpu Linux OS code. Corretc this. Also fix the header guard define in swprofile_debugfs.h since it did not quite match the header file name. Bug Change-Id: I317056823cade697fdf65f9ff83306129ee0ebe3 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2374698 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Antony Clince Alex	8a9acf8a7e	gpu: nvgpu: move set_hww_esr_report_mask to golden context init The driver configures the sm hww global, warp ESR report masks during poweron as part of gops_gr.gr_init_support. However, during golden context init, these are overwritten with default entries from sw_ctx_load list; this leaves the report masks in a state inconsistent with the driver expectation. The driver should configure the sm hww warp, global ESR report masks during golden context init and not before it; Hence, move set_hww_esr_report_mask from power-on path to golden context init. In addition, update set_hww_esr_report_mask to do RMW, so as to retain the values loaded from sw_ctx_load list. Update global ESR report mask to enable all exceptions. Bug 3029888 Bug 2997718 Change-Id: Id7ad4cff5409982143f49695c95c5e1d1c9fdec9 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2367466 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Sagar Kamble	1a4b454b56	gpu: nvgpu: advertise RESCHEDULE_RUNLIST capability only for realtime processes Below change added capability check in the ioctl. nvgpu is advertising the support for RESCHEDULE_RUNLIST for all processes even though it fails the ioctl for non-realtime processes. Clear the ioctl flag for RESCHEDULE_RUNLIST for non-realtime processes. commit `838ba0a14d` ("gpu: nvgpu: check capability for reschedule runlist submit flag") Author: David Li <davli@nvidia.com> Date: Tue Sep 12 18:37:00 2017 -0700 NVGPU_SUBMIT_GPFIFO_FLAGS_RESCHEDULE_RUNLIST is only used by realtime priority EGL context, which checks for CAP_SYS_NICE during context creation in userspace, so it wasn't secure against unprivileged program spoofing submit ioctl with this flag to stall GPU progress of others. This flag does increase duration of submit by approx 16us, mostly due to register accesses and PMU FIFO mutex. Bug 2823941 Change-Id: Iecee3989e5af035264b1ed5c1aa9a8576dd90883 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2372957 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Sami Kiminki <skiminki@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Sagar Kamble	d0848abee5	gpu: nvgpu: remove cap checks from fifo_sched & ctxsw_ring debugfs open Debugfs can be mounted with root-only permissions hence remove the extra cap checks in the debugfs open calls for fifo_sched & ctxsw_ring. Bug 2823941 Change-Id: I41668a887635f34897886b872ad435b183b85959 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2372982 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Sami Kiminki <skiminki@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Debarshi Dutta	4a54ccc3ef	gpu: nvgpu: move linux configs from Kconfig file Kstable branch is getting rid of all downstream patches. This also removes support for NVGPU build as kernel overlays will no longer be supported. In order to move towards a uniform out of tree build system, nvgpu must manage the CONFIGS present in Kconfig itself and stop relying on Kconfigs. A new file Makefile.linux.configs is created to house these configs temporarily. This file is included as part of the linux Makefile. Eventually the plan is to move towards using Makefile.shared.configs. This takes us one more step closer to having out of tree module building for NVGPU internal builds. With this change, kstable can still go ahead with building extmod builds for NVGPU. This also allows downstream builds to continue as in-tree builds as long as the overlays are set for the downstream kernels. Bug 200617256 Change-Id: I78aae6b02521e2a07e8e74aa401ffdfaf9d8cf7c Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2369209 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Debarshi Dutta	08ec6e874d	gpu: nvgpu: avoid using priv data for non-compressed buffer Instead of allocating priv data for all external buffers, allocate only on a demand basis for when compression is requested either in CDE or via libnvrm_gpu. This will allow allocators like nvidia-drm to use non-compressed buffers without needing to avoid the core drm checks. e.g. drm_gem_prime_import_dev that checks for if (dma_buf->ops == &drm_gem_prime_dmabuf_ops)" This patch also gets rid of optimization of dma_buf's attach/detach calls. Now, nvgpu instead needs to call attach/detach for everytime the dmabuf fd is imported. Change-Id: Idefd269b32974106e85ff09e17ebc752b92f830c Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2372213 Tested-by: Yogish Kulkarni <yogishk@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	58ce9156a8	gpu: nvgpu: get gpc/tpc addr width from litter Some chips have GPC/TPC address width exposed through litter value. - Add GPC/TPC address width to litter value. - Update pri_gpccs_addr_width() and pri_tpccs_addr_width() to read value from litter value. JIRA NVGPU-5598 Change-Id: I534fa3188e3412f7e1b7bbf61c8227c966895ea5 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2371425 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
smadhavan	c261f7573b	gpu: nvgpu: support nvgpu-next secure boot Add NVGPU_NEXT_GPUID in nvgpu_acr_init, nvgpu_acr_lsf_fecs_ucode_details, and nvgpu_acr_lsf_gpccs_ucode_details functions. JIRA NVGPU-5323 Change-Id: I514ab6de08ffaad323072499a92acef24668d3fc Signed-off-by: smadhavan <smadhavan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2361630 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00

... 6 7 8 9 10 ...

8327 Commits