linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-25 11:04:51 +03:00

Author	SHA1	Message	Date
Rajesh Devaraj	4bb32e33e7	gpu: nvgpu: update pbdma_dump_intr_0 as an hal To reduce the duplication of HALs to new chips, this makes pbdma dump_intr_0 as an HAL. JIRA NVGPU-9325 JIRA NVGPU-9064 Change-Id: I737146068cb144165bae8666c04f876aed20a89c Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2847566 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-01-23 22:39:36 -08:00
Rajesh Devaraj	7ad50fee7c	gpu: nvgpu: add new device types This patch adds macros for new device types based on the device information table. Specifically, it adds the device type IDs for the following engines: - SEC - NVENC - NVDEC - NVJPG - OFA Further, the max dev types has been updated as 57 in the device info table. To support his, the corresponding macro has been updated as 58. JIRA NVGPU-9501 Change-Id: I23f17c91da8a6063457c27763a62b1a08beeed0d Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2846203 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-01-21 16:52:39 -08:00
Rajesh Devaraj	afb971b66e	gpu: nvgpu: update report_pbdma_error as hal To reduce the duplication pbdma_handle_intr_0 API to new chips, this patch converts report_pbdma_error as a HAL. JIRA NVGPU-9325 Change-Id: Ifcb0838037c750070c26343e008a176b26eebf16 Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2845088 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-01-20 03:40:17 -08:00
prsethi	4dec633f5f	gpu:nvgpu: clean the domain queues in abrupt close Domain queues are being cleaned as part of graceful close but does not gets cleaned in case of abrupt app failures. This change cleanup the queues for abrupt failures. Bug 3884011 Change-Id: I8cd07b726718b1e13a05d4455a2670cedc9dee37 Signed-off-by: prsethi <prsethi@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2842632 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-01-17 01:40:36 -08:00
Divya	04cd344b35	gpu: nvgpu: free rpc_payload when driver is dying - During nvgpu module unload, the poweroff sequence will not wait for the ACK from the PMU for the RPCs sent. - Due to this, rpc_payload struct info will be present in pmu seq struct. - This can lead to memory corruption during unload path. - To avoid this, return a different value for driver shutting down scenario from fw ack function and based on this return value free the RPC payload and release the respective pmu sequence struct. Bug 3789998 Change-Id: I25104828d836ae37e127b40c88209da81754ffb8 Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2839968 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-01-13 06:10:52 -08:00
Rajesh Devaraj	3e2eff564f	gpu: nvgpu: update pbdma intr enable set/clear masks as hals To reduce the entire duplication of pbdma_intr_enable for future chips, make set and clear masks as HALs. JIRA NVGPU-9325 Change-Id: Id8434fc15ca4bf542680a8452dc294f2c4068084 Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2838036 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Ramalingam C <ramalingamc@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-01-09 20:04:11 -08:00
prsethi	8c710694e8	gpu:nvgpu: fix for consecutive domain submission When a user-domain gets removed, tsg belongs to this also gets removed and runlist update happens accordingly. If same tsg was submitted to gpu then updated runlist also needs to re-submit. This works fine with the existing legacy cases but if GPU is running the shadow domain submitted by manual mode scheduler and domain belongs to this gets removed then updated runlist is not being submitted to GPU. This runlist buffer inconsistency causes mmu fault later. This change adds a "remove" field in the runlist domain which gets set to true when runlist update happens for the channel removal. Later worker thread submit the updated runlist if this flag set to true. Bug 3884011 Change-Id: I3ce08a5a281e20661915746e70ac0dcd711f3f38 Signed-off-by: prsethi <prsethi@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2838808 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-01-09 12:26:12 -08:00
Rajesh Devaraj	2d3745810b	gpu: nvgpu: add support flag for gsp stress test Add support flag for GSP stress test. JIRA NVGPU-9347 Change-Id: I6b93e085b4e25798f1227297fd1baba8c1380604 Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2833485 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-01-03 19:09:17 -08:00
Rajesh Devaraj	95c5d3d776	gpu: nvgpu: add flag to disable static pg This patch introduces the flag NVGPU_DISABLE_STATIC_POWERGATE. Further, static pg related initialization APIs have been updated to check whether this newly added flag is enabled. If this flag is enabled, then static pg init gets skipped. JIRA NVGPU-9350 Change-Id: I450b6dd2c541d31fc40fb5540e557b5db730fee2 Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2834021 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-01-02 01:09:34 -08:00
Rajesh Devaraj	536ff1001e	gpu: nvgpu: add flag to disable ecc stats This patch adds the flag NVGPU_DISABLE_ECC_STATS to disable the creation of sysfs nodes that will be used to provide ECC counter values. Further, this patch adds a check on whether NVGPU_DISABLE_ECC_STATS flag is disabled before performing ECC sysfs initialization. JIRA NVGPU-9349 Change-Id: I99bc3895d3a5ffcf73035351f55c63ed5c46356b Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2833907 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Prathap Kumar Valsan <prathapk@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-01-02 01:09:13 -08:00
Alex Waterman	a9e21b487a	nvgpu: Fixes for QNX build structure changes Minor updates in the path to header files. Change-Id: I6fe1db8f050d9b168a1662f0cb65e15bc13c2195 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2810665 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Sagar Kadamati <skadamati@nvidia.com> Reviewed-by: Tejal Kudav <tkudav@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-12-28 06:56:34 -08:00
vivekku	470c1738b5	gpu: nvgpu: gsp: host routed interrupt handling Changes: - support for watchdog timer interrupt handling - code modified to support gsp scheduler interrupts for ecc errors directed to host - interrupts like IMEM, DMEM, EMEM, Delayed Lock Step and incorrect Register access NVGPU-9273 Change-Id: I93a2ef0961aaa40e76ca7efe8450ce07a6709453 Signed-off-by: vivekku <vivekku@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2818202 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> Tested-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-12-21 11:59:25 -08:00
rmylavarapu	7bbf10b04a	gpu: nvgpu: gsp: bootstrap gsp scheduler firmware This change will call nvgpu_gsp_sched_bootstrap_hs which will bootstrap the gsp with gsp scheduler firmware. NVGPU-9297 Change-Id: If5de945dc7994666fd87ecf99e15ca2014c13573 Signed-off-by: rmylavarapu <rmylavarapu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2826165 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-12-18 11:43:14 -08:00
Seeta Rama Raju	e0a9553533	gpu: nvgpu: Add magic value at instance block This is adding magic value in instance block while initializing instance block for a context. This will be verified by FECS firmware. Bug 3638810 Change-Id: I7d304c1b622b3c9f50a7443e9fadce9bac869258 Signed-off-by: Seeta Rama Raju <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2786274 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-12-17 02:21:07 -08:00
Tejal Kudav	31b2738f6a	gpu: nvgpu: Add Epl Init EPL lib constructor is replaced by NvEplInit() by safety services. NvGPU, being an EPL user, needs to call NvEplInit before using EPL API to report errors. NvEplInit() usage is limited to user space in Linux, so NvGPU is not expected to call NvEplInit on Linux. Bug 3863536 Change-Id: I7fc9b33d3443bd39a6367a0c691bddb80b9edb68 Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2817245 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-12-15 20:55:54 -08:00
Atul Anand	5db14f3bfb	nvgpu: Fix pm resource release sequence The memory leak issue was due to nvgpu_profiler_unbind_context() calling nvgpu_profiler_pm_resource_release() for all resources which clears the flag required by nvgpu_profiler_free_pma_stream() to release the memory for perf_buf instance block. Fixing this issue by splitting nvgpu_profiler_unbind_context() to release all the pm resources at a later time separately. Bug 3510455 Change-Id: Ibab8d071693e600c46f7e7f16575e36e6f62af3c Signed-off-by: atanand <atanand@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2825013 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-12-15 15:13:29 -08:00
mpoojary	9b73378362	gpu: nvgpu: Add support for loading ctxsw encrypted binaries Add checks to load encrypted CTXSW binaries for T234, when executing in silicon; else load the non encrypted binaries. Jira NVGPU-9303 Change-Id: Icf55ed76b1a7340006b00d1c24472d26462a880c Signed-off-by: mpoojary <mpoojary@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2819642 GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com> Reviewed-by: Dinesh Kamalakannan <dineshka@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com>	2022-12-14 23:48:10 -08:00
Dinesh T	2509287e71	gpu: nvgpu: enable NVGPU on OOT This patch is required for - enabling NVGPU driver on OOT by enabling various configs required. - replacing new APIs for some deprecated APIs by guarding with linux version for soc.c. - CONFIG_TEGRA_HV_MANAGER is enabled by default in OOT kernel, removing CONFIG_TEGRA_HV_MANAGER check from various places. Bug 3812973 Change-Id: I07f0b738ca95d4a3996e7f3ee5e895463db0626b Signed-off-by: Dinesh T <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2822434 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-12-13 16:56:58 -08:00
vivekku	a92bca9772	gpu: nvgpu: gsp: created cmd for GSP to bind ctx reg Changes - create command for GSP firmware to bind ctx register. NVGPU-8730 Change-Id: If92bbbc0169b6466e55f3dff05828b2b649ad3de Signed-off-by: vivekku <vivekku@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2815472 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-12-13 06:22:02 -08:00
rmylavarapu	01eb416745	gpu: nvgpu: gsp sched: enable gsp sw init for safety build Changes 1. Remove dGPU flag dependency on calling gsp sw init on tot. 2. Created Enable flag for gsp scheduler to enable them on ga10b platforms. 3. Engine config flag is only enabled for dGPU enabled platforms, as gsp is using engine functions it need to be enabled for all gsp sched enabled builds. 4. Changes in gsp_sequence_init/de_init where on qnx we are seeing issues. NVGPU-9297 Change-Id: Ia4bce85ae8fd2794da1553e9ea418c76845a10ac Signed-off-by: rmylavarapu <rmylavarapu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2822537 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-12-12 06:06:09 -08:00
rmylavarapu	398a30a546	gpu: nvgpu: gsp sched: get the binary file names as per debug fuse Changes 1. Created gsp hal function to read the hardware config register to tell whether the board is debug fused. 2. Created function to get the binary file names as per debug fuse. NVGPU-9295 Bug 3897331 Signed-off-by: rmylavarapu <rmylavarapu@nvidia.com> Change-Id: Ia8462aa6f3d8d0d538c06f35245c965e106b3d37 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2822443 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-12-11 19:09:29 -08:00
prsethi	b4494a4b86	gpu:nvgpu: fix the nvs mmap issues - As part of mmap call handler mmap_sem is being acquired which is causing an BUG: scheduling while atomic:. This is happening because mmap callback gets called with mmap_sem held. The mmap file handler is called via call_mmap() via mmap_region() via do_mmap(), and the caller of do_mmap() must hold down_write(&current->mm->mmap_sem). To fix this issue removing the mmap_sem locking from nvs mmap handler. - If remap_vmalloc_range() is used to map the pages to userspace then allocated pages should be mapped into virtually contiguous space by passing VM_USERMAP flag to vmap(). Passing VM_USERMAP to vmap() call if API nvgpu_dma_alloc_flags_sys() is called with flag NVGPU_DMA_VM_USERMAP_ADDRESS. Bug 3884011 Change-Id: I64264f6f6e0dd75b1f828dc58355d740e6ef5ccb Signed-off-by: prsethi <prsethi@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2820781 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-12-09 15:19:07 -08:00
Dinesh T	373398a46b	gpu: nvgpu: Add os specific call to initialize the channels Add OS specific function to return the number of syncpoints available to the GPU. This is required for making the syncpoints as configurable. Linux: The default number of syncpoints is 512. Bug 3644504 Change-Id: Iddbc38cb25480876d6d8f39f039218a1b2b22605 Signed-off-by: Dinesh T <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2820152 Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-12-06 04:47:24 -08:00
Debarshi Dutta	8e60795b9c	gpu: nvgpu: Add correct nomenclature for NVS ioctls Its preferable to use the following naming convention NVGPU_<group>_IOCTL_<function>. The IOCTL interfaces are updated accordingly. Also, all KMD based defines as part of the UAPI need to be prefixed by NVGPU. Jira NVGPU-8619 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I2210336536cbcc0415885f3f92a2f7fa982fa39c Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2814484 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-12-06 04:40:05 -08:00
Debarshi Dutta	2d38294912	gpu: nvgpu: add Doxygen documentation for Control-Fifo Add Doxygen for Control-FIFO APIs. Add null checks where necessary. Jira NVGPU-8619 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I75f92108c73a521e45299b8870e106916954e7a8 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2805551 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Prateek Sethi <prsethi@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Tested-by: Prateek Sethi <prsethi@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-29 04:06:53 -08:00
Shashank Singh	7abaeda619	gpu: nvgpu: add API to query page table memhandles Add API to query all memhandles used for pde and pte. - Some direct pde/pte allocation should also add entry to the pd-cache full list. - Add OS API for querying MemServ handle from nvgpu_mem. - Traverse through all pd-cache partial and full lists to get memhandles for all pde/pte buffers. Jira NVGPU-8284 Change-Id: I8e7adf1be1409264d24e17501eb7c32a81950728 Signed-off-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2735657 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-24 11:19:10 -08:00
Debarshi Dutta	63e8de5106	gpu: nvgpu: Remove NVGPU_SUPPORT_NVS_CTRL_FIFO Now that we are planning to enable CTRL_FIFO support with NVS, there is no need for a separate enabled flag for the same. CTRL_FIFO support is instead determined by the presence of NVGPU_SUPPORT_NVS enable flag alone. For non-auto platforms, Control-Fifo can be disabled by restricting access to /dev/nvsched_ctrl_fifo. Jira NVGPU-8619 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I9dbec60e5668f38e1460c43800584e88b16a2550 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2814435 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-11-24 00:47:37 -08:00
Debarshi Dutta	a8bdb67b2e	gpu: nvgpu: add doxygen comments for NVS Add doxygen comments for Domain Management APIs of NVS. Added NULL handling where required. Jira NVGPU-8619 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I23f45b95c070c8249bb83a336239b2b2d1a852a4 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2805043 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-11-24 00:39:21 -08:00
Debarshi Dutta	5d2dfc88a3	gpu: nvgpu: Replace CONFIG_NVS_KMD_BACKEND Use CONFIG_KMD_SCHEDULING_WORKER_THREAD instead of CONFIG_NVS_KMD_BACKEND to remove confusion about the CPU based KMD scheduling worker thread. The KMD based scheduling worker thread caters to both Manual Mode CPU based scheduler as well as Automatic Round Robin CPU based scheduler. For the traditional submit path, add correct handling of the CONFIG_NVS_PRESENT. CPU based worker thread should be part of CONFIG_NVS_PRESENT. Eventually, when DCONFIG_KMD_SCHEDULING_WORKER_THREAD is removed, the application must switch to GSP. Jira NVGPU-8619 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I0886ef3b2e0124b6fe22c2bf0bf7d1fa98039d00 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2810217 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-11-23 08:07:24 -08:00
Kishan	ea9aebb358	nvgpu:cic: API to handle fatal error interrupt Any corrected or uncorrected error reported by gpu hw will be seen by nvgpu-mon. nvgpu-mon will raise a devctl call to notify nvgpu-rm if its a fatal error interrupt. nvgpu_cic_mon_handle_fatal_intr is the corresponding handler which will walk through the entire tree structure of interrupts for all the subunits and enter quiesce state. Change-Id: I3c00c61a7f2c52ae1920f84ee7dfb65cba6b683d Signed-off-by: Kishan <kpalankar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2801693 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-17 19:34:43 -08:00
Divya	f6aedfc36b	gpu: nvgpu: adjust the max idle filter for AELPG - When we enable AELPG, we send maximum idle threshold value as one of the parameters. - This was set to 70 msec whereas the ELPG IDLE threshold was set to 15 msec. - Thus, when AELPG is enabled, the threshold is getting modified to a much higher value. So sometimes ELPG is not getting engaged and this is leading to timeout in ELPG MODS 101 test on GVS. Bug 3737783 Change-Id: Iac94f053e19cea5898b962c3d02369d556b8518f Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2786749 (cherry picked from commit 5010eaffda2ecaf6c9cc5f0fed68498cb2947071) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2809154 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-17 07:21:27 -08:00
rmylavarapu	e432d2a41c	gpu: nvgpu: gsp: fix update runlist info cmd Current runlist update command uses domain info present in runlist structure which would be changing depending upon the active domain, this would cause issue while sending runlist info to the gsp fw. This Cl will fix the issue by passing the domain parameters to the update runlist function for sending it to the gsp fw. NVGPU-8531 Change-Id: Id57b446b55c6e50833376767d672d873a1a9207e Signed-off-by: rmylavarapu <rmylavarapu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2803045 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-17 02:05:43 -08:00
Sagar Kamble	ae5488c495	gpu: nvgpu: add multi process tsg sharing char for linux Add the characteristic flag NVGPU_SUPPORT_MULTI_PROCESS_TSG_SHARING for Linux. Bug 3677982 JIRA NVGPU-8681 Change-Id: I774c1aa57f91704a28cfb18912eba4f5afe3b9b8 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2792083 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-10 11:50:04 -08:00
Sagar Kamble	96f675595c	gpu: nvgpu: implement get and revoke share token ioctls Add share token list to gk20a_ctrl_priv. Implement GET_SHARE_TOKEN and REVOKE_SHARE_TOKEN ioctls. Revoke tokens while closing the TSG for all active devices. Bug 3677982 JIRA NVGPU-8681 Change-Id: I74455c21d881d5a0d381729fd695239722599980 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2792081 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-10 11:49:54 -08:00
Sagar Kamble	675edd5053	gpu: nvgpu: maintain authorized devices in TSG When the TSG is successfully created first time or is opened with share token, the device instance id associated with the CTRL fd will be added to the TSG private data structure as authorized device instance ids. This is used for a security check when creating a TSG share token with nvgpu_tsg_get_share_token. Bug 3677982 JIRA NVGPU-8681 Change-Id: I67bb0514e1272dab15023cd3828a6a51e9a4c928 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2792080 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-10 11:49:44 -08:00
Sagar Kamble	6e2b592ab9	gpu: nvgpu: add ctrl device instance ID In order to share the TSG across different devices securely, device instance IDs are to be exchanged for endpoint identification. Add device instance ID field to gk20a_ctrl_priv which is generated from gk20a level device instance id value. Share this ID to userspace via gpu characteristics. Bug 3677982 JIRA NVGPU-8681 Change-Id: I79d92a81c02272c52e24f5b12c452c8993137037 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2792079 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-10 11:49:39 -08:00
Tejal Kudav	41c874a2d9	gpu: nvgpu: Fix error injection HAL init Currently, the registeration with error injection utility is done only for GA10b using HAL. But HALs are not initialized during the probe stage when we try to register the error injection utility. So, the callback registration does not happen HAL is set to NULL. Move the callback registration from probe to poweron stage when HAL is initialized. Update the nvgpu_cic_mon_init_lut() API name as it is no longer doing only LUT initialization. Bug 3828050 Change-Id: Ide718029e9317124749b4a51c423ae70dc8227c8 Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2790269 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-08 13:11:58 -08:00
Sagar Kamble	d1b28712b6	gpu: nvgpu: implement VEID alloc/free Implement the ioctls NVGPU_TSG_IOCTL_CREATE_SUBCONTEXT and NVGPU_TSG_IOCTL_DELETE_SUBCONTEXT. These will allocate and free the VEID numbers. Address space association with the VEIDs is verified to ensure that channels association with VEIDs and address space remains consistent. Bug 3677982 JIRA NVGPU-8681 Change-Id: I2d913baf61a6bdeec412c58270c0024b80ca15c6 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2766765 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-01 00:05:18 -07:00
prsethi	de0808ea5b	gpu:nvgpu: fix below issue with ctrl nvs. - Move queue lock at correct place. - Free the allocated memory. Jira NVGPU-8622 Change-Id: Ia996d80498e53fb21ddf1f1202abd6fb8e3f6168 Signed-off-by: prsethi <prsethi@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2791618 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-28 15:28:46 -07:00
Debarshi Dutta	b2e3810514	gpu: nvgpu: add support for manual mode NVS worker thread is changed to support manual mode exclusively with multi-domain round-robin scheduling. If control-fifo is enabled, NVS worker thread parses the ring buffer. Jira NVGPU-8619 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: Icc78e0749d5e4ebdb52f0c503ec303947011b163 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2757241 Reviewed-by: Vivek Kumar (SW-TEGRA) <vivekku@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-10 14:07:58 -07:00
Debarshi Dutta	562c4f6ea3	gpu: nvgpu: add infra for manual mode submits in KMD Added infrastructure for enabling parsing Control-Fifo's ring buffers(i.e. send/receive). Initialization of these buffers are handled as part of nvgpu_nvs_buffer_alloc() call itself. A follow-up change shall implement the methods defined here as part of the existing NVS worker thread. The changes adhered to the design laid out in the header nvsched/include/nvs/nvs-control-interface.h. Jira NVGPU-8619 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I2050e6fb681eba80e01cf547ada37a955e58315a Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2764518 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-10-10 14:07:52 -07:00
Debarshi Dutta	17dc483a6b	gpu: nvgpu: enclose NVS KMD inside a config Use CONFIG_NVS_KMD_BACKEND to enclose all NVS KMD based scheduling code. Current configuration contains all the scheduling code managed within CONFIG_NVS_PRESENT. Eventually, scheduling code shall only use GSP. Hence, isolate KMD based scheduling code to a config CONFIG_NVS_KMD_BACKEND. This shall make it easier to remove this code later. Jira NVGPU-8619 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I9dc668e0fa3e7706c111fda7a5e2415e1fc0dd03 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2769465 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-10-10 14:07:37 -07:00
Tejal Kudav	724f49f6eb	gpu: nvgpu: Remove dependency on DGPU CONFIG The error injection code was enabled only when CONFIG_NVGPU_DGPU = n so that the dGPUs do not attempt any error injection callback function registration. But, this introduced dependency on DGPU config when needs to be explicitly set to n for error injection to be enabled. Remove the dependency by moving the error injection callback registration and deregistration to a HAL which is enabled only on GA10b. Bug 3819160 Change-Id: I4f4eb99189b1af3502d719536a91cc5e5d866bce Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2787202 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-10-06 17:25:52 -07:00
rmylavarapu	30e7a5e5ed	gpu: nvgpu: gsp sched: create and enable gsp virtual memory access Changes - Initialize virtual memory for gsp. This space is used for creating queues for ctrl fifo. Also can be used to ro map sync-pt to this instance where gsp firmware can poll the sync-pt with sync-pt id. - Enabled gsp context interface and written the instance block pointer to nxtctx register for the gsp firmware to access created virtual memory. - Added required gsp registers for this feature. NVGPU-8730 Bug 3770916 Change-Id: If538f615eca3f9b7840ffe2787826528b4808886 Signed-off-by: rmylavarapu <rmylavarapu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2764649 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-10-06 17:16:21 -07:00
Austin Tajiri	3761c468ad	gpu: nvgpu: add channel.get_vmid gops Add a channel.get_vmid gops so that we can pass the proper VMID to gr.fecs_trace.bind_channel in virtualized environments. Jira GVSCI-14708 Change-Id: Ifc4e6aafa33fa7274bdeb000e8c0fd1a7fc849c7 Signed-off-by: Austin Tajiri <atajiri@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2780108 Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-05 20:03:53 -07:00
vivekku	5bb56723be	gpu: nvgpu: gsp: Create functions to pass nvs data to gsp firmware Changes: - created functions to populate gsp interface data from nvs and runlist structures. - Handled both user domains and shadow domains. - Provided support for four engines from two. NVGPU-8531 Signed-off-by: vivekku <vivekku@nvidia.com> Change-Id: I1d9ec9ded8a9b47a5b2a00c44dacbab22e3b04b1 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2743596 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-05 06:18:18 -07:00
Mahantesh Kumbar	8c36750fd8	gpu: nvgpu: cleanup the seq for railgate seq - Perfmon cmds are non-blocking calls and response may/may-not come during railgate sequence for the perfmon command sent as part of nvgpu_pmu_destroy call. - if response is missed then payload allocated will not be freed and allocation info will be present as part seq data structure. - This will be carried forward for multiple railgate/ rail-ungate sequence and that will cause the memleak when new allocation request is made for same seq-id. - Cleanup the sequence data struct as part of nvgpu_pmu_destroy call by freeing the memory if cb_params is not NULL. Bug 3747586 Bug 3722721 Change-Id: I1a0f192197769acec12993ae575277e38c9ca9ca Signed-off-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2763054 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Divya Singhatwaria <dsinghatwari@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com> Tested-by: Divya Singhatwaria <dsinghatwari@nvidia.com>	2022-09-21 01:08:54 -07:00
atanand	f43897c940	gpu: nvgpu: GA10X_NEXT pulling GR1 out of reset This patch is to enable GR1 before resetting GR0 which is not visible to the driver. Bug 3690950 Change-Id: I8a1907349f5a4354c6b7f95f9904b52738f51f00 Signed-off-by: atanand <atanand@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2758161 (cherry picked from commit 48d925cacf373a97dbdb031a109b83be3bfe2972) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2765635 Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-08 21:05:01 -07:00
Sagar Kamble	ef99d9f010	gpu: nvgpu: implement scg, pbdma and cilp rules Only certain combination of channels of GFX/Compute object classes can be assigned to particular pbdma and/or VEID. CILP can be enabled only in certain configs. Implement checks for the configurations verified during alloc_obj_ctx and/or setting preemption mode. Bug 3677982 Change-Id: Ie7026cbb240819c1727b3736ed34044d7138d3cd Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2719995 Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-08 21:00:30 -07:00
Sagar Kamble	693305c0fd	gpu: nvgpu: subcontext add/remove support Subcontext PDBs and valid mask in the instance blocks of the channels in various subcontexts has to be updated when new subcontext is created or a subcontext is removed. Replayable fault state is cached in the channel structure. Replayable fault state for subcontext is set based on first channel's bind parameter. It was earlier programmed in function channel_setup_ramfc. init_inst_block_core is updated to setup TSG level pdb map and mask. Added new hal gv11b_channel_bind to enable the subcontext on channel bind. Bug 3677982 Change-Id: I58156c5b3ab6309b6a4b8e72b0e798d6a39c1bee Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2719994 Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-08 21:00:20 -07:00

1 2 3 4 5 ...

3018 Commits