linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-24 10:34:43 +03:00

Author	SHA1	Message	Date
Sagar Kamble	2b69b9b264	gpu: nvgpu: return 40 bit addr from nvgpu_mem_userspace_get_addr For some of the unit tests cpu va for malloc'd buffers was going above 4gb and assert about 4gb is hit. HW supports 40 bit physical address. Hence return 40 bit address instead of 32 bit address. Bug 3862385 Change-Id: Ia8cc71d7e7356f2de8d0a4ba1e17f2a2cef0fe10 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2805596 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-14 20:59:06 -08:00
Sagar Kamble	ae5488c495	gpu: nvgpu: add multi process tsg sharing char for linux Add the characteristic flag NVGPU_SUPPORT_MULTI_PROCESS_TSG_SHARING for Linux. Bug 3677982 JIRA NVGPU-8681 Change-Id: I774c1aa57f91704a28cfb18912eba4f5afe3b9b8 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2792083 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-10 11:50:04 -08:00
Sagar Kamble	ce26e92de6	gpu: nvgpu: open TSG with the share token Implement OPEN_TSG ioctl with share tokens. Bug 3677982 JIRA NVGPU-8681 Change-Id: If44aef863c932163df769acef5b3586f97aaecd3 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2792082 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-10 11:49:59 -08:00
Sagar Kamble	96f675595c	gpu: nvgpu: implement get and revoke share token ioctls Add share token list to gk20a_ctrl_priv. Implement GET_SHARE_TOKEN and REVOKE_SHARE_TOKEN ioctls. Revoke tokens while closing the TSG for all active devices. Bug 3677982 JIRA NVGPU-8681 Change-Id: I74455c21d881d5a0d381729fd695239722599980 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2792081 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-10 11:49:54 -08:00
Sami Kiminki	31a4701931	gpu: nvgpu: UAPI specification for TSG sharing Add below ioctls for TSG share token management: 1. NVGPU_TSG_IOCTL_GET_SHARE_TOKEN 2. NVGPU_TSG_IOCTL_REVOKE_SHARE_TOKEN Update the ioctl NVGPU_GPU_IOCTL_OPEN_TSG to consider the creation of TSG with share token. Bug 3677982 JIRA NVGPU-8681 Change-Id: I436217061bc0e9f6424ea793cf7efbc3368d0817 Signed-off-by: Sami Kiminki <skiminki@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2792078 Tested-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-10 11:49:49 -08:00
Sagar Kamble	675edd5053	gpu: nvgpu: maintain authorized devices in TSG When the TSG is successfully created first time or is opened with share token, the device instance id associated with the CTRL fd will be added to the TSG private data structure as authorized device instance ids. This is used for a security check when creating a TSG share token with nvgpu_tsg_get_share_token. Bug 3677982 JIRA NVGPU-8681 Change-Id: I67bb0514e1272dab15023cd3828a6a51e9a4c928 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2792080 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-10 11:49:44 -08:00
Sagar Kamble	6e2b592ab9	gpu: nvgpu: add ctrl device instance ID In order to share the TSG across different devices securely, device instance IDs are to be exchanged for endpoint identification. Add device instance ID field to gk20a_ctrl_priv which is generated from gk20a level device instance id value. Share this ID to userspace via gpu characteristics. Bug 3677982 JIRA NVGPU-8681 Change-Id: I79d92a81c02272c52e24f5b12c452c8993137037 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2792079 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-10 11:49:39 -08:00
Tejal Kudav	41c874a2d9	gpu: nvgpu: Fix error injection HAL init Currently, the registeration with error injection utility is done only for GA10b using HAL. But HALs are not initialized during the probe stage when we try to register the error injection utility. So, the callback registration does not happen HAL is set to NULL. Move the callback registration from probe to poweron stage when HAL is initialized. Update the nvgpu_cic_mon_init_lut() API name as it is no longer doing only LUT initialization. Bug 3828050 Change-Id: Ide718029e9317124749b4a51c423ae70dc8227c8 Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2790269 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-08 13:11:58 -08:00
Kishan	f29ed3a474	gpu: nvgpu: Makefile changes to enable IPC between nvgpu-mon and rm. CONFIG_NVGPU_MON_PRESENT is being enabled only for safety debug & release build. Change-Id: I5c58ea52a5a844483236927366e74faf800423b3 Signed-off-by: Kishan <kpalankar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2775941 GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: Tejal Kudav <tkudav@nvidia.com>	2022-11-04 20:49:39 -07:00
Sagar Kamble	d1b28712b6	gpu: nvgpu: implement VEID alloc/free Implement the ioctls NVGPU_TSG_IOCTL_CREATE_SUBCONTEXT and NVGPU_TSG_IOCTL_DELETE_SUBCONTEXT. These will allocate and free the VEID numbers. Address space association with the VEIDs is verified to ensure that channels association with VEIDs and address space remains consistent. Bug 3677982 JIRA NVGPU-8681 Change-Id: I2d913baf61a6bdeec412c58270c0024b80ca15c6 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2766765 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-01 00:05:18 -07:00
Sami Kiminki	9233886943	gpu: nvgpu: UAPI specification for VEID alloc/free nvgpu will be allocating and freeing the subcontext VEIDs for sharing the TSG across processes. Add interfaces for allocating and freeing the VEIDs. Bug 3677982 JIRA NVGPU-8681 Change-Id: I48b00609147e2696404c3aad14dae9bb940d04d4 Signed-off-by: Sami Kiminki <skiminki@nvidia.com> Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2497716 Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-11-01 00:05:12 -07:00
Rajesh Devaraj	7138e7e673	gpu: nvgpu: export function definitions across chips To avoid duplication of same code across multiple chips, export the following functions through the corresponding headers for the consumption of other GPU enabling functions: - ga10b_gr_intr_report_tpc_sm_rams_ecc_err - gv11b_gr_intr_report_l1_tag_uncorrected_err - gv11b_gr_intr_report_l1_tag_corrected_err - gv11b_gr_intr_report_icache_uncorrected_err JIRA NVGPU-9075 Change-Id: I927285b6e638479ac52cd5d214711e490e5f151e Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2798371 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-28 15:30:20 -07:00
prsethi	de0808ea5b	gpu:nvgpu: fix below issue with ctrl nvs. - Move queue lock at correct place. - Free the allocated memory. Jira NVGPU-8622 Change-Id: Ia996d80498e53fb21ddf1f1202abd6fb8e3f6168 Signed-off-by: prsethi <prsethi@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2791618 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-28 15:28:46 -07:00
Mikko Perttunen	5c8e511e48	gpu: nvgpu: linux/host1x: Execute fence callback in non-atomic context Due to changes in the host1x driver, dma_fence callbacks will be executed in interrupt context instead of workqueue context as previously. To allow for that, this patch effectively moves the workqueue step into nvgpu so that the in-nvgpu fence callback gets executed in workqueue context. Signed-off-by: Mikko Perttunen <mperttunen@nvidia.com> Change-Id: I7bfa294aa3b4bea9888921b79175a8fc218d8e3f Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2785968 Reviewed-by: Jonathan Hunter <jonathanh@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-27 11:54:31 -07:00
Ramalingam C	e933a47bd8	gpu: nvgpu: Export func definitions across chips Export below functions through the corresponding headers for the consumption of other GPU enabling codes gr_gv11b_pri_pmmgpc_addr gr_gv11b_split_pmm_fbp_broadcast_address JIRA NVGPU-9073 Change-Id: I8ebaa5329352c1c0d5bb5f787736cbe04a61b809 Signed-off-by: Ramalingam C <ramalingamc@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2796095 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-26 12:09:31 -07:00
Ramalingam C	6c9ae09d93	gpu: nvgpu: Export func definions for future usage Export below functions through the corresponding headers for the consumption of other GPU enabling codes. gv11b_fb_copy_from_hw_fault_buf gv11b_mm_mmu_fault_handle_mmu_fault_refch gsp_get_emem_boundaries Change-Id: If041d1983a6981f510d8dd622c95b1e80fa50e16 Signed-off-by: Ramalingam C <ramalingamc@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2794239 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: V M S Seeta Rama Raju Mudundi <srajum@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-20 19:42:05 -07:00
Sagar Kamble	7ab770ae93	gpu: nvgpu: skip mapping global ctx buffer if already mapped Global context buffers were mapped on every ALLOC_OBJ_CTX call. If many channels are created sharing an address space they can exhaust the VA space by mapping same global context buffers again and again. Skip mapping the global context buffer if it is already mapped for an address space. Bug 3802863 Bug 3796293 Change-Id: I3844c211b3350aa06cabd92c415a34a83034dd43 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2789584 (cherry picked from commit 0611ec30c6a61b7e1b07d516b74d6eddb3c6b37e) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2789581 Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-12 22:16:36 -07:00
Debarshi Dutta	7ab3b9937d	gpu: nvgpu: plugin control-fifo ioctls Enable control-fifo IOCTL operations for Linux Jira NVGPU-8619 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I112322d207f6e20e60e726c24f47c6f73035562c Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2789850 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Prateek Sethi <prsethi@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-12 06:00:37 -07:00
Debarshi Dutta	280b69e66d	nvgpu: userspace: add unit test for nvs Add a unit test to add verification for S/W parts of NVGPU-KMD based scheduler Jira NVGPU-8619 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I266cb4167074dc5f7da647ce627e96188fc6bdcb Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2767591 Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-10 14:08:03 -07:00
Debarshi Dutta	b2e3810514	gpu: nvgpu: add support for manual mode NVS worker thread is changed to support manual mode exclusively with multi-domain round-robin scheduling. If control-fifo is enabled, NVS worker thread parses the ring buffer. Jira NVGPU-8619 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: Icc78e0749d5e4ebdb52f0c503ec303947011b163 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2757241 Reviewed-by: Vivek Kumar (SW-TEGRA) <vivekku@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-10 14:07:58 -07:00
Debarshi Dutta	562c4f6ea3	gpu: nvgpu: add infra for manual mode submits in KMD Added infrastructure for enabling parsing Control-Fifo's ring buffers(i.e. send/receive). Initialization of these buffers are handled as part of nvgpu_nvs_buffer_alloc() call itself. A follow-up change shall implement the methods defined here as part of the existing NVS worker thread. The changes adhered to the design laid out in the header nvsched/include/nvs/nvs-control-interface.h. Jira NVGPU-8619 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I2050e6fb681eba80e01cf547ada37a955e58315a Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2764518 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-10-10 14:07:52 -07:00
Debarshi Dutta	17dc483a6b	gpu: nvgpu: enclose NVS KMD inside a config Use CONFIG_NVS_KMD_BACKEND to enclose all NVS KMD based scheduling code. Current configuration contains all the scheduling code managed within CONFIG_NVS_PRESENT. Eventually, scheduling code shall only use GSP. Hence, isolate KMD based scheduling code to a config CONFIG_NVS_KMD_BACKEND. This shall make it easier to remove this code later. Jira NVGPU-8619 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I9dc668e0fa3e7706c111fda7a5e2415e1fc0dd03 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2769465 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-10-10 14:07:37 -07:00
Sagar Kamble	6d836becf5	gpu: nvgpu: retry unbind when force killing the channel If NEXT bit remains set for a channel being unbound, it can lead to MMU fault of type unbound inst block. When userspace is closing the channel and NEXT bit is set, userspace retries. When force killing the channel, nvgpu can retry few iterations to ensure the channel is truly idle and unbound. If the channel is really stuck then unbind will fail and TSG will be aborted. Bug 3800844 Change-Id: I8fb024630ff2dd272245ae27116f3db6d6e0f788 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2787533 (cherry picked from commit 99e39f4b387743a93b05ba4b097c33b23fbbcf68) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2786479 Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-10 08:17:12 -07:00
Jon Hunter	6bef424e1e	gpu: nvgpu: Update include paths for OOT module When building NVGPU as an OOT module for upstream Linux kernels, the NVGPU driver source is now copied into a common location with all the other OOT modules. Therefore, we can now use the 'srctree.nvidia' path for finding the necessary header files for Host1x and NVMAP. Update the include search paths to use 'srctree.nvidia' when building NVGPU as an OOT module. Bug 3817518 Change-Id: I63066e4331c66a0f47ada83fde3e63402faaf38a Signed-off-by: Jon Hunter <jonathanh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2785910 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-10-07 03:10:10 -07:00
Tejal Kudav	724f49f6eb	gpu: nvgpu: Remove dependency on DGPU CONFIG The error injection code was enabled only when CONFIG_NVGPU_DGPU = n so that the dGPUs do not attempt any error injection callback function registration. But, this introduced dependency on DGPU config when needs to be explicitly set to n for error injection to be enabled. Remove the dependency by moving the error injection callback registration and deregistration to a HAL which is enabled only on GA10b. Bug 3819160 Change-Id: I4f4eb99189b1af3502d719536a91cc5e5d866bce Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2787202 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-10-06 17:25:52 -07:00
rmylavarapu	30e7a5e5ed	gpu: nvgpu: gsp sched: create and enable gsp virtual memory access Changes - Initialize virtual memory for gsp. This space is used for creating queues for ctrl fifo. Also can be used to ro map sync-pt to this instance where gsp firmware can poll the sync-pt with sync-pt id. - Enabled gsp context interface and written the instance block pointer to nxtctx register for the gsp firmware to access created virtual memory. - Added required gsp registers for this feature. NVGPU-8730 Bug 3770916 Change-Id: If538f615eca3f9b7840ffe2787826528b4808886 Signed-off-by: rmylavarapu <rmylavarapu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2764649 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-10-06 17:16:21 -07:00
Martin Radev	6249220e09	gpu: nvgpu: fix nvgpu_css_allocate_perfmon_ids This patch fixes nvgpu_css_allocate_perfmon_ids which leads to a buffer overflow if the allocation of perfmon ids does not succeed. If the allocation of perfmon ids cannot be satisfied, bitmap_find... would return CSS_MAX_PERFMON_IDS and nvgpu_bitmap_set would still be called with start after the bitmap array. This results into a buffer overflow. Bug 3814963 Change-Id: I4caff36cf0c920b4445e1841d16ba2b4c3d19aaa Signed-off-by: Martin Radev <mradev@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2786747 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Prateek Sethi <prsethi@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-05 20:13:27 -07:00
Sagar Kamble	b8d8d621b9	gpu: nvgpu: allow re-registering TSG events With TSG shared across devices/processes, it is necessary to allow all clients to registers for the events. Bug 3677982 Change-Id: I3cde10665e481fcc58759066e4b70de1ff792e79 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2784666 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: Richard Zhao <rizhao@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-05 20:07:19 -07:00
Austin Tajiri	3761c468ad	gpu: nvgpu: add channel.get_vmid gops Add a channel.get_vmid gops so that we can pass the proper VMID to gr.fecs_trace.bind_channel in virtualized environments. Jira GVSCI-14708 Change-Id: Ifc4e6aafa33fa7274bdeb000e8c0fd1a7fc849c7 Signed-off-by: Austin Tajiri <atajiri@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2780108 Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-05 20:03:53 -07:00
vivekku	5bb56723be	gpu: nvgpu: gsp: Create functions to pass nvs data to gsp firmware Changes: - created functions to populate gsp interface data from nvs and runlist structures. - Handled both user domains and shadow domains. - Provided support for four engines from two. NVGPU-8531 Signed-off-by: vivekku <vivekku@nvidia.com> Change-Id: I1d9ec9ded8a9b47a5b2a00c44dacbab22e3b04b1 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2743596 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-05 06:18:18 -07:00
vivekku	12b539aa69	gpu: nvgpu: gsp: create nvgpu gsp control fifo interface Changes: - control fifo file and its build support is done - Interface to containing control fifo info to be passed to gsp created - command and function to send fifo info to GSP NVGPU-8686 NVGPU-8688 NVGPU-8692 Change-Id: I96c59b621ca299f0f4b71e16bd15cad03e719192 Signed-off-by: vivekku <vivekku@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2756560 GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com> Reviewed-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com>	2022-09-29 19:37:51 -07:00
ht	125cc72c39	gpu: nvgpu: Fix devg_nvgpu_igpu process crash-2. As part of the negative test case we replace the ACR binaries with corrupted one(by editing the binary in hex editor). The expectaion is that the process should log the error and exit properly but instead the process crashed. The root cause was because NVGPU driver was trying to pause the thread using nvgpu_nvs_worker_pause but the but NVS isn't initialized at that point. NVS is initialized after acr init. Mitigated this failure by adding a checking condition in nvgpu_nvs_worker_pause. Bug 3670576 Change-Id: Ibfe66b253be034e7ca2c3ed298dc28d27e1d6de9 Signed-off-by: ht <ht@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2782937 Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Prateek Sethi <prsethi@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-29 15:06:46 -07:00
Sagar Kamble	b48892ea33	gpu: nvgpu: update l2 sector promotion logic L2 sector promotion setup in cfg2_vidmem and cfg3_sysmem registers was verified by comparing full register values after writing. However that fails as some of the bits like VIDMEM_SP2_256B_PROMOTE_ON_SECT0 in cfg2 and SYSMEM_PROMOTE_ENABLE, FETCH_PARTIAL_CATOM_32B in cfg3 are set on setting promotion. Just compare the promotions bits for L1 and T1 in the cfg registers. Bug 3634348 Change-Id: I53c0a0a7bbe776a000a386524759d7277a015054 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2779619 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-28 22:48:31 -07:00
vivekku	4315132e7d	gpu: nvgpu: nvs: fixed nvgpu buffer alloc null ptr Changes - initialized g inside sched which was throwing null pointer issue. JIRA NVGPU-8692 Change-Id: I3a278ecb87ce2c4933297e04ab68a7183f40c67b Signed-off-by: vivekku <vivekku@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2767830 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-09-28 22:44:58 -07:00
Debarshi Dutta	fb8bfb90c3	gpu: nvgpu: allow custom header include stdint.h is not included as part of the kernel build file for linux resulting in build failures when using this header as it is. Modified this interface to remove the restriction for using <stdint.h>. Custom build environments can include their own correct header for type definitions Jira NVGPU-8619 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: Ida7c327a5ac4a5c7a0ed18f792a58a17dcbc36b2 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2767310 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-28 12:41:07 -07:00
Divya	44587840e2	gpu: nvgpu: Update the error code for tpc_pg_mask - nvpmodel service used to expect a return value of -ENODEV from the underlying tpc_pg_mask_store() when the golden image size was initialized. - With the current implementation, the return value is -EINVAL due to which write for new tpc_pg_mask was not successful. - Update the return value to -EBUSY for the case where golden image is already initialized. Bug 3765637 Change-Id: I5a1a38cce035ea245db5d72c9f5db210d3bb95f1 Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2778855 (cherry picked from commit `1274f25dda`) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2780005 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: Yi-Wei Wang <yiweiw@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com> Tested-by: Yi-Wei Wang <yiweiw@nvidia.com>	2022-09-28 12:40:37 -07:00
Debarshi Dutta	667867a199	gpu: nvgpu: Resolve failed cond init. Following changes are added to fix the issue. 1) Threads having higher priority e.g. RT may preempt threads with sched-normal priority. As a consequence, higher priority threads might not still see initialization of data in another thread resulting in failures such as accessing a condition value before initialization. Any initialization in the parent thread must be accompanied by a barrier to make it visible in other thread. Added appropriate barriers to prevent reordering of the initialization in the thread construction path. 2) There is a race condition between nvgpu_cond_signal() and nvgpu_cond_destroy() in the asynchronous submit code and corresponding worker thread's process_item callback for NVS. This may lead to data corruption and resulting in the above errors as well. Fixed that by adding a refcount based mechanism for ownership sharing of the struct nvgpu_nvs_worker_item between the two threads. Bug 3778235 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: Ie9b9ba57bc1dcbb8780801be79863adc39690f72 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2771535 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Prateek Sethi <prsethi@nvidia.com> Reviewed-by: Ketan Patil <ketanp@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-27 23:25:55 -07:00
Divya	038005986e	gpu: nvgpu: ga10b: enable AELPG Enable AELPG supoort for ga10b JIRA NVGPU-7182 Change-Id: Ifcd9930cd4382b55fbcaecefa62c916649dc21a7 Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2732015 (cherry picked from commit 64efb1067e1fd258397bf4ae0eeb164a0282b322) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2734634 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-26 15:59:13 -07:00
Mahantesh Kumbar	8c36750fd8	gpu: nvgpu: cleanup the seq for railgate seq - Perfmon cmds are non-blocking calls and response may/may-not come during railgate sequence for the perfmon command sent as part of nvgpu_pmu_destroy call. - if response is missed then payload allocated will not be freed and allocation info will be present as part seq data structure. - This will be carried forward for multiple railgate/ rail-ungate sequence and that will cause the memleak when new allocation request is made for same seq-id. - Cleanup the sequence data struct as part of nvgpu_pmu_destroy call by freeing the memory if cb_params is not NULL. Bug 3747586 Bug 3722721 Change-Id: I1a0f192197769acec12993ae575277e38c9ca9ca Signed-off-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2763054 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Divya Singhatwaria <dsinghatwari@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com> Tested-by: Divya Singhatwaria <dsinghatwari@nvidia.com>	2022-09-21 01:08:54 -07:00
Dinesh T	dabf933944	gpu: nvgpu: Decrease the channel to 128 As the number of supported syncpoints is 128 in SAFETY config, this is decreasing the number of channels supported in SAFETY to 128. Bug 3644504 Change-Id: If62f0c5489e4ad83abbc0e5b9ed9d698ea97967f Signed-off-by: Dinesh T <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2773429 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-09-14 23:32:55 -07:00
prsethi	9fc3010344	gpu:nvgpu: Remove BUSY_IDLE_SUPPORT config flag gk20a_busy ref count is needed to track if there no one has taken ref and processing something to avoid any ongoing activity during suspend. Patch enables the ref count support by removing the config flag. Jira NVGPU-8506 Change-Id: Ic9389ad42be34e2357858ea79adf05f30e85efbf Signed-off-by: prsethi <prsethi@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2769479 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Deepak Goyal <dgoyal@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-14 23:28:12 -07:00
Debarshi Dutta	50f95f789c	gpu: nvgpu: improvements to NVS code Fix the bug in NVS worker initialization code. Ensure main thread waits for NVS worker to start. Jira NVGPU-8619 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I2a719bad691099881f3ac4468d32f9e81ece3800 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2773376 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com>	2022-09-14 09:40:16 -07:00
mkumbar	71065d8613	gpu: nvgpu: FW load flag update -Move FW load flag settings before chips specific SW init in ACR unit init. This helps to alter the flag later based on the chip requirment if required. Bug 3765772 Change-Id: I4639d85c0d4ffce06a172acef42891011125b322 Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2773059 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Mayur Poojary <mpoojary@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-14 09:39:00 -07:00
atanand	f43897c940	gpu: nvgpu: GA10X_NEXT pulling GR1 out of reset This patch is to enable GR1 before resetting GR0 which is not visible to the driver. Bug 3690950 Change-Id: I8a1907349f5a4354c6b7f95f9904b52738f51f00 Signed-off-by: atanand <atanand@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2758161 (cherry picked from commit 48d925cacf373a97dbdb031a109b83be3bfe2972) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2765635 Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-08 21:05:01 -07:00
Sagar Kamble	cfc663a65d	gpu: nvgpu: add unit test to check class, veid and pbdma for channels Add unit test to validate the class, veid and pbdma assignment of the channels. Bug 3677982 Change-Id: I35fda0a35fec2939209d0e4380b0628f65ea774e Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2772062 Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-08 21:00:41 -07:00
Sagar Kamble	f1896e0a64	gpu: nvgpu: acquire tsg ctx_init_lock when changing ctx state GR context associated with channel is updated in various driver paths. Sequence to do the same is disable the TSG, preempt the TSG, update the GR context or instance block and then enable the TSG. These operations and runlist updates for channel have to be done under TSG specific ctx_init_lock to avoid the race. suspend_contexts and resume_contexts needs special handling which is not covered in this patch. Bug 3677982 Change-Id: I837257fe9d9ef3eb6f69f5d7e0707e0bb6d4ea72 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2720222 Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-08 21:00:36 -07:00
Sagar Kamble	ef99d9f010	gpu: nvgpu: implement scg, pbdma and cilp rules Only certain combination of channels of GFX/Compute object classes can be assigned to particular pbdma and/or VEID. CILP can be enabled only in certain configs. Implement checks for the configurations verified during alloc_obj_ctx and/or setting preemption mode. Bug 3677982 Change-Id: Ie7026cbb240819c1727b3736ed34044d7138d3cd Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2719995 Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-08 21:00:30 -07:00
Sagar Kamble	06410ba862	gpu: nvgpu: add unit test to check subctx programming in inst blocks Add unit test to validate the subcontext programming in the channel instance blocks on creating and closing the channels. Bug 3677982 Change-Id: I82cdc7d2f341381b2a143f300238f6390cfe3114 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2771035 Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-08 21:00:25 -07:00
Sagar Kamble	693305c0fd	gpu: nvgpu: subcontext add/remove support Subcontext PDBs and valid mask in the instance blocks of the channels in various subcontexts has to be updated when new subcontext is created or a subcontext is removed. Replayable fault state is cached in the channel structure. Replayable fault state for subcontext is set based on first channel's bind parameter. It was earlier programmed in function channel_setup_ramfc. init_inst_block_core is updated to setup TSG level pdb map and mask. Added new hal gv11b_channel_bind to enable the subcontext on channel bind. Bug 3677982 Change-Id: I58156c5b3ab6309b6a4b8e72b0e798d6a39c1bee Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2719994 Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-08 21:00:20 -07:00
Sagar Kamble	269e853fc5	gpu: nvgpu: add unit test to check gr ctx buffer mappings for multi as Add unit test to validate the gr ctx buffer mappings when subcontext channels are created with multiple address spaces. Bug 3677982 Change-Id: I369c2e7099bfb41d92d8e63ece27cc56fd2da420 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2771034 Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-08 21:00:15 -07:00

1 2 3 4 5 ...

9661 Commits