linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-23 09:57:08 +03:00

Author	SHA1	Message	Date
Debarshi Dutta	42beb7f4db	gpu: nvgpu: simplify the runlist update sequence Following changes are added here to simplify the overall sequence. 1) Remove deferred update for runlists. NVS worker thread shall submit the updated runlist. 2) Moved Runlist mem swap inside update itself. Protect the swap() and hw_submit() path with a spinlock. This is temporary till GSP. 3) Enable Control-Fifo mode from nvgpu driver. Jira NVGPU-8609 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: Icc52e5d8ccec9d3653c9bc1cf40400fc01a08fde Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2757406 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-08-20 23:33:45 -07:00
Debarshi Dutta	1d4b7b1c5d	gpu: nvgpu: modify priority of NVS worker thread In linux threaded interrupts run with a Realtime priority of 50. This bumps up the priority of bottom-half handlers over regular kernel/User threads even during process context. In the current implementation scheduler thread still runs in normal kernel thread priority. In order to allow a seamless scheduling experience, the worker thread is now created with a Realtime priority of 1. This allows for the Worker thread to work at a priority lower than interrupt handlers but higher than the regular kernel threads. Linux kernel allows setting priority with the help of sched_set_fifo() API. Only two modes are supported i.e. sched_set_fifo() and sched_set_fifo_low(). For more reference, refer to this article https://lwn.net/Articles/818388/. Added an implementation of nvgpu_thread_create_priority() for linux thread using the above two APIs. Jira NVGPU-860 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I0a5a611bf0e0a5b9bb51354c6ff0a99e42e76e2f Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2751736 Reviewed-by: Prateek Sethi <prsethi@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-08-20 23:33:34 -07:00
Debarshi Dutta	13699c4c15	gpu: nvgpu: ensure worker thread is disabled during rg A previous commit ID `44b6bfbc1` added a hack to prevent the worker thread from calling nvgpu_runlist_tick() in post_process if the next domain matches the previous. This could potentially still face issues with multi-domains in future. A better way is to synchronize the thread to suspend/resume alongwith the device's common OS agnostic suspend/resume operations. This shall emulate the GSP as well. This shall also take care of the power constraints i.e. the worker thread can be expected to always work with the power enabled and thus we can get rid of the complex gk20a_busy() lock here for good. Implemented a state-machine based approach for suspending/ resuming the NVS worker thread from the existing callbacks. Remove support for NVS worker thread creation for VGPU. hw_submit method is currently set to NULL for VGPU. VGPU instead submits its updates via the runlist.reload() method. Jira NVGPU-8609 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I51a20669e02bf6328dfe5baa122d5bfb75862ea2 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2750403 Reviewed-by: Prateek Sethi <prsethi@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-08-20 23:33:29 -07:00
Debarshi Dutta	21ab579341	gpu: nvgpu: don't skip setting same clk in arbiter In the current setting, clock arbiter skips setting the clock if its already set previously. The value set by the arbiter is stored in "struct nvgpu_clk_arb->actual" whenever the clock is updated via the arbiter. However, DVFS might also update the clock and the updates are not synchronized with the arbiter. Hence, ensure that any clock requests are always updated i.e. the requested rate is set even if the previous rate remains the same. In the devfreq scale() part, scale emc when clk_arb is active and skip setting of clocks. Bug 3666615 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I32bf4dbf81b19fdd6fa0bdec3a6c9a9312b78eca Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2727787 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-08-19 21:28:46 -07:00
Prasun Kumar	4194c35e17	gpu: nvgpu: Err injection utility support Update callback registration with error inject utility. Add callback de-registration with error injection utility when CIC_MON is removed. Bug 3413214 Signed-off-by: Prasun Kumar <prasunk@nvidia.com> Change-Id: Iab682cd522a96fd6af136485c4f3b73f81f723b8 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2755178 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-08-17 19:22:17 -07:00
Dinesh T	4e78d478c3	gpu: nvgpu: Add mutex lock to synchronise isr threads This is adding a mutex lock to synchronise between stall isr threads. Orin(t234) has three interrupt lines and three ISR threads to handle bottom half of the ISR. The threads sharing same data between them without proper synchronization. When multiple interrupts trigger simeltaneously, causing the threads running in parallell like below traces #0 nvgpu_cic_mon_intr_stall_isr (g=g@entry=0x5ed62a9318) at /home/dt/automotive-dev-main-20220802T015100095/kernel/nvgpu/drivers/gpu/nvgpu/common/cic/mon/mon_intr.c:158 #1 0x00000013758cae30 in nvgpu_intr_stall (arg=0x5ed62a9120) at /home/dt/automotive-dev-main-20220802T015100095/qnx/src/resmgrs/nvrm/nvgpu_rmos/os/intr.c:140 #2 0x00000013758ec090 in nvgpu_posix_thread_wrapper (data=<optimized out>) at /home/dt/automotive-dev-main-20220802T015100095/kernel/nvgpu/drivers/gpu/nvgpu/os/posix/thread.c:77 #3 0x0000001375b01000 in pthread_attr_setdetachstate () from /home/dt/automotive-dev-main-20220802T015100095/out/embedded-qnx-t186ref-debug-none/target_rootfs/lib/libc.so.5 Backtrace stopped: previous frame identical to this frame (corrupt stack?) This is causing some race in shared data access and causing multiple issues. Bug 3647988 Change-Id: If40e581635b52cce288d8f4b00af6a040f7f9a6e Signed-off-by: Dinesh T <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2755874 Reviewed-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-08-13 05:18:56 -07:00
Dinesh T	68976fbd22	gpu: nvgpu: gv11b+: set live pes mask This change is reading the live pes from the register "gr_gpc0_gpm_pd_live_physical_pes_r" and set it to "gr_gpc0_swdx_pes_mask_r". Every PES needs at least a TPC to work. If any of the TPCs are floorswept,the live PES mask is read from "gr_gpc0_gpm_pd_live_physical_pes_r" and the corresponding active PES mask is updated in "gr_gpc0_swdx_pes_mask_r". Bug 3677421 Change-Id: I899ac41c4a82beb3ce75c84ad57dcad262a49ba1 Signed-off-by: Dinesh T <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2736560 (cherry picked from commit 85f2ceb3db6eeef925b49553f445d8cc31ec39da) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2759135 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-08-12 11:05:35 -07:00
Sagar Kamble	7085934303	gpu: nvgpu: update channel TSG unbind fail paths On tsg.unbind_channel hal failure, channel teardown was being done again that was done already as part of the function nvgpu_tsg_unbind_channel_common. Just abort the TSG and return err in that case. Also, decrement the TSG ch_count in the fail path. Bug 3677982 Change-Id: I466f2b2c693d43ed64dc531b08bf740bf00f28a6 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2749970 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com>	2022-08-10 15:42:34 -07:00
Dinesh T	b1d7c77d8e	gpu: nvgpu: Create unique share id This is fixing a race in address space identifier creation by making atomic variable increment. Bug 3684734 Change-Id: I864e8f61257569e35f926822c2a5260532d41360 Signed-off-by: Dinesh T <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2742206 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Richard Zhao <rizhao@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-08-09 21:56:46 -07:00
atanand	eae4593343	gpu: nvgpu: add ioctl to configure implicit ERRBAR Add ioctl support to configure implicit ERRBAR by setting/unsetting NV_PGRAPH_PRI_GPCS_TPCS_SM_SCH_MACRO_SCHED register. Add gpu characteritics flag: NVGPU_SCHED_EXIT_WAIT_FOR_ERRBAR_SUPPORTED to allow userspace driver to determine if implicit ERRBAR ioctl is supported. Bug: 200782861 Change-Id: I530a4cf73bc5c844e8d73094d3e23949568fe335 Signed-off-by: atanand <atanand@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2718672 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-08-05 23:10:18 -07:00
Debarshi Dutta	3d95f2b803	gpu: nvgpu: add support for event queue observers Add support for accessing Event Queue for non-exclusive users. Allows, non-exclusive users to open Event Queues before exclusive users. Non-Exclusive users can only use the Event Queue in a read-only mode. Add VM_SHARED for Event Queues across all users instead of just Read-Only users. Event queues are shared with multiple processes and as such require VM_SHARED across all users(exclusive and observers). Jira NVGPU-8608 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: Id9733c2511ded6f06dd9feea880005bdc92e51a0 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2745083 Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-07-29 00:04:51 -07:00
Richard Zhao	1f9fbc85fe	gpu: nvgpu: add nvgpu_hvrtos build - nvgpu_hvrtos disabled bellow configs: - CONFIG_NVGPU_IVM_BUILD - CONFIG_NVGPU_TRACE - CONFIG_NVGPU_SYSFS - CONFIG_NVGPU_DGPU - CONFIG_NVGPU_IGPU_VIRT - CONFIG_NVGPU_NVLINK - CONFIG_NVGPU_CLK_ARB - CONFIG_NVGPU_MIG - nvgpu_hvrtos re-uses posix bitmap.c - add nvgpu_hvrtos specific headers - add static check of vgpu ivc frame - fix build errors caused by new CFLAGS Jira GVSCI-9976 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I5f65fda9444d0cbfe6008ac4ab8262b44d1a4f56 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2653745 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Austin Tajiri <atajiri@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Aparna Das <aparnad@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-07-28 23:59:10 -07:00
Debarshi Dutta	44b6bfbc1d	gpu: nvgpu: skip post_process() operations for single domain. There is a possible deadlock that gets triggered when device is being resumed() and NVS worker thread tries to submit the data as part of the post_process() operation. The NVS worker thread works asynchronously in the post_process() part w.r.t the USER threads and thus an initial implementation requires acquiring the busy lock() arriving at a deadlock scenario. This quick change shall disallow post_process() from executing during the case where we have only one scheduling domain present(legacy) Any submits meant to be updated are handled via the synchronous wakeup_process_item() callback. This implementation is being modified to allow the worker thread to be suspended/resumed during GPU railgate/unrailgate in upcoming releases and currently is in a state of flux. Bug 3723127 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I318cda0fbdd5651884cf21f748c86687679e6fdb Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2750293 Reviewed-by: Prateek Sethi <prsethi@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-07-25 18:41:43 -07:00
Divya	ee5053f7be	gpu: nvgpu: ga10b: Add new RPC for AELPG - Add AP_INIT RPC to initialize the AELPG feature. - Add AP_CTRL_INIT_AND_ENABLE RPC to program some APCTRL values for Adaptive ELPG. - Add AP_CTRL_ENABLE and AP_CTRL_DISABLE RPCs to send AELPG enable/disable request to PMU via sysfs node. - Re-structure the rpc handler based on PG_LOADING and PG unit id. This is needed to handle different types of new RPCs from PMU. JIRA NVGPU-7182 Change-Id: If00b00730507f17ff1883a67094f7e16da5b81ea Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2728286 (cherry picked from commit fffb58703bd718600e8c983dcd1c81d9abe83802) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2603161 Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-07-19 21:51:15 -07:00
Sagar Kamble	3fb2a2e209	gpu: nvgpu: track gr_ctx init state On successful obj_ctx allocation, set ctx_initialized member in gr_ctx to true and when it is true then only invoke free_gr_ctx. With this we can get rid of tsg->vm check while calling free_gr_ctx. tsg->vm will go away with multiple address spaces support in TSG. Bug 3677982 Change-Id: I4a64842411ce4ab157010808e4e8e4d5cd254a7f Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2746803 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-07-19 10:32:35 -07:00
shashank singh	b1802085f2	gpu: nvgpu: reduce CCM for "nvgpu_rc_pbdma_fault" - This API has CCM of 12, it should be <= 10. JIRA NVGPU-7057 Change-Id: I42f1972c29aaa179760a4fb7c6d1f7112456520e Signed-off-by: shashank singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2602678 (cherry picked from commit c601f8b6bb07a504e01b57c42c1964b885744cbf) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2745517 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-07-18 11:07:46 -07:00
Debarshi Dutta	1bdca92c50	gpu: nvgpu: modify rl_domain member KMD needs to send the domain id and GPU_VA corresponding to the struct runlist_domains to GSP. In the current implementation, struct nvgpu_runlist_domain contains the domain name instead of domain id. This requires an additional search by name everytime an update is needed to be submitted to the GSP. Modify the struct nvgpu_runlist_domain to store domain id instead of domain name. This simplifies the flow and avoids unnecessary search. Removed the conditional check for existence of shadow domain as its a deadcode. Shadow Domain is not searchable in the list of domains inside the struct nvgpu_runlist. Jira NVGPU-8610 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I0d67cfa93d89186240290e933aa750702b14f4f0 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2744890 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-07-15 15:15:30 -07:00
rmylavarapu	00fd453830	gpu: nvgpu: pmu: pmgr: fix taint parameter using nvgpu_memcpy() instead of downcasting the parameter to assign. CID 10074974 Change-Id: I5ce333735dca5fc4e34dd42c9fca432223989bf2 Signed-off-by: rmylavarapu <rmylavarapu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2744418 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-07-15 15:14:50 -07:00
Sagar Kamble	d75473a115	gpu: nvgpu: fix unit test traceability issues Some of the functions with no traceability to unit tests are already covered by callee API functions. Skip these functions in SWVR by skipping doxygen for them. Some of the functions are non-fusa like those in profile.h and bsearch.h. Those were included as the header was included in Doxygen sources. Mark then non-safe. Some of the nvgpu functions were not added to Targets entries for respective tests. Fix those. JIRA NVGPU-7211 Change-Id: Iacf22dccdd9340100cf93814566d3979734c455d Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2612982 (cherry picked from commit a40f62654747102cc8ef53ddbd9f953c21c2b745) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2737672 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-07-15 07:15:34 -07:00
rmylavarapu	bd5ade293d	gpu: nvgpu: gsp: msg: remove event handle support Changes: - Currently we are not using any event handling between NVGPU and GSP. So removing the event handler function and directly calling response handler. - This change will fix the respective coverity issue. CID 10138010 Change-Id: I12ddb0012ad9f54bf99f707ab2123598065561d8 Signed-off-by: rmylavarapu <rmylavarapu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2732628 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-07-15 07:12:52 -07:00
Sagar Kamble	f95cb5f4f8	gpu: nvgpu: maintain ctx buffers mappings separately from ctx mems In order to maintain separate mappings of GR TSG and global context buffers for different subcontexts, we need to separate the memory struct and the mapping struct for the buffers. This patch moves the mappings of all GR ctx buffers to new structure nvgpu_gr_ctx_mappings. This will be instantiated per subcontext in the upcoming patches. Summary of changes: 1. Various context buffers were allocated and mapped separately. All TSG context buffers are now stored in gr_ctx->mem[] array since allocation and mapping is unified for them. 2. Mapping/unmapping and querying the GPU VA of the context buffers is now handled in ctx_mappings unit. Structure nvgpu_gr_ctx_mappings in nvgpu_gr_ctx holds the maps. On ALLOC_OBJ_CTX this struct is instantiated and deleted on free_gr_ctx. 3. Introduce mapping flags for TSG and global context buffers. This is to map different buffers with different caching attribute. Map all buffers as cacheable except PRIV_ACCESS_MAP, RTV_CIRCULAR_BUFFER, FECS_TRACE, GR CTX and PATCH ctx buffers. Map all buffers as privileged. 4. Wherever VM or GPU VA is passed in the obj_ctx allocation functions, they are now replaced by nvgpu_gr_ctx_mappings. 5. free_gr_ctx API need not accept the VM as mappings struct will hold the VM. mappings struct will be kept in gr_ctx. 6. Move preemption buffers allocation logic out of nvgpu_gr_obj_ctx_set_graphics_preemption_mode. 7. set_preemption_mode and gr_gk20a_update_hwpm_ctxsw_mode functions need update to ensure buffers are allocated and mapped. 8. Keep the unit tests and documentation updated. With these changes there is clear seggregation of allocation and mapping of GR context buffers. This will simplify further change to add multiple address spaces support. With multiple address spaces in a TSG, subcontexts created after first subcontext just need to map the buffers. Bug 3677982 Change-Id: I3cd5f1311dd85aad1cf547da8fa45293fb7a7cb3 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2712222 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-07-15 07:10:11 -07:00
Sagar Kamble	931e5f8220	gpu: nvgpu: update gr_ctx patch and pm setup functions set_patch_addr parameter to nvgpu_gr_ctx_set_patch_ctx was redundant. Remove it. Prepare new functions nvgpu_gr_ctx_set_hwpm_pm_mode to set PM mode, nvgpu_gr_ctx_set_hwpm_ptr to set PM ptr in gr_ctx. Rename subctx function to nvgpu_gr_subctx_set_hwpm_ptr. This simplifies the logic in gr_gk20a_update_hwpm_ctxsw_mode to set the PM mode and PM ptr. Channel loop is needed only for subcontexts. Bug 3677982 Change-Id: I44acb09f6296ba8d510e278910188864f39e7157 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2743724 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-07-15 07:10:00 -07:00
Debarshi Dutta	7a956cf5a2	gpu: nvgpu: implement domain scheduler characteristics ioctl Added the NVGPU_GPU_QUERY_CTRL_FIFO_SCHEDULER_CHARACTERISTICS ioctl as part of the ctrl device node. Jira NVGPU-8129 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I651bd1958b6a27dc17687dee663bb93c2f807b68 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2723871 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-07-15 07:08:37 -07:00
Debarshi Dutta	e7f9de6567	gpu: nvgpu: add control-fifo queues Added implementation for following IOCTLs NVGPU_NVS_CTRL_FIFO_CREATE_QUEUE NVGPU_NVS_CTRL_FIFO_RELEASE_QUEUE The above ioctls are supported only for users with R/W permissions. 1) NVGPU_NVS_CTRL_FIFO_CREATE_QUEUE constructs a memory region via the nvgpu_dma_alloc_sys() API and creates the corresponding GPU and kernel mappings. Upon successful creation, KMD exports this buffer to the userspace via a dmabuf fd that the UMD can use to mmap it into its process address space. 2) Added plumbing to store VMA's corresponding to different users for event queue in future. 3) Added necessary validation checks for the IOCTLs 4) NVGPU_NVS_CTRL_FIFO_RELEASE_QUEUE is used to clear the queues. 5) Using a global queue lock to protect access to the queues. This could be modified to be more fine-grained in future when there is more clarity on GSP's implementation and access of queues. 6) Added plumbing to enable user subscription to queues. NVGPU_NVS_CTRL_FIFO_RELEASE_QUEUE is used to unsubscribe the user from the queue. Once, the last user is deleted, all the queues will be cleared. User must ensure that any mappings are removed before calling release queue. 7) Set the default queue_size for event queues to PAGE_SIZE. This can be modified later. For event queues, UMD shall fetch the queue_size. Jira NVGPU-8129 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I31633174e960ec6feb77caede9d143b3b3c145d7 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2723198 Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-07-15 07:08:32 -07:00
Debarshi Dutta	62c03dfaef	gpu: nvgpu: add support for nvs control_fifo Add a device node for management of nvs control fifo buffers for scheduling domains. The current design consists of a master structure struct nvgpu_nvs_domain_sched_ctrl for management of users as well as control queues. Initially all users are added as non-exclusive users. Subsequent changes will add support for IOCTLS to manage opening of Send/Receive and Event buffers, querying characteristics etc. In subsequent changes, a user that tries to open a Send/Receive queue will first try to reserve itself as an exclusive user and only if that succeeds can proceed with creation of both Send/Receive queues. Exclusive users will be reset to non-exclusive users just before they close their device node handle. Jira NVGPU-8128 Change-Id: I15a83f70cd49c685510a9fd5ea4476ebb3544378 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2691404 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-07-15 07:08:22 -07:00
Sagar Kamble	04587333ca	gpu: nvgpu: fix MISRA Rule 10.3 and 10.4 violations BVEC changes for nvgpu_rc_pbdma_fault and nvgpu_rc_mmu_fault started reporting below MISRA issue. kernel/nvgpu/drivers/gpu/nvgpu/common/fifo/tsg.c:522: 1. misra_c_2012_rule_10_4_violation: Essential type of the left hand operand "error_notifier" (unsigned) is not the same as that of the right operand "NVGPU_ERR_NOTIFIER_INVAL"(enum). kernel/nvgpu/drivers/gpu/nvgpu/common/fifo/tsg.c:541: 1. misra_c_2012_rule_10_3_violation: Implicit conversion of "NVGPU_ERR_NOTIFIER_FIFO_ERROR_MMU_ERR_FLT" from essential type "anonymous enum" to different or narrower essential type "unsigned 32-bit int". Change the enum nvgpu_err_notif values to u32 values declared using the #define macro. JIRA NVGPU-6772 Change-Id: Icac7f567cea52cde07ca200b21eb3e7dd2b9e645 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2584153 (cherry picked from commit 2f073f341bd55242c857c6c6d35d6015495025e2) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2623634 Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-07-14 08:58:42 -07:00
Sagar Kamble	bcbccbe083	gpu: nvgpu: add BVEC test for nvgpu_rc_mmu_fault Update nvgpu_rc_mmu_fault to return error on invalid params and add BVEC test for it. JIRA NVGPU-6772 Change-Id: If44d80888c665ca3b528c9937de8a66ccce29f57 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2551618 (cherry picked from commit 229727512a1facc33ef9f16cc1831405e960ab2a) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2623626 Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-07-14 08:58:36 -07:00
Sagar Kamble	80efe558b1	gpu: nvgpu: add BVEC test for nvgpu_rc_pbdma_fault Update nvgpu_rc_pbdma_fault with invalid checks and add BVEC test for it. Make ga10b_fifo_pbdma_isr static. NVGPU-6772 Change-Id: I5485760c53e1fff1278557a5b25659a1fc0e4eaf Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2551617 (cherry picked from commit e917042d395d07cb902580bad3d5a7d0096cc303) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2623625 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-07-14 08:58:31 -07:00
Debarshi Dutta	d8e8eb65d3	nvgpu: gpu: separate runlist submit from construction This patch primary separates runlist modification from runlist submits. Instead of submitting the runlist(domain) immediately after modification, a worker thread interface is now being used to synchronously schedule runlist submits. If the runlist being scheduled is currently active, the submit happens instantly, otherwise, it will happen in the next iteration when the nvs thread will schedule the domain. This external interface uses a condition variable to wait for the completion of the synchronous submits. A pending_update variable is used to synchronize domain memory swaps just before being submitted. To facilitate faster scheduling via the NVS thread, nvgpu_dom itself contains an array of rl_domain pointers. This can then be used to select the appropriate rl_domain directly for scheduling as against the earlier approach of maintaining nvs domains and rl domains in sync everytime. Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I1725c7cf56407cca2e3d2589833d1c0b66a7ad7b Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2739795 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-07-13 16:36:19 -07:00
Dinesh T	fb466b5b25	gpu: nvgpu: Enable ptimer This is enabling ptimer in mme_config and mme_fe1_config by setting the corresponding field. Debugger is expected to make use of ptimer. So this is required for nvgpu to enable ptimer in the register. Bug 3637441 Change-Id: Id596a87081753bcaf945e54444a8abbd025b3f76 Signed-off-by: Dinesh T <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2710632 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-07-07 07:30:52 -07:00
Scott Long	ac4d8b9bff	gpu: nvgpu: fix remap page size flag handling When destroying a virtual memory pool the associated page size must be set in the nvgpu_vm_remap_op structure. This patch adds a new nvgpu_vm_remap_page_size_flag() routine that converts the page size derived from the vm/vm_area structs to the corresponding NVGPU_VM_REMAP_OP_FLAGS_PAGESIZE bit. Bug 3669908 Change-Id: Idca77cc36d353777b399c872f68a1f5231ddb8dd Signed-off-by: Scott Long <scottl@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2734822 Tested-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit (cherry picked from commit `868b723b16`) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2740035 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com>	2022-07-07 01:25:25 -07:00
Ramesh Mylavarapu	951ad46819	gpu: nvgpu: gsp: sched: domain management apis Changes: - Added Domain management APIs with interfaces to communicate with GSP scheduler. - Domain creation shall be done inside NVGPU and respective Domain and runlist info are sent to GSP for scheduling. Design: https://confluence.nvidia.com/display/TGS/GSP+Scheduler+Interface+Specifications NVGPU-7371 Signed-off-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Change-Id: Icba7f1ed3b9b2f409aac346084dd9a123c9d3779 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2682686 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-07-05 14:27:00 -07:00
Tejal Kudav	494dc19ee8	gpu: nvgpu: Err injection utility support The HSI error injection utility is an on-bench debug and test utility which can be used by customers and SQA to test end-to-end error detection and reporting path. Inplement callback function to integrate with this utility and allow injecting GPU HSI related errors. As part of callback function hsierrrpt_inj(), invoke the driver's error-reporting logic which uses the EPD MISC_EC APIs. In future, we can enhance the callback function to trigger driver's error handling logic incrementally for different errors. Bug 3413214 Change-Id: I2d050b6c850d6151b40095f243a6733b4ba74f47 Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2727198 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-07-01 08:11:45 -07:00
Sagar Kamble	5b55088970	gpu: nvgpu: skip subctx pdb init during as-channel bind While creating a new channel, ioctls are called in the below sequence: 1. GPU_IOCTL_OPEN_CHANNEL 2. AS_IOCTL_BIND_CHANNEL 3. TSG_IOCTL_BIND_CHANNEL_EX 4. CHANNEL_ALLOC_GPFIFO_EX 5. CHANNEL_ALLOC_OBJ_CTX. subctx pdbs and valid mask are programmed in the channel instance block in the channel ioctls AS_IOCTL_BIND_CHANNEL & CHANNEL_ALLOC_GPFIFO_EX. Programming them in the ioctl AS_IOCTL_BIND_CHANNEL is redundant. Remove related hal g->ops.mm.init_inst_block_for_subctxs. The hal init_inst_block will program context pdb and big page size. The hal init_inst_block_core will program context pdb, big page size and subctx 0 pdb. This is used by h/w units (fecs, pmu, hwpm, bar1, bar2, sec2, gsp, perfbuf etc.). For user channels, subctx pdbs are programmed as part of ramfc setup. Bug 3677982 Change-Id: I6656b002d513404c1fd7c3d349933e80cca7e604 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2680907 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-06-28 23:33:31 -07:00
Sagar Kamble	bfa20f62c6	gpu: nvgpu: add/remove l2 cache flush when updating the ctx buffers gr ctx buffer in non-cacheable hence there is no need to do L2 cache flush when updating the buffer. Remove the flushes. pm ctx buffer is cacheable hence add l2 flush in the function nvgpu_profiler_quiesce_hwpm_streamout_non_resident since it updates the buffer. Bug 3677982 Change-Id: I0c15ec7a7f8fa250af1d25891122acc24443a872 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2713916 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-06-24 12:08:54 -07:00
Sagar Kamble	65e7baf856	gpu: nvgpu: s/NVGPU_GR_CTX__VA/NVGPU_GR_GLOBAL_CTX__VA Indices for global ctx buffer virtual address array were named with prefix GR_CTX and defined in ctx.h. Prefix those with GR_GLOBAL_CTX and move to global_ctx.h Also remove the flag global_ctx_buffer_mapped as it is not used. Bug 3677982 Change-Id: I9042e1c2bd8e8e10e97893484daeff0f97a96ea0 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2704855 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-06-24 12:08:33 -07:00
Sagar Kamble	7fa6976a98	gpu: nvgpu: remove dead code nvgpu_gr_subctx_set_patch_ctx was earlier used in the HAL gops.gr.ctx_patch_smpc. Usage was removed since that HAL applies to only gm20b that doesn't support subcontexts. Remove that function. gp10b_gr_init_commit_global_attrib_cb is also not used by any chip, so remove that also. Bug 3677982 Change-Id: Ief1c1a4038d3eed1cba3a71d83a2a438158f15f3 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2704854 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-06-24 12:08:20 -07:00
vivekku	1116d90d32	gpu: nvgpu: gsp: enable gsp scheduler debug prints Changes: - created gsp debug info mask enabled with GSP flag. - defined a macro to display gsp debug info instead of using nvgpu_log_fn. - replaced nvgpu_log_fn with gsp_dbg_info inside gsp_scheduler. NVGPU-8529 Change-Id: I98f0e470d7f056958a64579fa64c76de5691aefb Signed-off-by: vivekku <vivekku@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2727812 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-06-17 02:00:58 -07:00
Sagar Kadamati	fdba1eef10	gpu: nvgpu: add FLCG support for PERFMON Add FLCG register programming for PERFMON Jira NVGPU-7228 Change-Id: Ia1b3b2976c65c44f718789bcfbef4cad7e0718b3 Signed-off-by: Sagar Kadamati <skadamati@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2712095 Tested-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-06-15 04:25:56 -07:00
Jinesh Parakh	b8b90f85ee	gpu: nvgpu: Fix CERT-C Violations Fix the following CERT-C Violations: gsp_runlist.c : CERT EXP34-C channel_sync_syncpt.c : CERT ERR33-C grmgr_ga100.c : CERT ERR33-C grmgr_ga10b.c : CERT ERR33-C debug.c : CERT ERR33-C debug_fecs_trace.c : CERT EXP34-C ioctl.c : CERT ERR33-C CID 495110 CID 141061 CID 222881 CID 222890 CID 450994 CID 366644 CID 466529 Bug 3512546 Signed-off-by: Jinesh Parakh <jparakh@nvidia.com> Change-Id: I318a27a6fcb8ea8f6d5d6c1f65d940c48d6f8dfc Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2723008 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-06-03 12:05:26 -07:00
Jinesh Parakh	658f83ca48	gpu: nvgpu: Fix Explicit null dereference Fix the following Coverity Defect: pwrpolicy.c : Explicit null dereference CID 10059138 Bug 3460991 Signed-off-by: Jinesh Parakh <jparakh@nvidia.com> Change-Id: Ie572e0608d0b07d5023e7cca878d16087cfc284f Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2717978 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-05-30 12:49:04 -07:00
prsethi	697215afd3	gpu: nvpgu: configure static ZBC table Patch defines a ZBC static table and configure it at sw layer. Later existing API read this sw configuration and program it to hw. This is applicable only for ga10b safety build and for other chips/ configuration it will be supported in the legacy way. Bug 3585766 Change-Id: I00d79162c0b096616e3f555da965e82e47c014d1 Signed-off-by: prsethi <prsethi@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2713821 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-05-29 10:56:58 -07:00
Krishna Reddy	961925be02	Revert "gpu: nvgpu: correct usage for gk20a_busy_noresume" This reverts commit `c1ea9e3955`. Reason for revert: ap_vulkan, ap_opengles, ap_mods tests failures Bug 3661058 Bug 3661080 Bug 3659004 Change-Id: I929b5675a4fb0ddc8cbf3eeefc982b4ba04ddc59 Signed-off-by: Krishna Reddy <vdumpa@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2718996 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com>	2022-05-27 14:49:26 -07:00
Jinesh Parakh	bb73cf9597	gpu: nvgpu: Fixed out-of-bounds Coverity Defects Fix following Coverity Defects: clk_mon_tu104.c : Out-of-bounds read and Out-of-bounds access CID 10061400 CID 10061401 Bug 3460991 Changed the datatype of domain_mask from u32 to unsigned long to solve the out-of-bounds defect. Signed-off-by: Jinesh Parakh <jparakh@nvidia.com> Change-Id: I1c43bd90053264ee4104ca8c3a33d9ea07f04045 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2708765 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-05-25 11:44:59 -07:00
Debarshi Dutta	c1ea9e3955	gpu: nvgpu: correct usage for gk20a_busy_noresume Background: In case of a deferred suspend implemented by gk20a_idle, the device waits for a delay before suspending and invoking power gating callbacks. This helps minimize resume latency for any resume calls(gk20a_busy) that occur before the delay. Now, some APIs spread across the driver requires that if the device is powered on, then they can proceed with register writes, but if its powered off, then it must return. Examples of such APIs include l2_flush, fb_flush and even nvs_thread. We have relied on some hacks to ensure the device is kept powered on to prevent any such delayed suspension to proceed. However, this still raced for some calls like ioctl l2_flush, so gk20a_busy() was added (Refer to commit Id dd341e7ecbaf65843cb8059f9d57a8be58952f63) Upstream linux kernel has introduced the API pm_runtime_get_if_active specifically to handle the corner case for locking the state during the event of a deferred suspend. According to the Linux kernel docs, invoking the API with ign_usage_count parameter set to true, prevents an incoming suspend if it has not already suspended. With this, there is no longer a need to check whether nvgpu_is_powered_off(). Changed the behavior of gk20a_busy_noresume() to return bool. It returns true, iff it managed to prevent an imminent suspend, else returns false. For cases where PM runtime is disabled, the code follows the existing implementation. Added missing gk20a_busy_noresume() calls to tlb_invalidate. Also, moved gk20a_pm_deinit to after nvgpu_quiesce() in the module removal path. This is done to prevent regs access after registers are locked out at the end of nvgpu_quiesce. This can happen as some free function calls post quiesce might still have l2_flush, fb_flush deep inside their stack, hence invoke gk20a_pm_deinit to disable pm_runtime immediately after quiesce. Kept the legacy implementation same for VGPU and older kernels Jira NVGPU-8487 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I972f9afe577b670c44fc09e3177a5ce8a44ca338 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2715654 Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-05-25 04:59:46 -07:00
Sagar Kamble	a0b0acad05	gpu: nvgpu: pass pmu rpc struct as char pointer nvgpu_pmu_rpc_execute takes pmu rpc header address and dereferences it at address past header based on rpc struct that the header is part of. This usage of pointer is not right and confuses CERT checker. Instead, pass the rpc struct address as char pointer and use as header or rpc struct as per need. CID 17141 CID 154223 CID 17557 CID 154226 CID 153904 CID 153926 CID 153929 CID 153925 CID 153925 CID 225346 CID 225355 CID 225356 CID 225360 CID 225361 CID 225365 CID 225367 CID 296735 CID 330244 CID 17557 Bug 3512546 Change-Id: I93b154d4321e75c0d2b41f43d7c2b701682962a3 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2710224 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-05-24 04:43:35 -07:00
mkumbar	5339bd3466	gpu: nvgpu: Add extra delay for ACR commands in non-silicon platforms Increase delay for non-silicon platforms between ACR commands and before polling to skip incorrect reading of IRQSTAT register and generate false PMU external interrupt. Bug 3596273 Change-Id: I0163cddbaa1919ac949467f65c74e06f85817aec Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2699396 Reviewed-by: Divya Singhatwaria <dsinghatwari@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Tested-by: Divya Singhatwaria <dsinghatwari@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-05-20 10:36:04 -07:00
Dinesh T	6e4c3275bf	gpu: nvgpu: Set max_ways_evict_cache to maximum This is setting evict_max_ways for L2 cache to the maximum supported value for safety. In normal build L2 cache MAX_EVICT_LAST is configure via KMD and RegOps. RegOps is enabled only on standard build with CONFIG_DEBUGGER flag. This method we cant use it for safety build. Safety we can make use of the patch buffer to patch the register while creating the context. JIRA NVGPU-8227 Change-Id: Iec5d73197239b9cad31c6b593ca2b87c224aad5e Signed-off-by: Dinesh T <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2708702 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-05-18 22:57:54 -07:00
Richard Zhao	802aadf263	nvgpu: move nvgpu_falcon_copy_from/to_emem out of CONFIG_NVGPU_DGPU nvgpu_falcon_copy_from/to_emem are also used by iGPU in engine_emem_queue. Jira GVSCI-9976 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: Ia36a38521807714eb5ad52b6e81c9f31ecc8fda6 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2708509 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-05-18 00:59:10 -07:00
Richard Zhao	db4a1713cb	gpu: nvgpu: gr: move .load_sw_bundle64() out of CONFIG_NVGPU_DGPU .load_sw_bundle64 is also used by ga10b. Jira GVSCI-9976 Change-Id: Ife46dd5bf40a9e143cf119a64dd0d2adcb1ae81c Signed-off-by: Richard Zhao <rizhao@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2708393 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-05-18 00:58:54 -07:00

1 2 3 4 5 ...

3269 Commits