linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-24 10:34:43 +03:00

Author	SHA1	Message	Date
Lakshmanan M	7d473f4dcc	gpu: nvgpu: Expose logical mask for MIG 1) Expose logical mask instead of physical mask when MIG is enabled. For legacy, NvGpu expose physical mask. 2) Added fb related info in struct nvgpu_gpu_instance(). 4) Added utility api to get the logical id for a given local id nvgpu_grmgr_get_gr_gpc_logical_id() 5) Added grmgr api to get max_gpc_count nvgpu_grmgr_get_max_gpc_count(). 5) Added grmgr's fbp api to get num_fbps and its enable masks. nvgpu_grmgr_get_num_fbps() nvgpu_grmgr_get_fbp_en_mask() nvgpu_grmgr_get_fbp_rop_l2_en_mask() 6) Used grmgr's fbp apis in ioctl_ctrl.c 7) Moved fbp_init_support() in nvgpu_early_init() 8) Added nvgpu_assert handling in grmgr.c 9) Added vgpu hal for get_max_gpc_count(). JIRA NVGPU-5656 Change-Id: I90ac2ad99be608001e7d5d754f6242ad26c70cdb Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2538508 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-10 03:05:21 -07:00
Richard Zhao	e2d8bdc38d	gpu: nvgpu: unify nvgpu_get_gpfifo_entry_size moved nvgpu_get_gpfifo_entry_size implementation to common code. Jira GVSCI-10880 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: Ia6ccee5e26836662f7c2196ff41658ff41e3a570 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2541575 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-09 19:27:25 -07:00
Deepak Nibade	67399a1892	gpu: nvgpu: unit: BVEC test for common.class unit class_validate_setup is already testing for valid/invalid boundary values for common.class APIs. Append the valid/invalid list with BVEC test values. Fix obsolete gops_class doxygen documentation. Jira NVGPU-6403 Change-Id: Id713db614919842324f6d655b36dd57043958919 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2539797 (cherry picked from commit 6aed159f9f3eeea553a442af37e3bcc840152154) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2539795 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-09 14:06:07 -07:00
Richard Zhao	9ac7550f35	gpu: nvgpu: unify NV_READ_ONCE and NV_WRITE_ONCE Implemented NV_READ_ONCE and NV_WRITE_ONCE in common code. Jira GVSCI-10879 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I5465b4bd1cd44fc7bc1592da01d6be455b1fcdcc Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2541559 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Aparna Das <aparnad@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-09 03:15:39 -07:00
Seshendra Gadagottu	5ec1e0cc21	gpu: nvgpu: make gp10b_tegra_acquire_platform_clocks public Made gp10b_tegra_acquire_platform_clocks as public function so that each gpu architecture can supply different number of clock list. Jira NVGPU-6707 Change-Id: Iad2156a63e00913374ce5fa4274c95e7488fdb31 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2511795 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Sivaram Nair <sivaramn@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-08 21:54:23 -07:00
Deepak Nibade	6fb2b892ce	gpu: nvgpu: set/check mmu nack flags only for GPC exceptions gv11b_mm_mmu_fault_handle_mmu_fault_refch() right now checks/sets mmu_nack_handled flag for MMU faults from all clients (i.e. GPC/HUB). Handling of MMU nack in MMU fault handling path is only needed if MMU nack exception is triggered by SM in GPC. Hence set and check this flag only if source client is GPC. In certain cases it is possible that CE engine triggers back to back MMU faults on same channel. When this happens, and because of above incorrect mmu_nack_handled flag handling, mmu_nack_handled flag is set while handling second MMU fault from CE. And because of this gv11b_mm_mmu_fault_handle_mmu_fault_refch() function could end up dropping extra channel refcounts and trigger access after free scenarios on that channel. Bug 3315942 Change-Id: I28d8311edf34a041364dddedb5fc3a5b83132f85 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2540497 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-08 06:48:56 -07:00
Richard Zhao	1685a2404f	gpu: nvgpu: vgpu: add b0cc profiler support - added new commands to bind/unbind hwpm/hwpm_streamout/smpc - added new command to updat get/put for PMA buffer - tune function nvgpu_perfbuf_update_get_put so it could be reused on server side. - enable profiler v2 device for gv11b Jira GVSCI-10351 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I4226c89ec3040e53dee5381ac8a30c9fd598e5ef Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2537683 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-08 01:30:03 -07:00
Richard Zhao	a3c4236574	gpu: nvgpu: profiler: create bind/unbind hals - created gops_profiler - added HALs for bind/unbind hwpm/hwpm_streamout/smpc - it helps enable b0cc on vgpu Jira GVSCI-10351 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I9fd30b134d54a92d1ce8108172aa77237c702bc0 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2537682 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-08 01:29:57 -07:00
Richard Zhao	4ea92a530b	gpu: nvgpu: profiler: remove profiler obj from hwpm bind/unbind It helps the hwpm bind/unbind functions to be reused on server side. Server side does not track profiler object. Jira GVSCI-10351 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: Ib692c686e940b8123c934b5bb6ba843e09a27246 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2537681 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-08 01:29:52 -07:00
Richard Zhao	7664bee12f	gpu: nvgpu: profiler: remove profiler obj from smpc bind/unbind It helps the smpc bind/unbind functions to be reused on server side. Server side does not track profiler object. Jira GVSCI-10351 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I5e62901cabb56cb2f2d40d51a249b1404b292f5a Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2537680 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-08 01:29:46 -07:00
Richard Zhao	9b66fca165	gpu: nvgpu: move .exec_regops to only execute regops HAL .exec_regops used to first validate regops then execute it, now moving it to only execute the regops. - It helps B0CC on HV. On server side it does not track profiler object, but regops validation uses the profiler, so moving validation to client side. - The change also remove ctx_buffer_offset checking in validate_reg_op_offset. The offset already checked again whitelists which have be verified when update whitelist. Also vgpu does not have information of ctx and golden image. - Added function nvgpu_regops_exec to cover both regops validation and execution. Jira GVSCI-10351 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I434e027290e263a8a64a25a55500f7294038c9c4 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2534252 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-08 01:29:40 -07:00
Lakshmanan M	08cd42093d	gpu: nvgpu: Add multi gr l2_evict support 1) Added l2_evict support for multi gr 2) Added multi gr handling for the following apis, nvgpu_gr_get_cilp_preempt_pending_chid nvgpu_gr_clear_cilp_preempt_pending_chid JIRA NVGPU-5656 Change-Id: Iee6142a49b9a569f2b440077762164af8aee9fb3 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2539734 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-07 13:46:40 -07:00
Lakshmanan M	df87591b7d	gpu: nvgpu: Add multi gr handling for debugger and profiler 1) Added multi gr handling for dbg_ioctl apis. 2) Added nvgpu_assert() in gr_instances.h (for legacy mode). 3) Added multi gr handling for prof_ioctl apis. 4) Added multi gr handling for profiler. 5) Added multi gr handling for ctxsw enable/disable apis. 6) Updated update_hwpm_ctxsw_mode() HAL for multi gr handling. JIRA NVGPU-5656 Change-Id: I3024d5e6d39bba7a1ae54c5e88c061ce9133e710 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2538761 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-04 18:07:47 -07:00
Sagar Kamble	1dd3e0761c	gpu: nvgpu: fix the usermode mappings deadlock during railgate and munmap Following locking sequence leads to deadlock: 1. gk20a_pm_prepare_poweroff (alter_usermode_mappings): ctrl_privs_lock -> mmap_lock 2. __do_munmap (usermode_vma_close): mmap_lock -> ctrl_privs_lock This lock contention can be resolved by retrying the usermode mapping alteration after a while releasing the ctrl_priv_lock for munmap to proceed. Below is the kernel panic log with deadlock. [] INFO: task kworker/1:1:116 blocked for more than 120 seconds. [] Tainted: G W 5.10.17-tegra #1 [] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [] task:kworker/1:1 state:D stack: 0 pid: 116 ppid: 2 flags:0x00000028 [] Workqueue: pm pm_runtime_work [] Call trace: [] __switch_to+0x104/0x160 [] __schedule+0x3d4/0x900 [] schedule+0x74/0x100 [] rwsem_down_write_slowpath+0x250/0x4b0 [] down_write+0x6c/0x80 [] alter_usermode_mappings+0xb4/0x160 [nvgpu] [] nvgpu_hide_usermode_for_poweroff+0x24/0x30 [nvgpu] [] gk20a_pm_prepare_poweroff+0xe8/0x140 [nvgpu] [] gk20a_pm_runtime_suspend+0x78/0xf0 [nvgpu] [] pm_generic_runtime_suspend+0x3c/0x60 [] genpd_runtime_suspend+0xb0/0x2c0 [] __rpm_callback+0x90/0x150 [] rpm_callback+0x34/0xa0 [] rpm_suspend+0xe0/0x5e0 [] pm_runtime_work+0xbc/0xc0 [] process_one_work+0x1c0/0x4a0 [] worker_thread+0x11c/0x430 [] kthread+0x148/0x170 [] ret_from_fork+0x10/0x18 [] INFO: task nvrm_gpu_tests:1273 blocked for more than 121 seconds. [] Tainted: G W 5.10.17-tegra #1 [] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [] task:nvrm_gpu_tests state:D stack: 0 pid: 1273 ppid: 1245 flags:0x00000000 [] Call trace: [] __switch_to+0x104/0x160 [] __schedule+0x3d4/0x900 [] schedule+0x74/0x100 [] schedule_preempt_disabled+0x28/0x40 [] __mutex_lock.isra.0+0x184/0x5c0 [] __mutex_lock_slowpath+0x24/0x30 [] mutex_lock+0x5c/0x70 [] usermode_vma_close+0x30/0x50 [nvgpu] [] remove_vma+0x34/0x60 [] __do_munmap+0x1f4/0x4a0 [] __vm_munmap+0x74/0xd0 [] __arm64_sys_munmap+0x3c/0x50 [] el0_svc_common.constprop.0+0x7c/0x1a0 [] do_el0_svc+0x34/0xa0 [] el0_svc+0x1c/0x30 [] el0_sync_handler+0xa8/0xb0 [] el0_sync+0x160/0x180 [] ---[ end Kernel panic - not syncing: hung_task: blocked tasks ]--- Bug 200703921 Change-Id: Ie7f017c92f20061d3bf891079f7fc7fe390f7cf7 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2533853 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-04 18:06:11 -07:00
Deepak Nibade	419a65965b	gpu: nvgpu: add mutex for gr_ctx initialization If user calls IOCTL to allocate object context for two channels in same TSG in parallel, nvgpu_gr_setup_alloc_obj_ctx() could end up racing and trying to allocate object context for both channels at the same time. This could result in corrupting object context. Fix this by introducing per-TSG mutex ctx_init_lock to serialize context initialization for all channels within TSG. In ideal scenario nvrm_gpu is the only caller of all the IOCTLs, and nvrm_gpu makes sure to initialize object context for each channel in serial order. Because of this new lock does not cause any contention. Jira NVGPU-6431 Change-Id: Ibb1cbb4878748929bb7f23e8666c283c39ecbf5a Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2538333 (cherry picked from commit 8be447838dc1ecbd5637eb6bd13b8f338eaf33cd) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2538773 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-03 15:59:43 -07:00
dt	5e82717c96	gpu: nvgpu: Add powernode support to vgpu As the normal gpu is powered on by writing one to power-node, the patch is adding power node for vgpu. Change-Id: I08fbbe8694e02c826a0d5692f5a4c0f4efd396ff Signed-off-by: dt <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2537053 Reviewed-by: Richard Zhao <rizhao@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-02 19:40:39 -07:00
Sagar Kamble	ac30c4cb65	gpu: nvgpu: change acr bootstrap completion info message Following information message was printed unconditionally. Often, it is not useful. nvgpu_acr_wait_for_completion:100 [INFO] flcn-0: sctl reg 7021 cpuctl reg 50 It is okay to move this to nvgpu_acr_dbg. bug 200734207 Change-Id: Ie66caf20d0e2eb692532e26bf89417342a054cf8 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2536471 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-02 19:40:26 -07:00
Lakshmanan M	edbcf5cfc6	gpu: nvgpu: add multi gr handling for debugger Added multi gr handling for debugger apis. Replaced g->gr with nvgpu_gr_get_cur_instance_ptr(g). JIRA NVGPU-5656 Change-Id: I010eff39b1ebec231b4dbdd53caffc25e1cd54c4 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2537784 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-01 12:50:48 -07:00
Debarshi Dutta	11d27743f8	gpu: nvgpu: add NULL checks before freeing ZBC and ZCULL Disabling NVGPU_SUPPORT_MIG in suspend path leads to inconsistencies. During driver removal without the flag set, the driver still tries to free structures that might not have been allocated in the first place. e.g. nvgpu_gr_zbc_deinit, nvgpu_gr_zcull_deinit. Added NULL checks for ZBC and ZCULL structures before freeing them as a solution. Jira NVGPU-6832 Change-Id: I8a0c64ca982d11fee55542abd3c5bce5a51b4007 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2535101 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-01 07:58:57 -07:00
Tejal Kudav	9f43914933	gpu: nvgpu: Move Intr handling common code to CIC CIC (Central Interrupt controller) will be responsible for the interrupt handling. common.cic unit is the placeholder for all interrupt related code. Move interrupt related defines and Public APIs present in common.mc to common.cic. Note: The common.mc interrupts related struct definitions are not moved as part of this patch. Adapt the code to use interrupt handling related defines and public APIs migrated from common.mc to common.cic JIRA NVGPU-6899 Change-Id: I747e2b556c0dd66d58d74ee5bb36768b9370d276 Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2535618 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-31 19:37:31 -07:00
Deepak Nibade	9034b1676e	gpu: nvgpu: compile out GFxP support in safety GFxP preemption for graphics contexts is not supported in safety. But the support was enabled along with CONFIG_NVGPU_GRAPHICS since GFxP preemption was protected under same config. Add a separate config CONFIG_NVGPU_GFXP to protect all GFxP specific code, enum values, and HALs. Disable the config in safety profile. Jira NVGPU-6893 Change-Id: Iebb5f754a1025dfa6e05a94704bdb8a7123b599a Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2534986 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-28 15:17:36 -07:00
dt	c1b302652e	gpu: nvgpu: Add fix for dev_node leak This is adding fix for dev_node leak when user_deinit called. The dev_nodes in linux are created in two phases. In first phase the power dev_nodes(one for legacy and other for v2) are created. The second phase other dev_nodes are created. While creating the dev_nodes the power cdev_region overwritten by cdev_region. This is fixed by introducing new cdev_region and updating respective nodes. JIRA NVGPU-6721 Change-Id: Iec78db8e5fe40cc0b14fb3fecc35b8881dff716f Signed-off-by: dt <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2535265 Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-28 11:39:58 -07:00
Antony Clince Alex	5c80999ec3	gpu: nvgpu: gm20b: update priv ring init sequence Update priv ring init sequence to poll and validate enumerate command completion. With this approach it is no longer required to configure the chiplets to holdoff priv transactions when the ring has not been initialized. Hence, the write to pri_ringstation_sys_decode_config_r register is removed. Bug 3307879 Change-Id: I3f9ede95dea2814f279955884621fd4c028d722f Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2527924 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-05-28 11:36:46 -07:00
Sami Kiminki	5f6ff29aea	gpu: nvgpu: report number of syncpoints in nvgpu_as_get_sync_ro_map_arg Add reporting for the number of syncpoints when mapping the RO shim. This allows the userspace to perform boundary condition checks when computing the GPU VA for a syncpoint. JIRA GCSS-1579 Change-Id: Ia6c9eee917d2c1e08f9905701e03f2b09e01ba60 Signed-off-by: Sami Kiminki <skiminki@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2533981 Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-05-27 21:19:38 -07:00
Martin Radev	8834275906	gpu: nvgpu: Validate PMA buffer size The original code would only truncate the size to 32 bits and later write the value to a hw register. Let's check that the user-provided buffer is large enough. Bug 2510974 Change-Id: I8b14a07a46d30c0b8c7ea63e5bdef53fbd19ec6f Signed-off-by: Martin Radev <mradev@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2527148 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-05-25 14:30:35 -07:00
Martin Radev	04ce9faf04	gpu: nvgpu: Minor fixes in ioctl handling Fixes: 1) gk20a_sched_dev_ioctl allocates a buffer with size CTXSW_IOCTL_MAX_ARG_SIZE but then sanitizes IOC_SIZE against SCHED_IOCTL_MAX_ARG_SIZE. No big deal here since both are of size 0x20 but may lead to issues in the future. 2) nvgpu_clk_arb_ioctl_event_dev would BUG_ON if IOC_SIZE is larger than expected. Let's instead sanitize and return error. Jira VFND-1586 Jira VQRM-3741 Change-Id: I9e00796a2b2f4a83c3a04194c34eb4c006b937d3 Signed-off-by: Martin Radev <mradev@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2525753 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-05-25 14:30:30 -07:00
Tejal Kudav	e0a1fcf5f5	gpu: nvgpu: Add Central Intr Controller unit Add a new Central Interrupt Controller(CIC) unit in common code. The interrupt handling is done in a distributed manner currently. The error handling policy for different errors resides in each unit's ISR code. The goal is to converge this data under one central place - the CIC unit. This patch creates framework for CIC unit and moves the gv11b QNX safety LUT to CIC unit. All the error reporting APIs from different units are also moved to CIC. New APIs are exposed by CIC unit to access its internal data like: 1. Struct err_desc - the static err handling /injection data per error id 2. Num_hw_modules - the number of error reporting HW units supported by CIC Init and deinit of CIC unit: 1. CIC unit should be initialized earlyon during boot so that it is available for any interrupt handling. 2. Initialize CIC just before the interrupts are enabled during boot. 3. Similarly, CIC is disabled late during deinit cycle; right after the interrupts are masked. LUT: 1. LUT is currently used only for reporting error to safety services in gv11b QNX safety build. 2. This error handling policy LUT currently has only two levels of handing - correctable and quiecse. 3. Once, the error handling policy decision is moved from leaf unit nodes to CIC, LUT will be updated to have additional levels like fast recovery and full recovery. 4. Also, then a separate LUT will be added for each platform/build. 5. In current framework, the LUT is set to NULL for all configurations except gv11b. report_err() ops is added to report error to safety services. This ops is only effective for gv11b qnx build; and set to NULL for other configurations. NVGPU-6521 NVGPU-6523 NVGPU-6750 NVGPU-6758 NVGPU-6760 NVGPU-6754 Change-Id: I24be7836a96d787741e37b732e19863ed8014635 Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2518683 Reviewed-by: Ajesh K V <akv@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-25 14:28:04 -07:00
Tejal Kudav	bced5c5785	gpu: nvgpu: Add CIC specific debug logging API Add gpu_dbg_cic bit to log_mask to enable/disable Central Interrupt Controller debug logs. Define CIC specific debug print API with "CIC \|" prefix to help grep CIC related logs. NVGPU-6521 Change-Id: I86deee761ad9125001cd48d94b43bb2979174d42 Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2518692 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-05-25 14:27:58 -07:00
Prateek sethi	84534a050f	gpu:nvgpu: Update doxygen range for io APIs Patch updates the access range to 0 to SIZE-4. Jira NVGPU-6229 Change-Id: I98606e1310c45e4b7343f739524bd77674080c3a Signed-off-by: Prateek sethi <prsethi@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2521643 (cherry picked from commit b01a8689c470c67d32855981b115edba7954f451) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2530175 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-20 06:09:51 -07:00
mkumbar	f3c2c4e730	gpu: nvgpu: Update the FALCON/NVRISCV define's Update the FALCON/NVRISCV define's Bug 200728965 Change-Id: I2b45c216cc274e097d6bc99831b934eb29840dc9 Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2531635 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-20 06:09:41 -07:00
Shashank Singh	57089a1b34	gpu: nvgpu: address comments from common.rc code inspection CR review - Move unnecessary headers under recovery flag. - Update doxygen documentation of one API to match the code. Jira NVGPU-6372 Change-Id: I9cf744c8014ea92f18cc10824e9fcaed9aa7d5de Signed-off-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2527118 (cherry picked from commit cb4b03a3b00321a4c07b3d9cc2768f7183e99c45) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2531583 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-19 07:45:35 -07:00
Seshendra Gadagottu	85efe929ca	gpu: nvgpu: prod programming for slcg timer unit Added init function for common.ptimer unit and called this init function during nvgpu early init. int nvgpu_ptimer_init(struct gk20a g); Added following helper function for programming prod values for slcg timer unit: void nvgpu_cg_slcg_timer_load_enable(struct gk20a g); Invoked prod programming for slcg timer unit from nvgpu_ptimer_init. Jira NVGPU-6026 Change-Id: I29e32380a4d05ec8276d7ebe59bc2733917f8184 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2524037 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-19 04:06:43 -07:00
ajesh	b15bd97c08	gpu: nvgpu: fix misra violation in bug unit Modify the callback interface from bug to quiesce unit to remove a possible cyclic dependency in the bug unit. Make the list of callbacks from bug unit, UT specific. The quiesce callback function and argument are kept in separate variables, and in a normal run the only callback that bug unit would invoke will be the quiesce specific function. These changes will fix the violation of Rule 17.2 in bug unit. JIRA NVGPU-6537 Change-Id: Icb6bc92077f8d26c87425768b09a7194a98e015d Signed-off-by: ajesh <akv@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2527207 (cherry picked from commit 7696565648c5dd573a03be19ba9525856b781ea6) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2530900 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-18 18:20:18 -07:00
Martin Radev	d1983f5cfa	gpu: nvgpu: Decrement CSS dmabuf ref cnt before ret The function gk20a_channel_cycle_stats does not decrement the dmabuf refcnt if vmapping it fails. This patch fixes it by decrementing the ref cnt before returning. NVGPU-397 NVGPU-415 Change-Id: Iae01ada710adb04fd4e4ba0371eccec5f8765254 Signed-off-by: Martin Radev <mradev@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2527190 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-05-18 18:18:25 -07:00
mkumbar	d2349b32ec	gpu: nvgpu: update SSMD array size -Update SSMD array size to hold all supported super-surface members -Handle the error and report if invalid SSMD ID is found. issue: At present SSMD array size set to 32 but overall 33 super-surface members are supported, when 33rd member accessed system crash happened due to overflow access, so fixing it by setting the SSMD array size to actual number of super-surface members supported Bug 200721968 Bug 200721966 Change-Id: I5ba1084a661d7497056f13a053d2fc79d50f595c Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2528569 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-17 12:56:39 -07:00
Vedashree Vidwans	2514814851	gpu: nvgpu: common.ce fix MISRA 5.7 errors Rule 5.7 doesn't allow an identifier to be reused. This patch renames identifier "ops" to resolve this violation. Jira NVGPU-6281 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Change-Id: I02da8db6406ccc44b7d8c3037dfd2b062250878f (cherry-picked from commit 659e54c96d5c8db8ab2f76cd110a11f1e2270c36) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2527279 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-17 12:56:19 -07:00
Lakshmanan M	ede8215ca8	gpu: nvgpu: Add NVGPU_SUPPORT_ROP_IN_GPC flag Added new flag to enable/disable the NVGPU_SUPPORT_ROP_IN_GPC JIRA NVGPU-5656 Change-Id: Icbcb63a879c4ae4de0701742319eb02e98f66ca6 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2529121 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-05-14 21:00:44 -07:00
Vedashree Vidwans	2b0b2e9b70	gpu: nvgpu: update hw headers Update gm20b, gv11b, tu104 hw headers to resolve kernel checkpatch warnings as below: Rule kernel_checkpatch: Fail ERROR: need consistent spacing around '+' (ctx:WxV) #89: FILE: drivers/gpu/nvgpu/include/nvgpu/hw/tu104/hw_fb_tu104.h:74: + ((U32(0x0U) << (16U +((i)*1U)))) Bug 3139301 Change-Id: Ib5a9bff0a6711355f6d2923be8184b7f243af24f Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2524534 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-05-14 20:58:52 -07:00
mkumbar	636a70790b	gpu: nvgpu: add NEXT check for falcon functions add NEXT check for falcon functions JIRA NVGPU-6369 Change-Id: I4ec3063f1c109aa9e41dae1837e83e6a061552c2 Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2528545 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-13 11:54:52 -07:00
Ramesh Mylavarapu	7d0bd72fde	gpu: nvgpu: add clk arbiter check Check for NVGPU_CLK_ARB_ENABLED flag before initiating clk crbiter session which shouldn't be initiated in absence of clk arbiter. Bug 3236519 Signed-off-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Change-Id: I945203164063cec35fbab2256b3c7cb983e520ea Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2528551 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-13 06:32:01 -07:00
Lakshmanan M	d956938d3f	gpu: nvgpu: Add load_timestamp_prod in grmgr init 1) Moved load_timestamp_prod handling in nvgpu_init_gr_manager(). 2) Moved fifo.reset_enable_hw in nvgpu_early_init() - In simulation/emulation/GPU standalone platform, XBAR, L2 and HUB are enabled during g->ops.fifo.reset_enable_hw(). This introduces a dependency to get the MIG map conf information. (if nvgpu_is_bpmp_running() == false treated as simulation/emulation/GPU standalone platform). Bug 3307879 JIRA NVGPU-6633 Change-Id: I4cba3a527de4723a6500f9658ec1dcadc23b37e3 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2528174 Tested-by: Antony Clince Alex <aalex@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-05-12 16:09:52 -07:00
Lakshmanan M	5de20ab81b	gpu: nvgpu: Add gops for enable/disable PMASYS legacy mode Added a gops for enable/disable the PMASYS legacy mode. JIRA NVGPU-5656 Change-Id: I7f9d52d96d41ab38b201f857c492f10806453a7c Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2528423 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-05-12 16:07:26 -07:00
Alex Waterman	c55f7d624c	gpu: nvgpu: Use runlist struct in construction APIs Use a struct nvgpu_runlist pointer for the runlist update and construction APIs. This gets rid of the runlist ID being passed into the runlist code for most of the normal APIs. Some recovery and suspect APIs still use runlist ID masks since they may work with multiple runlists at a time. These will be updated in the future. Jira NVGPU-6425 Change-Id: Ib8d7a6aad0201af62267099cd993d130504478e8 Signed-off-by: Alex Waterman <alexw@nvidia.com> Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2470307 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-12 11:24:37 -07:00
Richard Zhao	9ff5b779c9	gpu: nvgpu: vgpu: add L2 evict last support Added two ivc commands to support get/set max ways of L2 cache evict last. Jira GVSCI-10422 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: Ic4539d2ff716e57b02696550312634c0393994f0 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2527485 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: Aparna Das <aparnad@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-12 03:39:54 -07:00
dt	a741347ead	gpu: nvgpu: Compute the proper gr_config before read any information This is added to compute proper gr_config to get the correct information like number of sm etc. This is added to fix the failure when running "NvRmGpuTest_TSG_ReadSmErrorState_Exists" on MIG instance. JIRA NVGPU-6833 Change-Id: I274720e31cde3636b3282fec586b161f884bc73d Signed-off-by: dt <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2526911 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-05-11 08:26:16 -07:00
mkumbar	e06eca9b17	gpu: nvgpu: FALCON/FALCON2 core selection based on fuse -Read the PMU and GSP fuse to select the FALCON/FALCON2 core -FUSE read based on FALCON id is done in FUSE unit -Core selection and info dump based on fuse is done in FALCON unit JIRA NVGPU-6369 Change-Id: I0747f7383c60f546bbce94eb89c0a8bd41fa7471 Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2465808 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-11 08:25:11 -07:00
srajum	573f02e68d	gpu: nvgpu: Fixing MISRA 21.1 violation. - "misra_c_2012_rule_21_1_violation" Defining or undefining a reserved name "__NVGPU_SAVE_KALLOC_STACK_TRACES", which is an identifier or macro name beginning with an underscore. Change-Id: If89ce68fb6dc76e5ffcdd2dc436dddcbe9ba96ee Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2525631 (cherry picked from commit a84c9e0d6987b22e24d777c5ac632c4072cbbb58) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2526776 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-05-10 10:08:13 -07:00
Debarshi Dutta	096f4ef055	gpu: nvgpu: fix l2_flush errors during rmmod The function gk20a_mm_l2_flush incorrectly returns an error value when it skips l2_flush when hardware is powered off. This causes the following prints to occur even when the behavior is expected. gv11b_mm_l2_flush:43 [ERR] gk20a_mm_l2_flush failed nvgpu_gmmu_unmap_locked:1043 [ERR] gk20a_mm_l2_flush[1] failed The above errors occur from the following paths 1) gk20a_remove -> gk20a_free_cb -> gk20a_remove_support -> nvgpu_pmu_remove_support -> nvgpu_pmu_pg_deinit -> nvgpu_dma_unmap_free 2) gk20a_remove -> gk20a_free_cb -> gk20a_remove_support -> nvgpu_remove_mm_support -> gv11b_mm_mmu_fault_info_mem_destroy -> nvgpu_dma_unmap_free Since, these do not belong in the Poweron/Poweroff path, its okay to skip flushing them when the hardware has powered off. Fixed the userspace tests by allocating g->mm.bar1.vm to prevent NULL access in gv11b_mm_l2_flush->tlb_invalidate. Jira LS-77 Change-Id: I3ca71f5118daf4b2eeacfe5bf83d94317f29d446 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2523751 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-05-10 10:06:24 -07:00
srajum	74deaae0bf	gpu: nvgpu: use GPLV2 license for files in os/linux JIRA NVGPU-6452 Change-Id: Iac22c3bf52c541a9fd3ba7e59cf4e78ce92ecd71 Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2526346 Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-05-10 02:53:39 -07:00
Deepak Nibade	cebefd7ea2	gpu: nvgpu: move RTV CB code to GRAPHICS config Some of the RTV circular buffer programming is under GRAPHICS config and some is under DGPU config. For nvgpu-next, RTV circular buffer is required even for iGPU so keeping the code under DGPU config does not make sense. Move all the code from DGPU config to GRAPHICS config. Bug 3159973 Change-Id: I8438cc0e25354d27701df2fe44762306a731d8cd Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2524897 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-06 06:10:58 -07:00

1 2 3 4 5 ...

8436 Commits