linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-24 10:34:43 +03:00

Author	SHA1	Message	Date
Deepak Nibade	cebefd7ea2	gpu: nvgpu: move RTV CB code to GRAPHICS config Some of the RTV circular buffer programming is under GRAPHICS config and some is under DGPU config. For nvgpu-next, RTV circular buffer is required even for iGPU so keeping the code under DGPU config does not make sense. Move all the code from DGPU config to GRAPHICS config. Bug 3159973 Change-Id: I8438cc0e25354d27701df2fe44762306a731d8cd Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2524897 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-06 06:10:58 -07:00
Vedashree Vidwans	86cb03d2f1	gpu: nvgpu: Replace WAR keyword with "fix" Replace/remove "WAR" keyword in the comments in nvgpu driver with "fix". Rename below functions and corresponding gops to replace "war" word with "errata" word: - g.pdb_cache_war_mem - ramin.init_pdb_cache_war - ramin.deinit_pdb_cache_war - tu104_ramin_init_pdb_cache_war - tu104_ramin_deinit_pdb_cache_war - fb.apply_pdb_cache_war - tu104_fb_apply_pdb_cache_war - nvgpu_init_mm_pdb_cache_war - nvlink.set_sw_war - gv100_nvlink_set_sw_war Jira NVGPU-6680 Change-Id: Ieaad2441fac87e4544eddbca3624b82076b2ee73 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2515700 Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-04-28 19:14:49 -07:00
Vedashree Vidwans	aba26fa082	gpu: nvgpu: handle chip specific erratas Currently, there are few chip specific erratas present in nvgpu code. For better traceability of the erratas and corresponding fixes, introduce flags to indicate existing erratas on a chip. These flags decide if a corresponding solution is applied to the chip(s). This patch introduces below functions to handle errata flags: - nvgpu_init_errata_flags - nvgpu_set_errata - nvgpu_is_errata_present - nvgpu_print_errata_flags - nvgpu_free_errata_flags nvgpu_print_errata_flags: print below details of erratas present in chip 1. errata flag name 2. chip where the errata was first discovered 3. short description of the errata Flags corresponding to erratas present in a chip are set during chip hal init sequence. JIRA NVGPU-6510 Change-Id: Id5a8fb627222ac0a585aba071af052950f4de965 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2498095 Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-04-28 19:14:44 -07:00
Seshendra Gadagottu	21e1328ea1	gpu: nvgpu: add fb gops for set_atomic_mode Separated set_atomic_mode functionality from init_fs_state/enable_nvlink and created new fb gops for set_atomic_mode. In gpu init sequence, set_atomic_mode is called after acr_construct_execute to take care of design changes required for nvgpu-next architectures. Updated fb_gv11b_init_test to use set_atomic_mode gops along with init_fs_state. Bug 3268664 Change-Id: I1ab9eb21cc4cce77f3325c4e8821a75b6e85fba2 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2508095 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-04-22 14:58:36 -07:00
absalam	3ec369d60a	gpu: nvgpu: Disable Clock Arbitor for TU104 This patch is to disable the clock arbitor for TU104. TU104 is not a POR for Drive 6.0 so disabling it to easy migration of clk arb for GA100. As a first step all the NVRM Clock tests will be skipped by setting NVGPU_SUPPORT_CLOCK_CONTROLS to false for TU104. Then clk arbitor will be rewritten for GA100 and enabled back. This patch implements by adding a new flag NVGPU_CLK_ARB_ENABLED which holds the status of clk arbitor for each platform and disables them for TU104 Bug 200699763 Change-Id: I51cd5c7821bdc0b48080c17a70735925b278ddf5 Signed-off-by: absalam <absalam@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2515086 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-04-20 07:47:38 -07:00
Antony Clince Alex	95bfa039f5	gpu: nvgpu: tu104: implement l2 sector promotion Introduce new HAL gops_ltc.set_l2_sector_promotion to configure L2 sector promotion policy. The follow three promotion settings are support: - NVGPU_GPU_IOCTL_TSG_L2_SECTOR_PROMOTE_FLAG_NONE - NVGPU_GPU_IOCTL_TSG_L2_SECTOR_PROMOTE_FLAG_64B - NVGPU_GPU_IOCTL_TSG_L2_SECTOR_PROMOTE_FLAG_128B Add ioctl "NVGPU_TSG_IOCTL_SET_L2_SECTOR_PROMOTION" to the gpu tsg node to support l2 sector promotion. On chips which do not support sector promotion, the ioctl returns 0. Bug 200656177 Change-Id: Iad835a5c954d3b10da436cfafb388aaaa04f44c7 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2460553 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-04-16 03:35:57 -07:00
Antony Clince Alex	5517e14e57	gpu: nvgpu: tu104: support regops to lts_tstg_cfg2/3 registers In-order to support L2 sector promotion, lts_tstg_cfg2,3 registers were added to the SYS priv save segment of the ctxsw'ed image. Update gops_gr.decode_priv_addr HAL to include regops support to the above two registers. Introduce HAL ops gops_ltc.pri_is_lts_tstg_addr to detect lts_tstg addresses. Bug 200656177 Change-Id: I0f6c24d802edf8ac72917ed099d7ae153f6b4219 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2510281 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-04-16 03:35:52 -07:00
Mayur Poojary	6277d57936	gpu: nvgpu: Add new api for setting longer timeslice on dbg node Add new ioctl api for setting longer timeslice and get timeslice inside 'dbg' dev node. Update ioctl gpu_get_characteristic to pass the max timeslice value Add debugfs to access and change the max timeslice value Bug 1842244 Change-Id: I7e80f59162cf5d90496f9752fc128f5fa8dcc7d2 Signed-off-by: Mayur Poojary <mpoojary@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2471569 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-04-06 04:37:38 -07:00
Rajesh Devaraj	0a713d366a	gpu: nvgpu: add doxygen for macros This patch adds doxygen for macros related to SDL unit. Also, it removes macros related to unused service IDs. LTC_RSTG is not present in GV11B. So, the error injection should not be supported for LTC_RSTG. This patch moves ltc_gv11b_debug_fusa as part of non-safety build. JIRA NVGPU-6181 Change-Id: Iede1612f1c85e2fad80e22bcc9d10c4552c73a92 Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> (cherry picked from commit 6bdd4781d8311613eebaf1cccead01823a45084e) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2506140 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-03-30 07:51:36 -07:00
Antony Clince Alex	f41e5975d8	gpu: nvgpu: add ioctl to configure l2 max_ways_evict_last Add ioctl support to configure and read the max number of lines/ways in a L2 cache set that can be marked as EVICT_LAST. This is accomplished through two new ltc hals: set_l2_max_ways_evict_last, get_l2_max_ways_evict_last. These hals will only be set for nvgpu-next chips. Incase of legacy chips, the IOCTLs will return error -ENOSYS. Generate following litter constants to get the number of sets in a l2 slice and the number of ways in each set: - GPU_LIT_NUM_LTC_LTS_SETS - GPU_LIT_NUM_LTC_LTS_WAYS Add gpu characteritics flag: NVGPU_L2_MAX_WAYS_EVICT_LAST_ENABLED to allow userspace driver to determine if L2_MAX_WAYS_EVICT_LAST ioctl is supported. Bug 200605474 Change-Id: Id3180f891399f5e128500f3835d762aee59953e0 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2445884 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-03-12 04:36:22 -08:00
Vedashree Vidwans	26fc64fb0b	gpu: nvgpu: update common.mc function and docs - Update documentation for common.mc and gops_mc functions. - Rename test_setup_env and test_free_env to test_mc_setup_env and test_mc_free_env respectively. This will make sure that mc test has independent setup and free functions. - Add doxygen comments for mc.enable and mc.disable. - Modify MC unit test description. Jira NVGPU-6240 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Change-Id: I87291ee5f90b8e3c29c475c00a78c7855de5740e Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2457183 (cherry picked from commit c62ff36f87878a8a7513bef06e111117d96c61c8) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2480602 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-03-04 11:04:15 -08:00
Lakshmanan M	edf03baedd	gpu: nvgpu: Enable SCG flag * Enabled NVGPU_SUPPORT_SCG for tu104. * Enabled NVGPU_SUPPORT_SCG if graphics support is enabled. JIRA NVGPU-6532 Change-Id: I22175de6906a496127fef464f70a6521b2ad2ad2 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2485632 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-02-18 17:32:35 -08:00
Seshendra Gadagottu	ca477efc1e	gpu: nvgpu: t21x: enable CTX_MMU_DEBUG_MODE Enable support for NVGPU_SUPPORT_SET_CTX_MMU_DEBUG_MODE, since latest gm20b firmware has support for MMU_DEBUG_CTRL. Bug 2586406 Change-Id: I126c9ea516a8c60d4c66964dc1c8857a708f16a2 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2477047 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-02-01 05:55:36 -08:00
Alex Waterman	77c0b9ffdc	gpu: nvgpu: Update runlist_update() to take runlist ptr Update the nvgpu_runlist_update_for_channel() function: - Rename it to nvgpu_runlist_update() - Have it take a pointer to the runlist to update instead of a runlist ID. For the most part this makes the code better but there's a few places where it's worse (for now). This starts the slow and painful process of moving away from the non-runlist code using runlist IDs in many places it should not. Most of this patch is just fixing compilation problems with the minor header updates. JIRA NVGPU-6425 Change-Id: Id9885fe655d1d750625a1c8aceda9e67a2cbdb7a Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2470304 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-01-29 09:51:44 -08:00
Sagar Kamble	cf287a4ef5	gpu: nvgpu: retry tsg unbind if NEXT is set The NEXT bit can remain set for the channel if timeslice expires before scheduler clears it. Due to this nvgpu fails TSG unbind and in turn nvrm_gpu fails channel close. In this case, checking the channel hw state after some time can help see NEXT bit cleared by scheduler. Reenable the tsg and return -EAGAIN to nvrm_gpu for it to retry again. Bug 3144960 Change-Id: I35f417f02270e371a4e632986b73a00f8a4f921a Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2468391 Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-01-18 23:11:57 -08:00
Deepak Nibade	cae88e7451	gpu: nvgpu: initialize cau data while binding HWPM in global mode Add CAU initialization data in const array hwpm_cau_init_data[]. Add HAL API gops.gr.get_hwpm_cau_init_data() to retrieve this data and implement it for TU104. Add new HAL API gops.gr.init_cau() that uses above data and initializes all cau units. Implement this HAL only for TU104. Invoke above sequence from nvgpu_profiler_bind_hwpm() in case of global HWPM mode. Jira NVGPU-5360 Change-Id: I1c7a380e9d04d6cd45fb7f746c0a79fc56675244 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2463854 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-01-05 12:39:54 -08:00
Deepak Nibade	7158db453c	gpu: nvgpu: add test offsets to allowlist Add ptimer register offsets to regops allowlist for testing. New allowlist restricts regops only to reserved resources, this makes it difficult to test the interface since only HWPM registers can be accessed and that could have side effects on system. Having ptimer registers as test offsets has advantage that the offsets do not change across chips, registers are read-only, and values are always incrementing so a test can verify read regops and test various flags of interface. Add gops.ptimer.get_timer_reg_offsets() HAL to return timer offsets. Add static function add_test_range_to_map() that adds timer offsets to allowlist always. In nvgpu_profiler_validate_regops_allowlist() return success if timer offsets are hit in range search. Bug 2510974 Jira NVGPU-5360 Change-Id: I8b51bb92e43e8b1bbe903c874a429341659ef603 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2460002 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-01-05 12:38:12 -08:00
Deepak Nibade	869735cda4	gpu: nvgpu: add dynamic allowlist support Add gv11b and tu104 HALs to get allowed HWPM resource register ranges, offsets, and stride meta data. Add new enum nvgpu_pm_resource_hwpm_register_type for HWPM register type. Add new struct nvgpu_pm_resource_register_range_map to store all the register ranges for HWPM resources. Add pointer of map in struct nvgpu_profiler_object along with map entry count. Add new API nvgpu_profiler_build_regops_allowlist() to build the regops allowlist dynamically while binding the resources. Map entry count is received with get_pm_resource_register_range_map_entry_count() and only those resource ranges are added for which resource is reserved by profiler object. Add nvgpu_profiler_destroy_regops_allowlist() to destroy the allowlist while unbinding the resources. Add static functions allowlist_range_search() to search a register offset in HWPM resource ranges. Add another static function allowlist_offset_search() to search the offset in per-resource offset list. Add nvgpu_profiler_validate_regops_allowlist() that accepts an offset value, checks if it is in allowed ranges using allowlist_range_search() and then checks if offset is in allowlist using allowlist_offset_search(). Update gops.regops.exec_regops() to receive profiler object pointer as a parameter. Invoke nvgpu_profiler_validate_regops_allowlist() from validate_reg_ops() if prof pointer is not-null. This will be true only for new profiler stack and not legacy profilers. In gr_exec_ctx_ops(), skip regops execution if offset is invalid. Bug 2510974 Jira NVGPU-5360 Change-Id: I40acb91cc37508629c83106ea15b062250bba473 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2460001 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-01-05 12:38:06 -08:00
Deepak Nibade	9221b01968	gpu: nvgpu: implement HWPM streamout teardown sequence Implement below functions: - nvgpu_profiler_quiesce_hwpm_streamout_resident Teardown sequence when context is resident or in case profiling session is a device level session. - nvgpu_profiler_quiesce_hwpm_streamout_non_resident Teardown sequence when context is non resident - nvgpu_profiler_quiesce_hwpm_streamout Generic sequence to call either of above API based on whether context is resident or not. Trigger HWPM streamout teardown sequence while unbinding resources in nvgpu_profiler_unbind_hwpm_streamout() Add a new HAL gops.gr.is_tsg_ctx_resident to call gk20a_is_tsg_ctx_resident() from common code. Implement below supporting HALs for resident teardown sequence: - gops.perf.pma_stream_enable() - gops.perf.disable_all_perfmons() - gops.perf.wait_for_idle_pmm_routers() - gops.perf.wait_for_idle_pma() - gops.gr.disable_cau() - gops.gr.disable_smpc() Jira NVGPU-5360 Change-Id: I304ea25d296fae0146937b15228ea21edc091e16 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2461333 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-18 15:26:21 -08:00
mkumbar	c62cfa2efb	gpu: nvgpu: get PMU NEXT core irqmask -Add new PMU ops to get NEXT core irq mask -Add support to handle NEXT core interrupt request. Bug 200659053 Bug 3199589 Change-Id: I78738f074a425f8934bbba28bf6996eeec7ab05a Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2457077 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
Joshua Widen	60f44506a3	Revert "gpu: nvgpu: get PMU NEXT core irqmask" This reverts commit 4ff427c51619cecdcc74fdbb388d82421cf45655. Reason for revert: Testing for regression seen in GVS. Bug 3198736 Change-Id: If12da341c3e13907bdcbb778c8fb4118cd5e3803 Signed-off-by: jwiden <jwiden@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2456791 Reviewed-by: svcguardwords <svcguardwords@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
mkumbar	8284832300	gpu: nvgpu: get PMU NEXT core irqmask -Add new PMU ops to get NEXT core irq mask -Add support to handle NEXT core interrupt request. Bug 200659053 Change-Id: I8b1c9b9d74ed59b4130fea712f970b4a31a8b4fe Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2429042 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
Deepak Nibade	b23a114c63	gpu: nvgpu: ensure all perfmon writes are complete after reset gr_gv100_reset_hwpm_pmm_registers() writes a bunch of registers in sys/gpc/fbp chiplets to reset perfmons. To ensure all the writes have completed it is necessary to readback each chiplet's PRI fence register. Add and use new HAL g->ops.priv_ring.read_pri_fence() to achieve this. Implement the HAL for gv11b in new source code file hal/priv_ring/priv_ring_gv11b.c. Bug 2510974 Jira NVGPU-5360 Change-Id: If4dd61cb4265422e8c2d16884790eb0fe7f2c103 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2453631 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
tkudav	2ca4f145e4	gpu: nvgpu: Fix HAL checker pointed mismatches Add new HALs for register field definition/value changes in GV11B as compared to Pascal. Update the HALs for recent chips too if applicable. Bug 200604892 Change-Id: I14ee9440859007e86a1ffa937df399a31e2628bd Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2437564 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
tkudav	e962ec3fa0	gpu: nvgpu: Set PC sampling HAL to NULL for GP10b+ Pascal+ chips do not support updating PC sampling using register NV_CTXSW_MAIN_IMAGE_PM (Unlike GM20B, bit 6 = PC_SAMPLING is not present on GP10b, GV11b and TU104). To correct this in NVGPU, we are setting the set_pc_sampling HAL to NULL. We need to make sure devtools also does not call into these APIs. Until the devtools team updates their code, we would return success(0) from update_pc_sampling API even if the HAL is set to NULL. Filed http://nvbugs/200671026 for devtools team. Bug 200604892 Bug 200671026 Change-Id: I6334d4b2a84d7a0f676d7e2faad4befde5f76310 Signed-off-by: tkudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2437002 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
tkudav	8303e93a60	gpu: nvgpu: Fix HAL checker mismatches for GV11B Add missing register definitions and set few HALs to NULL as they are not relevant on GV11B. Bug 200604892 Change-Id: I41aa87f50652eb1d0e99729838a58310cf586546 Signed-off-by: tkudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2430348 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Prateek sethi	223baa5883	gpu: nvgpu: add support for ACB SLCG on gv11b Register list for ACB SLCG is auto generated with scripts. Add HAL operations to enable/disable ACB clock gating. Bug 200647909 Change-Id: I4be4c14cc072fcccd91031a5a40321f5ff11f549 Signed-off-by: Prateek sethi <prsethi@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2420355 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	94bc3a8135	gpu: nvgpu: rearch zbc code and update hals Update nvgpu_gr_zbc as: struct nvgpu_gr_zbc { struct nvgpu_mutex zbc_lock; /* Lock to access zbc table / struct zbc_color_table zbc_col_tbl; /* SW zbc color table pointer / struct zbc_depth_table zbc_dep_tbl; /* SW zbc depth table pointer / struct zbc_stencil_table zbc_s_tbl; /* SW zbc stencil table pointer / u32 min_color_index; / Minimum valid color table index / u32 min_depth_index; / Minimum valid depth table index / u32 min_stencil_index; / Minimum valid stencil table index / u32 max_color_index; / Maximum valid color table index / u32 max_depth_index; / Maximum valid depth table index / u32 max_stencil_index; / Maximum valid stencil table index / u32 max_used_color_index; / Max used color table index / u32 max_used_depth_index; / Max used depth table index / u32 max_used_stencil_index; / Max used stencil table index / }; Add global struct nvgpu_gr_zbc_table_indices struct nvgpu_gr_zbc_table_indices { u32 min_color_index; u32 min_depth_index; u32 min_stencil_index; u32 max_color_index; u32 max_depth_index; u32 max_stencil_index; }; Currently, hw zbc table registers are written during both gr_init_setup_sw() and gr_init_setup_hw(). - Modify nvgpu_gr_zbc_load_default_table() to nvgpu_gr_zbc_load_default_sw_table() to only update sw copy of zbc table during gr_init_setup_sw(). - Modify nvgpu_gr_zbc_load_table() to write zbc values stored in sw zbc table to hw registers. Re-structure zbc function as per zbc type i.e. color, depth and stencil. Add gr.zbc.init_table_indices() hal to initialize zbc indices. Valid ZBC table indices start from 1. HW indices start from 0 for color, depth and stencil tables. Note that the corresponding format registers follow ZBC index range starting at 1. - void (init_table_indices)(struct gk20a g, struct nvgpu_gr_zbc_table_indices zbc_indices); - Add corresponding functions for legacy chips - Add zbc color, depth and stencil table size hw defines - Remove ltc.zbc_table_size() hal - Update ltc.set_zbc_s_entry(), ltc.set_zbc_color_entry and ltc.set_zbc_depth_entry() accordingly. Bug 3122410 Bug 3122649 Change-Id: Ib799991ad35c6613534c0a6eb07f3bf24e600dc5 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2417620 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	673cd507a8	gpu: nvgpu: add mm gops to get default va size Currently, default va aperture size, user size and kernel size are defined as fixed macros. However, max va bits can be chip specific. Add below mm gops API to obtain default aperture, user and/or kernel virtual memory size. void (get_default_va_sizes)(u64 aperture_size, u64 user_size, u64 kernel_size); JIRA NVGPU-5302 Change-Id: Ie0c60ca08ecff6613ce44184153bda066803d7d9 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2414840 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Lakshmanan M	c0e2dc5b74	gpu: nvgpu: Add subctx programming for MIG This CL covers the following code changes, 1) Added api to init inst_block for more than one subctxs. 2) Added logic to limit the subctx bind based on max. VEID count allocated to a gr instance. 3) Renamed nvgpu_grmgr_get_gr_runlist_id. JIRA NVGPU-5647 Change-Id: Ifec8164a9e5f46fbd0538c3dd50e19ee63667a54 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2418463 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	dd9298c959	gpu: nvgpu: move perf unit accesses to common.perf unit Below HALs are implemented in common.gr unit, but they really belong to common.perf unit since they access registers from perf unit. gops.gr.init_hwpm_pmm_register() gops.gr.get_num_hwpm_perfmon() gops.gr.set_pmm_register() gops.gr.reset_hwpm_pmm_registers() Move them to common.perf unit, and update all the code accordingly gops.perf.init_hwpm_pmm_register() gops.perf.get_num_hwpm_perfmon() gops.perf.set_pmm_register() gops.perf.reset_hwpm_pmm_registers() Add new HAL gops.gr.get_pm_ctx_buffer_offsets() and set it to gr_gk20a_get_pm_ctx_buffer_offsets() for all chips. Bug 2510974 Jira NVGPU-5360 Change-Id: Ib5e84ed5c8b6e72cc6923161e55fc2c3a6a4070e Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2418306 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	9652764b65	gpu: nvgpu: reset HWPM regs while binding HWPM in global mode Add new HAL g->ops.gr.reset_hwpm_pmm_registers() to reset all HWPM regs while binding HWPM in global mode in nvgpu_profiler_bind_hwpm() Add below new HALs to get sys/gpc/fbp register list and count g->ops.perf.get_hwpm_sys_perfmon_regs() g->ops.perf.get_hwpm_gpc_perfmon_regs() g->ops.perf.get_hwpm_fbp_perfmon_regs() Auto generate all the HWPM regs in below arrays for gv11b/tu104 static const u32 hwpm_sys_perfmon_regs[] static const u32 hwpm_gpc_perfmon_regs[] static const u32 hwpm_fbp_perfmon_regs[] Bug 2510974 Jira NVGPU-5360 Change-Id: I2ca5c04ed75c7b30ae942807bf018a24551d7ba0 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2414934 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Sagar Kamble	df4d5f3c57	gpu: nvgpu: replace CONFIG_NVGPU_SUPPORT_TURING usage with CONFIG_NVGPU_DGPU Also update the config check in pci_power.c for definitions of stubs for pcie_attach\|detach_controller callbacks. Bug 200658918 Bug 200609273 Change-Id: Ie3f3b4de4cbcd520e54a3eb0590699c1a433e82d Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2414959 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Antony Clince Alex	7b7f42bd33	gpu: nvgpu: add gr ops find_priv_offset_in_buffer Convert gr_gk20a_find_priv_offset_in_buffer into hal function gops.gr.find_priv_offset_in_buffer. This is done in-order to facilitate nvgpu-next to transition into a new ctxsw buffer layout. Bug 2761598 Jira NVGPU-6008 Change-Id: Id294be628944daad7f9afa68214d98d87bbbf68c Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2403708 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	221475f753	gpu: nvgpu: add profiler apis to manage PMA stream Support new IOCTL to manage PMA stream meta data by adding below API nvgpu_prof_ioctl_pma_stream_update_get_put() Add nvgpu_perfbuf_update_get_put() to handle all the updates coming from userspace and to pass all required information. Add gops.perf.update_get_put() to handle all HW accesses required in perf HW unit. Add gops.perf.bind_mem_bytes_buffer_addr() to bind the available bytes buffer while binding HWPM streamout. Bug 2510974 Jira NVGPU-5360 Change-Id: Ibacc2299b845e47776babc081759dfc4afde34fe Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2406484 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	db20451d0d	gpu: nvgpu: fix pmm chiplet offsets gr_gv100_init_hwpm_pmm_register() and gr_gv100_set_pmm_register() right now assume common chiplet stride for all sys/fbp/gpc and use common API g->ops.perf.get_pmm_per_chiplet_offset() to get the stride. Chiplet strides are same for all partitions only by chance, and future chip might change that. Hence add and use below 3 separate HALs to get appropriate strides. g->ops.perf.get_pmmsys_per_chiplet_offset() g->ops.perf.get_pmmgpc_per_chiplet_offset() g->ops.perf.get_pmmfbp_per_chiplet_offset() Also store sys/fbp/gpc perfmon count in struct gk20a after first query instead of querying them again and again. Querying the counts from HW is time consuming. Bug 2510974 Jira NVGPU-5360 Change-Id: I186009221009780d561617c0cd6f535854db585f Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2413108 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	e0dd79cd43	gpu: nvgpu: rearch mc reset and enable hals Remove current mc hals - mc.reset() - mc.enable() - mc.disable() - mc.reset_mask() - mc.reset_engine() - mc.reset_engine_enable() Add new mc hals - mc.enable_units(g, units, enable) > enable/disable given unit(s) - mc.enable_dev(g, dev, enable) > enable/disable engine represented by given device pointer - mc.enable_devtype(g, devtype) > enable/disable all engines of given devtype Move common mc intr functions to common/mc/mc_intr.c. Add below common mc functions - nvgpu_mc_reset_units(g, units) > reset given logical OR of nvgpu unit bitmap - nvgpu_mc_reset_dev(g, dev) > reset given single engine via dev > if engine is graphics, reset gpcs for nvgpu_next - nvgpu_mc_reset_devtype(g, devtype) > reset all engines of given devtype > if devtype is graphics, reset gpcs for nvgpu_next Bug 200648985 Bug 3109773 Change-Id: Idc67a14a0a7cde83de44fbfbec13007fead3ed5c Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2408523 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	3b746dce0c	gpu: nvgpu: use a falcon flag instead of enabled bit common.gr unit right now makes use of a capability bit NVGPU_PMU_FECS_BOOTSTRAP_DONE to ensure the recovery path hits a different routine. This is actually needless and a common check cannot be used for all GR instances anyways. Delete this capability bit. Add and use a new flag coldboot_bootstrap_done added under struct nvgpu_gr_falcon Jira NVGPU-5648 Change-Id: I46faea6f07cf054f17a3215d4cbbe0fc8a6382ae Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2409533 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Debarshi Dutta	cdacc2e2b2	gpu: nvgpu: reorganize HAL for gm20b, gp10b, tu104 Designated initializers with nested structs should not be used to avoid a known problem in the qnx compiler that results in incorrect values used for some fields. Remove nested structs initialization and instead perform runtime initialization for GM20B, GP10B and TU104 HAL assignments. Change-Id: I6c94f85c7d6f7e279206bff7bd3535f56a377494 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2401399 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Debarshi Dutta	9cb1fa8429	gpu: nvgpu: update HAL file for gv11b Designated initializers with nested structs should not be used to avoid a known problem in the qnx compiler that results in incorrect values used for some fields. 5.1 Disclosure ID: NVGPU_RM-CODE-OIL-06 Remove nested structs initialization and instead perform runtime initialization for GV11B's HAL. Change-Id: Idd964c4e974db8707fc6cc8b1195a1365079c213 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2401398 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	a2809088eb	gpu: nvgpu: remove unnecessary hal gops.gr.gr_enable_hw() gops.gr.gr_enable_hw() is a common function and not referred on vGPU. Remove HAL pointer and directly use nvgpu_gr_enable_hw() instead. Jira NVGPU-5648 Change-Id: Id031024ed01f9d890cffb5902cc433800810b219 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2403548 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	8cccb49bd2	gpu: nvgpu: collapse nvgpu_gr_prepare_sw into nvgpu_gr_alloc common.gr unit exports a separate API nvgpu_gr_prepare_sw to initialize some SW pieces required for nvgpu_gr_enable_hw(). A separate API is really unnecessary since same initialization can be performed in nvgpu_gr_alloc(). Remove nvgpu_gr_prepare_sw() and HAL gops.gr.gr_prepare_sw(). Initialize falcon and interrupt structures in loop from nvgpu_gr_alloc(). Move nvgpu_netlist_init_ctx_vars() from nvgpu_gr_prepare_sw() to common init path since netlist parsing need not be done from common.gr unit. It just needs to happen before nvgpu_gr_enable_hw(). Also, trigger nvgpu_gr_free() from gr_remove_support() instead of OS specific paths. Also remove nvgpu_gr_free() calls from probe error paths since nvgpu_gr_alloc is no longer called in probe path. Move interrupt and falcon data structure free calls to nvgpu_gr_free(). Also remove corresponding unit testing code that tests nvgpu_gr_prepare_sw() specifically. Update some unit tests to initialize ecc counters and netlist. Disable some unit tests that fail for reasons unknown. Jira NVGPU-5648 Change-Id: I82ec8160f76530bc40e0c11a9f26ba1c8f9cf643 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2400166 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Alex Waterman	fba96fdc09	gpu: nvgpu: Replace nvgpu_engine_info with nvgpu_device Delete the struct nvgpu_engine_info as it's essentially identical to struct nvgpu_device. Duplicating data structures is not ideal as it's terribly confusing what does what. Update all uses of nvgpu_engine_info to use struct nvgpu_device. This is often a fairly straight forward replacement. Couple of places though where things got interesting: - The enum_type that engine_info uses is defined in engines.h and has a bit of SW abstraction - in particular the GRCE type. The only place this seemed to be actually relevant (the IOCTL providing device info to userspace) the GRCE engines can be worked out by comparing runlist ID. - Addition of masks based on intr_id and reset_id; those can be computed easily enough using BIT32() but this is an area that could be improved on. This reaches into a lot of extraneous code that traverses the fifo active engines list and dramtically simplifies this. Now, instead of having to go through a table of engine IDs that point to the list of all host engines, the active engine list is just a list of pointers to valid engines. It's now trivial to do a for-all-active-engines type loop. This could even be turned into a generic macro or otherwise abstracted in the future. JIRA NVGPU-5421 Change-Id: I3a810deb55a7dd8c09836fd2dae85d3e28eb23cf Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2319895 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Lakshmanan M	48f1da4dde	gpu: nvgpu: Add bundle skip sequence in MIG mode In MIG mode, 2D, 3D, I2M and ZBC classes are not supported by GR engine. So skip those bundle programming sequence in MIG mode. JIRA NVGPU-5648 Change-Id: I7ac28a40367e19a3e31e63f3e25991c0ed4d2d8b Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2397912 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Lakshmanan M	2a6fcec078	gpu: nvgpu: add gr manager ops-2 and mig infra-2 This CL covers the code changes related to following support, - Enabled gr manager ops. - Added gr manager init/remove support. - Refactor in gpu instance config infra. - Refactor in gr syspipe gpcs config infra. JIRA NVGPU-5645 JIRA NVGPU-5646 Change-Id: Ib2fab2796d76fe105fc5a08f2c5f9bfa36317f7c Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2393550 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Alex Waterman	7c1c533a4a	gpu: nvgpu: Don't disable coalesce for gv11b+ Stop enabling LG and SU coalesce on gv11b and tu104. This is no longer required. Bug 1951653 Bug 1801194 Change-Id: I412be2caae6b841d5387ae5a153d38e49d3d61bc Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2392901 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	6daa0636d1	gpu: nvgpu: rework regops execution API Rework regops execution API to accomodate below updates for new profiler design - gops.regops.exec_regops() should accept TSG pointer instead of channel pointer. - Remove individual boolean parameters and add one flag field. Below new flags are added to this API : NVGPU_REG_OP_FLAG_MODE_ALL_OR_NONE NVGPU_REG_OP_FLAG_MODE_CONTINUE_ON_ERROR NVGPU_REG_OP_FLAG_ALL_PASSED NVGPU_REG_OP_FLAG_DIRECT_OPS Update other APIs, e.g. gr_gk20a_exec_ctx_ops() and validate_reg_ops() as per new API changes. Add new API gk20a_is_tsg_ctx_resident() to check context residency from TSG pointer. Convert gr_gk20a_ctx_patch_smpc() to a HAL gops.gr.ctx_patch_smpc(). Set this HAL only for gm20b since it is not required for later chips. Also, remove subcontext code from this function since gm20b does not support subcontext. Remove stale comment about missing vGPU support in exec_regops_gk20a() Bug 2510974 Jira NVGPU-5360 Change-Id: I3c25c34277b5ca88484da1e20d459118f15da102 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2389733 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	a73b5d3c6f	gpu: nvgpu: use smpc global mode capability check In nvgpu_dbg_gpu_ioctl_smpc_ctxsw_mode(), check if SMPC global mode capability is supported instead of checking for the function pointer. Enable the capability only for Turing since pre-Turing GPUs don't support it. Bug 2510974 Jira NVGPU-5360 Change-Id: I352fb2a91b836cd8ef727966a53a28255d8ea834 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2389653 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Seema Khowala	9ea21459b4	gpu: nvgpu: pascal+: trigger_suspend, wait_for/resume_from _pause set to NULL - NvRmGpuDeviceSetSmDebugMode uses regops interface. - NvRmGpuDeviceTriggerSuspend, NvRmGpuDeviceWaitForPause, and NvRmGpuDeviceResumeFromPause should return error on Pascal+. Use regops interface to suspend/resume. - On non-cilp devices(Maxwell), NvRmGpuDeviceTriggerSuspend, NvRmGpuDeviceWaitForPause, NvRmGpuDeviceResumeFromPause and NvRmGpuDeviceSetSmDebugMode are used when debugger(including coredump, memcheck) is attached or when CUDA application uses a syscall that requires traphandler(assert, cnp). Bug 2558022 Bug 2559631 Bug 2706068 JIRA NVGPU-5502 Change-Id: I9eb2ab0c8c75c50f53523d8bf39c75f98b34f3f0 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2376159 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Lakshmanan M	c99afa1766	gpu: nvgpu: add gr manager and mig infra This CL covers the code changes related to following support, - Added gr manager infra. - Added grmgr_gops infra. - Added mig infra. - Added log mask for MIG verbose support. JIRA NVGPU-5645 JIRA NVGPU-5646 Change-Id: Iec356e08e6cfee86ad9f59fdf6cfee9c38231359 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2385111 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00

1 2 3 4 5 ...

314 Commits