linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-24 10:34:43 +03:00

Author	SHA1	Message	Date
Vedashree Vidwans	86cb03d2f1	gpu: nvgpu: Replace WAR keyword with "fix" Replace/remove "WAR" keyword in the comments in nvgpu driver with "fix". Rename below functions and corresponding gops to replace "war" word with "errata" word: - g.pdb_cache_war_mem - ramin.init_pdb_cache_war - ramin.deinit_pdb_cache_war - tu104_ramin_init_pdb_cache_war - tu104_ramin_deinit_pdb_cache_war - fb.apply_pdb_cache_war - tu104_fb_apply_pdb_cache_war - nvgpu_init_mm_pdb_cache_war - nvlink.set_sw_war - gv100_nvlink_set_sw_war Jira NVGPU-6680 Change-Id: Ieaad2441fac87e4544eddbca3624b82076b2ee73 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2515700 Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-04-28 19:14:49 -07:00
Vedashree Vidwans	aba26fa082	gpu: nvgpu: handle chip specific erratas Currently, there are few chip specific erratas present in nvgpu code. For better traceability of the erratas and corresponding fixes, introduce flags to indicate existing erratas on a chip. These flags decide if a corresponding solution is applied to the chip(s). This patch introduces below functions to handle errata flags: - nvgpu_init_errata_flags - nvgpu_set_errata - nvgpu_is_errata_present - nvgpu_print_errata_flags - nvgpu_free_errata_flags nvgpu_print_errata_flags: print below details of erratas present in chip 1. errata flag name 2. chip where the errata was first discovered 3. short description of the errata Flags corresponding to erratas present in a chip are set during chip hal init sequence. JIRA NVGPU-6510 Change-Id: Id5a8fb627222ac0a585aba071af052950f4de965 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2498095 Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-04-28 19:14:44 -07:00
Deepak Nibade	c08719cb0b	gpu: nvgpu: move graphics specific HALs to fusa files All graphics code is under CONFIG_NVGPU_GRAPHICS and all the HALs are in non-fusa files. In order to support graphics in safety, CONFIG_NVGPU_GRAPHICS needs to be enabled. But since most of the HALs are in non-fusa files, this causes huge compilation problem. Fix this by moving all graphics specific HALs used on gv11b to fusa files. Graphics specific HALs not used on gv11b remain in non-fusa files and need not be protected with GRAPHICS config. Protect call to nvgpu_pmu_save_zbc() also with config CONFIG_NVGPU_POWER_PG, since it is implemented under that config. Delete hal/ltc/ltc_gv11b.c since sole function in this file is moved to fusa file. Enable nvgpu_writel_loop() in safety build since it is needed for now. This will be revisited later once requirements are clearer. Move below CTXSW methods under CONFIG_NVGPU_NON_FUSA for now. Safety CTXSW ucode does not support these methods. These too will be revisited later once requirements are clearer. NVGPU_GR_FALCON_METHOD_PREEMPT_IMAGE_SIZE NVGPU_GR_FALCON_METHOD_CTXSW_DISCOVER_ZCULL_IMAGE_SIZE Jira NVGPU-6460 Change-Id: Ia095a04a9ba67126068aa7193f491ea27477f882 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2513675 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-04-22 14:59:38 -07:00
Debarshi Dutta	e9a8fa028e	gpu: nvgpu: disable ssync access when MIG is enabled Disable access to ssync unit when MIG is enabled as ssync is part of GR and not Compute. A runtime check is now added for the below function. gv11b_gr_intr_enable_hww_exceptions The following priv errors are seen. SYS write error: ADR 0x00405a14 WRDAT 0xc0000000 master 0x00000000 [ERR] INFO 0x19400200: (subid 0x00000019 priv_level 0 local_ordering 1) [ERR] CODE 0xbadf1100 Jira NVGPU-6699 Change-Id: I9a08f1b6ab58affdcaa18e8ca314a4a00478a3e5 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2514761 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-04-20 07:47:19 -07:00
Antony Clince Alex	5517e14e57	gpu: nvgpu: tu104: support regops to lts_tstg_cfg2/3 registers In-order to support L2 sector promotion, lts_tstg_cfg2,3 registers were added to the SYS priv save segment of the ctxsw'ed image. Update gops_gr.decode_priv_addr HAL to include regops support to the above two registers. Introduce HAL ops gops_ltc.pri_is_lts_tstg_addr to detect lts_tstg addresses. Bug 200656177 Change-Id: I0f6c24d802edf8ac72917ed099d7ae153f6b4219 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2510281 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-04-16 03:35:52 -07:00
Lakshmanan M	d4c33de919	gpu: nvgpu: Skip determine ppc config for MIG Added a logic to skip the query ppc config when MIG is enabled. JIRA NVGPU-5650 Change-Id: Id95d016cd3fd1e7ee283ebd9e7e8c5ee677eafd3 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2510884 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-04-07 20:16:43 -07:00
Antony Clince Alex	78dbec7f44	gpu: nvgpu: tu104: update CAU hal Update CAU hal tu104_gr_init_cau to use regops.get_cau_register_stride hal function. Jira: NVGPU-5689 Change-Id: I7c6e933630587e2d69b92173fd8c3fa8a7021c1d Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2489388 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-03-23 04:39:14 -07:00
Antony Clince Alex	7072b39783	gpu: nvgpu: gv11b: update prop hww handling Update prop hww handling to read and print additional diagnostic registers. Generate following registers: - gr_gpc0_prop_hww_esr_coord_r - gr_gpc0_prop_hww_esr_format_r - gr_gpc0_prop_hww_esr_state_r - gr_gpc0_prop_hww_esr_state2_r - gr_gpc0_prop_hww_esr_offset_r Rename following registers and associated fields: - pes_hww_esr => gpc0_ppc0_pes_hww_esr - setup_hww_esr = > gpc0_setup_hww_esr - zcull_hww_esr => gpc0_zcull_hww_esr - prop_hww_esr => gpc0_prop_hww_esr Jira NVGPU-6078 Bug 2865015 Change-Id: I131c48d2375ef0a76ac6c57ff1eb019f7c113286 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2472894 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-03-12 04:38:04 -08:00
ajesh	0030dc3eb4	gpu: nvgpu: fix MISRA violations in Posix unit Fix violations of MISRA rule 5.4 in Posix unit. JIRA NVGPU-6534 Change-Id: I9471e5fca913ca8cc19403998fdbe5450fb49879 Signed-off-by: ajesh <akv@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2488184 (cherry picked from commit f9bc21ca8d96e9c531a1b0077cfe1e78502e7ee5) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2491855 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-03-04 00:37:15 -08:00
Thomas Steinle	1b5a9b28ea	gpu: nvgpu: Add gr.ops NULL-ptr check This fix add NULL-ptr checks for some of the user-accessible ioctl. Bug 3240771 Bug 200696704 Change-Id: Ibe7f75b31b2521a530883253a93ba832f010dc80 Signed-off-by: Thomas Steinle <tsteinle@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2483635 (cherry picked from commit `cc717e3145`) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2490126 Tested-by: Dinesh T <dt@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-03-04 00:36:14 -08:00
Deepak Nibade	53cc5be723	gpu: nvgpu: update doxygen for common.gr unit Update common.gr doxygen based on review comments from design verification. - Add error return values to some APIs. - Remove redundant description lines from some APIs. Update gm20b_gr_falcon_status_check_ctx_wait_ucode() to return actual error codes instead of -1. Jira NVGPU-6494 Change-Id: Ieb3f5acd27c30cd50049b114ddd8847b1b376ca3 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2490213 (cherry picked from commit 3694fd1bdd1ac36f8c91b1fbaab47cadd8ba1868) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2490211 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-03-02 19:34:13 -08:00
Deepak Nibade	ce206826f2	gpu: nvgpu: use explicit timeout to wait for SM lock down gv11b_gr_wait_for_sm_lock_down() uses nvgpu_get_poll_timeout() to get timeout value for polling of SM lock down status. nvgpu_get_poll_timeout() returns -1 if timeouts are disabled by debugger, and if SM lock down fails, nvgpu lands in an infinite loop. Use g->poll_timeout_default instead of nvgpu_get_poll_timeout() so that explicit timeout value is always used. This also means that timeout value of ULONG_MAX will still be used on simulation platforms. Bug 200676073 Change-Id: I5777e98efcd63f24ade244384cf7b157dcea991d Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2478255 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Mikhail Filimonov <mfilimonov@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-02-01 19:20:34 -08:00
Alex Waterman	77c0b9ffdc	gpu: nvgpu: Update runlist_update() to take runlist ptr Update the nvgpu_runlist_update_for_channel() function: - Rename it to nvgpu_runlist_update() - Have it take a pointer to the runlist to update instead of a runlist ID. For the most part this makes the code better but there's a few places where it's worse (for now). This starts the slow and painful process of moving away from the non-runlist code using runlist IDs in many places it should not. Most of this patch is just fixing compilation problems with the minor header updates. JIRA NVGPU-6425 Change-Id: Id9885fe655d1d750625a1c8aceda9e67a2cbdb7a Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2470304 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-01-29 09:51:44 -08:00
Vedashree Vidwans	df1c9c4640	gpu: nvgpu: remove trivial operations before BUG Corrected ECC errors are not applicable to GV11B. Remove unnecesary lines of code before invoking BUG(). Jira NVGPU-6272 Change-Id: I410d6463efd39584efdff7939ad66fae8ee63afc Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2473098 Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-01-22 07:05:31 -08:00
Deepak Nibade	cae88e7451	gpu: nvgpu: initialize cau data while binding HWPM in global mode Add CAU initialization data in const array hwpm_cau_init_data[]. Add HAL API gops.gr.get_hwpm_cau_init_data() to retrieve this data and implement it for TU104. Add new HAL API gops.gr.init_cau() that uses above data and initializes all cau units. Implement this HAL only for TU104. Invoke above sequence from nvgpu_profiler_bind_hwpm() in case of global HWPM mode. Jira NVGPU-5360 Change-Id: I1c7a380e9d04d6cd45fb7f746c0a79fc56675244 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2463854 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-01-05 12:39:54 -08:00
Deepak Nibade	869735cda4	gpu: nvgpu: add dynamic allowlist support Add gv11b and tu104 HALs to get allowed HWPM resource register ranges, offsets, and stride meta data. Add new enum nvgpu_pm_resource_hwpm_register_type for HWPM register type. Add new struct nvgpu_pm_resource_register_range_map to store all the register ranges for HWPM resources. Add pointer of map in struct nvgpu_profiler_object along with map entry count. Add new API nvgpu_profiler_build_regops_allowlist() to build the regops allowlist dynamically while binding the resources. Map entry count is received with get_pm_resource_register_range_map_entry_count() and only those resource ranges are added for which resource is reserved by profiler object. Add nvgpu_profiler_destroy_regops_allowlist() to destroy the allowlist while unbinding the resources. Add static functions allowlist_range_search() to search a register offset in HWPM resource ranges. Add another static function allowlist_offset_search() to search the offset in per-resource offset list. Add nvgpu_profiler_validate_regops_allowlist() that accepts an offset value, checks if it is in allowed ranges using allowlist_range_search() and then checks if offset is in allowlist using allowlist_offset_search(). Update gops.regops.exec_regops() to receive profiler object pointer as a parameter. Invoke nvgpu_profiler_validate_regops_allowlist() from validate_reg_ops() if prof pointer is not-null. This will be true only for new profiler stack and not legacy profilers. In gr_exec_ctx_ops(), skip regops execution if offset is invalid. Bug 2510974 Jira NVGPU-5360 Change-Id: I40acb91cc37508629c83106ea15b062250bba473 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2460001 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-01-05 12:38:06 -08:00
Deepak Nibade	9221b01968	gpu: nvgpu: implement HWPM streamout teardown sequence Implement below functions: - nvgpu_profiler_quiesce_hwpm_streamout_resident Teardown sequence when context is resident or in case profiling session is a device level session. - nvgpu_profiler_quiesce_hwpm_streamout_non_resident Teardown sequence when context is non resident - nvgpu_profiler_quiesce_hwpm_streamout Generic sequence to call either of above API based on whether context is resident or not. Trigger HWPM streamout teardown sequence while unbinding resources in nvgpu_profiler_unbind_hwpm_streamout() Add a new HAL gops.gr.is_tsg_ctx_resident to call gk20a_is_tsg_ctx_resident() from common code. Implement below supporting HALs for resident teardown sequence: - gops.perf.pma_stream_enable() - gops.perf.disable_all_perfmons() - gops.perf.wait_for_idle_pmm_routers() - gops.perf.wait_for_idle_pma() - gops.gr.disable_cau() - gops.gr.disable_smpc() Jira NVGPU-5360 Change-Id: I304ea25d296fae0146937b15228ea21edc091e16 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2461333 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-18 15:26:21 -08:00
Antony Clince Alex	b285fb33ee	gpu: nvgpu: update regops ctxsw address types Update the below two regops ctxsw address types to fix misnomers: - CTXSW_ADDR_TYPE_ROP: This address type is used to access the PMM config registers and does not belong to the ROP unit. Hence, rename it to CTXSW_ADDR_TYPE_PMM_FBPGS_ROP. - CTXSW_ADDR_TYPE_BE: This address type is used to access registers exclusively in ROP unit and not the entire BE unit. Hence, its more appropriate to rename it to CTXSW_ADDR_TYPE_ROP. In addition, rename the following functions: - pri_is_be_addr_shared => pri_is_rop_addr_shared - pri_be_shared_addr => pri_rop_shared_addr - pri_is_be_addr => pri_is_rop_addr - pri_get_be_num => pri_get_rop_num Bug 3146324 Change-Id: I8613f0972936699b2ef8f7dbe3de78582af2a35f Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2429885 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
Seshendra Gadagottu	722ee1cbc5	gpu: nvgpu: t18x+: return error for set_pc_sampling API Return error for set_pc_sampling API, if this HAL is not supported(for pascal+) by gpu version. nvrm driver updated to handle this error gracefully/skip this API call to nvgpu driver for pascal+ gpu versions. Bug 200671026 Bug 2916124 Change-Id: I7e323367ff03868814541c7c2b423e1476331ebb Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2453899 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Seeta Rama Raju	82ed6cbec8	gpu: nvgpu: Fix for MISRA 8.6 violation - MISRA scan reports violation that these functions are declared but never defined. - Here function definitions are under conditional compilation but not function declaration. So keeping these declarations under conditional comilation. JIRA NVGPU-6053 Change-Id: Ic5fcdd321276cfadcff103cd46c31903fd236e7e Signed-off-by: Seeta Rama Raju <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2448753 (cherry picked from commit 9ce8fbd39fc12c709295cef0e7ecddaf2bea4e31) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2449718 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
Lakshmanan M	87e988aa24	gpu: nvgpu: Skip graphics unit access during MIG This CL covers the following code changes, * Skipped pd mapping. * Skipped ZCULL netlist handling. * Skipped gfxp programming sequence. JIRA NVGPU-5650 JIRA NVGPU-5653 Change-Id: I73ee63f9399c47ca4afe3d4320698d0bd61e371e Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2444562 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
tkudav	2ca4f145e4	gpu: nvgpu: Fix HAL checker pointed mismatches Add new HALs for register field definition/value changes in GV11B as compared to Pascal. Update the HALs for recent chips too if applicable. Bug 200604892 Change-Id: I14ee9440859007e86a1ffa937df399a31e2628bd Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2437564 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
tkudav	e962ec3fa0	gpu: nvgpu: Set PC sampling HAL to NULL for GP10b+ Pascal+ chips do not support updating PC sampling using register NV_CTXSW_MAIN_IMAGE_PM (Unlike GM20B, bit 6 = PC_SAMPLING is not present on GP10b, GV11b and TU104). To correct this in NVGPU, we are setting the set_pc_sampling HAL to NULL. We need to make sure devtools also does not call into these APIs. Until the devtools team updates their code, we would return success(0) from update_pc_sampling API even if the HAL is set to NULL. Filed http://nvbugs/200671026 for devtools team. Bug 200604892 Bug 200671026 Change-Id: I6334d4b2a84d7a0f676d7e2faad4befde5f76310 Signed-off-by: tkudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2437002 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	2531107818	gpu: nvgpu: add zbc debug flag and prints Add debug prints in zbc table functions and add zbc debug flag to enable manageable and modular debug prints related to zbc. Bug 3156369 Change-Id: I0fd532ba6e4fd8dba125a2270ea70aaafdb2ed8e Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2434170 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	94bc3a8135	gpu: nvgpu: rearch zbc code and update hals Update nvgpu_gr_zbc as: struct nvgpu_gr_zbc { struct nvgpu_mutex zbc_lock; /* Lock to access zbc table / struct zbc_color_table zbc_col_tbl; /* SW zbc color table pointer / struct zbc_depth_table zbc_dep_tbl; /* SW zbc depth table pointer / struct zbc_stencil_table zbc_s_tbl; /* SW zbc stencil table pointer / u32 min_color_index; / Minimum valid color table index / u32 min_depth_index; / Minimum valid depth table index / u32 min_stencil_index; / Minimum valid stencil table index / u32 max_color_index; / Maximum valid color table index / u32 max_depth_index; / Maximum valid depth table index / u32 max_stencil_index; / Maximum valid stencil table index / u32 max_used_color_index; / Max used color table index / u32 max_used_depth_index; / Max used depth table index / u32 max_used_stencil_index; / Max used stencil table index / }; Add global struct nvgpu_gr_zbc_table_indices struct nvgpu_gr_zbc_table_indices { u32 min_color_index; u32 min_depth_index; u32 min_stencil_index; u32 max_color_index; u32 max_depth_index; u32 max_stencil_index; }; Currently, hw zbc table registers are written during both gr_init_setup_sw() and gr_init_setup_hw(). - Modify nvgpu_gr_zbc_load_default_table() to nvgpu_gr_zbc_load_default_sw_table() to only update sw copy of zbc table during gr_init_setup_sw(). - Modify nvgpu_gr_zbc_load_table() to write zbc values stored in sw zbc table to hw registers. Re-structure zbc function as per zbc type i.e. color, depth and stencil. Add gr.zbc.init_table_indices() hal to initialize zbc indices. Valid ZBC table indices start from 1. HW indices start from 0 for color, depth and stencil tables. Note that the corresponding format registers follow ZBC index range starting at 1. - void (init_table_indices)(struct gk20a g, struct nvgpu_gr_zbc_table_indices zbc_indices); - Add corresponding functions for legacy chips - Add zbc color, depth and stencil table size hw defines - Remove ltc.zbc_table_size() hal - Update ltc.set_zbc_s_entry(), ltc.set_zbc_color_entry and ltc.set_zbc_depth_entry() accordingly. Bug 3122410 Bug 3122649 Change-Id: Ib799991ad35c6613534c0a6eb07f3bf24e600dc5 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2417620 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	1a914b3699	gpu: nvgpu: support preemption mode API for specific GR instance Get current GR instance pointer with nvgpu_gr_get_cur_instance_ptr() in nvgpu_gr_setup_set_preemption_mode() and refer to other GR engine specific data structures using this pointer. Add/update debug prints to include gpu_dbg_gr flag. Jira NVGPU-5648 Change-Id: I38f49b80c4969e9ae20ba1516898fa152786a984 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2419035 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	dd9298c959	gpu: nvgpu: move perf unit accesses to common.perf unit Below HALs are implemented in common.gr unit, but they really belong to common.perf unit since they access registers from perf unit. gops.gr.init_hwpm_pmm_register() gops.gr.get_num_hwpm_perfmon() gops.gr.set_pmm_register() gops.gr.reset_hwpm_pmm_registers() Move them to common.perf unit, and update all the code accordingly gops.perf.init_hwpm_pmm_register() gops.perf.get_num_hwpm_perfmon() gops.perf.set_pmm_register() gops.perf.reset_hwpm_pmm_registers() Add new HAL gops.gr.get_pm_ctx_buffer_offsets() and set it to gr_gk20a_get_pm_ctx_buffer_offsets() for all chips. Bug 2510974 Jira NVGPU-5360 Change-Id: Ib5e84ed5c8b6e72cc6923161e55fc2c3a6a4070e Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2418306 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	9652764b65	gpu: nvgpu: reset HWPM regs while binding HWPM in global mode Add new HAL g->ops.gr.reset_hwpm_pmm_registers() to reset all HWPM regs while binding HWPM in global mode in nvgpu_profiler_bind_hwpm() Add below new HALs to get sys/gpc/fbp register list and count g->ops.perf.get_hwpm_sys_perfmon_regs() g->ops.perf.get_hwpm_gpc_perfmon_regs() g->ops.perf.get_hwpm_fbp_perfmon_regs() Auto generate all the HWPM regs in below arrays for gv11b/tu104 static const u32 hwpm_sys_perfmon_regs[] static const u32 hwpm_gpc_perfmon_regs[] static const u32 hwpm_fbp_perfmon_regs[] Bug 2510974 Jira NVGPU-5360 Change-Id: I2ca5c04ed75c7b30ae942807bf018a24551d7ba0 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2414934 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Lakshmanan M	054fcf5635	gpu: nvgpu: Add gr VEID programming for MIG This CL covers the following code changes, 1) Added api to get max VEID count per gpu/gr instance. 2) Added logic to limit the SW VEID bundle programming based on max. VEID count allocated to a gr instance. JIRA NVGPU-5647 Change-Id: I5cbe98c505f81eaf29cc96707782f6350694e4c3 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2417800 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	96dc116eed	gpu: nvgpu: support context creation for specific GR instance Get current GR instance pointer with nvgpu_gr_get_cur_instance_ptr() in nvgpu_gr_setup_alloc_obj_ctx() and update all the code in this function to use this GR instance pointer instead of globally accessing g->gr->* data structures. Add lots of GR engine specific debug prints in context creation path. Jira NVGPU-5648 Change-Id: Ia8681d115ee88c5848621854f23e1cce4ff3deb2 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2415239 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: Lakshmanan M <lm@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Antony Clince Alex	7b7f42bd33	gpu: nvgpu: add gr ops find_priv_offset_in_buffer Convert gr_gk20a_find_priv_offset_in_buffer into hal function gops.gr.find_priv_offset_in_buffer. This is done in-order to facilitate nvgpu-next to transition into a new ctxsw buffer layout. Bug 2761598 Jira NVGPU-6008 Change-Id: Id294be628944daad7f9afa68214d98d87bbbf68c Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2403708 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	db20451d0d	gpu: nvgpu: fix pmm chiplet offsets gr_gv100_init_hwpm_pmm_register() and gr_gv100_set_pmm_register() right now assume common chiplet stride for all sys/fbp/gpc and use common API g->ops.perf.get_pmm_per_chiplet_offset() to get the stride. Chiplet strides are same for all partitions only by chance, and future chip might change that. Hence add and use below 3 separate HALs to get appropriate strides. g->ops.perf.get_pmmsys_per_chiplet_offset() g->ops.perf.get_pmmgpc_per_chiplet_offset() g->ops.perf.get_pmmfbp_per_chiplet_offset() Also store sys/fbp/gpc perfmon count in struct gk20a after first query instead of querying them again and again. Querying the counts from HW is time consuming. Bug 2510974 Jira NVGPU-5360 Change-Id: I186009221009780d561617c0cd6f535854db585f Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2413108 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	6a69ea235e	gpu: nvgpu: disable graphics specific init functions in MIG mode MIG mode does not support graphics, ELPG, and use cases like TPC floorsweeping. Skip all such initialization functions in common.gr unit if MIG mode is enabled. Set can_elpg to false if MIG mode is enabled. Jira NVGPU-5648 Change-Id: I03656dc6289e49a21ec7783430db9c8564c6bf1f Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2411741 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	7a937a6190	gpu: nvgpu: add debug logs for common.gr debugging Add separate flag gpu_dbg_gr to enable common.gr specific debugging. Add this flag to all the existing debug logs that use gpu_dbg_fn or gpu_dbg_info for debugging. Also add many other debugging logs that might be helpful in debugging. Removing debug log in gv11b_gr_init_get_nonpes_aware_tpc() as it dumps too much data that does not seem useful. Batch all interrupt enable functions in gr_init_setup_hw() together for readability. Jira NVGPU-5648 Change-Id: I0b857650122cdb1f974b452d28c26e7f142baf61 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2411740 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	bafeea3530	gpu: nvgpu: setup HW for each GR instance Get number of SMs from GR instance specific nvgpu_gr_config pointer instead of global SM count in below functions : nvgpu_gr_fs_state_init() gv11b_gr_init_sm_id_config() Update nvgpu_gr_config_get_gpc_skip_mask() to return 0 in case gpc_index is greater than available gpc_count. This is not MIG specific, but based on code review possible even today for existing chips. See gm20b_gr_init_pd_skip_table_gpc() Update nvgpu_gr_get_override_ecc_val() to return GR instance specific value. Execute gr_init_setup_hw() for each GR instance. Disable below failing unit tests: nvgpu_gr_fs_state.test_gr_fs_state_error_injection nvgpu_gr_init.test_gr_init_hal_config_error_injection Jira NVGPU-5648 Change-Id: Ie8f1c0c304c634756786d85facf336a5c9ae8195 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2410702 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Debarshi Dutta	38ce6fa717	gpu: nvgpu: change unnamed structs to named structs Following changes are made in this patch. 1) Change unnamed structs within gpu_ops to named structs with the prefix gops_. 2) Each named struct gops_ are moved into a separate gops specific file under include/nvgpu/gops/ 3) struct gpu_ops is moved into a separate file include/nvgpu/gpu_ops.h and all other dependent struct gops_ are included in this header. 4) Direct references to include/nvgpu/gops are removed from files as its enough to include gk20a.h. Change-Id: Ieb22cb853be567e3bef14f5f8a04674eebd902ea Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2398776 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Lakshmanan M	48f1da4dde	gpu: nvgpu: Add bundle skip sequence in MIG mode In MIG mode, 2D, 3D, I2M and ZBC classes are not supported by GR engine. So skip those bundle programming sequence in MIG mode. JIRA NVGPU-5648 Change-Id: I7ac28a40367e19a3e31e63f3e25991c0ed4d2d8b Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2397912 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	2012a6b558	gpu: nvgpu: add profiler api to execute regops Implement new API nvgpu_prof_ioctl_exec_reg_ops() to support regops on new profiler objects. Add two new staging buffers to hold regops copied from userspace, and to convert and execute regops in common code. Buffers are allocated and released along with the profiler object. New API will implements this : - copy regops data in chunks of 4K from userspace - store them in staging buffer - convert the new regop struct into common regop struct and also copy the content into second staging buffer - trigger gops.regops.exec_regops() with second staging buffer as operation pointer - convert common regop struct back into new regop struct and copy back to userspace Export bunch of helper functions from ioctl_dbg.h. e.g. nvgpu_get_regops_op_values_common() Update regop execution code to skip regop execution if regop status is not valid. This is only possible when userspace requests for CONTINUE_ON_ERROR mode. Add more documentation to some of the fields in UAPI header. Note that maximum atomic operations reported by new API are same as legacy API and are incorrect. This will be fixed up in upcoming patches. Bug 2510974 Jira NVGPU-5360 Change-Id: I9f82052b22143aec33f6e778c0784386744b699e Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2394208 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Lakshmanan M	2a6fcec078	gpu: nvgpu: add gr manager ops-2 and mig infra-2 This CL covers the code changes related to following support, - Enabled gr manager ops. - Added gr manager init/remove support. - Refactor in gpu instance config infra. - Refactor in gr syspipe gpcs config infra. JIRA NVGPU-5645 JIRA NVGPU-5646 Change-Id: Ib2fab2796d76fe105fc5a08f2c5f9bfa36317f7c Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2393550 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	6df58938ad	gpu: nvgpu: gp10b: add beta cb default size define Currently, get_attrib_cb_default_size() return value is hardcoded with recommended beta cb default size value. Add a macro for the fixed buffer size and add description. JIRA NVGPU-5302 Change-Id: If415e8bc6bc15b2d2ed6875a49a1a23bbe3c740a Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2375623 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Antony Clince Alex	e02ea5456b	gpu: nvgpu: tu104: update offset calculation of gpccs ctxsw'ed priregs The ctxsw'ed registers have been moved to a separate list starting from nvgpu_next chip onwards. Hence, update gr_tu104_get_offset_in_gpccs_segment function to account for ctxsw'ed registers in nvgpu_next. Introduce functions: nvgpu_netlist_get_gpc_ctxsw_regs_count to compute the number of ctxsw'ed gpc registers. Bug 2916121 Change-Id: I69fcd8df883af62999d0fa8d1f9a398f8f5d7454 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2394684 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	6daa0636d1	gpu: nvgpu: rework regops execution API Rework regops execution API to accomodate below updates for new profiler design - gops.regops.exec_regops() should accept TSG pointer instead of channel pointer. - Remove individual boolean parameters and add one flag field. Below new flags are added to this API : NVGPU_REG_OP_FLAG_MODE_ALL_OR_NONE NVGPU_REG_OP_FLAG_MODE_CONTINUE_ON_ERROR NVGPU_REG_OP_FLAG_ALL_PASSED NVGPU_REG_OP_FLAG_DIRECT_OPS Update other APIs, e.g. gr_gk20a_exec_ctx_ops() and validate_reg_ops() as per new API changes. Add new API gk20a_is_tsg_ctx_resident() to check context residency from TSG pointer. Convert gr_gk20a_ctx_patch_smpc() to a HAL gops.gr.ctx_patch_smpc(). Set this HAL only for gm20b since it is not required for later chips. Also, remove subcontext code from this function since gm20b does not support subcontext. Remove stale comment about missing vGPU support in exec_regops_gk20a() Bug 2510974 Jira NVGPU-5360 Change-Id: I3c25c34277b5ca88484da1e20d459118f15da102 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2389733 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Seema Khowala	9ea21459b4	gpu: nvgpu: pascal+: trigger_suspend, wait_for/resume_from _pause set to NULL - NvRmGpuDeviceSetSmDebugMode uses regops interface. - NvRmGpuDeviceTriggerSuspend, NvRmGpuDeviceWaitForPause, and NvRmGpuDeviceResumeFromPause should return error on Pascal+. Use regops interface to suspend/resume. - On non-cilp devices(Maxwell), NvRmGpuDeviceTriggerSuspend, NvRmGpuDeviceWaitForPause, NvRmGpuDeviceResumeFromPause and NvRmGpuDeviceSetSmDebugMode are used when debugger(including coredump, memcheck) is attached or when CUDA application uses a syscall that requires traphandler(assert, cnp). Bug 2558022 Bug 2559631 Bug 2706068 JIRA NVGPU-5502 Change-Id: I9eb2ab0c8c75c50f53523d8bf39c75f98b34f3f0 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2376159 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Antony Clince Alex	dd82cdca97	gpu: nvgpu: introduce new ctxsw_addr_type LTS_MAIN The LTS_MAIN will be used by nvgpu-next chips. In addition, update gops_ltc.h to include nvgpu_next_gops_ltc.h and nvgpu_next_gops_ltc_intr.h Jira NVGPU-5352 Bug 200605474 Bug 200608785 Change-Id: Id77ddfc4c1aa2f93e98e05cfd8645f7ffb8f41c8 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2366350 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Seema Khowala	23f290a128	gpu: nvgpu: dbgr_control0 value can be different for different SM Do not assume dbgr_control0 register value uniformity as different SM can have different values. JIRA NVGPU-5502 Change-Id: Ib2e1f418f04f142b1948f5713b473df0f9b3ffc3 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2373946 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Seema Khowala	52271d7ab6	gpu: nvgpu: add check for SM debug mode Add check for SM debug mode in trigger_suspend, wait_for_pause and resume_from_pause hals. SMs cannot be suspended/resumed if all SMs are not in debug mode. JIRA NVGPU-5502 Change-Id: I790eb11405155a5e5d327ca048ebf21f9f8d2fab Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2373489 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Antony Clince Alex	8a9acf8a7e	gpu: nvgpu: move set_hww_esr_report_mask to golden context init The driver configures the sm hww global, warp ESR report masks during poweron as part of gops_gr.gr_init_support. However, during golden context init, these are overwritten with default entries from sw_ctx_load list; this leaves the report masks in a state inconsistent with the driver expectation. The driver should configure the sm hww warp, global ESR report masks during golden context init and not before it; Hence, move set_hww_esr_report_mask from power-on path to golden context init. In addition, update set_hww_esr_report_mask to do RMW, so as to retain the values loaded from sw_ctx_load list. Update global ESR report mask to enable all exceptions. Bug 3029888 Bug 2997718 Change-Id: Id7ad4cff5409982143f49695c95c5e1d1c9fdec9 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2367466 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	58ce9156a8	gpu: nvgpu: get gpc/tpc addr width from litter Some chips have GPC/TPC address width exposed through litter value. - Add GPC/TPC address width to litter value. - Update pri_gpccs_addr_width() and pri_tpccs_addr_width() to read value from litter value. JIRA NVGPU-5598 Change-Id: I534fa3188e3412f7e1b7bbf61c8227c966895ea5 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2371425 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Seema Khowala	b91b1f06e1	gpu: nvgpu: check and handle all bits set in fecs_host_intr_status Check all the bits set in fecs_host_intr_status h/w register. Read fecs_host_intr_status before calling handle_fecs_error and store this info in isr_data. JIRA NVGPU-5502 Change-Id: I198b11aa62e394706007d6dc034fe0ac8da2bcb5 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2343684 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	041bedaee9	gpu: nvgpu: fix fecs watchdog method params Use correct condition GR_IS_UCODE_OP_EQUAL and success mailbox value of gr_fecs_ctxsw_mailbox_value_pass_v() instead of using GR_IS_UCODE_OP_SKIP. Continue skipping the ACK from CTXSW on non-silicon platforms. Change-Id: I93b69471b1560acbf06c206ab9bd721d64b7f7d5 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2371275 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00

1 2 3 4 5 ...

361 Commits