linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-23 18:16:01 +03:00

Author	SHA1	Message	Date
Sagar Kamble	449a4823d4	gpu: nvgpu: compile out non fusa LTC functionality nvgpu_ltc_sync_enabled functionality is used only in the kernel mode submit path and for debugging. en_illegal_compstat functionality is used for debugging . Compile them out under CONFIG_NVGPU_NON_FUSA. JIRA NVGPU-6982 Change-Id: I404d4b74b2e60ba4c2173ba0bfb643b1ecb6ba7c Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2605011 (cherry picked from commit f4bcafe73c8f7184b5e125e3ff6e55ceccaf87eb) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2632547 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-17 14:36:40 -08:00
Divya	9446cfa320	gpu: nvgpu: update golden image flag for RG seq The flag pmu->pg->golden_image_initialized is set to true during initial GPU context creation and is not cleared while the GPU goes into pm_suspend (during railgate). Hence, when the GPU resumes after un-railgate it retains the previous value which can cause ELPG to kick in immediately. Due to this, when ELPG and Railgating are enabled, IDLE_SNAP is seen for read access of gr_gpc0_tpc0_sm_arch_r reg. To resolve this, if golden image is ready set the pmu->pg->golden_image_initialized to suspend state during railgate, to delay the early enable of ELPG. Add a new pmu_init_golden_img_state hal in the NVGPU_INIT_TABLE_ENTRY. This will be called after all the GR access is done and GPU resumes completely after un-railgate. This hal will then check if golden_image_initialized flag is in suspend state, it will set it to ready state and then re-enable ELPG. Bug 3431798 Change-Id: I1fee83e66e09b6b78d385bbe60529d0724f79e79 Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2639188 Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-12-11 14:06:49 -08:00
Mahantesh Kumbar	ce7d589a4d	gpu: nvgpu: ga10b: add PMU interrupt check hal -GA10B PMU IRQ registers are not accessible when NVRISCV PRIV lockdown is engaged, so need to skip accessing IRQ registers. NVGPU-7061 Change-Id: If5233e502a9bef838839376c412582e08d729a99 Signed-off-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2636964 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-06 04:37:39 -08:00
Dinesh T	ad09e3e3cc	gpu: nvgpu: Enable sm_l1tag_surface_cut_collector This is enabling sm_l1tag_surface_cut_collector at gpu boot. This is done with adding new hal "set_sm_l1tag_surface_collector" that sets l1tag_surface_cut_collector in the sm_l1tag_ctrl register. Bug 2557724 Change-Id: I869e3bfa563db204259e7a464657229632f182d9 Signed-off-by: Dinesh T <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2634878 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-06 04:36:56 -08:00
dt	e1d6b8af8d	gpu: nvgpu: ga10x: compute gnic_stride GNIC register stride calculation is fixed by adding new hal to compute the stride by getting the difference of gpc1 and gpc0 xbar_gnic strides for ga10x GPUs. Bug 200782045 Change-Id: Iaa84109bd9f1a974ef1af6fee136ca1fcc89bbb1 Signed-off-by: Dinesh T <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2624848 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-01 08:40:36 -08:00
Tejal Kudav	6a1fd53b54	gpu: nvgpu: Mark read_ptimer() HAL as NON_FUSA Remove read_ptimer() API from safety build as GPU_GET_TIME DEVCTL got removed. This functionality is entirely implemented inside nvrm_gpu. Remove related unit-tests. JIRA NVGPU-4922 Change-Id: I3c1d2e16ddf170d4f08d6bf4826ee683ea0d9e19 Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2608654 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-01 08:39:27 -08:00
Deepak Nibade	3d9c67a0e7	gpu: nvgpu: enable Orin support in safety build Most of the Orin chip specific code is compiled out of safety build with CONFIG_NVGPU_NON_FUSA and CONFIG_NVGPU_HAL_NON_FUSA. Remove the config protection from Orin/GA10B specific code. Currently all code is enabled. Code not required in safety will be compiled out later in separate activity. Other noteworthy changes in this patch related to safety build: - In ga10b_ce_request_idle(), add a log print to dump num_pce so that compiler does not complain about unused variable num_pce. - In ga10b_fifo_ctxsw_timeout_isr(), protect variables active_eng_id and recover under CONFIG_NVGPU_KERNEL_MODE_SUBMIT to fix compilation errors of unused variables. - Compile out HAL gops.pbdma.force_ce_split() from safety since this HAL is GA100 specific and not required for GA10B. - Compile out gr_ga100_process_context_buffer_priv_segment() with CONFIG_NVGPU_DEBUGGER. - Compile out VAB support with CONFIG_NVGPU_HAL_NON_FUSA. - In ga10b_gr_intr_handle_sw_method(), protect left_shift_by_2 variable with appropriate configs to fix unused variable compilation error. - In ga10b_intr_isr_stall_host2soc_3(), compile ELPG function calls with CONFIG_NVGPU_POWER_PG. - In ga10b_pmu_handle_swgen1_irq(), move whole function body under CONFIG_NVGPU_FALCON_DEBUG to fix unused variable compilation errors. - Add below TU104 specific files in safety build since some of the code in those files is required for GA10B. Unnecessary code will be compiled out later on. hal/gr/init/gr_init_tu104.c hal/class/class_tu104.c hal/mc/mc_tu104.c hal/fifo/usermode_tu104.c hal/gr/falcon/gr_falcon_tu104.c - Compile out GA10B specific debugger/profiler related files from safety build. - Disable CONFIG_NVGPU_FALCON_DEBUG from safety debug build temporarily to work around compilation errors seen with keeping this config enabled. Config will be re-enabled in safety debug build later. Jira NVGPU-7276 Change-Id: I35f2489830ac083d52504ca411c3f1d96e72fc48 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2627048 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-26 08:46:47 -08:00
Seshendra Gadagottu	7194fe5f1f	nvgpu: ga10b: disable errata NVGPU_ERRATA_200601972 To avoid priv access error, disable errata NVGPU_ERRATA_200601972 Enable this errata back once, Bug 3414399 is fixed. Bug 3414399 Change-Id: I3a5277e6f109d319744499bc9898bec4ed292a49 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2620254 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-13 11:29:55 -08:00
Antony Clince Alex	3e7643bb9c	gpu: nvgpu: update gops.mssnvlink Introduce HAL function gops.mssnvlink.get_links, this function retrieves the number of nvlinks supported by the chip along with their base addresses. Update ga10b_mssnvlink_init_soc_credits to call mssnvlink.get_links. Jira NVGPU-6641 Change-Id: I4ff857925f126bf41dc83eebc5723403244f66b0 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2618368 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-12 07:31:27 -08:00
Antony Clince Alex	1bcc22ab19	gpu: nvgpu: make mssnvlink programming OS agnositc Make ga10b_init_nvlink_soc_credits OS agnostic by replacing OS specific functions with corresponding nvgpu wrappers. This function is now assigned to gops.mssnvlink.init_soc_credits HAL. Introduce nvgpu wrapper, nvgpu_io_map/unmap to map/unmap specified physical address range. Jira NVGPU-6641 Change-Id: I337bc75b8ec36552fe471bf5e42f62c19f67ed4a Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2618237 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-12 07:31:15 -08:00
Sagar Kamble	83dbb711bb	gpu: nvgpu: make buffer metadata support independent of compression Earlier, buffer metadata support was made dependent on compression. However that is not required. Update the enabled flag NVGPU_SUPPORT_BUFFER_METADATA setup for various hals. Enable it for all from linux characteristics init. Update REGISTER_BUFFER and GET_BUFFER_INFO ioctls to seggregate the compile/runtime compression functionality. If compression is disabled, return error in case comptags are required else don't fail the REGISTER_BUFFER ioctl. Bug 200767700 Change-Id: I3850ccc879f180c97b830fb3d652c094b9d28a5b Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2614378 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-12 07:30:33 -08:00
Tejal Kudav	243e52a771	gpu: nvgpu: ga10b: Disable compression on Av+L/Q GPU HW expects physically contiguous addresses when clearing the compression bit store in memory. Currently on hypervisor setup, the DMA_ATTR_FORCE_CONTIGUOUS flag ensures contiguous IPA, but it is not possible to ensure contiguous physical memory.Disable compression on virtualized environments until physically contiguous memory is feasible. Buffer Metadata support is dependent on compression support. Move the initialization of NVGPU_SUPPORT_BUFFER_METADATA flag to common code where NVGPU_SUPPORT_COMPRESSION is initialized. Bug 200780546 Change-Id: Id94bffc878e275a80948880f0475162d0bb4ddae Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2607830 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-10-11 17:01:06 -07:00
Divya	af448362eb	gpu: nvgpu: ga10b: set pmu elpg sequencer to NULL - For older chips, nvgpu used to set some registers for ELPG sequencer settings - These writes are no longer required post-Turing as these programming have been updated into the HW default values itself and the register definitions have been changed to help improve security as well. - Set the pmu_setup_elpg HAL to NULL Bug 200766930 Bug 3389932 Change-Id: I3820b14c8491f8180b2feb28cb38e23462546655 Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2607599 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-10-11 08:18:12 -07:00
Seshendra Gadagottu	4333bc7faf	gpu: nvgpu: ga10b: patch ctx with rops_crop_debug1_crd_cond_read_disable For ga10b emulate_mode, patch context with rops_crop_debug1_crd_cond_read_disable for required perf setting. Bug 200768322 JIRA NVGPU-6433 Change-Id: Ib1f977ed28e3b18184bce7ac695a0b6a2bae979d Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2602268 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-10-06 18:15:40 -07:00
Ramesh Mylavarapu	d2d59d6206	gpu: nvgpu: add gsp ops to support cmd/msg Added all dependent gsp dependent ops. This include read/write from/into EMEM, get Queue head/tail, engine dependent ops and aperture settings. NVGPU-6784 Signed-off-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Change-Id: Ic780bfdcd2de593bf2e8f292756e3d1700610ad2 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2590940 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-10-01 09:29:55 -07:00
Mahantesh Kumbar	3880443b53	gpu: nvgpu: t234: Fix PMU IRQ reg PRIV error -GA10B PMU IRQ registers are not accessible when NVRISCV PRIV lockdown is engaged, so need to skip modifying/configuring IRQ registers. -Add new GA10B PMU HAL for PMU IRQ support. -GA10B PMU IRQ HAL checks for PRIV lockdown and if enabled then just enable PMU interrupt from MC, if not enabled then follows legacy chip method to configure the PMU interrupt. Bug 200780546 Change-Id: Idecf460a58b0e334f9ca2301ce8ee33b760b73c0 Signed-off-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2603245 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-10-01 00:51:54 -07:00
Divya	ae2d561c48	gpu: nvgpu: add platform support for Static PG - Add support for taking static PG config values from DT nodes - Check those values against valid set of values for GPC, TPC and FBP - Store valid values in g->gpc_pg_mask, g->fbp_pg_mask and g->tpc_pg_mask[] array. Bug 200768322 JIRA NVGPU-6433 Change-Id: Ifc87e7d369034b1daa13866bc16a970602514bf6 Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2594802 Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-09-25 15:47:25 -07:00
Seshendra Gadagottu	6bc5e4bf3f	gpu: nvgpu: ga10b: update l2 size using active sets In some configurations, number of active l2 sets may be reduced. Use active sets for reporting actual l2 size. ga10b ltc.determine_L2_size_bytes hal is updated to use active sets during l2 size calculation. Bug 3279344 Change-Id: Icf1cf7ecd751e331a8ec3bd606f7eacb370e9027 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2595566 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-09-21 20:56:25 -07:00
Divya	9266da636b	gpu: nvgpu: update static pg support for pre-si - On pre-silicon platform, static pg will be done by nvgpu driver. For this, retain structs and HALs of static pg. - Add the static pg support under pre-silicon code. - On silicon, the static pg will be done by BPMP. - Rename variables used in static pg for better readability and consistency Bug 200768322 JIRA NVGPU-6433 Change-Id: Ib31c0f83b751c2b1563a36bd51af78a0bd12a117 Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2594801 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-09-21 13:40:11 -07:00
Sagar Kamble	72c3bce602	gpu: nvgpu: compile out non-safe ctxsw_prog hals Following two hals are non-safe. Compile them under CONFIG_NVGPU_HAL_NON_FUSA: 1. init_ctxsw_hdr_data 2. disable_verif_features JIRA NVGPU-5358 Change-Id: I751c4655dc628f7ab66ed3a779268a6a88f9a1e3 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2581361 (cherry picked from commit abf16c6a01109d174879609c10354f06739fb6dc) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2581842 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-09-21 03:17:12 -07:00
Sagar Kamble	62b04331de	gpu: nvgpu: compile out priv_access_map config/addr hals These hals are non-safe. Compile them out with CONFIG_NVGPU_SET_FALCON_ACCESS_MAP. JIRA NVGPU-5358 Change-Id: I75b46e201fa132e09fee15679a402d24bbf9b2ab Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2581360 (cherry picked from commit d048333ef391019b2618abf7d09c8fe2042f8ee0) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2581841 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-09-21 03:17:00 -07:00
Tejal Kudav	5a94007725	gpu: nvgpu: Remove redundant HAL from common.fbp common.fbp has two interfaces to initialize FBP: 1. Public API nvgpu_fbp_init_support 2. HAL fbp.fbp_init_support nvgpu_fbp_init_support() is only used to initialize HAL fbp.fbp_init_support. Remove the HAL and use the API directly. JIRA NVGPU-6644 Change-Id: I2c455e09dbcf5e4fb1dc370b284e4f0d5c678b40 Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2592047 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-09-16 05:59:00 -07:00
Seshendra Gadagottu	5f62534127	Revert "gpu: nvgpu: ga10b: add errata for disable CBU ECC" This reverts commit `78d7a7fdde`. Reason for revert: fix is available, so no errata required Bug 200759575 Change-Id: Id46dd3e8ecde1e56fd0e0bca2746dc9c35e07728 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2584855 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-09-14 16:09:48 -07:00
Vedashree Vidwans	a3e2283cf2	gpu: nvgpu: ga10b: Use active ltcs count for cbc init This patch fixes a bug in the cbc initialization code for ga10b, where it was erroneously assumed that a fixed ltc count of only one should be used for historical reasons. For volta and later, the full ltc count should be used in cbc-related computation. Ensure - CBC base address is 64K aligned - CBC start address lies within CBC allocated memory Check CBC is marked safe only for silicon platform. Bug 3353418 Change-Id: I5edee2a78dc9e8c149e111a9f088a57e0154f5c2 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2585778 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-09-10 16:00:25 -07:00
dt	152d7c9edd	gpu: nvgpu: Fix for pes_tpc_mask programming After CONFIG_UBSAN kernel compilation flag to know any shifting cause overflow or not enablement ,this is identified. The register "gr_fe_tpc_fs_r(gpc_index)" is read only after Volta. The gops where we are computing the index is not needed. Bug 200727116 Change-Id: Ib2306103389ba9df77fd59d012ec70e775104989 Signed-off-by: dt <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2573296 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-09-07 15:59:48 -07:00
Ramesh Mylavarapu	ffd0d3962f	gpu: nvgpu: gsp: gsp isr and debug trace support - Created GSP NVRISCV interrupt handle and respective functions and register reads. - Created Debug trace support for GSP firmware. NVGPU-7084 Signed-off-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Change-Id: I2728150c4db00403aa6e3c043bc19c51677dd9cf Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2589430 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-09-07 05:37:51 -07:00
deepak goyal	77d1e765f5	gpu: nvgpu: ga10b: Fix logic for BROM pass status Current code assumes riscv brom passed if it does not times out. This patch explicitly checks for brom pass/fail or timeout. Bug 3361416 Change-Id: I399a6cf9d32be92b24990532f81892642513ba54 Signed-off-by: deepak goyal <dgoyal@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2585786 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-08-31 08:54:35 -07:00
Seshendra Gadagottu	d255c64f50	gpu: nvgpu: ga10x: update pdiv_duration for thermal To keep pdiv_duration at 15usec between steps at 102MHz utilsclk, update stepping duration value from 0xBF4 to 0x5FA for ga10x. Bug 200757274 Change-Id: I333a5b0b35307402a734a7eafc4ab13d20316cd1 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2584539 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-08-30 19:35:54 -07:00
Deepak Nibade	3c97f3b932	gpu: nvgpu: disallow binding more channels than MAX channels supported per TSG There is HW specific limit on number of channel entries that can be added for each TSG entry in runlist. Right now there is no checking to enforce this from SW and hence if User binds more than supported channels to same TSG, invalid TSG formation error interrupts are generated. Fix this by adding appropriate checks in below steps : - Add new field ch_count to struct nvgpu_tsg to keep track of channels bound to TSG. - Define new hal gops.runlist.get_max_channels_per_tsg() to retrieve HW specific maximum channel count per TSG. - Implement the HAL for gk20a and gv11b chips, and assign new HALs for all chips appropriately. - Increment ch_count while binding the channel to TSG and decrement it while unbinding. - While binding channel to TSG, Check if current channel count is already equal to max channel count. If yes, print an error and bail out. Bug 200763991 Change-Id: Ic5f17a52e0fb171d1c020bf4f085f57cdb95f923 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2582095 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-08-25 09:47:47 -07:00
Seshendra Gadagottu	8ed1487860	gpu: nvgpu: ga10b: Enable clock arb support Enable clock arbitration support for silicon. Bug 200764879 Change-Id: I40d47f7f15197a8dd55ca0866e177fd42b8c4e9d Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2579556 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-08-19 14:07:47 -07:00
Seshendra Gadagottu	78d7a7fdde	gpu: nvgpu: ga10b: add errata for disable CBU ECC Add NVGPU_ERRATA_200761358 errata for CBU ECC disable in nvgpu driver. Bug 200761358 Change-Id: I51fcddb47946e84b1cdf39ab908e2185bc112c83 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2574530 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: Antoine Chauveau <achauveau@nvidia.com> Tested-by: V M S Seeta Rama Raju Mudundi <srajum@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-08-12 02:25:43 -07:00
Sagar Kamble	40064ef1ec	gpu: nvgpu: fix ecc counter free ECC counter structures are freed without removing the node from the stats_list. This can lead to invalid access due to dangling pointers. Update the ecc counter free logic to set them to NULL upon free, to remove them from stats_list and free them by validation. Also updated some of the ecc init paths where error was not propa- gated to callers and full ecc counters deallocation was not done. Now, calling unit ecc_free from any context (with counters alloc- ated or not) is harmless as requisite checks are in place. bug 3326612 bug 3345977 Change-Id: I05eb6ed226cff9197ad37776912da9dcb7e0716d Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2565264 Tested-by: Ashish Mhetre <amhetre@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-08-11 01:55:08 -07:00
Seshendra Gadagottu	13a77ce843	gpu: nvgpu: ga10b: don't wait for ctxsw wdt ack Currently, ctxsw is not sending watchdog timeout ack that results in GPU timeout and failure on silicon. Bug 3354738 Change-Id: Idc8fbe3bcc8c539a8b391f19c5bfa3207d1a3e45 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2570595 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-08-07 21:23:13 -07:00
Vedashree Vidwans	a7a2e1e263	gpu: nvgpu: ga10b: update cbc divisor and top reg Currently, cbc init and compression tests are failing because MMU marks cbc to be not safe. - Modify cbc.get_base_divisor hal to use ltc_count = 1 for Tegra devices - Update fb.cbc_configure to write compbit_backing_size value to fb_mmu_cbc_top register. - After config confirm that CBC is marked safe. Bug 3353418 Change-Id: I1e9c27f47f7bfcf476f2499231951382e2e8653d Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2570550 Reviewed-by: Sami Kiminki <skiminki@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: V M S Seeta Rama Raju Mudundi <srajum@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-08-05 22:33:56 -07:00
mkumbar	87984ea344	gpu: nvgpu: support nvriscv debug feature Enable nvriscv debug buffer feature in NVGPU. Debug buffer is a feature to print the debug log from ucode onto console in real time. Debug buffer feature uses the DMEM, queue and SWGEN1 interrupt to share ucode debug data with NVGPU. Ucode writes debug message to DMEM and updates offset in queue to trigger interrupt to NVGPU. NVGPU copies the debug message from DMEM to local buffer to process and print onto console. Debug buffer feature is added under falcon unit and required engine can utilize the feature by providing required param through public functions. Currently GA10B NVRISCV NS/LS PMU ucode has support for this feature and enabled support on NVGPU side by adding required changes, with this feature enabled, it is now possible to see prints in real time. JIRA NVGPU-6959 Change-Id: I9d46020470285b490b6bc876204f62698055b1ec Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2548951 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-07-17 12:45:00 -07:00
Ramesh Mylavarapu	d328bff79e	gpu: nvgpu: gsp NVRISCV load and bootstrap Changes: - This change will only init gsp software state, nvgpu_gsp_bootstrap need to be called. - CONFIG_NVGPU_GSP_SCHEDULER flag is created to compile out the gsp scheduler code when needed. - Created GSP engine reset which is needed when ACR completed execution and need to load gsp fw. NVGPU-6783 Signed-off-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Change-Id: I2ce43e512b01df59443559eab621ed39868ad158 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2554267 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-07-15 17:21:03 -07:00
Divya Singhatwaria	77e3a8c5e4	gpu: nvgpu: ga10b: Add request_idle ce ops Issue observed: - In GA10B, it was observed that after recovery happens ELPG does not engage. - It was because, after CE reset, when nvgpu_submit_twod test was run to engage ELPG, IDLE_FLIPPED_PWR_OFF signal was asserted. - This means that when ELPG was engaged (engine is in PWR_OFF), some idle signal flips (becomes non-idle) and this causes IDLE_SNAP. After IDLE_SNAP is hit, ELPG will not engage further. - After debugging from WAVES, it was observed that: LCE0/LCE1 are not done with the reset sequence. - The state of these LCE is RESET0. A pri request (pri read to NV_CE_PCE_MAP register in CE) is seen that kicks it out of RESET0. After this state, it goes through few states to update some internal states (states RESET1/RESET2/PCE_MAP etc) and then eventually settles down to IDLE state. Solution: - Read ce_pce_map_r register in recovery sequence (after ce reset). - It is observed that when this read is added recovery is complete and post that when nvgpu_submit_two test is executed, ELPG is engaging. - This means that a pri read is needed after CE reset so that it settles to idle state properly and post that ELPG can engage properly. Bug 200734258 Change-Id: I5bb84921ca62a740fde81ffe6c29ccde4ebb341b Signed-off-by: Divya Singhatwaria <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2554493 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-07-15 10:05:02 -07:00
Deepak Nibade	4edf952e3e	gpu: nvgpu: fix rule 5.1 misra violations in common.gr Fix rule 5.1 misra violations in common.gr by renaming below functions : nvgpu_gr_config_get_gpc_tpc_mask_base -> nvgpu_gr_config_get_base_mask_gpc_tpc nvgpu_gr_config_get_gpc_tpc_count_base -> nvgpu_gr_config_get_base_count_gpc_tpc gm20b_ctxsw_prog_set_priv_access_map_config_mode -> gm20b_ctxsw_prog_set_config_mode_priv_access_map gm20b_ctxsw_prog_set_priv_access_map_addr -> gm20b_ctxsw_prog_set_addr_priv_access_map gm20b_gr_falcon_read_fecs_ctxsw_mailbox -> gm20b_gr_falcon_read_mailbox_fecs_ctxsw gm20b_gr_falcon_read_fecs_ctxsw_status0 -> gm20b_gr_falcon_read_status0_fecs_ctxsw gm20b_gr_falcon_read_fecs_ctxsw_status1 -> gm20b_gr_falcon_read_status1_fecs_ctxsw gv11b_gr_intr_get_sm_hww_warp_esr_pc -> gv11b_gr_intr_get_warp_esr_pc_sm_hww gv11b_gr_intr_get_sm_hww_warp_esr -> gv11b_gr_intr_get_warp_esr_sm_hww Jira NVGPU-6779 Change-Id: Icbe23a7b022373785968fc417ee247e2d80cfcc6 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2554521 (cherry picked from commit 1432650774506f2a7e45f70b084f498736d0d0c5) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2555330 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-07-13 09:20:41 -07:00
Antony Clince Alex	f51a43b579	gpu: nvgpu: ga10b: fix fetching of FBP_L2 FS mask On all chips except ga10b, the number of ROP, L2 units per FBP were in sync, hence, their FS masks could be represented by a single fuse register NV_FUSE_STATUS_OPT_ROP_L2_FBP. However, on ga10b, the ROP unit was moved out from FBP to GPC and it no longer matches the number of L2 units, so the previous fuse register was broken into two - NV_FUSE_CTRL_OPT_LTC_FBP, NV_FUSE_CTRL_OPT_ROP_GPC. At present, the driver reads the NV_FUSE_CTRL_OPT_ROP_GPC register and reports incorrect L2 mask. Introduce HAL function ga10b_fuse_status_opt_l2_fbp to fix this. In addition, rename fields and functions to exclusively fetch L2 masks, this should help accommadate ga10b and future chips in which L2 and ROP units are not in same. As part of this, the following functions and fields have been renamed. - nvgpu_fbp_get_rop_l2_en_mask => nvgpu_fbp_get_l2_en_mask - fuse.fuse_status_opt_rop_l2_fbp => fuse.fuse_status_opt_l2_fbp - nvgpu_fbp.fbp_rop_l2_en_mask => nvgpu_fbp.fbp_l2_en_mask The HAL ga10b_fuse_status_opt_rop_gpc is removed as rop mask is not used anywhere in the driver nor exposed to userspace. Bug 200737717 Bug 200747149 Change-Id: If40fe7ecd1f47c23f7683369a60d8dd686590ca4 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2551998 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-07-07 05:48:56 -07:00
Pekka Jylhä-Ollila	8a72068508	Revert "gpu: nvgpu: gsp NVRISCV load and bootstrap" This reverts commit `aef4b80acb`. Change-Id: I47e02bf97e6a3aaa9acdd7f5eec41518b31ee5dc Signed-off-by: Pekka Jylhä-Ollila <pjylhaollila@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2554105 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com>	2021-07-05 06:01:52 -07:00
Ramesh Mylavarapu	aef4b80acb	gpu: nvgpu: gsp NVRISCV load and bootstrap Changes: - This change will only init gsp software state, nvgpu_gsp_bootstrap need to be called. - CONFIG_NVGPU_GSP_SCHEDULER flag is created to compile out the gsp scheduler code when needed. - Created GSP engine reset which is needed when ACR completed execution and need to load gsp fw. NVGPU-6783 Signed-off-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Change-Id: I26263ee5bae07de056f676ed0fddc1193b5af82d Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2530438 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-07-04 13:34:51 -07:00
Lakshmanan M	e9872a0d91	gpu: nvgpu: Skip graphics unit access when MIG is enabled This CL covers the following modifications, 1) Added logic to skip the graphics unit specific sw context load register write during context creation when MIG is enabled. 2) Added logic to skip the graphics unit specific sw method register write when MIG is enabled. 3) Added logic to skip the graphics unit specific slcg and blcg gr register write when MIG is enabled. 4) Fixed some priv errors observed during MIG boot. 5) Added MIG Physical support for GPU count < 1. 6) Host clk register access is not allowed for GA100. So skipped to access host clk register. 7) Added utiliy api - nvgpu_gr_exec_with_ret_for_all_instances() 8) Added gr_pri_mme_shadow_ram_index_nvclass_v() reg field to identify the sw method class number. Bug 200649233 Change-Id: Ie434226f007ee5df75a506fedeeb10c3d6e227a3 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2549811 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-07-02 16:41:51 -07:00
Antony Clince Alex	d2919409e9	gpu: nvgpu: rename/collpase nvgpu_next functions and structs Replace all nvgpu_next functions/structs either by 1) collapsing them into nvgpu legacy functions/structs 2) renaming them as follows: - nvgpu_next_() => nvgpu_(ga10b/ga100)_() - nvgpu_next_() => (ga10b/ga100)_() - nvgpu_next_() => nvgpu_() [only if this doesn't cause collision] - nvgpu_next_() = > nvgpu__extra() Create hal.sim unit and move Ampere+ SIM code into it. Jira NVGPU-4771 Change-Id: I215594a0d0df4bd663bd875a0d0db47bcb9ff6a2 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2548056 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-27 05:02:58 -07:00
Sagar Kadamati	3e43f92f21	gpu: nvgpu: add ga10b & ga100 sources Mass copy ga10b & ga100 sources from nvgpu-next repo. TOP COMMIT-ID: 98f530e6924c844a1bf46816933a7fe015f3cce1 Jira NVGPU-4771 Change-Id: Ibf7102e9208133f8ef3bd3a98381138d5396d831 Signed-off-by: Sagar Kadamati <skadamati@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2524817 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-17 12:56:16 -07:00

44 Commits