linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-23 18:16:01 +03:00

Author	SHA1	Message	Date
Antony Clince Alex	62d6f753d2	gpu: nvgpu: add support for PES, ROP floorsweeping Volta+ chips supports PES floorsweeping and Ampere+(iGPU) chips supports ROP floorsweeping. At present, the driver isn't aware of PES, ROP floorsweeping, make the driver PES, ROP floorsweeping aware by introducing the following fields in nvgpu_gr_config: - gpc_(rop/pes)_mask: Contains the bit mask of non FSed ROP/PES units per GPC. - gpc_(rop/pes)_logical_id_map: Translates per GPC ROP/PES physical id to logical id. Introduce the following HAL functions to read PES/ROP FS data: - gops_fuse.fuse_status_opt_(pes/rop)_gpc: This fuction gets the FS config from the fuse. - gops_top.get_max_(pes/rop)_per_gpc: Gets the maximum number of PES/ROP units that can be present in a GPC. In addition, introduce the enabled flag NVGPU_SUPPORT_PES_FS to identify chips which support PES floorsweeping, piggyback on NVGPU_SUPPORT_ROP_IN_GPC enabled flag to identify ROP floorsweeping. Bug 3524791 Change-Id: I065bab6c02618fe38892c8c890b069c340b85301 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2679570 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-04-13 02:32:14 -07:00
Antony Clince Alex	19a8adeae1	gpu: nvgpu: prof: add new resource type Add new profiler resource type NVGPU_PROFILER_PM_RESOURCE_TYPE_PC_SAMPLER. Introduce regops HAL get_hwpm_pc_sampler_register_ranges to get allowlist for PC_SAMPLER resources. Re-generate allowlist files to include register ranges for PC_SAMPLER resources. Update uapi header to advertise new resource type NVGPU_PROFILER_PM_RESOURCE_ARG_PC_SAMPLER. Bug 3408536 Change-Id: I7009ef822665771eed727da48ef1e89dcc6b9c4b Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2689057 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-04-12 16:30:52 -07:00
Sagar Kamble	71eee998b1	gpu: nvgpu: sanitize page allocator name strncpy to a string from non-null terminating string without checking the size of the source string can lead to target string become non-null terminating. This can't be passed to strcat as it expects null terminating string. Check the size of the source string "name" in nvgpu_page_allocator_init. CID 81207 Bug 3512546 Change-Id: I4b245a8c2236038c40912cee72d4dbf1ca14a525 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2692604 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-04-07 03:24:15 -07:00
Divya	fb019bf43a	gpu: nvgpu: async cmd resp for gv11b - When DISALLOW cmd is sent from driver to PMU the actual completion of the disallow will be acknowledged by PMU via a PG EVENT: ASYNC_CMD_RESP. - Disallow needs a delayed ACK from PMU in order to disable the ELPG. - If ELPG is already engaged, the DISALLOW cmd will trigger ELPG exit and then transition to PMU_PG_STATE_DISALLOW. - After this whole process is completed, PMU will send DISALLOW_ACK through ASYNC_CMD_RESP msg. - After disallow command is sent from the driver, NvGPU driver waits/polls for disallow command ack. This is sent immediately by msg framework of PMU. - Then, the driver will poll/wait for ASYNC_CMD_RESP event which is the delayed DISALLOW ACK. - The driver captures the ASYNC_CMD_RESP sent from PMU. - set disallow_state to ELPG_OFF. - If the driver does not wait/poll for this delayed disallow ack from PMU, it can result in erros as PMU is still processing DISALLOW cmd but the driver progressed further. Bug 3580271 Change-Id: I332180c05b6a398107f065d54e9718b7038fb1b2 Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2689500 Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-04-07 03:21:29 -07:00
Antony Clince Alex	9e0fd1a093	gpu: nvgpu: gr: update gr suspend Update GR suspend routine to clear GR falcon "coldboot_bootstrap_done" flag, this is needed because GPU power rails are turned off during suspend cycle due to which GR falcons need to be bootstrapped again during resume. Function "nvgpu_gr_falcon_suspend" is added to clear the above mentioned flag. Bug 3497398 Bug 3514055 Change-Id: If852a2c09f05c096f287b845c56d8b4f335ec8e7 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2670554 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-28 23:47:06 -07:00
Jinesh Parakh	bbaf01590c	gpu: nvgpu: Fix Logically dead code Coverity bugs Fixed following Coverity Defects: ioctl_clk_arb.c : Logically dead code gr_gp10b.c : Logically dead code vfe_var.c : Logically dead code grmgr_ga10b.c : Logically dead code vm_remap.c : Logically dead code falcon_debug.c : Logically dead code CID 1994001 CID 3008644 CID 9870823 CID 10062537 CID 10127915 CID 10128008 Bug 3460991 Signed-off-by: Jinesh Parakh <jparakh@nvidia.com> Change-Id: I711d2ccb480328d8f0a4ba49e877612669f3d41f Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2686362 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-28 07:36:44 -07:00
Jinesh Parakh	d4cb2eb3c0	gpu: nvgpu: Fix Dereference Coverity issues Fixed following Coverity Defects: fw.c : Dereference after null check channel.c : Dereference before null check log.c : Dereference before null check CID 10064128 CID 10056456 CID 10127934 Bug 3460991 Signed-off-by: Jinesh Parakh <jparakh@nvidia.com> Change-Id: I9c075f5c38c2254d5c656af58bb002714bd53396 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2685320 Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-28 07:36:10 -07:00
Dinesh T	90d245978f	gpu: nvgpu: Fix for Compression enablement on safety This is removing NON-FUSA code that is needed for compression enablement on safety. The code is needed for comptag update on page table entry used by the GPU. Bug 3582013 Change-Id: Ib4e5c9810fabac5f479e0993184b9abf35df4afb Signed-off-by: Dinesh T <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2686411 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Martin Radev <mradev@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-25 21:37:15 -07:00
Konsta Hölttä	e9d453806c	gpu: nvgpu: move duplicate timer api to common The high level API for the timer unit is the same across all OSs, so get rid of the slight code duplication by moving the timer init functions under a new file in common code: - nvgpu_timeout_init_cpu_timer - nvgpu_timeout_init_cpu_timer_sw - nvgpu_timeout_init_retry Much of the timer logic is also duplicated, but it is mixed between OS specific current time retrieval. With some refactoring and addition of an OS independent time keeping layer, that logic could also be made shared. Change-Id: I75d02ceb0d32022b0ba7f3bcd9fdb13d47039dbc Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2669510 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-03-25 21:33:21 -07:00
mkumbar	8cce8dea70	gpu: nvgpu: PMU NVRISCV BR failure HSI - Add PMU NVRISCV BR failure HSI support. - Created a falcon unit function to check for the BR competition status check and called from other units as needed. Bug 3491596 Bug 3366818 Change-Id: I5c3c6a7e6aeaad68f77e6b24f21239e40d9a7f78 Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2686370 Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-25 13:25:27 -07:00
Antony Clince Alex	f670687441	gpu: nvgpu: move ltc_tstg_mgmt register setup The ltc_ltcs_ltss_tstg_set_mgmt_3 register should only be configured after ACR init, hence move it down the init order from early_init to finalize_poweron after acr is loaded. Bug 3514215 Change-Id: I2462715d25f75b7476ab163cd6c9f73ced5efb6d Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2685547 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-25 13:14:15 -07:00
Rajesh Devaraj	c5822b0d98	gpu: nvgpu: add error prints for errors reported to sdl In Drive 6.0, only error IDs are reported to Safety_Services. The additional debug/error information is printed using nvgpu_err(). JIRA NVGPU-8094 Bug 3491596 Change-Id: Ie90f3e1453e6a796d5c76373c11f8a5a188ac590 Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2684289 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-22 17:55:10 -07:00
Rajesh Devaraj	9edbac4494	gpu: nvgpu: add macros related to error reporting This patch does the following: - adds macros related to error reporting - introduces a flag to enable polling for error reporting JIRA NVGPU-8094 Bug 200729736 Change-Id: Ib02e8b7a7765e45eb1b3b3c6dba3720d5421a638 Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2683864 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-21 10:51:31 -07:00
Divya	201b5c1c7f	gpu: nvgpu: add SLCG support for GSP and CTRL unit Add SLCG register programming for GSP and CTRL units Bug 3452217 Change-Id: I69e414a82b5c12f26ff3b6626c328b5c0aa9e04c Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2678782 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-03-18 07:54:48 -07:00
mpoojary	7df16ee9c4	gpu: nvgpu: Add support for acr safety binaries Add support to pick ACR safety binaries when in safety for ga10b Jira NVGPU-8108 Change-Id: I3aca5e9d4b6e90af87cc7d8520366304ab579ec3 Signed-off-by: mpoojary <mpoojary@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2680710 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-03-17 12:21:28 -07:00
mpoojary	c1a995403a	gpu: nvgpu: Add ACR error reporting to SDL -Add check for ECC parity errors in IMEM, DMEM, EMEM, DCLS, REG for ACR running in GSP engine. The EXTIRQ3 external interrupt is set from ACR pointing towards host. -Add function to check error type when ACR or Bootrom execution fails and report accordingly to SDL with relevant error codes. This is a part of HSI safety requirements. Bug 3564039 Jira NVGPU-8108 Change-Id: I65407371f7a1d1ba50a10bdf443ef6b903eeaa36 Signed-off-by: mpoojary <mpoojary@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2678100 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-03-15 17:33:42 -07:00
Dinesh T	358f62a9d7	gpu: nvgpu: Add compression for safety This is adding compression support for qnx-safety by - Adding the compression related files under FUSA. - Adding new posix contig-pool.c for user space compilation. Bug 3426194 Change-Id: Ib3c8e587409dc12099c1196f55a87858d4dc520e Signed-off-by: Dinesh T <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2652963 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-03-15 17:30:57 -07:00
Tejal Kudav	b80b2bdab8	gpu: nvgpu: Add CE interrupt handling a. LAUNCH_ERR - Userspace error. - Triggered due to faulty launch. - Handle using recovery to reset CE engine and teardown the faulty channel. b. An INVALID_CONFIG - - Triggered when LCE is mapped to floorswept PCE. - On iGPU, we use the default PCE 2 LCE HW mapping. The default mapping can be read from NV_CE_PCE2LCE_CONFIG INIT value in CE refmanual. - NvGPU driver configures the mapping on dGPUs (currently only on Turing). - So, this interrupt can only be triggered if there is kernel or HW error - Recovery ( which is killing the context + engine reset) will not help resolve this error. - Trigger Quiesce as part of handling. c. A MTHD_BUFFER_FAULT - - NvGPU driver allocates fault buffers for all TSGs or contexts, maps them in BAR2 VA space and writes the VA into channel instance block. - Can be triggered only due to kernel bug - Recovery will not help, need quiesce d. FBUF_CRC_FAIL - Triggered when the CRC entry read from the method fault buffer does not match the computed CRC from the methods contained in the buffer. - This indicates memory corruption and is a fatal interrupt which at least requires the LCE to be reset before operations can start again, if not the entire GPU. - Better to quiesce on memory corruption CE Engine reset (via recovery) will not help. e. FBUF_MAGIC_CHK_FAIL - Triggered when the MAGIC_NUM entry read from the method fault buf does not match NV_CE_MTHD_BUFFER_GLOBAL_HDR_MAGIC_NUM_VAL - This indicates memory corruption and is a fatal interrupt - Better to quiesce on memory corruption f. STALLING_DEBUG - Only triggered with SW write for debug purposes - Debug interrupt, currently ignored Move launch error handling from GP10b to GV11b HAL as - 1. LAUNCHERR_REPORT errcode METHOD_BUFFER_ACCESS_FAULT is not defined on Pascal 2. We do not support GP10b on dev-main ToT JIRA NVGPU-8102 Change-Id: Idc84119bc23b5e85f3479fe62cc8720e98b627a5 Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2678893 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-03-14 17:12:14 -07:00
Deepak Nibade	a1ef716f9d	gpu: nvgpu: set graphics specific PRI values for graphics contexts Add new HAL gops.gr.init.set_default_gfx_regs() to set graphics specific PRI values for graphics contexts in function nvgpu_gr_obj_ctx_alloc(). Add new HAL gops.gr.init.capture_gfx_regs() to capture and save init values for the PRIs. Add new struct nvgpu_gr_obj_ctx_gfx_regs to hold the PRI init values. Define HAL functions gv11b_gr_init_set_default_gfx_regs() and gv11b_gr_init_capture_gfx_regs(). Set the HAL functions for gv11b and ga10b. Register accessors required to set PRIs are auto-generated. Bug 3506078 Change-Id: I4c2843a274f3c924e402541e600e104ed0c9ed1c Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2671598 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-by: Jonathan Mccaffrey <jmccaffrey@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-14 13:17:05 -07:00
Dinesh T	e4cf52123f	gpu: nvgpu: Add ce halt function This is adding CE halt fuction to reset CE properly by setting stall req and waiting for stallack. Bug 200641946 Change-Id: I501ccf68a4f6fe95911e73fa2eb65bde93a9f3e9 Signed-off-by: Dinesh T <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2678366 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-03-11 20:44:38 -08:00
srajum	390df709ca	gpu: nvgpu: fixing static analysis violation - MISRA Rule 17.7 The value returned by a function having non-void return type shall be used JIRA NVGPU-5955 Change-Id: I59539042d05afa9e74272fc8645b2fe1fa8e42aa Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2572085 (cherry picked from commit bd75b6196b7fac67fbf7a458e6bed9e3c7076ee8) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2678671 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-10 16:03:28 -08:00
srajum	8381647662	gpu: nvgpu: fixing MISRA violations - MISRA Directive 4.7 Calling function "nvgpu_tsg_unbind_channel(tsg, ch, true)" which returns error information without testing the error information. - MISRA Rule 10.3 Implicit conversion from essential type "unsigned 64-bit int" to different or narrower essential type "unsigned 32-bit int" - MISRA Rule 5.7 A tag name shall be a unique identifier JIRA NVGPU-5955 Change-Id: I109e0c01848c76a0947848e91cc6bb17d4cf7d24 Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2572776 (cherry picked from commit 073daafe8a11e86806be966711271be51d99c18e) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2678681 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-03-10 16:01:18 -08:00
srajum	069fe05dca	gpu: nvgpu: remove whitelisting for wrongly reported violations by tool - Earlier we whitelisted wrongly reported static analysis violations by tool, raised coverity tool bugs for these cases. - These bugs are fixed with new version of tool, so no need fo whitelisting. JIRA NVGPU-7119 Change-Id: Ib2341db0d46fa7fac4c0cc9a6c1bdc8704377ef1 Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2604365 (cherry picked from commit dc2d8ddaa409aefe0e04e0bacb3a8a977f6dbd64) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2677523 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-10 16:01:06 -08:00
Rajesh Devaraj	2ff9f6b759	gpu: nvgpu: add config for misc_ec and update error id This patch does the following: - Defines a new flag "CONFIG_NVGPU_ENABLE_MISC_EC" when the build is not targeted for RM Server. Since iGPU is in pass-through mode in both safety and standard build, EPL libraries will not be included in RM Server. This is done with the help of the flag "NV_BUILD_CONFIGURATION_IS_VM_SERVER". - Updates error id that will be reported to Safety_Services. The format of the error ID is: - HW_unit_id: (4-bits: bit 0 to 3), - Error_id: (5-bits: bit 4 to 8), - Corrected/Uncorrected error: (1-bit: bit-9), - Remaining 22-bits are unused. - Defines macros that will be used to form error ID. - Defines a macro for SW_ERR_CODE_0 register which is allocated for NvGPU to report errors to Safety_Services via MISC_EC. JIRA NVGPU-8094 Change-Id: I02f37db75ef3b82952ef5f196f4e065d6c5d1a3e Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2677373 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-03-10 16:00:53 -08:00
Richard Zhao	cf43371073	nvgpu: vgpu: ga10b: enable compression - contiguous mempool has been added on server side. - init cbc support only on compression flag enabled - enable compression flag only on silicon Jira GVSCI-12883 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I339f25b81224b55124928231be65070660e27080 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2676951 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-03-09 21:08:57 -08:00
Rajesh Devaraj	329807b8f9	gpu: nvgpu: update error ids for pgraph This patch updates PGRAPH related error IDs for ga10b. Since sub error type is not supported in Safety_Services 6.0, dedicated error IDs have been allocated for all sub-errors in PGRAPH. JIRA NVGPU-8094 Change-Id: Ic8de5815c5ea63e290d11ffca598e58812573603 Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2678289 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-03-09 04:42:36 -08:00
Tejal Kudav	3fe70bf86e	gpu: nvgpu: Update CE Intr code as per Orin HSIs Below CE interrupts do not have any users(usecases) on safety build; disable them only on safety build. 1. BLOCKPIPE stall intr: Not used by GFX(VKSC) and CUDA on safety. 2. NONBLOCK_PIPE nonstall intr: Non-stall intrs are not supported on safety build. Also, this one is not used by GFX(VKSC) and CUDA. 3. STALLING_DEBUG intr: Added in Orin tree. It is only needed for debugging. Disable on safety build as there is no current usage in driver. 4. POISON_ERROR intr: Poison is a fault containment and not supported on GA10b. 5. INVALID_CONFIG intr: Floor sweeping not supported on functional safety SKU. Bug 3548082 Change-Id: I8d97ccb38f138b2c04a780e1c255a64d28723405 Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2671927 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-03-08 11:41:26 -08:00
srajum	07583dffed	gpu: nvgpu: fix MISRA 5.7 and 10.4 violations - Rule 5.7 doesn't allow an identifier to be reused. This change renames variable "ops" to resolve this violation. - Rule 10.4 says both operands of operators in which arithmetic operations will be do shall be of same type. JIRA NVGPU-6056 Change-Id: Ic88f398c49d122cee206efcf88afd1edf951b042 Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2561772 (cherry picked from commit c129465413db2c28bfcb0a039962cb65e2fca1ea) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2677518 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-08 05:31:29 -08:00
Antony Clince Alex	c0f4723339	gpu: nvgpu: perbuf: update PMA buffer mapping The PMA unit can only access GPU VAs within a 4GB window, hence both the user allocated PMA buffer and the kernel allocated bytes available buffer should lie in the same 4GB window. This is accomplished by carving out and reserving a 4GB VA space in perbuf.vm and using fixed GPU VAs to ensure that both buffers are bound within the same 4GB window. In addition, update ALLOC_PMA_STREAM to use pma_buffer_offset, pma_buffer_map_size fields correctly. Bug 3503708 Change-Id: Ic5297a22c2db42b18ff5e676d565d3be3c1cd780 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2671637 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-07 15:17:35 -08:00
mpoojary	e7c082aa66	gpu: nvgpu: Enable falcon debug flag for safety debug Falcon safety debug flag was previously disabled for safety debug profile. This patch enables the flag support for safety debug. copy_from_dmem function is required to copy the debug info from dmem debug buffer whenever there's an error generated. Hence, moved copy_from_dmem function to fusa file from non-fusa and added ifdef condition to only enable when non-fusa or falcon debug flag is set. Also, some fixes for type conversion error in falcon_debug.c during compilation. Bug 3482988 Change-Id: Ic0ea32b3227b84d4ba0835e6e1aeb40f58ec7327 Signed-off-by: mpoojary <mpoojary@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2673900 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-03-07 06:25:35 -08:00
mkumbar	6609a05683	gpu: nvgpu: Move ACR WPR init region cmd to LSFM Move ACR WPR init region cmd from ISR to LSFM as part of LSF bootstrap request to execute the ACR commands sequentially as well as a blocking call by polling is_wpr_init_done status till set to true. Needed to add dealy after each ACR command for ga10b LSPMU due to nvriscv priv lockdown for ACR commands asynchronously from the nvgpu as detailed below, LSPMU engages priv lockdown whenever ACR commands needs to be processed, and nvgpu polls for interrupt status by polling pwr_falcon_irqstat_r registers once command is sent to PMU to process the ACK message from LSPMU if priv lockdown is not engaged. During NVRISCV priv lockdown couple of register are not accessible including irqstat register, priv lockdown is done by LSPMU upon ACR command receive and its asynchronous to nvgpu which cause nvgpu irqstat read data to be 0xbadf* during polling at corner cases even though priv lockdown check is present and interpreting wrongly the irq stat register. Add delay of 5ms after ACR command sent to LSPMU(LSPMU takes ~3.5msec to complete the command process) and before polling the irqstat register in nvgpu to engage priv lockdown in LSPMU. This additional delay will help to skip reading the irqstat at corner case during the priv lockdown process. Bug 3464141 Bug 3482947 Change-Id: I494493a92f6ede5dcb876aeb0d76d54969f0f59e Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2673246 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-02 18:07:03 -08:00
srajum	8e56c73eab	gpu: nvgpu: fixing MISRA Rule 21.2 violation - "va_start", "time" a reserved identifiers or macro names described in Section 7, "Library", of the C standard, shall not be declared. JIRA NVGPU-6536 Change-Id: I868362819dd7178eb7b165f243fb6d36322d8372 Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2582291 (cherry picked from commit 29c2c55b184cf16aee51614da895747750217885) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2674867 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-01 06:08:00 -08:00
Konsta Hölttä	2ab6184955	gpu: nvgpu: debug dump tsg domain name Include the scheduling domain name in the channel debug dump. The domain name of a channel is the domain name of its parent TSG, if any. Copy just the name into the dump info to avoid refcounting concerns. While at it, reword the deterministic flag for less ambiguity. Jira NVGPU-6791 Change-Id: I06041277f938e20f23de9aa419cfffbaa028035e Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2673101 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-03-01 00:09:12 -08:00
Konsta Hölttä	f10ee4ab0e	gpu: nvgpu: add domain name API Add nvgpu_nvs_domain_get_name() to minimize messing up with nvs internals and to help code organization when nvs is not built in yet. A stub to help compilation returns NULL because no domains can exist when the stub is built in, and thus it won't be used. Jira NVGPU-6788 Change-Id: If663f7c0e8434ef00dd3a3f40f6404a35b477f2b Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2673120 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-01 00:09:01 -08:00
Konsta Hölttä	2a8914619d	gpu: nvgpu: bind sched domains as fds Replace id-based lookup with fd-based lookup when binding a TSG to a domain. The device node based domain interface naturally provides access control; this way userspace tools can limit which uid/gid can access each domain. Also, explicitly disallow binding channels to a TSG that has no runlist domain yet. Normally a TSG is in the default domain if nothing else has been specified, but the default domain can be deleted. Jira NVGPU-6788 Change-Id: I2af96dfc002367d894eaf0c175006332f790c27f Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2651165 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-01 00:08:55 -08:00
Konsta Hölttä	3a64fdefc4	gpu: nvgpu: domains as files for access control Create device nodes for user-created scheduling domains. This helps leverage filesystem based access control: domains can be chosen to be available for a limited set of users on a system. The device nodes are dynamic: they can be removed while the driver is running normally. This is a bit different from the nodes that exist until the driver is unloaded, so the devno/domain mapping is stored in a separate list. The usual container_of pattern would suffer from an unavoidable race condition if a domain file was opened while the same domain would get removed. As usual, domain refcounting prevents a domain from being removed. Now the open device files hold refs and thus any open domain files prevent a domain from getting removed, in addition to the userspace-invisible ref that is taken when a TSG is bound to a domain. While at it, make the query ioctl guarded by the sched domain mutex, as domains might technically get added or removed during the querying code. Jira NVGPU-6788 Change-Id: Ief2a09a442c4e70f1f2be8a32359341071d74659 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2651164 Reviewed-by: Alex Waterman <alexw@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-01 00:08:49 -08:00
srajum	ad74b20cca	gpu: nvgpu: uniform approach for both quiesce state exit and BUG() - We are using exit() API to exit the process when gpu goes to quiesce state, but MISRA not allowing to use exit() API. JIRA NVGPU-7056 Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2588005 (cherry picked from commit 219c172da5b6ae330121009765f7eb090b87e054) Change-Id: Id5b927693cc684931bd4aa56e66543b673d0c493 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2674411 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-02-27 18:12:32 -08:00
srajum	d1b3a9359a	gpu: nvgpu: fixing MISRA Violations - MISRA C-2012 Rule 10.5 The values of an expression should not be cast to an inappropriate essential type. - MISRA C-2012 Rule 21.18 The size_t argument passed to any function <string.h> shall have an appropriate value. JIRA NVGPU-7056 Change-Id: I6a9f0de0b2e235cef3a9550a318a106d89a9ab49 Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2583109 (cherry picked from commit 65c73e71e65849a3af8e6c87811e24134d00dfb5) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2673651 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-02-27 18:12:14 -08:00
srajum	f45762d032	gpu: nvgpu: remove whitelisting for wrongly reported violation by tool - Earlier we whitelisted wrongly reported static analysis violations by tool, raised coverity tool bugs for these cases. - These bugs are fixed with new version of tool, so no need fo whitelisting. Change-Id: I9ba19e9907905a71ca0eb06c168a061a5f248f84 Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2617012 (cherry picked from commit 4aa2d531d3fd851806b1eb7e2c36cb75923da79c) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2673012 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-02-27 18:12:09 -08:00
Dinesh T	ef2a2be44f	gpu: nvgpu: Add compression support with added contig memory pool This is adding compression support for Ampere gpus by the given contig memory pool. Bug 3426194 Change-Id: I1c2400094296eb5448fe18f76d021a10c33ef861 Signed-off-by: Dinesh T <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2673581 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-02-27 18:10:41 -08:00
shashank singh	fb0ebef0a7	gpu: nvgpu: compile out ununsed code on safety build for common.nvgpu Jira NVGPU-7052 Change-Id: Idab4f9d56c0748f54fd08fc5fd01d96a66f94700 Signed-off-by: shashank singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2581247 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2670885 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-02-23 11:08:05 -08:00
mpoojary	3da5e1515f	gpu: nvgpu: Add support to pass platform data to ACR This patch passes an extra byte(23:16) value to 'mode' variable of ACR interface, value depends on func 'nvgpu_platform_is_simulation'. This will let ACR ucode to identify whether a platform is simulation(VDK) or not(VSP & Silicon). Change-Id: I78efe9cd748c023dd17e80f498c3b8b34edd18e5 Signed-off-by: mpoojary <mpoojary@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2673063 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-02-22 05:53:46 -08:00
Shashank Singh	5ec241a1d8	gpu: nvgpu: remove non stall intr from top handler for safety On safety nonstall interrupt is not used and should be compiled out to rule out any chance of interference with safety code. Remove top handler support of nonstall interrupt for safety which is currently not applicable to linux. Jira NVGPU-7066 Jira NVGPU-4078 Change-Id: I278efc8da6ddd0f22c128af6630cfd1b20ba4784 Signed-off-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2589006 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2671586 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-02-21 02:31:38 -08:00
Antony Clince Alex	ca27a7d841	gpu: nvgpu: ga10b: move grmgr.load_timestamp_prod HAL The timestamp control register in the SMCARB should be configured to have the NV_PSMCARB_TIMESTAMP_CTRL_DISABLE_TICK field cleared, otherwise the PTIMER ticks will not be sent to GR engine. Hence, remove the pre-processor checks around grmgr.load_timestamp_prod call. Bug 3510460 Bug 3500065 Change-Id: I223cea1aca28a9215287f540eb961a16e3fe6626 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2671021 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-02-17 05:03:01 -08:00
Shashank Singh	19a3b86f06	gpu: nvgpu: remove unused code from common.nvgpu on safety build - remove unused code from common.nvgpu unit on safety build. Also, remove the code which uses them in other places. - document use of compiler intrinsics as mandated in code inspection checklist. Jira NVGPU-6876 Change-Id: Ifd16dd197d297f56a517ca155da4ed145015204c Signed-off-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2561584 (cherry picked from commit 900391071e9a7d0448cbc1bb6ed57677459712a4) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2561583 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-02-17 04:58:32 -08:00
Rajesh Devaraj	0699220b85	gpu: nvgpu: compile-out unused apis from safety build This patch does the following changes: - Compiles-out unused error reporting APIs and the related data structures from safety build. For this purpose, it introduces the new flag: CONFIG_NVGPU_INTR_DEBUG - Updates nvgpu_report_err_to_sdl() API with one more argument, hw_unit_id. This aids in finding whether an error to be reported is corrected or uncorrected from LUT. - Triggers SW quiesce, if an uncorrected error is reported to Safety_Services, in safety build. - Renames files in cic folder by replacing gv11b with ga10b, since error reporting for gv11b is not supported in dev-main. JIRA NVGPU-8002 Change-Id: Ic01e73b0208252abba1f615a2c98d770cdf41ca4 Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2668466 Reviewed-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-02-14 22:00:33 -08:00
Debarshi Dutta	10c3c0ddbb	gpu: nvgpu: add FBP index conversion infra for MIG Add a mapping between local ids and logical ids for FBPs. This is enabled to support conversion for FBP local ids to logical ids when memory partition is enabled for SMC. Bug 200712091 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: Iba33327a98bf427b21f37cbf7f2d5ee5619e7ae5 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2651964 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-02-11 13:29:14 -08:00
Debarshi Dutta	01dccf163d	gpu: nvgpu: add logic for index conversion of GPCs/FBPs For SMC mode, userspace is expected to use local indexing for accessing GPC/FBP specific perf registers where local indexing refers to indexes localized to a given SMC instance. H/W however expects logical id based indexing for these registers. Currently, nvgpu driver maintains a mapping between local <-> logical/physical ids of the GPCs for SMC specific configurations/instances. These register accesses are performed by the Debugger/Profiler interfaces and uses regops for read/writes. In their current state, regops simply validates register addresses and performs the required operation on them. These registers are currently indexed using local ids and there is a need to convert them to use logical ids for supporting SMC modes. For non-SMC case local ids are equivalent to logical ids and hence the conversion would have no effect on them. Following changes are added to facilitate the above conversion from local ids to logical ids in the regops path. 1) nvgpu_profiler_allowlist_range_search is modified to update a nvgpu_pm_resource_register_range_map entry instead of just the type. 2) added two APIs, one meant for profiler V2 based interfaces and the other for legacy profiler interface. The logic for legacy profiler interface extends into the more generic profiler V2 logic to help retain future compatibility. These APIs are added just after the validation stage for nvgpu_exec_regops. 3) The above APIs return an error if the local ids exceed the number of GPCs/FBPs for a particular instance. Bug 200712091 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I060c2408a798f2f4e058aba266fa1ea9cebc2682 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2644956 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-02-11 13:29:07 -08:00
Debarshi Dutta	7db5f0d339	gpu: nvgpu: add perfmon Hals Add following HALs for Ga100 and Ga10b. These will be used for calculating chiplet offsets corresponding to GPC/FBP perf register. get_pmmgpcrouter_per_chiplet_offset get_pmmfbprouter_per_chiplet_offset get_hwpm_fbp_perfmon_regs_base get_hwpm_gpc_perfmon_regs_base get_hwpm_fbprouter_perfmon_regs_base get_hwpm_gpcrouter_perfmon_regs_base Bug 200712091 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: Iec1a16ef4a3c26dca054c30d95bef991983dc2b7 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2648832 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-02-11 13:29:02 -08:00
Debarshi Dutta	3d01b89e68	gpu: nvgpu: expose physical masks for GPCS/FBPs for MIG Following changes are added 1) nvgpu_gr_config->gpc_tpc_mask_physical is now indexed by physical gpc id instead of logical id. 2) Removed the conversion of logical fbp ids and replace them with physical ids. 3) nvgpu_gpu_instance->fbp_en_mask now contains the mask of physical fbp ids. 4) gk20a_ctrl_ioctl_gpu_characteristics returns gpu.gpc_mask returns mask of physical ids. Bug 200712091 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I0e066df76e07203ff4a5be5bfff2cef8566b425d Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2648831 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-02-11 13:28:50 -08:00

... 2 3 4 5 6 ...

3348 Commits