linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-24 10:34:43 +03:00

Author	SHA1	Message	Date
Ninad Malwade	69ffeeaa26	gpu: nvgpu: fix the tpc_pg_mask read and store return values tpc_pg_mask_read: function returns the value stored in the tpc_pg_mask node. Since, it does not have the terminator it returns garbage value after the mask value and gets troublesome to parse in the userspace. Fix is to add '\n' at the end of the mask value to be consistent with the earlier releases. tpc_pg_mask_store: In this function there is a call to check if gpu is powered on or not. In case when gpu is powered on, the return value is just assigned to the 'err' variable which is never returned to notify the application to follow the next steps. Thus, need to return this value and inform the same to the userspace. Bug 3445617 Change-Id: Ibb6fed88c83c751a5fb73181089274aa27c3f18b Signed-off-by: Ninad Malwade <nmalwade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2630925 Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Bibek Basu <bbasu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-30 11:30:16 -08:00
Konsta Hölttä	4a7a6c0839	gpu: nvgpu: evaluate srctree.nvgpu immediately The definition of srctree.nvgpu for external modules is specified as a function of $(MAKEFILE_LIST). This variable gets extended also when files are included, so any include directives in the makefile affect the path because srctree.nvgpu is assigned with "?=" which is lazy like "=". To avoid problems with the lazy init, use ":=" surrounded with an explicit check to initialize srctree.nvgpu only when undefined. Jira NVGPU-6427 Change-Id: Ic9847528d0da73f06bc7e421130c9c54247caadb Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2632925 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-30 07:13:28 -08:00
Divya	6a21dd929f	gpu: nvgpu: add a new PMU RPC: ASYNC_CMD_RESP RPC - When DISALLOW cmd is sent from driver to PMU the actual completion of the disallow will be acknowledged by PMU via a new RPC: ASYNC_CMD_RESP. - Disallow needs a delayed ACK from PMU in order to disable the ELPG. - If ELPG is already engaged, the DISALLOW cmd will trigger ELPG exit and then transition to PMU_PG_STATE_DISALLOW. - After this whole process is completed, PMU will send DISALLOW_ACK through ASYNC_CMD_RESP RPC. - After disallow command is sent from the driver, NvGPU driver waits/polls for disallow command ack. This is sent immediately by RPC framework of PMU. - Then, the driver will poll/wait for ASYNC_CMD_RESP event which is the delayed DISALLOW ACK. - The driver captures the ASYNC_CMD_RESP RPC sent from PMU. - set disallow_state to ELPG_OFF. - If the driver does not wait/poll for this delayed disallow ack from PMU, it can result in pmu halt issues as PMU is still processing DISALLOW cmd but the driver progressed further which can result in errors. Bug 3430273 Bug 3439350 Change-Id: If2acf8391d18cd3c6b8b07e3bf6577667ec99eea Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2631214 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-30 07:11:36 -08:00
Sagar Kamble	b6d349dcf6	gpu: nvgpu: init all buffer compbits state members User fence syncpt_id in the buffer compbits state was set to 0 on allocation through PREPARE_COMPRESSIBLE_READ ioctl or MARK_COMPRESSIBLE_WRITE ioctl. In case NVGPU_GPU_COMPBITS_GPU is requested through the ioctl PREPARE_COMPRESSIBLE_READ, CDE conversion command is not submitted and the output fence is cloned from the initial state fence (with syncpt_id=0). NvRmSyncWait on this fence from userspace lead to below error: 13e40000.host1x: nvhost_syncpt_wait_timeout: invalid syncpoint id 0 Initialize the buffer compbits state user fence syncpt_id to NVGPU_INVALID_SYNCPT_ID with nvgpu_user_sync_init() so that the userspace skips the NvRmSyncWait on that fence. While at it, initialize other uninitialized members, valid_compbits and zbc_color. Bug 3360675 Change-Id: Ie2a584546e1ed37841cbdc3472b598794f911e6f Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2631235 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-27 10:08:14 -08:00
Sagar Kamble	41df3e17a7	gpu: nvgpu: fix nvgpu remove sequence While removing the nvgpu module, all gpu unmaps should happen before removing the PMU support as ELPG_MS accesses pmu pg structure and ELPG_MS is disabled/enabled while accessing TLB or cache flush. nvgpu_fb_vab_teardown_hal and mmu_fault.info_mem_destroy do gpu unmaps. They were executed post removal of PMU support. Fix the sequence. Bug 3448630 Change-Id: I44925c313c625a2d0f297d1367d69069b3deacef Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2632490 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-26 08:47:52 -08:00
Deepak Nibade	3d9c67a0e7	gpu: nvgpu: enable Orin support in safety build Most of the Orin chip specific code is compiled out of safety build with CONFIG_NVGPU_NON_FUSA and CONFIG_NVGPU_HAL_NON_FUSA. Remove the config protection from Orin/GA10B specific code. Currently all code is enabled. Code not required in safety will be compiled out later in separate activity. Other noteworthy changes in this patch related to safety build: - In ga10b_ce_request_idle(), add a log print to dump num_pce so that compiler does not complain about unused variable num_pce. - In ga10b_fifo_ctxsw_timeout_isr(), protect variables active_eng_id and recover under CONFIG_NVGPU_KERNEL_MODE_SUBMIT to fix compilation errors of unused variables. - Compile out HAL gops.pbdma.force_ce_split() from safety since this HAL is GA100 specific and not required for GA10B. - Compile out gr_ga100_process_context_buffer_priv_segment() with CONFIG_NVGPU_DEBUGGER. - Compile out VAB support with CONFIG_NVGPU_HAL_NON_FUSA. - In ga10b_gr_intr_handle_sw_method(), protect left_shift_by_2 variable with appropriate configs to fix unused variable compilation error. - In ga10b_intr_isr_stall_host2soc_3(), compile ELPG function calls with CONFIG_NVGPU_POWER_PG. - In ga10b_pmu_handle_swgen1_irq(), move whole function body under CONFIG_NVGPU_FALCON_DEBUG to fix unused variable compilation errors. - Add below TU104 specific files in safety build since some of the code in those files is required for GA10B. Unnecessary code will be compiled out later on. hal/gr/init/gr_init_tu104.c hal/class/class_tu104.c hal/mc/mc_tu104.c hal/fifo/usermode_tu104.c hal/gr/falcon/gr_falcon_tu104.c - Compile out GA10B specific debugger/profiler related files from safety build. - Disable CONFIG_NVGPU_FALCON_DEBUG from safety debug build temporarily to work around compilation errors seen with keeping this config enabled. Config will be re-enabled in safety debug build later. Jira NVGPU-7276 Change-Id: I35f2489830ac083d52504ca411c3f1d96e72fc48 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2627048 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-26 08:46:47 -08:00
Rajkumar Kasirajan	71cd434f4f	gpu: nvgpu: add thermal cooling support Add devfeq thermal cooling device for GPU software thermal throttling. Bug 3287074 Change-Id: Ib0b53a58177964dfda3c8993da9c4835e2cb8a6e Signed-off-by: Rajkumar Kasirajan <rkasirajan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2625659 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: Srikar Srimath Tirumala <srikars@nvidia.com> Reviewed-by: Bibek Basu <bbasu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-26 08:46:00 -08:00
Seshendra Gadagottu	616a885079	gpu: nvgpu: ga10b: enable frequency scaling Enable GPU frequency scaling for t234 silicon. Bug 3315239 Change-Id: I0b581283d4c9558fb9f7c3b553ea8bee71ba22af Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2617544 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-25 08:16:42 -08:00
Seshendra Gadagottu	fc27aab523	gpu: nvgpu: ga10b: update clock operations ga10b can have 2 GPCs and each GPC is clocked with separate gpc clk. Added ga10b specific set_rate/get_rate operations for gpcclks considering GPC floor-sweeping info. Bug 3315239 Change-Id: I4e2156b4e06a1580a60d832e0d3296ed3dc17887 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2617441 Reviewed-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: Tejal Kudav <tkudav@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-25 08:16:31 -08:00
Jon Hunter	02ab4c3880	gpu: nvgpu: Correct permissions for cbc_ctrl debug node The cbc_ctrl debugfs node is created with both read and write permissions, however, this node only supports being written and not read. Fix this by removing read permissions for this debugfs node. Bug 3402817 Change-Id: I16ffba8285144389758be283121c3096745aae73 Signed-off-by: Jon Hunter <jonathanh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2630027 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Martin Radev <mradev@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-24 17:55:57 -08:00
Vincent Lu	8c53b54649	gpu: nvgpu: use signed int in binary search In case of the target we want to find is less than all candidates, end = mid - 1U will finally execute with mid = 0, which makes end = 0xFF..FF. We'll have an invalid memory access in this case. Change u32 to int for start, mid and end variables in allowlist_offset_search. This case also applied for allowlist_range_search. Bug 3417343 Change-Id: I30fe90e9439d2ac8bba01c68a8c70b6f6466d68b Signed-off-by: Vincent Lu <canjiangl@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2617309 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: Antony Clince Alex <aalex@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-24 17:52:23 -08:00
Divya Singhatwaria	09c0813c94	gpu: nvgpu: ga10b: disable ELPG Disable ELPG to avoid pmu halt Bug 3430273 Bug 3439350 Change-Id: I1a7ac388e334010a4a7cf848fa99bb078c5026ca Signed-off-by: Divya Singhatwaria <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2628191 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-24 10:53:20 -08:00
Konsta Hölttä	6de716a196	gpu: nvgpu: allow managing runlist domains Add support for adding and deleting domains for all runlists together. The core scheduler logic will control runlist domains. Initially, however, it may be necessary to only actually schedule only the GR runlist, but keeping the runlist code agnostic of such scheduling logic helps isolate the control complexity. NVGPU-6425 Change-Id: Id6039bd37a293a2cf3eaee5ed84d35459e8b89e7 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2628049 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-24 04:47:49 -08:00
Konsta Hölttä	3af33f2454	gpu: nvgpu: swap runlist domains When there are multiple scheduling domains, each runlist pointer has to be switched according to the active scheduling policy. For now implement a trivial round robin policy to loop the domains over, just sufficient for testing. In the future the switching will be owned by the scheduler code, but this helps prepare the design for that. The switching will not do anything if there is only one domain, so current functionality is not affected. For simplicity, all runlists are switched at the same time. In the future, it may be desirable to swap e.g. only the GR runlist and keep others running free, outside scheduler control. Jira NVGPU-6427 Jira NVGPU-6425 Change-Id: Ic68c13e97761bbdc210c74794de8ccb8dbd45587 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2628048 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-24 04:47:37 -08:00
Konsta Hölttä	fe7ae02f5f	gpu: nvgpu: add sched domain bind ioctl Support binding TSGs to some other scheduling domain than the default one. Binding happens by name until a more robust interface appears in the future, as the name is a natural identifier for users. No other domains are actually created anywhere yet; that will happen in next patches. Jira NVGPU-6788 Change-Id: I5abcdea869b525b0a0e9937302f106f7eee94ec2 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2628047 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-24 04:47:30 -08:00
Sagar Kamble	e692cd9913	gpu: nvgpu: fix spurious stall intr log Below sequence leads to the condition where nvgpu logs error about spurious stall intr occurence: 1. PMU interrupt is set after ELPG command is sent to PMU. 2. Stalling irqhandler sees the interrupt and schedules the stalling thread. 3. PMU isr gets executed from nvgpu_pmu_wait_fw_ack_status just before the stalling irq thread is run. Due to this, top level interrupt gets cleared. 4. When stalling irq thread gets executed it sees no interrupt and logs as spurious interrupt. This condition is not actually about "spurious interrupt", hence change error log to gpu_dbg_intr log type and rephrase it. Bug 200780211 Change-Id: Idab62f61007012f7022a836473562795c24821ef Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2628275 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-23 07:04:00 -08:00
Tejal Kudav	6648cfea09	gpu: nvgpu: Disable low power features on AV+L NVGPU does not support low power features on hypervisor based embedded environments.Disable low power features like ELPG, AELPG and railgating on AV+L using the nvgpu_is_hypervisor_mode(). JIRA NVGPU-7283 Change-Id: I930e06ae711ee1485109b7f519a2dacc95b7d74b Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2610056 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-19 13:16:36 -08:00
Debarshi Dutta	5d42343cbf	gpu: nvgpu: use appropriate permissions for debugfs entries SW Profiler's debugfs entry "enable" has both RW permissions set however the corresponding read operation is set to NULL. Similarly, the other entries i.e. "percentiles", "raw_entries" and "basic_stats" only have a read operation defined and write is set to NULL. To enable correct permissions, set permission for "enable" to "W" and set permissions for others as "R" only. Bug 200747304 Change-Id: I296a0ff08a871f752988c9b0a671c080695d5eab Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2566322 (cherry picked from commit 8542dd388da536ee693610acd0bb3a679ff52881) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2628035 Tested-by: Jonathan Hunter <jonathanh@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Jonathan Hunter <jonathanh@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-19 08:08:11 -08:00
Antony Clince Alex	cce1d7ad84	gpu: nvgpu: update device management framework to remove unusable engines On certain platforms, not all copy engine instances are usable. The user shouldn't submit any work to these engines. To enforce this, remove these engines from active/host_engine list, this should ensure that these engines do not get advertised to userspace. In order to accomplish this introduce the following functions: - nvgpu_engine_remove_one_dev: This function removes the specified device entry from following device lists: fifo->host_engines, fifo->active_engines, runlist->rl_dev_list, runlist->eng_bitmask. Replace iteration over LCE device type entries using nvgpu_device_for_each(g, dev, NVGPU_DEVTYPE_LCE), along with this introduce macro nvgpu_device_for_each_safe. Introduce gpu_dbg_ce flag for CE debugging. Bug 3370462 Change-Id: I2e21f18363c6e53630d129da241c8fece106cd33 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2616711 Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-18 09:18:55 -08:00
Tejal Kudav	79ee724b04	gpu: nvgpu: Add knob to control dbg/prof support On production boards, there is requirement to disable GPU profiler and debugger support. Add DT property 'support-gpu-tools' which can be modified to enable/disable debugger/profiler support. The default behavior is to enable the debugging features and set 'support-gpu-tools' to 1. This property is chosen to be u32 value to be in sync with GPU vserser property by same name. The debugger/profiler support is disabled by skipping the creation of below nodes under /dev/nvgpu/: 1. ctxsw 2. dbg 3. prof 4. prof-dev 5. prof-ctx Bug 200773450 JIRA NVGPU-7109 Change-Id: I86d72d17fa7f5492e117a4c1cd1144623e9b6132 Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2592012 Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-18 03:57:18 -08:00
Seshendra Gadagottu	c2901b6835	gpu: nvgpu: correct debug messages for fecs ecc errors Following error message is getting printed even when there are no fecs ecc errors: nvgpu: 17000000.ga10x gv11b_gr_intr_handle_fecs_ecc_error:114 [ERR] error count corrected: 0, uncorrected 0 To avoid confusion, print error messages only when fecs errors are reported. Bug 3417834 Change-Id: I96317555b11e1976f33add4b1dc8d84c936c26fb Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2625723 Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-17 17:31:52 -08:00
Sagar Kamble	430b9bef58	gpu: nvgpu: fix the naming of L2 cache ecc_control register L2 cache ecc_control register wasn't named correctly. Fix it. JIRA NVGPU-7054 Change-Id: I08d76aefb38663cb699c50c07a92af8a756d142d Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2603699 (cherry picked from commit 04a5d798f6948c34536dc02391e7da8464f5a7bf) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2623623 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-17 12:50:20 -08:00
Prateek sethi	b4528cac93	gpu: nvgpu: fix the return sequence API nvgpu_get_timestamps_zipper() returning with power reference in case of failure. Patch corrects the sequence. Bug 3412554 Change-Id: Id5bd027fd9861d2b04341b5045326278cef5c5d1 Signed-off-by: Prateek sethi <prsethi@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2551274 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-14 13:07:44 -08:00
Vedashree Vidwans	6cf7b9a4b2	gpu: nvgpu: enable dbn_fn only if feature is supported Currently, on non-secureboot with power features disabled, debug logs contain elpg enable/disable function prints. To lower confusion, move elpg enable and disable dbg_fn prints to after can_elpg check. Jira NVGPU-7183 Change-Id: Ib6a1fb93330042a90c6d87b153b26aff7907ab7d Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2624661 Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-14 09:33:05 -08:00
Sagar Kamble	48d17e9c53	gpu: nvgpu: fix the unit test traceability gk20a_tsg_unbind_channel_check_hw_next was not added to Targets in unit test specification. Add it. __attribute__ in debug.h is captured by Doxygen as function with no tests. However it is not really a function and applies to non-fusa function so skip it in Doxygen. JIRA NVGPU-7211 Change-Id: I2adadaebbf4e43768eb408dd10aaa20b1e13eccc Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2615256 (cherry picked from commit e829afb55a17dc0dacf17c71633f5689324171d7) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2623629 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-14 04:24:17 -08:00
Sagar Kamble	f64b5e20b0	gpu: nvgpu: add unit test for gk20a_tsg_unbind_channel_check_hw_next false branch when NEXT bit is not set is not covered. Add unit test for same. JIRA NVGPU-7211 Change-Id: I57725e35971605bf8144e7eaac618f44a38e5b31 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2614209 (cherry picked from commit 2064209f92700dc859d7398e061b3d7dc2725521) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2623628 Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-14 04:24:11 -08:00
srajum	b789ba7460	gpu: nvgpu: marking dt.h as non-safe - The nvgpu_dt_read_u32_index() functin of dt.h is used only in non-safe code, so no need of this file to be safe. - Also this is fixing the unit test tracibility issue with nvgpu_dt_read_u32_index() function JIRA NVGPU-7211 Change-Id: I4023915f2ad9872df01f5e75fa21c1d492cc731a Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2614653 (cherry picked from commit 3ac63b3a707736c20a9c53948a689c0a21ecf58c) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2623632 Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: Sagar Kamble <skamble@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-14 01:30:31 -08:00
Sagar Kamble	cdfbd4313b	gpu: nvgpu: add BVEC tests for common.mc unit Add BVEC tests for following common.mc unit APIs: 1. nvgpu_mc_intr_stall_unit_config 2. nvgpu_mc_intr_nonstall_unit_config 3. mc.reset_mask Changed the WARN to nvgpu_err in mc.reset_mask. Invalid inputs are handled properly. Updated the MC unit test logic w.r.t mc_intr_en_r, mc_intr_en_set_r and mc_intr_en_clear_r semantics. JIRA NVGPU-6399 Change-Id: I6a3ae42ac37cd6b586f6c71de338595e6cb04a37 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2542591 (cherry picked from commit b9908c979e8964a216141cc6ed475c7de2f2cc0b) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2623631 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-14 01:30:25 -08:00
Sagar Kamble	e8ba95c74b	gpu: nvgpu: address common.bus code inspection issue Fix bus_gk20a.h and bus_gv11b.h header guards as they are not as per conventions. JIRA NVGPU-6901 Change-Id: Ie1197c90ebc4a1c2635bf8a4dadbac08f394c964 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2606515 (cherry picked from commit bcd0e1c910ce146d72f103315313eed431dc2fbd) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2623624 Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-14 01:30:20 -08:00
Sagar Kamble	e00eabcbd3	gpu: nvgpu: reset ltc_ltc0_lts0_intr3 register LTC INTR3 register is not reset after handling. Reset it. JIRA NVGPU-6982 Change-Id: I6ab9e6de515e3dd2b45240d1a6953ffef171e1c0 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2586573 (cherry picked from commit bc754e5b0494d3ea2da71f186126f19ef5686c08) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2623622 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-14 01:30:14 -08:00
Martin Radev	64d2e25382	gpu: nvgpu: Expose cbc status and ctrl debug nodes This patch adds two new debug nodes: - cbc_status to retrieve cbc state information - cbc_ctrl to flush and mmap cbc These two nodes are to be used for purpose of testing and debugging. One immediate goal is to use them for verifying if CBC is programmed correctly via a user space test. Bug 3402817 Change-Id: I4046fb45ebfb48cb4644f88621429e1e323972c0 Signed-off-by: Martin Radev <mradev@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2620184 Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-14 01:29:53 -08:00
Seshendra Gadagottu	7194fe5f1f	nvgpu: ga10b: disable errata NVGPU_ERRATA_200601972 To avoid priv access error, disable errata NVGPU_ERRATA_200601972 Enable this errata back once, Bug 3414399 is fixed. Bug 3414399 Change-Id: I3a5277e6f109d319744499bc9898bec4ed292a49 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2620254 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-13 11:29:55 -08:00
Seshendra Gadagottu	319f4f6fe1	gpu: nvgpu: t234: update gating registers to avoid priv errors Generated clock gating registers list with updated skip list to avoid priv access violation errors. This change will be reverted, once proper fix in place. Bug 3414399 Change-Id: I0e6c877a98978256096b28b7427663fd59de3d32 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2620191 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-13 11:29:43 -08:00
Sagar Kamble	0394aef90d	gpu: nvgpu: cast bvec test Add BVEC tests for following functions: nvgpu_safe_cast_u64_to_u32, nvgpu_safe_cast_u64_to_u16, nvgpu_safe_cast_u64_to_u8, nvgpu_safe_cast_u64_to_s64, nvgpu_safe_cast_u64_to_s32, nvgpu_safe_cast_s64_to_u64, nvgpu_safe_cast_s64_to_u32, nvgpu_safe_cast_s64_to_s32, nvgpu_safe_cast_u32_to_u16, nvgpu_safe_cast_u32_to_u8, nvgpu_safe_cast_u32_to_s32, nvgpu_safe_cast_u32_to_s8, nvgpu_safe_cast_s32_to_u64, nvgpu_safe_cast_s32_to_u32, nvgpu_safe_cast_s8_to_u8, nvgpu_safe_cast_bool_to_u32 JIRA NVGPU-6412 Change-Id: Ic97e45051570a7133045de6cb4345c5f935cf9f6 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2555469 (cherry picked from commit be2ba5f1a7ead4c062eab74c7587c32797a651df) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2623637 Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-12 21:37:01 -08:00
Sagar Kamble	a1e75fe9bc	gpu: nvgpu: add unit test for nvgpu_wrapping_add_u32 Add BVEC unit test for the function nvgpu_wrapping_add_u32. JIRA NVGPU-7211 Change-Id: I5c4c870c75b3e7643a771110b2c0d248c1f8cb56 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2614166 (cherry picked from commit f6a2fae67c3dd0d3f11deba2cb943a8c6420fda5) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2623633 Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-12 21:36:55 -08:00
Sagar Kamble	d944313a54	gpu: nvgpu: arithmetic bvec tests Add BVEC tests for following functions: nvgpu_safe_sub_u8, nvgpu_safe_add_u32, nvgpu_safe_add_s32, nvgpu_safe_sub_u32, nvgpu_safe_sub_s32, nvgpu_safe_mult_u32, nvgpu_safe_add_u64, nvgpu_safe_add_s64, nvgpu_safe_sub_u64, nvgpu_safe_sub_s64, nvgpu_safe_mult_u64, nvgpu_safe_mult_s64 JIRA NVGPU-6412 Change-Id: Ie4f1138318314c3f53b1f188e1ca45f681ca895e Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2553170 (cherry picked from commit 74c32f975c181107372957a28aad0cb5278f42b2) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2623630 Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-12 21:36:49 -08:00
Antony Clince Alex	3e7643bb9c	gpu: nvgpu: update gops.mssnvlink Introduce HAL function gops.mssnvlink.get_links, this function retrieves the number of nvlinks supported by the chip along with their base addresses. Update ga10b_mssnvlink_init_soc_credits to call mssnvlink.get_links. Jira NVGPU-6641 Change-Id: I4ff857925f126bf41dc83eebc5723403244f66b0 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2618368 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-12 07:31:27 -08:00
Antony Clince Alex	1bcc22ab19	gpu: nvgpu: make mssnvlink programming OS agnositc Make ga10b_init_nvlink_soc_credits OS agnostic by replacing OS specific functions with corresponding nvgpu wrappers. This function is now assigned to gops.mssnvlink.init_soc_credits HAL. Introduce nvgpu wrapper, nvgpu_io_map/unmap to map/unmap specified physical address range. Jira NVGPU-6641 Change-Id: I337bc75b8ec36552fe471bf5e42f62c19f67ed4a Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2618237 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-12 07:31:15 -08:00
Sagar Kamble	83dbb711bb	gpu: nvgpu: make buffer metadata support independent of compression Earlier, buffer metadata support was made dependent on compression. However that is not required. Update the enabled flag NVGPU_SUPPORT_BUFFER_METADATA setup for various hals. Enable it for all from linux characteristics init. Update REGISTER_BUFFER and GET_BUFFER_INFO ioctls to seggregate the compile/runtime compression functionality. If compression is disabled, return error in case comptags are required else don't fail the REGISTER_BUFFER ioctl. Bug 200767700 Change-Id: I3850ccc879f180c97b830fb3d652c094b9d28a5b Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2614378 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-12 07:30:33 -08:00
Divya	c347b6e4ff	gpu: nvgpu: print riscv pmu pc trace - To print pmu RISCV PC trace, create a new flag which will be set to true after PMU is initialised. - This flag is then used to used to print RISCV trace buffer when pmu halt occurrs. JIRA NVGPU-7261 Change-Id: Ib3ad2f40efd1458d22b21e99ab151c11cfeb43be Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2624073 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-12 02:55:56 -08:00
Deepak Nibade	5d51872620	Revert "gpu: nvgpu: support SW methods in safety temporarily" This reverts commit `e0db40c3a5`. CUDA change to stop using SW methods in safety is integrated and this temporary patch can be reverted now. Bug 200748548 Change-Id: Ibdfd42b1fbfbfdf24455426e1b8001ad8b6218d5 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2623433 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-12 02:54:20 -08:00
Jon Hunter	aa44d0e041	gpu: nvgpu: Fix build for Linux v5.16-rc1 Building NVGPU against the current upstream mainline kernel is failing and errors such as the following are seen. ERROR: modpost: module nvgpu uses symbol dma_buf_map_attachment from namespace DMA_BUF, but does not import it. ERROR: modpost: module nvgpu uses symbol dma_buf_detach from namespace DMA_BUF, but does not import it. ERROR: modpost: module nvgpu uses symbol dma_buf_vmap from namespace DMA_BUF, but does not import it. Following upstream commit 16b0314aa746 ("dma-buf: move dma-buf symbols into the DMA_BUF module namespace"), it is now necessary to import the DMA_BUF module namespace into the NVGPU driver to fix this. Change-Id: I901b74cea692a5e0d66a190d01fe74a55aaf4431 Signed-off-by: Jon Hunter <jonathanh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2621641 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-12 02:51:30 -08:00
Konsta Hölttä	6cff904dc3	gpu: nvgpu: use runlist obj for wait_pending Change the gops_runlist::wait_pending API to take a runlist pointer instead of a runlist ID to better match with the rest of that interface. Jira NVGPU-6425 Change-Id: I96c4f49df8e2613498e0a09cc75a950824828bed Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2621214 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-11 20:39:47 -08:00
Konsta Hölttä	9be8fb80a2	gpu: nvgpu: make tsgs domain aware Start transitioning from an assumption of a single runlist buffer to the domain based approach where a TSG is a participant of a scheduling domain that then owns has a runlist buffer used for hardware scheduling. Concretely, move the concept of a runlist domain up to the users of the runlist code. Modifications to a runlist need to specify which domain is modified. There is still only the default domain that is created at boot. Jira NVGPU-6425 Change-Id: Id9a29cff35c94e0d7e195db382d643e16025282d Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2621213 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-11 20:39:42 -08:00
Konsta Hölttä	c8fa7f57f6	gpu: nvgpu: track runlist domains in list There will be multiple scheduling domains managed dynamically. Move from strictly one domain to a list of domains and still only one default domain in practice. This facilitates future changes on many domains. Jira NVGPU-6425 Change-Id: I6760c651be6c01791708740a821aa564d7a6b7b8 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2621212 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-11 20:39:35 -08:00
Divya	6885071c64	gpu: nvgpu: bring all supported GRs out of reset - The hardware is designed in such a way that if GR engine is not out of reset, it still takes clock. - This causes ELCG feature to not engage correctly. - So for iGPU, SW should bring all supported GR engines out of reset during gpu boot, if MIG feature is not enabled. - This will help low power feature like elcg to engage correctly and improve dynamic power savings. - For dGPU, all GRs are out of reset by default by dev init. Bug 200778542 Change-Id: I5f3519f73b4aaf1804fd112f28fe980f58181cd8 Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2613718 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-11 20:37:45 -08:00
Sagar Kamble	688a6aa17d	gpu: nvgpu: change set_err_notifier error message to info type for forced channel reset Below error notifier message is not really warning/error as that is user triggered reset and the notifier value set is already consumed by userspace hence limit this message to INFO type. nvgpu: 17000000.gp10b nvgpu_set_err_notifier_locked:142 [ERR] error notifier set to 43 for ch 460 Bug 3344409 Change-Id: Ia41cc85f30111ef72994f3ce8e5113e881c06b1b Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2620974 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-08 15:08:32 -08:00
Mahantesh Kumbar	7b29872bc4	gpu: nvgpu: swap the sequence of ACR & PERFMON Swap the command sequence of ACR WPR init and PERFMON init sent to PMU ucode upon init message, because perfmon init command read is failing in PMU ucode when ACR WPR init command is processed and accessed WPR info from system during un-rail-gate sequence. And also flushing the FB-Q's for rail-gate and un-rail-gate sequence. Bug 3400166 Change-Id: I23c38588d0ddc4e1621e83a72d5e232cf65371dc Signed-off-by: Mahantesh Kumbar <mkumbar@nvidia.com> Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2617398 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-08 15:08:05 -08:00
Konsta Hölttä	c0473460ea	gpu: nvgpu: don't check ch activity on bind Delete an unnecessary check of the active_channels bitmap when attempting to bind a channel to a TSG. There is already a verification that the channel must not be a part of a TSG; if it's not, it cannot be set in the bitmap. All channels become active via a parent TSG, but the activity check predates this design. A channel is bound to a TSG early before setting up its gpfifo etc. and mandatory membership of a TSG is one of the setup_bind prechecks. Jira NVGPU-6425 Change-Id: Id34686f198db0a0265ffd6a49a0b2e47c37fd5f7 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2621211 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-11-04 12:47:54 -07:00
Konsta Hölttä	3cf796b787	gpu: nvgpu: move active bitmaps to domain Move the active_channels and active_tsgs bitmaps from struct nvgpu_runlist to struct nvgpu_runlist_domain. A TSG and its channels are currently active as part of a runlist; in the future, a runlist may be switched from multiple domains that each are a collection of TSGs. The changes are still internal to the runlist code. Users of runlists need no modifications. Jira NVGPU-6425 Change-Id: I2d0e98e97f04b9716bc3f4890cf881735d0ab664 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2618387 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-11-03 20:55:08 -07:00

1 2 3 4 5 ...

9255 Commits