linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-23 09:57:08 +03:00

Author	SHA1	Message	Date
Deepak Nibade	2373a87048	gpu: nvgpu: set compute regs only for compute class In safety build, gops.gr.init.set_default_compute_regs() is invoked in nvgpu_gr_obj_ctx_alloc() for all classes. Before enabling graphics classes in safety this was executed only for compute class. But since graphics classes are supported in safety now this call should be made only for compute classes. Add gops.gpu_class.is_valid_compute() check before calling this function. Bug 3482988 Change-Id: If3722be36e779195122f54925ad122871cf13317 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2667324 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-02-10 20:36:06 -08:00
Rajesh Devaraj	7dc013d242	gpu: nvgpu: merge error reporting apis In DRIVE 6.0, NvGPU is allowed to report only 32-bit metadata to Safety_Services. So, there is no need to have distinct APIs for reporting errors from units like GR, MM, FIFO to SDL unit. All these error reporting APIs will be replaced with a single API. To meet this objective, this patch does the following changes: - Replaces nvgpu_report__err with nvgpu_report_err_to_sdl. - Removes the reporting of error messages. - Replaces nvgpu_log() with nvgpu_err(), for error reporting. - Removes error reporting to Safety_Services from nvgpu_report__err. However, nvgpu_report_*_err APIs and their related files are not removed. During the creation of nvgpu-mon, they will be moved under nvgpu-rm, in debug builds. Note: - There will be a follow-up patch to fix error IDs. - As discussed in https://nvbugs/3491596 (comment #12), the high level expectation is to report only errors. JIRA NVGPU-7450 Change-Id: I428f2a9043086462754ac36a15edf6094985316f Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2662590 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-02-09 00:41:18 -08:00
Ramesh Mylavarapu	2a98d20263	nvgpu: ga10b: gsp: implement runlist submit apis - implemented device info cmd to send device info to the gsp for runlist submission. Currently GSP scheduler support only GR engine '0' instance. - implemented runlist submit cmd. GSP firmware will submit the corresponding runlist by writing into submit registers. This command is direct replacement of hw_submit ga10b hal for GR engine. NVGPU-6790 Signed-off-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Change-Id: I5dc573a6ad698fe20b49a3466a8e50b94cae74df Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2608923 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-02-09 00:38:56 -08:00
rmylavarapu	6c1a77dfa9	gpu: nvgpu: gsp: add cmdq/msgq init check - Instead of waiting for mailbox update waiting for cmdq/msgq initialization request would be the better way to check the communication between NVGPU and GSP before sending any cmd. NVGPU-7342 Signed-off-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Change-Id: I6d20764516cee14ad84da7cc9a06c9370675786f Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2650148 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-02-09 00:38:45 -08:00
Ramesh Mylavarapu	e5fd0453cf	gpu: nvgpu: gsp: add priv lockdown release check - NVGPU need to check for priv lockdown release before configuring any priv registers. In current GSP bootstrap sequence has irq configuration after GSP engine reset which is causing priv errors. So irq configuration should be done after GSP firmware releases priv lockdown. - Removed clearing irq mask and dest registers before configuring them as GSP firmware would have done partial irq configuration before releasing the priv. NVGPU-7342 Signed-off-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Change-Id: I4b6e83452c051654253e02bfb72330b3d6aec3fd Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2649826 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-02-09 00:38:32 -08:00
Ramesh Mylavarapu	9302b2efee	gpu: nvgpu: gsp units separation Separated gsp unit into three unit: - GSP unit which holds the core functionality of GSP RISCV core, bootstrap, interrupt, etc. - GSP Scheduler to hold the cmd/msg management, IPC, etc. - GSP Test to hold stress test ucode specific support. NVGPU-7492 Signed-off-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Change-Id: I12340dc776d610502f28c8574843afc7481c0871 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2660619 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-02-09 00:38:21 -08:00
Chris Johnson	14ed75e857	gpu: nvgpu: fix REMAP to support small/big pages Initially, REMAP only worked with big pages but in some cases only small pages are supported where REMAP functionality is also needed. This cleans up some page size assumptions. In particular, on a remap request, the nvgpu_vm_area is found from the passed in VA, but can only be done from virt_offset_in_pages if we're also told the page size. This now occurs from _PAGESIZE_ flags which are required by both map and unmap operations. Jira NVGPU-6804 Change-Id: I311980a1b5e0e5e1840bdc1123479350a5c9d469 Signed-off-by: Chris Johnson <cwj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2566087 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-02-09 00:37:33 -08:00
Konsta Hölttä	359e83b45a	gpu: nvgpu: tsg: release default nvs domain ref A reference to the default scheduling domain is taken when a TSG is opened. Although the explicit bind is designed to support only one bind, the TSG is bound to the default one implicitly at that point. Release the reference to avoid leaking it. The domain might be null at that point if the default domain has been removed; in that case there's just no domain to put back. Change-Id: I7db43f7bbb2a8c86c391280eb7fa32431c8982da Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2663420 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-02-06 10:09:34 -08:00
Konsta Hölttä	8736c0d467	gpu: nvgpu: add and use sw-only timers The nvgpu timeout API has an internal override for presilicon mode by default: in presi simulation environments the timeouts never trigger. This behaviour is intended in the original usecase of the timer unit with hardware polling loops. In pure software logic though, the timer must trigger after the specified timeout even in presi mode so add a new init function to produce a timer for software logic. Use this new kind of timer in channel and scheduling worker threads. The channel worker currently times out for just the purpose of the channel watchdog timer which has its own internal timer. Although that's just software, the general expectation is that the watchdog does not trigger in presilicon tests that run slower than usual. The internal watchdog timer thus keeps the non-sw mode. Bug 3521828 Change-Id: I48ae8522c7ce2346a930e766528d8b64195f81d8 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2662541 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-02-04 22:02:33 -08:00
Richard Zhao	621417bc73	gpu: nvgpu: pmu: move a few units to dgpu specific Move below units to CONFIG_NVGPU_DGPU: - boardobj - clk - volt - perf - pmgr - therm - volt Jira GVSCI-9976 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I759d1c51c4c811bb39ca6b7a6b75b12891a23bf0 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2663188 Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Aparna Das <aparnad@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-02-04 05:49:10 -08:00
Antony Clince Alex	a6e5b76cbf	gpu: nvgpu: profiler: update reservation policy Update profiler object reservation policy to reject any subsequent reserve request made after the intial reserve->bind stage. Bug 3480919 Change-Id: I3e25f22d907d7e06f4cf73347e7bd07e2f675749 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2662360 Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-02-02 21:47:21 -08:00
Dinesh T	e33bdceb8b	gpu: nvgpu: Unify ivm mempool CBC contig allocation requires mempool node in DT and the node can be used for contig allocations. The code duplication can be avoided by unifying the code from vgpu. Change-Id: I6eaa1d0c9db47b158602bf0ba68ce4e09cf487a7 Signed-off-by: Dinesh T <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2650459 Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Richard Zhao <rizhao@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-02-01 09:50:45 -08:00
Antony Clince Alex	40397ac0c4	gpu: nvgpu: update CBC init sequence At present, for each resume cycle the driver sends the "nvgpu_cbc_op_clear" command to L2 cache controller, this causes the contents of the compression bit backing store to be cleared, and results in corrupting the metadata for all the compressible surfaces already allocated. Fix this by updating cbc.init function to be aware of resume state and not clear the compression bit backing store, instead issue "nvgpu_cbc_op_invalide" command, this should leave the backing store in a consistent state across suspend/resume cycles. The updated cbc.init HAL for gv11b is reusable acrosss multiple chips, hence remove unnecessary chip specific cbc.init HALs. Bug 3483688 Change-Id: I2de848a083436bc085ee98e438874214cb61261f Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2660075 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-02-01 06:03:33 -08:00
Sagar Kamble	29a0a146ac	gpu: nvgpu: fix coverity defects Fix following coverity defects: ioctl_prof.c resource leak ioctl_dbg.c logically dead code global_ctx.c identical code for branches therm_dev.c resource leak pmu_pstate.c unused value nvgpu_mem.c dead default in switch tsg.c Dereference before null check nvlink_gv100.c logically dead code nvlink.c Out-of-bounds write fifo_vgpu.c Dereference null return value pmu_pg.c Dereference before null check fw_ver_ops.c Identical code for different branches boardobjgrp.c Dereference after null check boardobjgrp.c Dereference before null check boardobjgrp.c Dereference after null check engines.c Dereference before null check nvgpu_init.c Unused value CID 10127875 CID 10127820 CID 10063535 CID 10059311 CID 10127863 CID 9875900 CID 9865875 CID 9858045 CID 9852644 CID 9852635 CID 9852232 CID 9847593 CID 9847051 CID 9846056 CID 9846055 CID 9846054 CID 9842821 Bug 3460991 Change-Id: I91c215a545d07eb0e5b236849d5a8440ed6fe18d Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2657444 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-01-28 04:50:12 -08:00
Richard Zhao	a3f3249c76	nvgpu: move .load_timestamp_prod to NON_FUSA and MIG .load_timestamp_prod was defined protected by CONFIG_NVGPU_HAL_NON_FUSA and CONFIG_NVGPU_MIG. This patch moves the implementation of .load_timestamp_prod to the same macros. Jira GVSCI-9976 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I3204f3e7085d4098be2ab73e3b5300214ef04cfa Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2659002 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: Aparna Das <aparnad@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-01-27 07:51:51 -08:00
Debarshi Dutta	1b3ae7eb92	gpu: nvgpu: fix ecc issues Fixed memory leaks within the ltc ecc code. Memory leak occurs as some of the stats are not free'd during Rmmod. Add a common API to handle the same. Bug 3364181 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Change-Id: I1ec5a7d7e57580bc75b7679c922d1e3af8418f6b Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2652684 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-01-27 07:50:52 -08:00
Rajesh Devaraj	878235e914	gpu: nvgpu: remove report error callback In DRIVE 6.0, NvGPU needs to support error reporting in QNX-Safety, QNX-Standard, and Linux. To support error reporting in all these platform variants, SDL unit will be moved from QNX to common code. As part of this refactoring activity, this patch removes ops assignment for report error. Also, it removes API calls that are used to take time-stamp for stall interrupt thread. This time-stamp APIs will be brought back later, if required to support periodic diagnostics. JIRA NVGPU-7353 Change-Id: I38536019dc7165e6a97674863b37d009854af948 Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2655958 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-01-24 02:06:24 -08:00
Seshendra Gadagottu	a7c1052024	gpu: nvgpu: program ltc cg prod values after acr boot Separate nvgpu_cg_blcg/slcg_fb_ltc_load_enable function into nvgpu_cg_blcg/slcg_fb_load_enable and nvgpu_cg_blcg/slcg_ltc_load_enable. Program fb slcg/blcg prod values during fb init and program ltc slcg/blcg prod values after acr boot to have correct privilege for ltc cg programming. Update unit tests to have sperate blcg/slcg hal for fb and ltc programming. Bug 3423549 Change-Id: Icdb45528abe1a3ab68a47f689310dee9a4fe9366 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2646039 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-01-15 06:08:21 -08:00
Richard Zhao	9ab1271269	gpu: nvgpu: common: fix compile error of new compile flags It's preparing to add bellow CFLAGS: -Werror -Wall -Wextra \ -Wmissing-braces -Wpointer-arith -Wundef \ -Wconversion -Wsign-conversion \ -Wformat-security \ -Wmissing-declarations -Wredundant-decls -Wimplicit-fallthrough Jira GVSCI-11640 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: Ia8f508c65071aa4775d71b8ee5dbf88a33b5cbd5 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2555056 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-01-13 12:36:14 -08:00
Richard Zhao	851666b632	gpu: nvgpu: common/pmu: fix compile error of new compile flags It's preparing to add bellow CFLAGS: -Werror -Wall -Wextra \ -Wmissing-braces -Wpointer-arith -Wundef \ -Wconversion -Wsign-conversion \ -Wformat-security \ -Wmissing-declarations -Wredundant-decls -Wimplicit-fallthrough Jira GVSCI-11640 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: Ide3ab484924bd5be976a9f335b55b136575ce428 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2555055 Reviewed-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-by: Aparna Das <aparnad@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-01-13 12:36:02 -08:00
Sagar Kadamati	a3ed73a57c	gpu: nvgpu: add tegra_raw support * This change adds NVGPU_AS_MAP_BUFFER_FLAGS_TEGRA_RAW flag to control buffer format * Add NVGPU_SUPPORT_TEGRA_RAW enabled flag to indicate if feature is enabled for a given chip. * Update gv11b_gpu_phys_addr function to set TEGRA_RAW bit Jira NVGPU-6640 Bug 3489827 Change-Id: I959c22bef906bb9c6dcdc8d5f5e9951ad9937a60 Signed-off-by: Sagar Kadamati <skadamati@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2545128 Reviewed-by: Martin Radev <mradev@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: Seema Khowala <seemaj@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-01-13 12:35:36 -08:00
Seshendra Gadagottu	03b1a81ab1	gpu: nvgpu: gr: ignore second zcull request to ctx All channels in TSG will share same zcull context. Any attempt to add a second zcull buffer will be ignored. Bug 3364302 Change-Id: I04e18dfe8e5fac4ca131c3b625755aa90a23180d Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2616677 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-01-13 10:30:23 -08:00
Sagar Kamble	535a27411a	gpu: nvgpu: fix allocator debugfs deinit Allocator (bitmap, buddy, page) debugfs files are not cleaned up when the allocators are destroyed. This leads to warning logs from nvgpu like below: [21073.493000] debugfs: File 'gk20a_as_17' in directory 'allocators' already present! [21073.493026] debugfs: File 'gk20a_as_17-sys' in directory 'allocators' already present! Remove the per-allocator debugfs node when destroying an allocator in runtime. While at this, add missing nvgpu_allocator locking to the function nvgpu_bitmap_alloc_destroy. And create nop functions for the functions nvgpu_init_alloc_debug and nvgpu_fini_alloc_debug when CONFIG_DEBUG_FS is not defined to avoid adding the CONFIG checks at multiple places. Move gk20a_debug_deinit to the end of gk20a_free_cb called in nvgpu_put as that tears down all debugfs entries. Allocator destroy happens as part of nvgpu_put call and it can lead to invalid debugfs dentry access if gk20a_debug_deinit is called before it. Bug 3481097 Change-Id: I8a66bcf6ade7e5707f9207c78a54d12d7bd94c02 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2648012 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-01-07 06:28:53 -08:00
mkumbar	2431b832e7	gpu: nvgpu: CONFIG_NVGPU_NON_FUSA cleanup for ga10b acr some part of ga10b acr blob creation code under CONFIG_NVGPU_NON_FUSA check which fails to create blob correctly for ga10b safety build. Bug 3456240 Change-Id: If246e2142daa8dac28ac9ce35f4562119a3b30aa Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2645647 (cherry picked from commit 4c52b59820804ed630836fbef9cf3e7e1a18a013) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2642679 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: Shashank Singh <shashsingh@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-01-06 17:19:54 -08:00
mkumbar	61c6aeec41	gpu: nvgpu: disable LSPMU for ga10b safety Bug 3456240 Change-Id: I0bb9581d2df46e5cb2dea57794ee0c918394eb66 Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2645646 (cherry picked from commit aab86a06485c94546e11809cdeeefe9906e9b680) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2629878 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: Shashank Singh <shashsingh@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-01-06 17:19:46 -08:00
Shashank Singh	a372ec9a38	gpu: nvgpu: disable golden context image verification - Disable golden context image verification until ctxsw fw for orin safety is ready for this feature. - Make NULL check for hal set_default_compute_regs else it causes crash for orin safety. Bug 3456240 Change-Id: I1f6ca9d78f22cc6776bb0b3a9e05f22171095c7f Signed-off-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2645666 (cherry picked from commit 3907d1b315e1247243632fefdcbce69d58090681) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2644533 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-01-06 11:40:46 -08:00
Dinesh T	a47ce8eafe	gpu: nvgpu: add ipa-pa cache for qnx This is adding ipa-pa cache for HV-qnx by making the code as OS independant. NVGPU-7329 Change-Id: If003ddf323124ba0899d7ead5db5c5478ddfc6e0 Signed-off-by: Dinesh T <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2645771 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-31 05:04:50 -08:00
Sagar Kamble	d424598b7b	gpu: nvgpu: stop nvs thread during unload nvs worker thread is created on each resume and deinitialized on every suspend. nvgpu can be resumed when process is getting killed. Thread creation can fail when the process is getting killed. That will lead to driver resume failure. To avoid the issue above, don't stop the nvs worker thread in suspend and let the first created thread handle the nvs work always. Deinitialize the nvs worker thread during nvgpu unload. Also, log the error returned by nvgpu_thread_create in the function nvgpu_worker_start. bug 3480192 Change-Id: I8d5d9e7716a950b162cc3c2d9fcfde07c4edfcf6 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2646218 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-12-29 09:35:03 -08:00
Martin Radev	b67a3cd053	gpu: nvgpu: ga10b: Correct VAB implementation This patch performs the following improvements for VAB: 1) It avoids an infinite loop when collecting VAB information. Previously, nvgpu incorrectly assumed that the valid bit would be eventually set for the checker when polling. It may not be set if a VAB-related fault has occurred. 2) It handles the VAB_ERROR mmu fault which may be caused for various reasons: invalid vab buffer address, tracking in protected mode, etc. The recovery sequence is to set the vab buffer size to 0 and then to the original size. This clears the VAB_ERROR bit. After reseting, the old register values are again set in the recovery code sequence. 3) Use correct number of VAB buffers. There's only one VAB buffer on ga10b, not two. 4) Simplify logic. Bug 3374805 Bug 3465734 Bug 3473147 Change-Id: I716f460ef37cb848ddc56a64c6f83024c4bb9811 Signed-off-by: Martin Radev <mradev@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2621290 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-22 08:22:13 -08:00
Seshendra Gadagottu	b2db6a8453	gpu: nvgpu: ga10b: set-up vab buffer during gpu re-init During gpu re-initialization(rail gate exit/sc7 exit), vab buffers needs to programmed in hw by writing vab buffer address to NV_PFB_PRI_MMU_VIDMEM_ACCESS_BIT_BUFFER_LO(HI)_ADDR and setting vab entries to NV_PFB_PRI_MMU_VIDMEM_ACCESS_BIT_BUFFER_SIZE_VAL. For this moved nvgpu nvgpu_fb_vab_init_hal from nvgpu_init_mm_setup_sw to nvgpu_init_mm_support. nvgpu_init_mm_setup_sw is skipping during re-init, because sw_ready is set during first boot. Bug 3468562 Change-Id: Iee2bd4bc5165397ea4f9cca0ba6744eaf361a342 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2643244 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Martin Radev <mradev@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-21 17:15:23 -08:00
Divya	744782e088	gpu: nvgpu: add IDLE_SNAP RPC Add support for IDLE_SNAP RPC sent from PMU. This RPC event is received when ELPG is engaged and some register, which lies in powergated region, is accessed for read/write. This RPC sends information like reason for idle_snap and cached value of idle status registers. JIRA NVGPU-7327 Change-Id: I289505c43f0d4246ee1379804b575cd8902050d3 Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2642951 Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-12-21 17:14:42 -08:00
shashank singh	c22ff25097	gpu: nvgpu: move GSPLITE outside DGPU flag Jira NVGPU-7276 Change-Id: Ic69443f66f42ab5e981ce865c24fcca63f783a76 Signed-off-by: shashank singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2632687 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-21 17:09:45 -08:00
Sagar Kamble	6a6562cd4d	gpu: nvgpu: ga10x: fix LTC ecc handling Notable differences from GV11B are below: 1. RSTG/TSTG uncorrected errors are supported. 2. PLTS_INTR doesn't report SEC/DED errors. Instead, PLTS_INTR3 will indicate the SEC/DED errors through CORRECTED_ERR_DSTG and UNCORRECTED_ERR_DSTG fields respectively. 3. DSTG_ECC_ADDRESS and DSTG_ECC_REPORT are deprecated. Bug 3446731 Change-Id: I60018d1b3825adcbb287dea05bc96a87f559c969 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2633959 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-17 14:36:51 -08:00
Sagar Kamble	c463810bcd	gpu: nvgpu: fix ltc isr, unit tests LTC isr doesn't handle ECC errors correctly. INTR3 reports only parity ECC errors and INTR reports SEC/DED ECC errors. nvgpu managed both these errors with same counters. Fix it as per Volta ECC HW Functional Description. JIRA NVGPU-6982 Change-Id: I6ddaab55f7e1354ad9b832672a9006b7e58df9f7 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2605012 (cherry picked from commit 5f92651e921b17cb61bbbb8954128c787cd89238) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2632548 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-17 14:36:45 -08:00
Sagar Kamble	449a4823d4	gpu: nvgpu: compile out non fusa LTC functionality nvgpu_ltc_sync_enabled functionality is used only in the kernel mode submit path and for debugging. en_illegal_compstat functionality is used for debugging . Compile them out under CONFIG_NVGPU_NON_FUSA. JIRA NVGPU-6982 Change-Id: I404d4b74b2e60ba4c2173ba0bfb643b1ecb6ba7c Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2605011 (cherry picked from commit f4bcafe73c8f7184b5e125e3ff6e55ceccaf87eb) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2632547 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-17 14:36:40 -08:00
mkumbar	4d1fa62dd9	gpu: nvgpu: ga10b: RPC for ELPG statistics data Fetch the ELPG statistics data using RPC NV_PMU_RPC_ID_PG_PG_CTRL_STATS_GET Earlier/legacy chips, ELPG stats data is fetched from DMEM directly using the offset got from pg init command but for GA10B RPC is used to fetch the ELPG stats data. Bug 3439350 Change-Id: Ia29d423c41913cd96e44aba9dae41f73fe236dd2 Signed-off-by: Divya <dsinghatwari@nvidia.com> Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2641832 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-12-15 08:13:24 -08:00
Konsta Hölttä	55afe1ff4c	gpu: nvgpu: improve nvs uapi - Make the domain scheduler timeslice type nanoseconds to future proof the interface - Return -ENOSYS from ioctls if the nvs code is not initialized - Return the number of domains also when user supplied array is present - Use domain id instead of name for TSG binding - Improve documentation in the uapi headers - Verify that reserved fields are zeroed - Extend some internal logging - Release the sched mutex on alloc error - Add file mode checks in the nvs ioctls. The create and remove ioctls require writable file permissions, while the query does not; this allows filesystem based access control on domain management on the single dev node. Jira NVGPU-6788 Change-Id: I668eb5972a0ed1073e84a4ae30e3069bf0b59e16 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2639017 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-12-15 06:05:25 -08:00
mkumbar	b92e8530fc	gpu: nvgpu: ga10b: slcg and blcg update for PMU Load register configuration for SLCG and BLCG for PMU. Bug 3452217 Change-Id: Ib54077ee00d0b9247db8d792e5ed566fd4ca2efd Signed-off-by: mkumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2641365 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-14 06:28:19 -08:00
Konsta Hölttä	d086c678fd	gpu: nvgpu: add domain scheduler worker Move away from the prototype call in channel wdt worker and create a separate worker thread for the domain scheduler. The details of runlist domains are still encapsulated in the runlist code; the domain scheduler controls when to switch domains. Switching happens based on domain timeslices or when the current domain is deleted. The worker thread is paused on railgate and spun back on poweron. The scheduler data was also left dangling, so fix that by deinitializing all nvs-related when gk20a_remove_support() is called. The runlist domains already get freed as part of fifo removal. Jira NVGPU-6427 Change-Id: I64f42498f8789448d9becdd209b7878ef0fdb124 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2632579 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-14 06:26:16 -08:00
Divya	9446cfa320	gpu: nvgpu: update golden image flag for RG seq The flag pmu->pg->golden_image_initialized is set to true during initial GPU context creation and is not cleared while the GPU goes into pm_suspend (during railgate). Hence, when the GPU resumes after un-railgate it retains the previous value which can cause ELPG to kick in immediately. Due to this, when ELPG and Railgating are enabled, IDLE_SNAP is seen for read access of gr_gpc0_tpc0_sm_arch_r reg. To resolve this, if golden image is ready set the pmu->pg->golden_image_initialized to suspend state during railgate, to delay the early enable of ELPG. Add a new pmu_init_golden_img_state hal in the NVGPU_INIT_TABLE_ENTRY. This will be called after all the GR access is done and GPU resumes completely after un-railgate. This hal will then check if golden_image_initialized flag is in suspend state, it will set it to ready state and then re-enable ELPG. Bug 3431798 Change-Id: I1fee83e66e09b6b78d385bbe60529d0724f79e79 Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2639188 Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-12-11 14:06:49 -08:00
Debarshi Dutta	9e3566a35b	gpu: nvgpu: move netlist_defs.h to include/ According to nvgpu coding guidelines, common headers should be put in include/ directory. Updated accordingly Change-Id: I448c562734616cb6b7ff5496094a3abb65e0d7df Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2599899 (cherry picked from commit 80f12d84015a433bbca2580f300d77c39d69097a) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2633417 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-10 13:23:59 -08:00
Konsta Hölttä	c6f50ee42e	gpu: nvgpu: use correct id for rl domain deletion The index for active_runlists is meaningless outside the active_runlists array, and may break on more complex GPUs. Use runlist->id. Jira NVGPU-6425 Change-Id: Ida9d53bd5180f4e5a9fa490b5b957e3b68aa410f Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2637930 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-07 21:41:09 -08:00
Konsta Hölttä	0e8184b976	gpu: nvgpu: print call stack on no regs nvgpu_warn_on_no_regs() only logs a warning message, but that does not explain what operation attempted to access the unmapped GPU registers. Call also WARN_ON() to produce a standard big loud kernel warning with the call trace to help debug mistaken HW accesses. Also print the accessed register address. Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Change-Id: I1c70ad2273c2e162193052436e64879d996f4572 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2634860 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-07 07:10:40 -08:00
Konsta Hölttä	632644b44a	gpu: nvgpu: couple runlist domains and nvs Now that the main nvsched code exists in the nvgpu build, make it control the runlist domains. As a new nvs domain is created, create the relevant runlist data too. To support the default domain, create a default nvs domain at boot. The scheduling domain code owns the responsibility of domain lifetime, and runlist domains exist to serve that logic although the RL domains are directly used by channel and TSG logic. Add refcounting to the scheduler uapi level to make sure that busy domains (that still have TSG participants) do not get removed too early. Adjust error injection sensitive unit tests to match the updated logic. Jira NVGPU-6425 Jira NVGPU-6427 Change-Id: I1beec97c54c60ad334165b1c0acb5e827c24f2ac Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2632287 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-07 07:07:12 -08:00
Konsta Hölttä	1d14a4412f	gpu: nvgpu: scheduler management uapi Add ioctls for creating, removing and querying scheduling domains and interface with the "nvsched" entity that will be the core scheduler. Include the scheduler in the Linux build. The core scheduler code will ultimately hold data on and control what gets scheduled, but this intermediate layer in nvgpu-rm needs a bit of bookeeping to manage the userspace interface. To keep changes isolated, this does not touch the internal runlist domains yet. The core scheduler logic will eventually control the runlist domains. Jira NVGPU-6788 Change-Id: I7b4064edb6205acbac2d8c593dad019d517243ce Signed-off-by: Alex Waterman <alexw@nvidia.com> Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2463625 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-07 07:07:01 -08:00
Dinesh T	ad09e3e3cc	gpu: nvgpu: Enable sm_l1tag_surface_cut_collector This is enabling sm_l1tag_surface_cut_collector at gpu boot. This is done with adding new hal "set_sm_l1tag_surface_collector" that sets l1tag_surface_cut_collector in the sm_l1tag_ctrl register. Bug 2557724 Change-Id: I869e3bfa563db204259e7a464657229632f182d9 Signed-off-by: Dinesh T <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2634878 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-06 04:36:56 -08:00
mpoojary	41b7299201	gpu: nvgpu: zero blob size support for rail-gating. Add support to pass ucode blob size as '0' while rail-gating. Bug 200776471 Change-Id: Ib178bc2f8881a1e49c874be346b0e712d4aca923 Signed-off-by: mpoojary <mpoojary@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2613466 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-04 11:59:44 -08:00
Konsta Hölttä	4a7e5056a5	gpu: nvgpu: disable gsp isr on suspend nvgpu_gsp_sw_deinit() is called so late that the GPU HW is not expected to be available, so it must not call nvgpu_gsp_isr_support(). Move that call to nvgpu_prepare_poweroff(). The gsp isr is still enabled in gsp bootstrap as before. Change-Id: I84276ad377158a5fdb11931bd188e6d82bafc3df Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2635681 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-04 11:13:29 -08:00
Konsta Hölttä	23f6da4fe5	gpu: nvgpu: add should stop condition to workers As the docs of nvgpu_thread say, each thread (which the worker loop is) should wake up and check also nvgpu_thread_should_stop() to manage graceful and quick exit as requested. The loop does have that check already, but the workqueue condition does not, so the cond wait might end up waiting until its timeout hits. It's not robust to trust the worker users to have a swift timeout for exiting the thread, so read the should-stop flag in the wakeup condition too. Simplify the clk arb worker ops now that calling nvgpu_worker_should_stop from there is no longer necessary. (Other worker users did not have those, so they were technically buggy.) Change-Id: I5409b8037564d4b6445a15cdbd4f1f3d616c4083 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2635808 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-12-03 08:47:16 -08:00
Deepak Nibade	9f55801a15	gpu: nvgpu: move local golden context memory allocation to poweorn - Separate out local golden context memory allocation from nvgpu_gr_global_ctx_init_local_golden_image() into a new function nvgpu_gr_global_ctx_alloc_local_golden_image(). - Add a new member local_golden_image_copy to struct nvgpu_gr_obj_ctx_golden_image to store copy used for context verification. - Allocate local golden context memory from nvgpu_gr_obj_ctx_init() which is called during poweron path. - Remove memory allocation from nvgpu_gr_obj_ctx_save_golden_ctx(). - Disable test test_gr_obj_ctx_error_injection since it needs rework to accomodate the new changes. - Fix below tests to allocate local golden context memory : test_gr_global_ctx_local_ctx_error_injection test_gr_setup_alloc_obj_ctx Bug 3307637 Change-Id: I2f760d524881fd328346838ea9ce0234358f8e51 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2633713 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-12-01 08:44:30 -08:00

... 3 4 5 6 7 ...

3348 Commits