linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-22 09:12:24 +03:00

Author	SHA1	Message	Date
Kishan Palankar	2eabcdb8a4	gpu: nvgpu: Guard profiler_objects list operations with a lock Both profiler and debugger device nodes access and update the list, g->profiler_objects. List operations were currently not guarded by lock thus leading to synchronisation issues. Stress-ng test attempts to trigger repeated random open close sessions on all the device nodes exposed by gpu. This results in kernel panic at random stages of test. Failure signature - Profiler node receives a release call and as part of it, nvgpu_profiler_free attempts to delete the prof_obj_entry and free the prof memory. Simulataneously debugger node also receives a release call and as part of gk20a_dbg_gpu_dev_release, nvgpu attempts to access g->profiler_objects to check for any profiling sessions associated with debugger node. There is a race to access the list which results in kernel panic for address 0x8 because nvgpu tries to access prof_obj->session_id which is at offset 0x8. As part of this change, g->profiler_objects list access/update is guarded with a mutex lock. Bug 4858627 Change-Id: I1e2cf8d27d195bbc9c012cf511029de9eaadb038 Signed-off-by: Kishan Palankar <kpalankar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/3239897 GVS: buildbot_gerritrpt <buildbot_gerritrpt@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com>	2024-11-07 08:53:58 -08:00
Martin Radev	81d95456b9	gpu: nvgpu: Rename PLATFORM_ATOMIC to SYSTEM_COHERENT To support current and future usecases, it would be beneficial to select the SYSTEM_COHERENT aperture explicitly. The benefits are: - platform atomic code is cleaned-up. - userspace can select the SYSTEM_COHERENT aperture for any specific usecases. Bug 3959027 Change-Id: I6489ebe87fa75cc760930277bad5e0cacca80eb6 Signed-off-by: Martin Radev <mradev@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2864177 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-04-21 11:31:53 -07:00
srajum	02834f8739	gpu: nvgpu: fix CERT-C issues - CID 10165014 Dereference before null check - CID 10166579 Unused value Bug 3952896 Change-Id: I6a7f2b97b4a6519272607e560d09c138048bd665 Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2872276 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-03-21 02:37:08 -07:00
Sagar Kamble	f1896e0a64	gpu: nvgpu: acquire tsg ctx_init_lock when changing ctx state GR context associated with channel is updated in various driver paths. Sequence to do the same is disable the TSG, preempt the TSG, update the GR context or instance block and then enable the TSG. These operations and runlist updates for channel have to be done under TSG specific ctx_init_lock to avoid the race. suspend_contexts and resume_contexts needs special handling which is not covered in this patch. Bug 3677982 Change-Id: I837257fe9d9ef3eb6f69f5d7e0707e0bb6d4ea72 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2720222 Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-08 21:00:36 -07:00
atanand	eae4593343	gpu: nvgpu: add ioctl to configure implicit ERRBAR Add ioctl support to configure implicit ERRBAR by setting/unsetting NV_PGRAPH_PRI_GPCS_TPCS_SM_SCH_MACRO_SCHED register. Add gpu characteritics flag: NVGPU_SCHED_EXIT_WAIT_FOR_ERRBAR_SUPPORTED to allow userspace driver to determine if implicit ERRBAR ioctl is supported. Bug: 200782861 Change-Id: I530a4cf73bc5c844e8d73094d3e23949568fe335 Signed-off-by: atanand <atanand@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2718672 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-08-05 23:10:18 -07:00
Sagar Kamble	f95cb5f4f8	gpu: nvgpu: maintain ctx buffers mappings separately from ctx mems In order to maintain separate mappings of GR TSG and global context buffers for different subcontexts, we need to separate the memory struct and the mapping struct for the buffers. This patch moves the mappings of all GR ctx buffers to new structure nvgpu_gr_ctx_mappings. This will be instantiated per subcontext in the upcoming patches. Summary of changes: 1. Various context buffers were allocated and mapped separately. All TSG context buffers are now stored in gr_ctx->mem[] array since allocation and mapping is unified for them. 2. Mapping/unmapping and querying the GPU VA of the context buffers is now handled in ctx_mappings unit. Structure nvgpu_gr_ctx_mappings in nvgpu_gr_ctx holds the maps. On ALLOC_OBJ_CTX this struct is instantiated and deleted on free_gr_ctx. 3. Introduce mapping flags for TSG and global context buffers. This is to map different buffers with different caching attribute. Map all buffers as cacheable except PRIV_ACCESS_MAP, RTV_CIRCULAR_BUFFER, FECS_TRACE, GR CTX and PATCH ctx buffers. Map all buffers as privileged. 4. Wherever VM or GPU VA is passed in the obj_ctx allocation functions, they are now replaced by nvgpu_gr_ctx_mappings. 5. free_gr_ctx API need not accept the VM as mappings struct will hold the VM. mappings struct will be kept in gr_ctx. 6. Move preemption buffers allocation logic out of nvgpu_gr_obj_ctx_set_graphics_preemption_mode. 7. set_preemption_mode and gr_gk20a_update_hwpm_ctxsw_mode functions need update to ensure buffers are allocated and mapped. 8. Keep the unit tests and documentation updated. With these changes there is clear seggregation of allocation and mapping of GR context buffers. This will simplify further change to add multiple address spaces support. With multiple address spaces in a TSG, subcontexts created after first subcontext just need to map the buffers. Bug 3677982 Change-Id: I3cd5f1311dd85aad1cf547da8fa45293fb7a7cb3 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2712222 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-07-15 07:10:11 -07:00
Sagar Kamble	bfa20f62c6	gpu: nvgpu: add/remove l2 cache flush when updating the ctx buffers gr ctx buffer in non-cacheable hence there is no need to do L2 cache flush when updating the buffer. Remove the flushes. pm ctx buffer is cacheable hence add l2 flush in the function nvgpu_profiler_quiesce_hwpm_streamout_non_resident since it updates the buffer. Bug 3677982 Change-Id: I0c15ec7a7f8fa250af1d25891122acc24443a872 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2713916 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-06-24 12:08:54 -07:00
Sagar Kamble	e80d74b810	gpu: nvgpu: validate return of nvgpu_tsg_get_sm_error_state nvgpu_tsg_get_sm_error_state already checks the sm_id and tsg->sm_error_state. No need to check these before calling nvgpu_tsg_get_sm_error_state. CID 484927 CID 299106 Bug 3512546 Change-Id: I02a05d8686cf7027cfc271f470198e7985dc4e16 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2722470 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-06-13 10:58:31 -07:00
Sagar Kamble	45c6aed68d	gpu: nvgpu: fix CERT violations in nvgpu_dbg_gpu_access_gpu_va Update nvgpu_dbg_gpu_access_gpu_va to: 1. Ensure that integer conversions do not result in lost or misinterpreted data. 2. Do not dereference null pointers. CID 436748 CID 473585 CID 254272 CID 490303 Bug 3512546 Change-Id: I551484b671aa48175a8cea119885eac478c2731c Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2707019 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-05-07 23:24:44 -07:00
Jon Hunter	86c0a696ed	gpu: nvgpu: Fix build for Linux v5.18 Upstream commit 7938f4218168 ("dma-buf-map: Rename to iosys-map") renames 'struct dma_buf_map' to 'struct iosys_map' and breaks building the NVGPU driver with Linux v5.18-rc1. In the NVGPU driver there are many places where 'dma_buf_map' is used and so to clean-up the code and minimise the impact of this change, add a gk20a_dmabuf_vmap() and a gk20a_dmabuf_vunmap() helper function. These new functions support all kernel versions and eliminate a lot the KERNEL_VERSION ifdefs. Bug 3598986 Change-Id: Id0f904ec0662f20f3d699b74efd9542d12344228 Signed-off-by: Jon Hunter <jonathanh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2693970 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-04-12 16:34:10 -07:00
prsethi	3651d1150d	gpu: nvgpu: update kmdi interfaces Patch udpates/fixes following issues. - Updates nvgpu_dbg_gpu_get_mappings_entry.size to u64 to address >4G limitations. - Removes offset from original cpuva and unmaps only original mapped address. - Call nvgpu_vm_find_mapped_buf_range() in place of nvgpu_vm_find_mapped_buf() to find the addresses which are not page aligned. - Update logic to parse the gpuva while trying to find gpu mappings so that gpuva which are more than the mapped buffer base address can also be considered. Bug 200722275 Change-Id: If33d85db37a9f03a662984c212544a8b2ade471c Signed-off-by: prsethi <prsethi@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2612129 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-17 10:15:04 -07:00
Antony Clince Alex	c0f4723339	gpu: nvgpu: perbuf: update PMA buffer mapping The PMA unit can only access GPU VAs within a 4GB window, hence both the user allocated PMA buffer and the kernel allocated bytes available buffer should lie in the same 4GB window. This is accomplished by carving out and reserving a 4GB VA space in perbuf.vm and using fixed GPU VAs to ensure that both buffers are bound within the same 4GB window. In addition, update ALLOC_PMA_STREAM to use pma_buffer_offset, pma_buffer_map_size fields correctly. Bug 3503708 Change-Id: Ic5297a22c2db42b18ff5e676d565d3be3c1cd780 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2671637 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-07 15:17:35 -08:00
Sagar Kamble	29a0a146ac	gpu: nvgpu: fix coverity defects Fix following coverity defects: ioctl_prof.c resource leak ioctl_dbg.c logically dead code global_ctx.c identical code for branches therm_dev.c resource leak pmu_pstate.c unused value nvgpu_mem.c dead default in switch tsg.c Dereference before null check nvlink_gv100.c logically dead code nvlink.c Out-of-bounds write fifo_vgpu.c Dereference null return value pmu_pg.c Dereference before null check fw_ver_ops.c Identical code for different branches boardobjgrp.c Dereference after null check boardobjgrp.c Dereference before null check boardobjgrp.c Dereference after null check engines.c Dereference before null check nvgpu_init.c Unused value CID 10127875 CID 10127820 CID 10063535 CID 10059311 CID 10127863 CID 9875900 CID 9865875 CID 9858045 CID 9852644 CID 9852635 CID 9852232 CID 9847593 CID 9847051 CID 9846056 CID 9846055 CID 9846054 CID 9842821 Bug 3460991 Change-Id: I91c215a545d07eb0e5b236849d5a8440ed6fe18d Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2657444 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-01-28 04:50:12 -08:00
Divya	d538737ba1	gpu: nvgpu: Add ELPG_MS protected call for L2 flush - if L2 flush is done when ELPG_MS feature is engaged then it can cause some of the signals to go non-idle. This can cause idle snap in ELPG_MS. - To avoid the idle snap, add elpg_ms protected call before L2 flush operation Bug 200763448 Change-Id: I651875bc051c3b7d26d2bb0b593083512a5765b2 Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2599459 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-10-22 06:20:13 -07:00
prsethi	dd94573e55	gpu: nvgpu: Update KMDI mapping interface Finding gpu va mapping inside a given range is a two step process where in first step number of mapping are queried and at second step it queries for all the continues mapping range for that given gpu va range. Mapping interface should count and return number of mappings if input count is 0 in place of failing it. Patch make the change for this two step process and only returns count at first step and in second step returns the continues memory ranges. Patch also replaces nvgpu_zalloc with nvgpu_big_zalloc to handle bigger size allocation. Bug 200722275 Change-Id: I56428deafa560ac8471c78f102bb1f9dbe20cabc Signed-off-by: prsethi <prsethi@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2591043 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-09-15 09:16:06 -07:00
Debarshi Dutta	791dc18666	gpu: nvgpu: bvec for struct nvgpu_tsg_sm_error_state fields Add Setter and Getter methods for accessing tsg->sm_error_states. Getter returns a constant pointer for struct nvgpu_tsg_sm_error_state. This renders it unnecessary to add BVEC for above fields for the struct in multiple locations. The current design ensures that only a constant pointer is obtained from the owner unit i.e. FIFO. The following new methods are added. Both unit tests and BVEC tests are added for them as well. nvgpu_tsg_store_sm_error_state nvgpu_tsg_get_sm_error_state Jira NVGPU-6947 Change-Id: I82c22a2774862c8579baa41b6fb8292fa164704a Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> (cherry picked from commit 79574638671a0c6efe41cd3423668fcd1bd96826) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2556938 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-09-13 20:57:09 -07:00
Richard Zhao	77f0ab6583	gpu: nvgpu: remove gpu_va update_hwpm_ctxsw_mode Since gpu server can noew allocate va itself, update_hwpm_ctxsw_mode does not need to fixed map pm ctx anymore. Jira GVSCI-10977 Change-Id: If592c8a2eb6dbfd7d922c79c87871162e9d8d8a4 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2546192 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-28 18:10:18 -07:00
Richard Zhao	9b66fca165	gpu: nvgpu: move .exec_regops to only execute regops HAL .exec_regops used to first validate regops then execute it, now moving it to only execute the regops. - It helps B0CC on HV. On server side it does not track profiler object, but regops validation uses the profiler, so moving validation to client side. - The change also remove ctx_buffer_offset checking in validate_reg_op_offset. The offset already checked again whitelists which have be verified when update whitelist. Also vgpu does not have information of ctx and golden image. - Added function nvgpu_regops_exec to cover both regops validation and execution. Jira GVSCI-10351 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I434e027290e263a8a64a25a55500f7294038c9c4 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2534252 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-08 01:29:40 -07:00
Lakshmanan M	df87591b7d	gpu: nvgpu: Add multi gr handling for debugger and profiler 1) Added multi gr handling for dbg_ioctl apis. 2) Added nvgpu_assert() in gr_instances.h (for legacy mode). 3) Added multi gr handling for prof_ioctl apis. 4) Added multi gr handling for profiler. 5) Added multi gr handling for ctxsw enable/disable apis. 6) Updated update_hwpm_ctxsw_mode() HAL for multi gr handling. JIRA NVGPU-5656 Change-Id: I3024d5e6d39bba7a1ae54c5e88c061ce9133e710 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2538761 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-04 18:07:47 -07:00
Vedashree Vidwans	86cb03d2f1	gpu: nvgpu: Replace WAR keyword with "fix" Replace/remove "WAR" keyword in the comments in nvgpu driver with "fix". Rename below functions and corresponding gops to replace "war" word with "errata" word: - g.pdb_cache_war_mem - ramin.init_pdb_cache_war - ramin.deinit_pdb_cache_war - tu104_ramin_init_pdb_cache_war - tu104_ramin_deinit_pdb_cache_war - fb.apply_pdb_cache_war - tu104_fb_apply_pdb_cache_war - nvgpu_init_mm_pdb_cache_war - nvlink.set_sw_war - gv100_nvlink_set_sw_war Jira NVGPU-6680 Change-Id: Ieaad2441fac87e4544eddbca3624b82076b2ee73 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2515700 Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-04-28 19:14:49 -07:00
Prateek sethi	d6d1b03496	gpu: nvgpu: implement ioctls to access GPU VA ranges Patch adds below two ioctls to access GPU VA. - NVGPU_DBG_GPU_IOCTL_GET_MAPPINGS - NVGPU_DBG_GPU_IOCTL_ACCESS_GPU_VA Bug 2108651 Bug 2543387 Change-Id: Iebcfa777c1a623eda070a866aed069ca9b3ec49d Signed-off-by: Prateek sethi <prsethi@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2383317 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-04-10 13:43:40 -07:00
Mayur Poojary	6277d57936	gpu: nvgpu: Add new api for setting longer timeslice on dbg node Add new ioctl api for setting longer timeslice and get timeslice inside 'dbg' dev node. Update ioctl gpu_get_characteristic to pass the max timeslice value Add debugfs to access and change the max timeslice value Bug 1842244 Change-Id: I7e80f59162cf5d90496f9752fc128f5fa8dcc7d2 Signed-off-by: Mayur Poojary <mpoojary@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2471569 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-04-06 04:37:38 -07:00
Deepak Nibade	869735cda4	gpu: nvgpu: add dynamic allowlist support Add gv11b and tu104 HALs to get allowed HWPM resource register ranges, offsets, and stride meta data. Add new enum nvgpu_pm_resource_hwpm_register_type for HWPM register type. Add new struct nvgpu_pm_resource_register_range_map to store all the register ranges for HWPM resources. Add pointer of map in struct nvgpu_profiler_object along with map entry count. Add new API nvgpu_profiler_build_regops_allowlist() to build the regops allowlist dynamically while binding the resources. Map entry count is received with get_pm_resource_register_range_map_entry_count() and only those resource ranges are added for which resource is reserved by profiler object. Add nvgpu_profiler_destroy_regops_allowlist() to destroy the allowlist while unbinding the resources. Add static functions allowlist_range_search() to search a register offset in HWPM resource ranges. Add another static function allowlist_offset_search() to search the offset in per-resource offset list. Add nvgpu_profiler_validate_regops_allowlist() that accepts an offset value, checks if it is in allowed ranges using allowlist_range_search() and then checks if offset is in allowlist using allowlist_offset_search(). Update gops.regops.exec_regops() to receive profiler object pointer as a parameter. Invoke nvgpu_profiler_validate_regops_allowlist() from validate_reg_ops() if prof pointer is not-null. This will be true only for new profiler stack and not legacy profilers. In gr_exec_ctx_ops(), skip regops execution if offset is invalid. Bug 2510974 Jira NVGPU-5360 Change-Id: I40acb91cc37508629c83106ea15b062250bba473 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2460001 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-01-05 12:38:06 -08:00
Lili Sang	3f0ea98b73	gpu: nvgpu: Add get_gr_context support for Linux. Implement the feature of retrieving gr context contents for all chips. Two IOCTLs, NVGPU_DBG_GPU_IOCTL_GET_GR_CONTEXT_SIZE and _GET_GR_CONTEXT, are added. Bug 3102903 Change-Id: If11006f4e294f190785a2c3159ca491b9f3b5187 Signed-off-by: Lili Sang <lilis@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2449183 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Chris Johnson <cwj@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Deepak Nibade	be9271d721	gpu: nvgpu: add API to extract gk20a pointer from cdev Add new API nvgpu_get_gk20a_from_cdev() that extracts gk20a pointer from cdev pointer. This helps in keeping cdev related implementation details in ioctl.c and away from other device ioctl files. Also move struct nvgpu_cdev, nvgpu_class, and nvgpu_cdev_class_priv_data from os_linux.h to ioctl.h since all of these structures are more IOCTL related and better to keep them in ioctl specific header. Jira NVGPU-5648 Change-Id: Ifad8454fd727ae2389ccf3d1ba492551ef1613ac Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2435466 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	a3e39c685d	gpu: nvgpu: track dev nodes using dynamic linked list Remove static dev node meta data from struct nvgpu_os_linux and replace it by a dynamic list. Struct nvgpu_os_linux will only keep track of list head and number of entries. Add new structure nvgpu_cdev to store meta data of each dev node and create/setup it dynamically in gk20a_user_init(). Once done, add the new node under list head maintained in nvgpu_os_linux. Add a static list dev_node_list[] that contains list of dev node names and file operations. This static list is used to create nvgpu_cdev data structures and to register new device nodes. Update all dev node open file operations (e.g. gk20a_as_dev_open()) to extract struct gk20a pointer from device pointer of dev node. gk20a device is the parent of dev node device. Jira NVGPU-5648 Change-Id: If070c3428afd6215e45b4919335d9f43e04c36f9 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2428500 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	d020778c55	gpu: nvgpu: reserve pma stream for legacy profiler Legacy profiler does not reserve PMA stream resource with PM reservation system. Also, HWPM system reset is separately implemented in membuf disable path. And it does not even restore perf unit SLCG prod values. Allcoate a dummy profiler object for debug session in perfbuf map path. Free it in perfbuf unmap path. This has advantage of synchronizing PMA stream reservation with new profiler stack. And this also leverages HWPM system reset and SLCG handling code during resource reservation. Remove explicit HWPM reset from gops.perf.membuf_reset_streaming() HALs Bug 2510974 Jira NVGPU-5360 Change-Id: I54c5202b6251dea3d80a4dfc011e8a296339e07f Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2399595 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	2012a6b558	gpu: nvgpu: add profiler api to execute regops Implement new API nvgpu_prof_ioctl_exec_reg_ops() to support regops on new profiler objects. Add two new staging buffers to hold regops copied from userspace, and to convert and execute regops in common code. Buffers are allocated and released along with the profiler object. New API will implements this : - copy regops data in chunks of 4K from userspace - store them in staging buffer - convert the new regop struct into common regop struct and also copy the content into second staging buffer - trigger gops.regops.exec_regops() with second staging buffer as operation pointer - convert common regop struct back into new regop struct and copy back to userspace Export bunch of helper functions from ioctl_dbg.h. e.g. nvgpu_get_regops_op_values_common() Update regop execution code to skip regop execution if regop status is not valid. This is only possible when userspace requests for CONTINUE_ON_ERROR mode. Add more documentation to some of the fields in UAPI header. Note that maximum atomic operations reported by new API are same as legacy API and are incorrect. This will be fixed up in upcoming patches. Bug 2510974 Jira NVGPU-5360 Change-Id: I9f82052b22143aec33f6e778c0784386744b699e Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2394208 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	6daa0636d1	gpu: nvgpu: rework regops execution API Rework regops execution API to accomodate below updates for new profiler design - gops.regops.exec_regops() should accept TSG pointer instead of channel pointer. - Remove individual boolean parameters and add one flag field. Below new flags are added to this API : NVGPU_REG_OP_FLAG_MODE_ALL_OR_NONE NVGPU_REG_OP_FLAG_MODE_CONTINUE_ON_ERROR NVGPU_REG_OP_FLAG_ALL_PASSED NVGPU_REG_OP_FLAG_DIRECT_OPS Update other APIs, e.g. gr_gk20a_exec_ctx_ops() and validate_reg_ops() as per new API changes. Add new API gk20a_is_tsg_ctx_resident() to check context residency from TSG pointer. Convert gr_gk20a_ctx_patch_smpc() to a HAL gops.gr.ctx_patch_smpc(). Set this HAL only for gm20b since it is not required for later chips. Also, remove subcontext code from this function since gm20b does not support subcontext. Remove stale comment about missing vGPU support in exec_regops_gk20a() Bug 2510974 Jira NVGPU-5360 Change-Id: I3c25c34277b5ca88484da1e20d459118f15da102 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2389733 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	a73b5d3c6f	gpu: nvgpu: use smpc global mode capability check In nvgpu_dbg_gpu_ioctl_smpc_ctxsw_mode(), check if SMPC global mode capability is supported instead of checking for the function pointer. Enable the capability only for Turing since pre-Turing GPUs don't support it. Bug 2510974 Jira NVGPU-5360 Change-Id: I352fb2a91b836cd8ef727966a53a28255d8ea834 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2389653 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	f34711d3de	gpu: nvgpu: split perfbuf initialization gk20a_perfbuf_map() allocates perfbuf VM, maps the user buffer into new VM, and then triggers gops.perfbuf.perfbuf_enable(). This HAL then does following : - Allocate perfbuf instance block - Initialize perfbuf instance block - Reset stream buffer - Program instance block address in PMA registers - Program user buffer address into PMA registers New profiler interface will have it's own API to setup PMA strem, and it requires above setup to be done in two phases of perfbuf initialization and then user buffer setup. Split above functionalities into below functions - nvgpu_perfbuf_init_vm() - Allocate perfbuf VM - Call gops.perfbuf.init_inst_block() to initialize perfbuf instance block - gops.perfbuf.init_inst_block() - Allocate perfbuf instance block - Initialize perfbuf instance block - Program instance block address in PMA registers using gops.perf.init_inst_block() - In case of vGPU, trigger TEGRA_VGPU_CMD_PERFBUF_INST_BLOCK_MGT command to gpu server - gops.perf.init_inst_block() - Reset stream buffer - Program user buffer address into PMA registers Also add corresponding cleanup functions as below : gops.perf.deinit_inst_block() gops.perfbuf.deinit_inst_block() nvgpu_perfbuf_deinit_vm() Bug 2510974 Jira NVGPU-5360 Change-Id: I486370f21012cbb7fea84fe46fb16db95bc16790 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2372984 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	39a3854584	gpu: nvgpu: support SMPC global mode Add tu104 specific HAL tu104_gr_falcon_ctrl_ctxsw() that processes below CTXSW methods to start/stop SMPC global mode : NVGPU_GR_FALCON_METHOD_START_SMPC_GLOBAL_MODE NVGPU_GR_FALCON_METHOD_STOP_SMPC_GLOBAL_MODE Add new tu104 specific HAL tu104_gr_update_smpc_global_mode() to trigger SMPC global mode start/stop using gops.gr.falcon.ctrl_ctxsw(). Update nvgpu_dbg_gpu_ioctl_smpc_ctxsw_mode() to enable/disable SMPC global mode if channel is not bound to debug session. Bug 2510974 Bug 2257799 Jira NVGPU-5360 Change-Id: I1f9d8f2a2d30a4738f291db3fc72c400d24f4048 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2368696 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	08308bc936	gpu: nvgpu: rework pm resource reservation system Current PM resource reservation system is limited to HWPM resources only. And reservation tracking is done using boolean variables. New upcoming profiler support requires reservation for all the PM resources like SMPC and PMA stream. Using boolean variables is not scalable and confusing. Plus the variables have to be replicated on gpu server in case of virtualization. Remove flag tracking mechanism and use list based approach to track all PM reservations. Also, current HALs are defined on debugger object. Implement new HALs in new pm_reservation object since it is really an independent functionality. Add new source file common/profiler/pm_reservation.c which implements functions to reserve/release resources and to check if any resource is reserved or not. Add common/vgpu/pm_reservation_vgpu.c for vGPU which simply forwards the request to gpu server. Define new HAL object gops.pm_reservation and assign above functions to below respective HALs : g->ops.pm_reservation.acquire() g->ops.pm_reservation.release() g->ops.pm_reservation.release_all_per_vmid() Last HAL above is only used for gpu server cleanup of guest OS. Add below new common profiler functions that act as APIs to reserve/ release resources for rest of the units in nvgpu. nvgpu_profiler_pm_resource_reserve() nvgpu_profiler_pm_resource_release() Initialize the meta data required for reservtion system in nvgpu_pm_reservation_init() and call it during nvgpu_finalize_poweron. Clean up the meta data before releasing struct gk20a. Delete below HALs : g->ops.debugger.check_and_set_global_reservation() g->ops.debugger.check_and_set_context_reservation() g->ops.debugger.release_profiler_reservation() Bug 2510974 Jira NVGPU-5360 Change-Id: I4d9f89c58c791b3b2e63099a8a603462e5319222 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2367224 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	7466369a58	gpu: nvgpu: update hwpm/smpc ctxsw mode API to accept TSG Below APIs to update hwpm/smpc ctxsw mode take a channel pointer as a parameter. APIs then extract corresponding TSG from channel and perform various operations on context stored in TSG. g->ops.gr.update_smpc_ctxsw_mode() g->ops.gr.update_hwpm_ctxsw_mode() Update both above APIs to accept TSG pointer instead of a channel. This is a refactor work to support new profiler design where a profiler object is bound to TSG and keeps track of TSG only. Bug 2510974 Jira NVGPU-5360 Change-Id: Ia4cefda503d8420f2bd32d07c57534924f0f557a Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2366122 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	d869040d7a	gpu: nvgpu: rename profiler object structure Rename profiler object structure from struct dbg_profiler_object_data to struct nvgpu_profiler_object. Annotate the structure members appropriately. Bug 2510974 Change-Id: I9454388f8ad143b39daca6bbc2b12511ffa3fd95 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2365675 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	e4e6be85ea	gpu: nvgpu: move profiler alloc/free APIs to separate file Move profiler object allocation/free APIs to separate profiler specific file common/profiler.c. Store struct gk20a pointer in struct dbg_profiler_object_data for convenience of accessing global struct pointer. Update profiler object to store TSG pointer instead of channel pointer. Since expectations is to have one profiler object per context/TSG. nvgpu_profiler_reserve_acquire() has a case to check if resource reservation is acquired by some other channel in TSG. But now since we keep track of TSG itself, this case becomes redundant and can be removed. All the support is compiled out of safety build with compile flag CONFIG_NVGPU_PROFILER. Linux will always compile the support. Bug 2510974 Change-Id: I197bbd67a9cdd1fbea42f1effd1b74b15a6068e5 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2365674 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Sagar Kamble	630eaa46cb	gpu: nvgpu: update the config options & makefile Added dependency between the Kconfig options as follows where '->' indicates 'depends on' relation: SUPPORT_CDE -> COMPRESSION -> DMABUF_HAS_DRVDATA DGPU -> GK20A_PCI Defined Kconfig option for VPR and for DGPU that is dependent GK20A_PCI as well. DGPU related sources are now compiled under config flag DGPU. Also update conditional compilation of the driver paths w.r.t DGPU, VPR and COMPRESSION flags. Bug 2834141 Change-Id: Ia0a39d6d4cf8b36e7f955b7355a5ab41783f821c Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2299627 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Thomas Fleury	f422aee393	gpu: nvgpu: use refcnt for ch mmu_debug_mode Replaced ch->mmu_debug_mode_enabled with ch->mmu_debug_mode_refcnt. If channel is enabled multiple times by userspace, then ref count is updated accordingly. There is an expectation that enable/disable calls are balanced for setting channel's mmu debug mode. When unbinding the channel, decrease refcnt for the channel until it reaches 0. Also, removed tsg parameter from nvgpu_tsg_set_mmu_debug_mode as it can be retrieved from ch. Bug 2515097 Change-Id: If334e374a55bd14ae219edbfd3b1fce5ff25c226 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2184702 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-08-28 16:54:51 -07:00
Thomas Fleury	8057514a9f	gpu: nvgpu: set FB/HSMMU debug mode Set NV_PFB_HSMMU_PRI_MMU_DEBUG_CTRL and NV_PFB_PRI_MMU_DEBUG_CTRL in addition to NV_PGRAPH_PRI_GPCS_MMU_DEBUG_CTRL, in NVGPU_DBG_GPU_IOCTL_SET_CTX_MMU_DEBUG_MODE Bug 2515097 Change-Id: I1763b43e79fac3edb68a35980683d58bfa89519f Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2115785 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-08-28 16:54:26 -07:00
Philip Elcan	91187b6db2	gpu: nvgpu: init: rename init functions Rename init functions that still carry the gk20a moniker to use the more appropriate nvgpu name instead. JIRA NVGPU-2385 Change-Id: I5d40cd72943272c8b5f16b97d9a786d9c41496d4 Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2156220 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-07-23 13:27:18 -07:00
Philip Elcan	9705c86b98	gpu: nvgpu: init: move functions from gk20a.h to own header This moves the nvgpu.common.init function prototypes from gk20a.h to a new unit-specific header nvgpu_init.h JIRA NVGPU-2385 Change-Id: I48c0b0e02a8064be0eda89f26cf55189ffd55803 Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2133845 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-07-23 13:26:12 -07:00
Deepak Nibade	0755b25231	gpu: nvgpu: remove reset and enable/disable ctxsw hals Remove below hals since the corresponding functions are same on all platforms and they are h/w independent g->ops.gr.enable_ctxsw() g->ops.gr.disable_ctxsw() g->ops.gr.reset() Call the functions directly at all places Remove CONFIG_NVGPU_DEBUGGER from places where these functions are called since they are not debugger dependent This also helps to disable CONFIG_NVGPU_DEBUGGER and to keep recovery sequence intact Jira NVGPU-3506 Change-Id: Id2b208ca23dc4667e78edcd8ad242a8558e0ff64 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2137255 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vinod Gopalakrishnakurup <vinodg@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-06-18 01:39:20 -07:00
Sagar Kamble	3f08cf8a48	gpu: nvgpu: rename feature Make and C flags Name the Make and C flag variables consistently wih syntax: CONFIG_NVGPU_<feature name> s/NVGPU_DEBUGGER/CONFIG_NVGPU_DEBUGGER s/NVGPU_CYCLESTATS/CONFIG_NVGPU_CYCLESTATS s/NVGPU_USERD/CONFIG_NVGPU_USERD s/NVGPU_CHANNEL_WDT/CONFIG_NVGPU_CHANNEL_WDT s/NVGPU_FEATURE_CE/CONFIG_NVGPU_CE s/NVGPU_GRAPHICS/CONFIG_NVGPU_GRAPHICS s/NVGPU_ENGINE/CONFIG_NVGPU_FIFO_ENGINE_ACTIVITY s/NVGPU_FEATURE_CHANNEL_TSG_SCHED/CONFIG_NVGPU_CHANNEL_TSG_SCHED s/NVGPU_FEATURE_CHANNEL_TSG_CONTROL/CONFIG_NVGPU_CHANNEL_TSG_CONTROL s/NVGPU_FEATURE_ENGINE_QUEUE/CONFIG_NVGPU_ENGINE_QUEUE s/GK20A_CTXSW_TRACE/CONFIG_NVGPU_FECS_TRACE s/IGPU_VIRT_SUPPORT/CONFIG_NVGPU_IGPU_VIRT s/CONFIG_TEGRA_NVLINK/CONFIG_NVGPU_NVLINK s/NVGPU_DGPU_SUPPORT/CONFIG_NVGPU_DGPU s/NVGPU_VPR/CONFIG_NVGPU_VPR s/NVGPU_REPLAYABLE_FAULT/CONFIG_NVGPU_REPLAYABLE_FAULT s/NVGPU_FEATURE_LS_PMU/CONFIG_NVGPU_LS_PMU s/NVGPU_FEATURE_POWER_PG/CONFIG_NVGPU_POWER_PG JIRA NVGPU-3624 Change-Id: I8b2492b085095fc6ee95926d8f8c3929702a1773 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2130290 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-06-11 09:46:24 -07:00
Deepak Nibade	455b0da253	gpu: nvgpu: add debugger flag for regops support Add NVGPU_DEBUGGER flag for regops API and hals Jira NVGPU-3505 Change-Id: I9f2b850c881bf05f8ba5b6ef1f59f0d73a948cde Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2130146 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-06-06 16:27:58 -07:00
Thomas Fleury	97762279b7	gpu: nvgpu: make nvgpu_init_mutex return void Make the nvgpu_init_mutex function return void. In linux case, this doesn't affect anything since mutex_init returns void. For posix, we assert() and die if pthread_mutex_init fails. This alleviates the need to error inject for _every_ nvgpu_mutex_init function in the driver. Jira NVGPU-3476 Change-Id: Ibc801116dc82cdfcedcba2c352785f2640b7d54f Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2130538 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-06-05 10:25:52 -07:00
Thomas Fleury	a1248d87fe	gpu: nvgpu: add refcounting for MMU debug mode GPC MMU debug mode should be set if at least one channel in the TSG has requested it. Add refcounting for MMU debug mode, to make sure debug mode is disabled only when no channel in the TSG is using it. Bug 2515097 Change-Id: Ic5530f93523a9ec2cd3bfebc97adf7b7000531e0 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2123017 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-06-01 06:36:14 -07:00
Debarshi Dutta	f39a5c4ead	gpu: nvgpu: rename gk20a_channel_* APIs Renamed gk20a_channel_* APIs to nvgpu_channel_* APIs. Removed unused channel API int gk20a_wait_channel_idle Renamed nvgpu_channel_free_usermode_buffers in os/linux-channel.c to nvgpu_os_channel_free_usermode_buffers to avoid conflicts with the API with the same name in channel unit. Jira NVGPU-3248 Change-Id: I21379bd79e64da7e987ddaf5d19ff3804348acca Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2121902 Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-21 09:26:16 -07:00
Debarshi Dutta	4c30bd599f	gpu: nvgpu: rename tsg_gk20a/gk20a_tsg functions. rename the functions with the prefixes tsg_gk20a/gk20a_tsg to nvgpu_tsg_* Jira NVGPU-3248 Change-Id: I9f5f601040d994cd7798fe76813cc86c8df126dc Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2120165 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-17 01:49:27 -07:00
Thomas Fleury	af2ccb811d	gpu: nvgpu: add SET_CTX_MMU_DEBUG_MODE ioctl Added NVGPU_DBG_GPU_IOCTL_SET_CTX_MMU_DEBUG_MODE ioctl to set MMU debug mode for a given context. Added gr.set_mmu_debug_mode HAL to change NV_PGPC_PRI_MMU_DEBUG_CTRL for a given channel. HAL implementation for native case is gm20b_gr_set_mmu_debug_mode. It internally uses regops, which directly writes to the register if the context is resident, or writes to gr context otherwise. Added NVGPU_SUPPORT_SET_CTX_MMU_DEBUG_MODE to enable the feature. NV_PGPC_PRI_MMU_DEBUG_CTRL has to be context switched in FECS ucode, so the feature is only enabled on TU104 for now. Bug 2515097 Change-Id: Ib4efaf06fc47a8539b4474f94c68c20ce225263f Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2110720 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-16 15:15:18 -07:00
Seema Khowala	671f1c8a36	gpu: nvgpu: channel MISRA fix for Rule 21.2 Rename _gk20a_channel_get -> nvgpu_channel_get__func gk20a_channel_get -> nvgpu_channel_get _gk20a_channel_put -> nvgpu_channel_put__func gk20a_channel_put -> nvgpu_channel_put trace_gk20a_channel_get -> trace_nvgpu_channel_get trace_gk20a_channel_put -> trace_nvgpu_channel_put JIRA NVGPU-3388 Change-Id: I4e37adddbb5ce14aa18132722719ca2f73f1ba52 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2114118 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-09 04:39:34 -07:00

1 2 3

101 Commits