linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-22 17:36:20 +03:00

Author	SHA1	Message	Date
Seema Khowala	1f54ea09e3	gpu: nvgpu: rename has_timedout and make it thread safe Currently has_timedout variable is protected by wmb at places where it is being set and there is no correspoding rmb whenever has_timedout variable is read. This is prone to errors for concurrent execution. This change is supposed to fix this issue. Rename has_timedout variable of channel struct to ch_timedout. Also to avoid rmb every time ch_timedout is read, ch_timedout_spinlock is added to protect ch_timedout variable for taking care of concurrent execution. Bug 2404865 Bug 2092051 Change-Id: I0bee9f50af0a48720aa8b54cbc3af97ef9f6df00 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1930935 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-15 15:35:57 -08:00
smadhavan	503b897b45	gpu: nvgpu: Fix MISRA rule 8.3 violations MISRA rule 8.3 requires that all declarations of a function shall use the same parameter names and type qualifiers. There are cases where the parameter names do not match between function prototype and declaration. This patch will fix some of these violations by renaming the prototype parameter. JIRA NVGPU-847 Change-Id: I980ca7ba8adc853de9c1b6f6c7e7b3e4ac12f88e Signed-off-by: smadhavan <smadhavan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1926980 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-15 15:35:47 -08:00
Anup Mahindre	a6138b7810	gpu: nvgpu: Add a characteristics flag to denote FECS tracing support Add a flag to nvgpu_gpu_characteristics to expose FECS tracing capability to userspace. This is required for adding nvrm_gpu APIs for CTXSW set of IOCTLs which were requested in several bugs. nvrm_gpu APIs would query this flag to check the availability of IOCTLs. Bug 2169678 Bug 2169677 Bug 2169675 Bug 2169674 Bug 2169673 Bug 2168342 Change-Id: Ie6ba80a4144637546b97fa93baae67b8d0c4d425 Signed-off-by: Anup Mahindre <amahindre@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1950559 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-15 02:53:39 -08:00
Scott Long	e24df49765	gpu: nvgpu: nvgpu_memcpy changes to linux os code MISRA Rule 21.15 prohibits use of memcpy() with incompatible ptrs to qualified/unqualified types. To circumvent this issue we've introduced a new MISRA-compliant nvgpu_memcpy() function. While linux os code does not need to be MISRA-compliant this change switches over all memcpy() uses to nvgpu_memcpy() with appropriate casts applied to maintain consistency within the nvgpu source base. JIRA NVGPU-849 Change-Id: I2c21a7845df5709dafa19508c121f8afa27cc4fc Signed-off-by: Scott Long <scottl@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1950995 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-14 21:44:35 -08:00
Terje Bergstrom	2dc48ceba1	gpu: nvgpu: Remove pmgr.h dependency from gk20a.h gk20a.h depends on definition of struct pmgr_pmupstate. Change that to a pointer and use forward declaration, and allocation and free functions. Fix a few build breaks by adding explicit includes where previously a header file had gotten included implicitly. JIRA NVGPU-596 Change-Id: I7ced14d6629e033b0ccef3a93a3dbf099e43ba4c Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1946662 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-14 13:34:06 -08:00
Terje Bergstrom	07760eb9a1	gpu: nvgpu: Remove clk.h dependency from gk20a.h gk20a.h depends on definition of struct clk_pmupstate. Change that to a pointer and use forward declaration, and allocation and free functions. Fix a few build breaks by adding explicit includes where previously a header file had gotten included implicitly. JIRA NVGPU-596 Change-Id: Iafe7d72a6fd31543653e0e10e2d2e552b6c3514b Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1945286 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-14 13:33:38 -08:00
Terje Bergstrom	bca27e31e3	gpu: nvgpu: Fix clk_gp106.h and clk_gv100.h headers clk_gp106.h and clk_gv100.h define conflicting symbols, which prevent including them both at the same time. One of the conflicting structs is namemap_cfg, which has different definitions in clk_gp106.h and include/nvgpu/clk.h. Move all constants used only by clk_*.c to be defined there, delete the extra namemap_cfg structure definition, and modify code to cope with the unified namemap_cfg. JIRa NVGPU-596 Change-Id: Id68919da4567ec1507eda0cfaa19bf047a7bfc59 Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1945285 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-14 13:33:29 -08:00
Richard Zhao	f2cb8c5d2e	gpu: nvgpu: vgpu: unify fecs trace move fecs_trace_vgpu.c to be common, leaving only few functions os specific. struct gk20a_fecs_trace_header was moved to header, to share with os specific code. Jira EVLR-3275 Change-Id: I372aeb539cbca3abb87e997c9e35e6d682f9cb96 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1831991 GVS: Gerrit_Virtual_Submit Reviewed-by: Aparna Das <aparnad@nvidia.com> Reviewed-by: Nirav Patel <nipatel@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-13 19:13:37 -08:00
Sai Nikhil	94e00ab6ad	gpu: nvgpu: gk20a: fix MISRA 10.4 Violations [1/2] MISRA Rule 10.4 only allows the usage of arithmetic operations on operands of the same essential type category. Adding "U" at the end of the integer literals to have same type of operands when an arithmetic operation is performed. This fixes violation where an arithmetic operation is performed on signed and unsigned int types. JIRA NVGPU-992 Change-Id: Ifb8cb992a5cb9b04440f162918a8ed2ae17ec928 Signed-off-by: Sai Nikhil <snikhil@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1822587 GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-09 13:27:08 -08:00
Terje Bergstrom	88e374d5eb	gpu: nvgpu: Move gk20a.c to os/linux gk20a.c is used only in Linux build. It's in theory common code, but in practice implements OS specific policies. Also implement os/posix/gk20a.c to implement gk20a_init_gpu_characteristics(), gk20a_get() and gk20a_put() which are called from common code. JIRA NVGPU-596 Change-Id: I6a6079ca6d4c6a225f0dd0e1cd7c439333a704bf Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1944884 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-08 21:44:18 -08:00
Alex Waterman	032b37bee5	gpu: nvgpu: Update debug crash dump Update the debug crash dump to be clearer, more concise and avoid many of the misformatting issues that have crept in over the last couple years. This also changes the debug prints to move from pr_err() in the Linux kernel to nvgpu_err(). This makes it easier to filter all nvgpu messages in a log file with a single grep command. Change-Id: I00ca9e6c32da7a79c8f6903a139bf6b43e89618a Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1940515 GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-08 21:42:38 -08:00
Alex Waterman	ac5763eb0c	gpu: nvgpu: Re-order the debug output Originally the order for output was: 1. Dump platform deps (sync-points/host1x stuff) 2. Dump PBDMA status 3. Dump engine status 4. Dump channel status The updated ordering is: 1. Dump channel status 2. Dump PBDMA status 3. Dump engine status 4. Dump platform deps (sync-points/host1x stuff) The purpose of this is to put the useful information first and relegate the less useful info to later in the dump. We naturally scan downwards and treat stuff at the top as most important. The end goal is to make the debug dump as useful in as little time as possible. So instead of making an engineer dig through a complex jumble of information to find the useful stuff the hope is that the useful stuff is immediately available. Change-Id: I9d2b755676b7e5dc2f8949f14dc36f3d337e2a3f Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1940514 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-08 21:42:34 -08:00
Alex Waterman	7222826680	gpu: nvgpu: Return bool from nvgpu_log_mask_enabled This function returns a boolean describing if a given log mask is enabled for a given GPU. Previously this returned and int but the bool type is far better suited for this. Also implement this function in posix, as it may be useful to have implemented there if any common code chooses to use this function. Change-Id: I7382e73df83282763df1bdbccbbb219c9f3e6f1b Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1938341 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-08 21:42:14 -08:00
Terje Bergstrom	7525c1337b	gpu: nvgpu: Remove the GPU-NEXT conditional Remove build conditional for GPU-NEXT. It was used for including code for tu104, but now it's part of main nvgpu. Leave a TURING conditional to not need Turing code in other builds. JIRA NVGPU-961 Change-Id: I74177863c451d78b6db6165249561f15eadc3cc3 Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1936803 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-08 19:35:09 -08:00
Amurthyreddy	710aab6ba4	gpu: nvgpu: MISRA 14.4 boolean fixes MISRA rule 14.4 doesn't allow the usage of non-boolean variable as boolean in the controlling expression of an if statement or an iteration statement. Fix violations where a non-boolean variable is used as a boolean in the controlling expression of if and loop statements. JIRA NVGPU-1022 Change-Id: I957f8ca1fa0eb00928c476960da1e6e420781c09 Signed-off-by: Amurthyreddy <amurthyreddy@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1941002 GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-07 10:35:13 -08:00
Konsta Holtta	513cb21f26	gpu: nvgpu: move doorbell token number to HAL Add a fifo HAL for querying the doorbell token of a specific channel and call it instead of doing the calculation directly. For Volta the token is just the channel id plus the possible base number. Bug 200145225 Change-Id: Ifbb150191575fdc72e413a14c799cab7e52d8c14 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1849639 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-06 21:56:26 -08:00
Srirangan Madhavan	ef5fdac7a6	gpu: nvgpu: Fix MISRA rule 15.6 violations MISRA Rule-15.6 requires that all if-else blocks and loop blocks be enclosed in braces, including single statement blocks. Fix errors due to single statement if-else and loop blocks without braces by introducing the braces. JIRA NVGPU-775 Change-Id: Ib70621d39735abae3fd2eb7ccf77f36125e2d7b7 Signed-off-by: Srirangan Madhavan <smadhavan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1928745 GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-05 22:13:16 -08:00
Nicolas Benech	b9e7ea65e1	gpu: nvgpu: Fix LibC MISRA 17.7 in os/linux MISRA Rule-17.7 requires the return value of all functions to be used. Fix is either to use the return value or change the function to return void. This patch contains fix for all 17.7 violations instandard C functions in OS/Linux interface. JIRA NVGPU-1036 Change-Id: I39b20f1d0e1a1da56d452f2c3d5ee049666cefe8 Signed-off-by: Nicolas Benech <nbenech@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1929900 Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-31 15:25:23 -07:00
Konsta Holtta	b08c613402	gpu: nvgpu: make gr_ctx a pointer in tsg Remove a dependency to a graphics type in tsg header by adding a pointer indirection. Jira NVGPU-967 Jira NVGPU-1149 Change-Id: I9177e6eedf08bfe4a3b981b67fa8d4d734f9e50f Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1822023 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 05:54:10 -07:00
Konsta Holtta	37659f5c8e	gpu: nvgpu: mark usermode submit supported for gv11b Mark usermode submit supported in gv11b and add the characteristics flag to expose the capability to userspace. Bug 200145225 Change-Id: Id9dcb0c71c020bd509fbdbffb94a756c69377f20 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1795822 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 08:04:47 -07:00
Konsta Holtta	99b1c6dcdf	gpu: nvgpu: support usermode submit buffers Import userd and gpfifo buffers from userspace if provided via NVGPU_IOCTL_CHANNEL_ALLOC_GPFIFO_EX. Also supply the work submit token (i.e., the hw channel id) to userspace. To keep the buffers alive, store their dmabuf and attachment/sgt handles in nvgpu_channel_linux. Our nvgpu_mem doesn't provide such data for buffers that are mainly in kernel use. The buffers are freed via a new API in the os_channel interface. Fix a bug in gk20a_channel_free_usermode_buffers: also unmap the usermode gpfifo buffer. Bug 200145225 Change-Id: I8416af7085c91b044ac8ccd9faa38e2a6d0c3946 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1795821 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 08:04:43 -07:00
Konsta Holtta	9de6d20abb	gpu: nvgpu: add FOREIGN_SGT mem flag Add an internal flag NVGPU_MEM_FLAG_FOREIGN_SGT to specify that the sgt member of an nvgpu_mem must not be freed when the nvgpu_mem is freed. Bug 200145225 Change-Id: I044fb91a5f9d148f38fb0cbf63d0cdfd64a070ce Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1819801 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 08:04:34 -07:00
Konsta Holtta	f33935f426	gpu: nvgpu: provide usermode region via mmap Add a mmap callback on the control device node for mapping the usermode register region to userspace. Each such mapping is removed when the GPU railgates, and brought back again on unrailgate. The mapping offset must be 0 and its size must be 4 KB. Bug 200145225 Change-Id: Ie8d3758da745b958376292691d7d1d02a24e7815 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1795819 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 08:04:25 -07:00
Konsta Holtta	d53495400e	gpu: nvgpu: track opened Linux ctrl files An upcoming patch will need to enumerate opened ctrl nodes; track them in a list, protected by a mutex. Bug 200145225 Change-Id: I50dc15056832a3bb53fbdd7bd2bffcdaecc7b21c Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1811840 Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 08:04:16 -07:00
Konsta Holtta	38c11db264	gpu: nvgpu: store bus addr of gpu regs Usermode submit needs to access the usermode region of registers from userspace. Store the start address of register resource in struct nvgpu_os_linux to be used in remap to userspace. Bug 200145225 Change-Id: I3796b6bf67942af0cc16c86accb82a013032bfc8 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1811838 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 08:04:03 -07:00
Deepak Nibade	0d065df144	gpu: nvgpu: tu104: enable SLCG/BLCG Enable SLCG/BLCG for TU104 device by setting corresponding flags in platform data Jira NVGPUT-108 Bug 200456693 Change-Id: I47e097f96c9056dcd0747897614fc316073291ad Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1934326 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 02:13:21 -07:00
Debarshi Dutta	6fe9bb835b	gpu: nvgpu: access channel_sync via public API struct nvgpu_channel_sync is moved to a private header i.e. channel_sync_priv.h present in common/sync/. All accesses to callback functions inside the struct nvgpu_channel_sync in NVGPU driver is replaced by the public channel_sync specific APIs. Jira NVGPU-1093 Change-Id: I52d57b3d458993203a3ac6b160fb569effbe5a66 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1929783 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 02:12:23 -07:00
Nitin Kumbhar	84e13ce30c	gpu: nvgpu: add shutdown callback for dgpu The nvlink needs to be de-initialized as part of system shutdown or reboot. Add the shutdown callback of pci driver and use it to trigger nvlink de-initialization. Bug 200422323 Change-Id: Iec8193d9665bc77ddbf3680ea130dfa4c1b3b0ad Signed-off-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1928821 Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 01:37:43 -07:00
aalex	80d03f34f7	gpu: nvgpu: Fix IPA to PA translation Background: In Hypervisor mode dGPU device is configured in pass through mode for the Guest (QNX/Linux). GMMU programming is handled by the guest which converts a mapped buffer's GVA into SGLes in IPA (Intermediate/Guest Physical address) which is then translated into PA (Acutual Physical address) and programs the GMMU PTEes with correct GVA to PA mapping. Incase of the vgpu this work is delegated to the RM server which takes care of the GMMU programming and IPA to PA conversion. Problem: The current GMMU mapping logic in the guest assumes that PA range is continuous over a given IPA range. Hence, it doesn't account for holes being present in the PA range. But this is not the case, a continous IPA range can be mapped to dis-contiguous PA ranges. In this situation the mapping logic sets up GMMU PTEes ignoring the holes in physical memory and creates GVA => PA mapping which intrudes into the PA ranges which are reserved. This results in memory being corrupted. This change takes into account holes being present in a given PA range and for a given IPA range it also identifies the discontiguous PA ranges and sets up the PTE's appropriately. Bug 200451447 Jira VQRM-5069 Change-Id: I354d984f6c44482e4576a173fce1e90ab52283ac Signed-off-by: aalex <aalex@nvidia.com> Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1850972 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-24 23:16:20 -07:00
Mahantesh Kumbar	09652f1ebf	gpu: nvgpu: remove sec2 as part of gk20a_remove_support -Add code to remove/free sec2 data as part of gk20a_remove_support() by calling sec2->remove_support() JIRA NVGPUT-77 Change-Id: Id0804d929e2fe866a0e2a93eff8d8dac6b69bc6b Signed-off-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1921518 GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-24 17:00:18 -07:00
Deepak Nibade	1b2a0833e0	gpu: nvgpu: add separate unit for debugger Rename gk20a/dbg_gpu_gk20a.c to common/debugger.c and make it a separate common unit Also rename gk20a/dbg_gpu_gk20a.h to include/nvgpu/debugger.h We had two different HALs for debugger - gops.debugger and gops.dbg_session_ops Combine them into one single HAL gops.debugger and remove gops.dbg_session_ops Rename all exported APIs from debugger.h to be in the form of nvgpu_*() Jira NVGPU-1013 Change-Id: I136dc7786e3b2065921eb03b99f16049212f3cd2 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1920075 Reviewed-by: Sachin Jadhav <sachinj@nvidia.com> Tested-by: Sachin Jadhav <sachinj@nvidia.com>	2018-10-24 00:30:19 -07:00
Konsta Holtta	e0c8a16c8d	gpu: nvgpu: Add CHANNEL_SETUP_BIND IOCTL For a long time now, the ALLOC_GPFIFO_EX channel IOCTL has done much more than just gpfifo allocation, and its signature does not match support that's needed soon. Add a new one called SETUP_BIND to hopefully cover our future needs and deprecate ALLOC_GPFIFO_EX. Change nvgpu internals to match this new naming as well. Bug 200145225 Change-Id: I766f9283a064e140656f6004b2b766db70bd6cad Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1835186 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-19 17:24:49 -07:00
Nicolin Chen	0fd9c84f87	gpu: nvgpu: Define functions static if DEBUG_FS=n When turning off CONFIG_DEBUG_FS, there are build errors: drivers/gpu/nvgpu/os/linux/os_ops_gp106.o: In function `nvgpu_fecs_trace_init_debugfs': os_ops_gp106.c:(.text+0x8): multiple definition of `nvgpu_fecs_trace_init_debugfs' drivers/gpu/nvgpu/os/linux/os_ops_gp10b.o:os_ops_gp10b.c:(.text+0x0): first defined here drivers/gpu/nvgpu/os/linux/os_ops_gv100.o: In function `gp106_therm_init_debugfs': os_ops_gv100.c:(.text+0x0): multiple definition of `gp106_therm_init_debugfs' drivers/gpu/nvgpu/os/linux/os_ops_gp106.o:os_ops_gp106.c:(.text+0x0): first defined here drivers/gpu/nvgpu/os/linux/os_ops_tu104.o: In function `gv100_clk_init_debugfs': os_ops_tu104.c:(.text+0x0): multiple definition of `gv100_clk_init_debugfs' drivers/gpu/nvgpu/os/linux/os_ops_gv100.o:os_ops_gv100.c:(.text+0x10): first defined here This is because those functions aren't marked as static. So this patch just simply fixes the bug. Bug 2284925 Change-Id: I1da39345c653dfb50c509adb0c822b4657646c56 Signed-off-by: Nicolin Chen <nicolinc@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1929355 Reviewed-by: Alex Waterman <alexw@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Sachin Nikam <snikam@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-19 08:39:43 -07:00
Philip Elcan	1c7bb9b538	gpu: nvgpu: channel: make chid u32 The chid member of the channel_gk20a struct was being used as a unsigned value. By being declared as an int, it was causing MISRA 10.3 violations for implicit assignment of different types. JIRA NVGPU-647 Change-Id: I7477fad6f0c837cf7ede1dba803158b1dda717af Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1918470 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-16 16:47:17 -07:00
Philip Elcan	f5cac144a0	gpu: nvgpu: make tsgid a consistent type Different units were declaring tsgid as int or u32. This makes everyone use u32. This change resolves MISRA 10.3 violations for implicit assingment to different types. JIRA NVGPU-647 Change-Id: I78660e737acb0dad76dd538e5dd37f4527cf5acd Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1918469 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-16 16:47:07 -07:00
ddutta	80b5e2b8d6	gpu: nvgpu: remove os_fence dependency from channel_sync Move the wait_cmd_buffer programming for channel_sync->wait_fd to channel_sync.c. nvgpu_os_fence->program_waits interface is now removed. channel_sync can directly retrieve syncpt/semaphore from the interfaces of struct nvgpu_os_fence_syncpt and struct nvgpu_os_fence_sema and use it for the wait programming. Also, change int to u32 for some variables such as num_fences, max_wait_size and wait_cmd_size. Jira NVGPU-1093 Change-Id: I19c1b10d676caff49ce57861091f7f0ea65e7676 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1829719 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Konsta Holtta <kholtta@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-16 15:34:13 -07:00
ddutta	9f948ed07f	gpu: nvgpu: add accessor methods to underlying objects of nvgpu_os_fence channel_sync->wait_fd depends upon nvgpu_os_fence->program_waits which invokes a channel_sync method and this leads to a circular dependency. In order to resolve the above, constructed struct nvgpu_os_fence_sema and struct nvgpu_os_fence_syncpt with interfaces that support conversion between struct nvgpu_os_fence to above. Also, added the following interfaces for retrieving syncpts and semaphore from the above structs respectively. void nvgpu_os_fence_sema_extract_nth_semaphore(...) int nvgpu_os_fence_sema_get_num_semaphores(...) void nvgpu_os_fence_syncpt_extract_nth_syncpt(...) int nvgpu_os_fence_syncpt_get_num_syncpoints(...) These enable channel_sync code to directly program the cmd_bufs based on the syncpts and semaphore received using the above APIs instead of the current state of doing the wait programming from within nvgpu_os_fence's interfaces. Jira NVGPU-1093 Change-Id: Ie411f0ba60bca38f66a0024f5dfca03ef0b836eb Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1827475 Reviewed-by: Konsta Holtta <kholtta@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-16 15:34:09 -07:00
Karl Ding	ee0a987dfd	gpu: nvgpu: vgpu: properly set dma mask Properly set the dma_mask and coherent_dma_mask for vgpu instead of using the default 32-bit mask. This fixes the dma_capable check that was previously failing. Bug 2412352 Change-Id: If1d5d74333f86855f8041cc199a04b4b8eb521b5 Signed-off-by: Karl Ding <kding@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1924967 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Aparna Das <aparnad@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Nirav Patel <nipatel@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-16 05:55:02 -07:00
Debarshi Dutta	435892a784	gpu: nvgpu: initialize boolean to prevent UBScan bugs UBSan flags the error "load of value 255 is not a valid value for type '_Bool'". This is caused due to unitialized boolean value as given in the UBSan specification i.e. the following check -fsanitize=bool: Load of a bool value which is neither true nor false. Bug 200452078 Change-Id: I262320fd72960b41951f6b9c99f64400457d9790 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1923241 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Ashish Mhetre <amhetre@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:12 +05:30
Nitin Kumbhar	dcb2a34200	gpu: nvgpu: fix circular dep of ce2 and gk20a headers struct gk20a from gk20a.h needs defination of struct gk20a_ce_app and ce2_gk20a.h needs defination of struct gk20a. This creates a circular dependency. Fix this by making gk20a_ce_app a pointer to skip knowing the complete type details and using forward declarations for struct gk20a_ce_app and struct gk20a. The gk20a_ce_app pointer is alloc'ed in gk20a_init_ce_support() and free'ed in gk20a_ce_destroy. JIRA NVGPU-611 Change-Id: I4d62d5f2b2d1492db73bae69f90a1fe5586fba76 Signed-off-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1917945 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:11 +05:30
Deepak Nibade	92c1949392	gpu: nvgpu: add separate unit for cyclestats_snapshot Add new separate unit common/perf/cyclestats_snapshot.c and add corresponding header file include/nvgpu/cyclestats_snapshot.h This unit is h/w independent and simply calls gops.perf.* HALs exposed by perf unit to do the h/w configurations Also remove gv11b/css_gr_gv11b.* files as h/w specific sequence implemented in them is already moved to perf unit Rename all cyclestats_snapshot HALs in the form nvgpu_css_*() Jira NVGPU-1103 Change-Id: I303f6becb313ac918e06c495a5fe299947a1f0b1 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1916652 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:11 +05:30
aalex	e1a4bc8401	Revert "Revert "gpu: nvgpu: refactor SET_SM_EXCEPTION_MASK ioctl"" This patch was reverted as the "set_sm_exception_type_mask" HAL assignment for gp10b was missing causing regression on Pascal platform. Added missing gp10b HAL assignment for setting SM exception mask. Bug 200447406 This reverts commit `ce5228e094`. Change-Id: Ic48f4661fd4b6100310f8b4d23d902847e31f5df Signed-off-by: aalex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1837653 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Aparna Das <aparnad@nvidia.com> Tested-by: Sandarbh Jain <sanjain@nvidia.com> Reviewed-by: Nirav Patel <nipatel@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:10 +05:30
Nitin Kumbhar	ff3cafa134	gpu: nvgpu: add nvgpu power off/on sysfs nodes Add sysfs nodes to manage power of dGPU. Writing pci dev name to poweroff/poweron sysfs node powers off/on dGPU. The format of pci dev name is DDDD:BB:DD.F i.e. domain:bus:device.function echo 0001:01:00.0 > /sys/bus/pci/drivers/nvgpu/poweroff echo 0001:01:00.0 > /sys/bus/pci/drivers/nvgpu/poweron The permissions of nodes are set such that only root user can write to the sysfs node to control dGPU power state. JIRA NVGPU-1100 Change-Id: I904881cab58c5f553e94510a3a10000194238433 Signed-off-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1749848 Reviewed-by: Deepak Nibade <dnibade@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Bharat Nihalani <bnihalani@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:10 +05:30
Nitin Kumbhar	8c7b542810	gpu: nvgpu: capture stats for pci gpu power off/on Use a debugfs node to export statistics of dgpu power on and power off events. The stats capture number of powerons and pwoeroffs, min/max/avg poweron and poweroff latency. JIRA NVGPU-1100 Change-Id: I7d8f9d6a5102478ec179d77f7072185ad32dda9b Signed-off-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1833306 Reviewed-by: Deepak Nibade <dnibade@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Bharat Nihalani <bnihalani@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:10 +05:30
Nitin Kumbhar	237af3ef86	gpu: nvgpu: add interface to power on-off gpu The power rail of dGPU is managed with help of a set of GPIOs. Using those GPIOs add an interface to power off and power on dGPU. Before dGPU is powered off, new work is blocked by setting NVGPU_DRIVER_IS_DYING and current jobs are allowed to finish by waiting for gpu to be idle. The tegra PCIe controller driver provided APIs tegra_pcie_attach_controller() and tegra_pcie_detach_controller() are used to manage PCIe link shutdown, PCIe refclk management and PCIe rescan. JIRA NVGPU-1100 Change-Id: Ifae5b81535f40dceca5292a987d3daf6984f3210 Signed-off-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1749847 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:10 +05:30
Vaibhav Kachore	5d26d84ad5	gpu: nvgpu: fix memory leak in fecs ring setup - If fecs ring buffer is already allocated, and then if user calls fecs ring buffer ioctl, memory leak will occur. This patch fixes it. Bug 2293018 Change-Id: I4204b80a1b2b7891efdcb7f5a48485cc2f01ea43 Signed-off-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1850961 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:09 +05:30
Terje Bergstrom	3bda3a0678	Revert "Revert "gpu: nvgpu: add turing support"" This reverts commit 278842d6ff4e15467e0b8761c6e1b2a05f926f91. Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Change-Id: I37f47c137c048ddc3a728e143b6f30be525de120 Reviewed-on: https://git-master.nvidia.com/r/1918622	2018-10-12 17:35:09 +05:30
David Gilhooley	b74a4dbd26	Revert "gpu: nvgpu: add turing support" This reverts commit 27686d8b56316c7ad772dd91548e91516d59f3b1. Change-Id: Iebda705858edbd58c10ca3024a4ad060401485b6 Signed-off-by: David Gilhooley <dgilhooley@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1918612	2018-10-12 17:35:09 +05:30
Deepak Nibade	51244d6112	gpu: nvgpu: add turing support Add Turing specific common, unit, hardware header files Make all the Makefile and Makefile.sources changes to compile all Turing specific code Bug 200454999 Change-Id: I62ebff5c078b4b8817fc83ea0e4ee3cfffe668dc Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1917983 GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com>	2018-10-12 17:35:09 +05:30
Thomas Fleury	2b4cd797b4	gpu: nvgpu: require vbios .18 for 0x1eba PCI device Mandate the VBIOS to be at least 90.04.18.00.xx which is the base ROM version for ES VBIOS for 0x1eba PCI device. Bug 200447617 Change-Id: I2387215c7de09186cc7a2daaed3c9444129752a3 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1821563 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:08 +05:30

1 2 3 4

178 Commits