linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-23 18:16:01 +03:00

Author	SHA1	Message	Date
Konsta Holtta	9adc7a6542	gpu: nvgpu: fix MISRA errors in runlist Fix some mistakes from commit `0fbc1a2652` (gpu: nvgpu: avoid recursion in runlist construction) and commit `998bf379df` (gpu: nvgpu: add runlist_append_tsg) for MISRA rules 10.3 and 10.4. - cast a sizeof to u32 in a calculation to match in size, - make the NVGPU_FIFO_RUNLIST_INTERLEAVE_LEVEL_* constants unsigned to make comparisons match in signedness. Jira NVGPU-1174 Change-Id: I00aa9758ca4352d8eb53a0e8ded42a1ba3b14561 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1938069 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 15:36:49 -07:00
Philip Elcan	e2c08f4f7b	gpu: nvgpu: unit: install-unit: check for -next Check for nvgpu-next before trying to copy it to the target. This avoids an error message when the directory doesn't exist. Change-Id: I0e195779c7bd53c19e3e5e2ce0aaa4ac4c5c653d Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1935594 Reviewed-by: Nicolas Benech <nbenech@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 15:36:20 -07:00
Seshendra Gadagottu	851d4e3fa9	gpu: nvgpu: gv11b: force scg slow drain tpc enable For Pascal SCG, force slow drain tpc enable to avoid hang in graphics pipeline. Bug 2398751 Change-Id: I72276d2391f4e9d0c0d0b746f99585443453c044 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1934995 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 15:36:08 -07:00
Konsta Holtta	ad6b7d419b	gpu: nvgpu: unify channel status dump styles Add tsgid to older chips' dump where it was missing and add the deterministic flag to newer chips' dump where it was missing. Jira NVGPU-886 Change-Id: Ia21d7c6709ee2863293c48dc0c04a1e4b8783963 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1933492 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 15:35:48 -07:00
Konsta Holtta	c4ac6bb410	gpu: nvgpu: don't check for null ch info in stat dump The channel dump info is always provided by the caller, so this null check is unnecessary. Jira NVGPU-886 Change-Id: Ie7b125a6d5b2940a94da3f87a34ac079384722de Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1933491 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 15:35:39 -07:00
Philip Elcan	40669da3b2	gpu: nvgpu: use boolean type for BUG_ON macro Using while(0) was causing a MISRA violation because 0 is not a boolean expression. This changes it to use false which is a boolean. Change-Id: I2d532076ad0308399df5c6a5054b0cbd8cd110c3 Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1932558 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Nicolas Benech <nbenech@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 15:35:30 -07:00
Konsta Holtta	f8188089df	gpu: nvgpu: save only used part of channel ram for dump Reduce the size of memory allocations in the channel debug dump by capturing only the necessary values from the instance block. This also simplifies the allocation path slightly with the downside of having to add a capture_channel_ram_dump HAL for reading the interesting parts explicitly beforehand to the now smaller staging buffer. Also rename struct ch_state to struct nvgpu_channel_dump_info. Jira NVGPU-886 Change-Id: I5d7518d9d474b0b728b183383bc83d89ecf91b98 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1928207 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 15:35:26 -07:00
Konsta Holtta	a971510737	gpu: nvgpu: add nvgpu_mem_rd32_pair Sometimes longer 64-bit values are composed of pairs of low and high parts in a same nvgpu_mem. Add a function to read the two and to do the bit ops given the low and high word offsets. Jira NVGPU-886 Change-Id: I4fe698d7c5a891ef5740ea0fb31549195e2e6c43 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1928206 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 15:35:16 -07:00
Konsta Holtta	61fb57fb9c	gpu: nvgpu: add format specifiers for debug output Add compiler attribute to gk20a_debug_output to type check the printf style arguments against the format string. Jira NVGPU-886 Change-Id: Ib0c36826bf0a05211c873dc54ad9177319354b65 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1929865 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 15:35:06 -07:00
Konsta Holtta	439d3eb74f	gpu: nvgpu: use a pointer for ch_state inst mem MISRA rule 18.7 doesn't allow flexible array members. To work around that, modify the instance block member in struct ch_state to be an explicit pointer and allocate it separately for simplicity. Jira NVGPU-886 Change-Id: I34299bec79bf7706f9cdfa42dee7fba765c9f312 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1928205 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 15:35:02 -07:00
Amurthyreddy	9aa74d5f86	gpu: nvgpu: MISRA 10.4 boolean fixes MISRA rule 10.4 doesn't allow arithmetic conversions on operands of different essential type category. Fix violations where an arithmetic conversion is performed on boolean and non-boolean types. JIRA NVGPU-994 Change-Id: I2af9937678462b632bb6ec6178e10d02104fc3bc Signed-off-by: Amurthyreddy <amurthyreddy@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1832337 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 15:33:58 -07:00
Konsta Holtta	b629d5cbb6	gpu: nvgpu: drop gr include in tsg The only declaration from gr is struct nvgpu_gr_ctx and that can now be forward-declared. Do that and remove the gr_gk20a include; add the include to gk20a.h where it was missing. Jira NVGPU-967 Change-Id: Idf77e7dc2ca043e9bbde9e9db03306fdd37e3c04 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1822024 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 05:54:19 -07:00
Konsta Holtta	b08c613402	gpu: nvgpu: make gr_ctx a pointer in tsg Remove a dependency to a graphics type in tsg header by adding a pointer indirection. Jira NVGPU-967 Jira NVGPU-1149 Change-Id: I9177e6eedf08bfe4a3b981b67fa8d4d734f9e50f Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1822023 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 05:54:10 -07:00
Terje Bergstrom	17261054d6	gpu: nvgpu: Do not configure PMU HALs for vgpu gp10b gp10b vgpu has dependencies to PMU APIs because it adds the PMU HALs. For gv11b we already deleted all of them, because vgpu does not have access to any registers, so PMU HALs cannot work. Do the same for gp10b. JIRA NVGPU-596 Change-Id: Ib0a01185f19499b8fdd05a472a7503436996eed9 Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1936865 Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Aingara Paramakuru <aparamakuru@nvidia.com> Reviewed-by: Aparna Das <aparnad@nvidia.com> Reviewed-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 23:42:57 -07:00
Alex Waterman	aee5511bc8	gpu: nvgpu: Update pd_cache to handle 64K pages Update the PD cache code to handle 64KB pages. To do this the number of partial/full lists is expanded for when we have 64K pages. Currently we only explicitly support 4K and 64K page sizes. Other pages sizes (16K for example) will fail compilation during preprocessing. This change also cleans up the definitions for some of the internal structs. They have been moved into pd_cache.c since they are not used outside of pd_cache.c. This allows the following functions to be removed from the global context: __nvgpu_pd_cache_alloc_direct() __nvgpu_pd_cache_free_direct() They have been replaced by calls to nvgpu_pd_{alloc,free}(). The nvgpu_pd_mem_entry alloc_map also had to be expanded to a real bitmap. 32 or 64 bits is not sufficient for packing 256 byte PDs into a 64K page (there's 256 PDs per nvgpu_pd_mem_entry in that case). To prevent doing too many find_first_zero operations on the bitmap an 'allocs' field was also added which tracks how many allocs are done. We can use this instead of comparing a mask against the bitmap to determine if an nvgpu_pd_mem_entry is full. Note: there's still a limitation with the TLB invalidate code: it simply assumes an nvgpu_mem is a 1 to 1 with a PDB. This means we can't invalidate a PDB allocated at an offset greater than 0 in a nvgpu_pd_mem_entry. This in turn means we must always use a full page size for a context's PDB. Bug 1977822 Change-Id: I6a7a3a95b7c902bc6487cba05fde58fbc4a25da5 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1718755 Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 21:56:59 -07:00
Scott Long	eb11d6a7ed	gpu: nvgpu: MISRA 21.15 fixes to lpwr code MISRA Rule 21.15 prohibits use of memcpy() with incompatible ptrs to qualified/unqualified types. To circumvent this issue we've introduced a new MISRA-compliant nvgpu_memcpy() function. This change switches all offending uses of memcpy() in lpwr/.c code over to use nvgpu_memcpy() with appropriate casts applied. Also changed the bios image table ptrs to type u8 from u32 * to avoid unnecessary casts (note this is consistent with other portions of nvgpu that walk the bios image tables). JIRA NVGPU-849 Change-Id: I61da0e26872912cf6757ff2b6136ae5ddd84c309 Signed-off-by: Scott Long <scottl@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1936182 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 17:07:07 -07:00
Scott Long	2da8c07710	gpu: nvgpu: MISRA 21.15 fixes to pmgr code MISRA Rule 21.15 prohibits use of memcpy() with incompatible ptrs to qualified/unqualified types. To circumvent this issue we've introduced a new MISRA-compliant nvgpu_memcpy() function. This change switches all offending uses of memcpy() in pmgr/*.c code over to use nvgpu_memcpy() with appropriate casts applied. JIRA NVGPU-849 Change-Id: I949f8cff49755c4cbe328ae628cf71ace18622c7 Signed-off-by: Scott Long <scottl@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1936181 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Nicolas Benech <nbenech@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 17:07:03 -07:00
Scott Long	3f314d013d	gpu: nvgpu: MISRA 21.15 fixes to therm code MISRA Rule 21.15 prohibits use of memcpy() with incompatible ptrs to qualified/unqualified types. To circumvent this issue we've introduced a new MISRA-compliant nvgpu_memcpy() function. This change switches all offending uses of memcpy() in therm/*.c code over to use nvgpu_memcpy() with appropriate casts applied. JIRA NVGPU-849 Change-Id: I6ef4010d35c3f6df8a23e8c0241883481880e1c7 Signed-off-by: Scott Long <scottl@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1936180 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Nicolas Benech <nbenech@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 17:06:59 -07:00
Scott Long	8bf44618fe	gpu: nvgpu: MISRA 21.15 fixes to volt code MISRA Rule 21.15 prohibits use of memcpy() with incompatible ptrs to qualified/unqualified types. To circumvent this issue we've introduced a new MISRA-compliant nvgpu_memcpy() function. This change switches all offending uses of memcpy() in volt/*.c code over to use nvgpu_memcpy() with appropriate casts applied. JIRA NVGPU-849 Change-Id: Ia58f6dddfead31fdd08361a17e6a15e0f0ed29b7 Signed-off-by: Scott Long <scottl@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1936179 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Nicolas Benech <nbenech@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 17:06:56 -07:00
Alex Waterman	c64f9432b1	gpu: nvgpu: Fix comment in priv_cmd_buf allocation Update the comment to fix obvious issues and describe the new allocation logic. Bug 2327792 Change-Id: Ica0dd4159467e3023cc487a2bf9f525db3ad76e6 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1831096 Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 17:06:45 -07:00
Alex Waterman	b9ec592f1d	gpu: nvgpu: Make priv_cmd_buf honor num_in_flight jobs If num_in_flight jobs is set use that to determine the proper size of the priv_cmd_buf. If num_in_flight is not set then use the original logic: the priv_cmd_buf is sized based on a worst case assumption for the GPFIFO. Also clean up MISRA issues. Bug 2327792 Change-Id: Ie192caeb6cc48fdcac57e5cbb71c534aeaf46011 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1831095 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 17:06:41 -07:00
Alex Waterman	05ec7b80eb	gpu: nvgpu: Use deterministic flag to decide pre-alloc Instead of using num_inflight_jobs to determine whether to pre-alloc resources for a channel use the c->deterministic flag and the number of inflight jobs field. Non-determinsitic channels do not require pre-alloced resources and deterministic channels with 0 in flight jobs (i.e no kernel job tracking, AKA fast path sumits) also do not require pre-alloced resources. Bug 2327792 Change-Id: I7e8eb0478c22e005ca2c46c555415afa0ded0be1 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1850123 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 17:06:37 -07:00
Nicolas Benech	78476a9bb6	gpu: nvgpu: unit: Add unit for list testing This simple unit covers all the APIs of list.h JIRA NVGPU-900 Change-Id: I74d4d33ad23c1f31ce76612e7afba167394a01b6 Signed-off-by: Nicolas Benech <nbenech@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1935612 GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 15:55:03 -07:00
Nicolas Benech	ae9fe7c438	gpu: nvgpu: unit: Add gcov for units on host The unit test source files must have gcov enabled to get coverage information for inline function included from header files. JIRA NVGPU-1316 Change-Id: I46f973c321da93ef2b374f9310e6f0214ca70b09 Signed-off-by: Nicolas Benech <nbenech@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1935613 GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 15:54:59 -07:00
Amulya	3e6a445310	nvgpu: common: MISRA 10.1 boolean fixes Fix violations where a variable of type non-boolean is used as a boolean in gpu/nvgpu/common. JIRA NVGPU-646 Change-Id: I64e96e02e9a3d5d5604c4fa52460e0415f484d75 Signed-off-by: Amulya <Amurthyreddy@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1807128 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: Amulya Murthyreddy <amurthyreddy@nvidia.com> Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 14:44:38 -07:00
Konsta Holtta	a39d91b591	gpu: nvgpu: gv100: support usermode submit Use usermode_base HAL from gv11b and turn on NVGPU_SUPPORT_USERMODE_SUBMIT for gv100. Bug 200145225 Change-Id: I9f60a1fb07ae19ee9e0de9e28d56789fe282907f Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1924509 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 08:04:56 -07:00
Konsta Holtta	37659f5c8e	gpu: nvgpu: mark usermode submit supported for gv11b Mark usermode submit supported in gv11b and add the characteristics flag to expose the capability to userspace. Bug 200145225 Change-Id: Id9dcb0c71c020bd509fbdbffb94a756c69377f20 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1795822 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 08:04:47 -07:00
Konsta Holtta	99b1c6dcdf	gpu: nvgpu: support usermode submit buffers Import userd and gpfifo buffers from userspace if provided via NVGPU_IOCTL_CHANNEL_ALLOC_GPFIFO_EX. Also supply the work submit token (i.e., the hw channel id) to userspace. To keep the buffers alive, store their dmabuf and attachment/sgt handles in nvgpu_channel_linux. Our nvgpu_mem doesn't provide such data for buffers that are mainly in kernel use. The buffers are freed via a new API in the os_channel interface. Fix a bug in gk20a_channel_free_usermode_buffers: also unmap the usermode gpfifo buffer. Bug 200145225 Change-Id: I8416af7085c91b044ac8ccd9faa38e2a6d0c3946 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1795821 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 08:04:43 -07:00
Konsta Holtta	9de6d20abb	gpu: nvgpu: add FOREIGN_SGT mem flag Add an internal flag NVGPU_MEM_FLAG_FOREIGN_SGT to specify that the sgt member of an nvgpu_mem must not be freed when the nvgpu_mem is freed. Bug 200145225 Change-Id: I044fb91a5f9d148f38fb0cbf63d0cdfd64a070ce Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1819801 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 08:04:34 -07:00
Konsta Holtta	f33935f426	gpu: nvgpu: provide usermode region via mmap Add a mmap callback on the control device node for mapping the usermode register region to userspace. Each such mapping is removed when the GPU railgates, and brought back again on unrailgate. The mapping offset must be 0 and its size must be 4 KB. Bug 200145225 Change-Id: Ie8d3758da745b958376292691d7d1d02a24e7815 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1795819 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 08:04:25 -07:00
Konsta Holtta	d53495400e	gpu: nvgpu: track opened Linux ctrl files An upcoming patch will need to enumerate opened ctrl nodes; track them in a list, protected by a mutex. Bug 200145225 Change-Id: I50dc15056832a3bb53fbdd7bd2bffcdaecc7b21c Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1811840 Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 08:04:16 -07:00
Konsta Holtta	fddb296924	gpu: nvgpu: add usermode_base HAL Add a HAL function pointer to fifo to for reading the usermode_cfg0 register and implement it for gv11b. Bug 200145225 Change-Id: I5f77b15d3b502d9370b1f14129314eaf51a9d7d1 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1811839 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 08:04:12 -07:00
Konsta Holtta	38c11db264	gpu: nvgpu: store bus addr of gpu regs Usermode submit needs to access the usermode region of registers from userspace. Store the start address of register resource in struct nvgpu_os_linux to be used in remap to userspace. Bug 200145225 Change-Id: I3796b6bf67942af0cc16c86accb82a013032bfc8 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1811838 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 08:04:03 -07:00
smadhavan	757f028320	gpu: nvgpu: MISRA Rule 21.2 header guard fixes MISRA rule 21.2 doesn't allow the use of macro names which start with an underscore. These leading underscores are to be removed from the macro names. This patch will fix such violations by renaming them to follow the convention, 'NVGPU_PARENT-DIR_HEADER-NAME' when there is no keyword repetition between file name and directory or 'NVGPU_HEADER-NAME' when there is repetition. Deviate from the convention mentioned earlier in the case of clk.h. There are two files with name clk.h. One under include/nvgpu directory and other on in clk/ directory. Both of them will have header guards with name NVGPU_CLK_H under the convention. Thus clk/clk.h will have header guard with name NVGPU_CLK_CLK_H and include/nvgpu/clk will have header guard with name NVGPU_INCLUDE_CLK_H JIRA NVGPU-1028 Change-Id: I386d39649f39a98019a9e06bc65c8b623a43629b Signed-off-by: smadhavan <smadhavan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1923539 Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Konsta Holtta <kholtta@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 03:57:01 -07:00
Konsta Holtta	8ac9a53d81	gpu: nvgpu: fix double handling in timeout The context switch timeout works by triggering a hardware timeout at 10 Hz. When handling these, we check whether a channel has actually timed out. Currently the timeout limit can be shorter than the 10 Hz interval which always causes us to recover a channel but would also cause detection of progress if there was any in the interval. Handling both situations at the same time would reuse the channel pointer local to the function after a loop has finished and would cause memory corruption. Fix this by making the two branches mutually exclusive, and move the recover case to happen first because that's how our tests assume things to work. Jira NVGPU-967 Change-Id: I26aa0fa7fd80ab42a9a1a93a6cca2cd29c9d3f3f Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1932449 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 00:36:14 -07:00
Debarshi Dutta	d89946b33e	gpu: nvgpu: remove gk20a.h from gk20a/ This patch is the last in the series and removes the gk20a.h from the directory gk20a/ Jira NVGPU-597 Change-Id: I4105434bf1c32fef48c55f3d3a74d3505ef4a712 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1936102 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 17:59:21 -07:00
Debarshi Dutta	f438e49662	gpu: nvgpu: move header location of gk20a.h Change path corresponding to gk20a.h to <nvgpu/gk20a.h> for the following files. acr_tu104.c scripts/nvgpu_debug_hal.py Change-Id: Ie4de9be6d66cebcb735242514874e9a8975da9ad Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1935127 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 17:59:17 -07:00
Scott Long	a5d5454c8b	gpu: nvgpu: MISRA 21.15 fixes to bios code MISRA Rule 21.15 prohibits use of memcpy() with incompatible ptrs to qualified/unqualified types. To circumvent this issue we've introduced a new MISRA-compliant nvgpu_memcpy() function. This change switches all offending uses of memcpy() in bios.c over to use nvgpu_memcpy() with appropriate casts applied. JIRA NVGPU-849 Change-Id: I218068dd1f1f2c978ea8ab2ba230b3e5fb3da171 Signed-off-by: Scott Long <scottl@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1936149 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 17:29:55 -07:00
Philip Elcan	aeadca8f43	gpu: nvgpu: unit: dma alloc fault inj test This adds a unit test to verify the fault injection for the dma alloc POSIX module used by the unit test framework. JIRA NVGPU-1235 Change-Id: Ib24d5db008b43c91c19c2f2ffff619147b46fc23 Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1930801 GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 17:29:27 -07:00
Philip Elcan	b43ea7df61	gpu: nvgpu: unit: fault injection for dma alloc Add the ability to force errors from the dma_alloc routines in the POSIX build for the unit test framework. JIRA NVGPU-1235 Change-Id: Ie368b8e5aa1385aa46e8190feb8a450cf96b94fe Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1930800 GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 17:29:23 -07:00
Philip Elcan	bd95a0c5ff	gpu: nvgpu: fix malloc in POSIX dma module This fixes a bug where the POSIX dma module was calling malloc for dma allocation requests with the incorrect size. Bug found as a result of JIRA NVGPU-1235 Change-Id: Ia2933dc90f12d84a7da91a50eb3eafd6eb0e129e Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1930799 GVS: Gerrit_Virtual_Submit Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 17:29:19 -07:00
smadhavan	2597c61886	gpu: nvgpu: MISRA 21.2 header guard fixes MISRA rule 21.2 doesn't allow the use of macro names which start with an underscore. These leading underscores are to be removed from the macro names. This patch will fix such violations in hw headers by renaming them to follow the convention, 'NVGPU_HEADER_NAME'. JIRA NVGPU-1028 Change-Id: Ib335e3e08b5c4e0d28855e4a057c5b6aaa7a35a2 Signed-off-by: smadhavan <smadhavan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1923546 Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Konsta Holtta <kholtta@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 17:29:01 -07:00
smadhavan	b597a721af	gpu: nvgpu: Fix MISRA 8.2 violations MISRA rule 8.2 makes it mandatory for all function prototypes to have named parameters. There were few instances where parameter name(s) for function prototypes were omitted. This patch will fix the same. JIRA NVGPU-861 Change-Id: I6cb28482becc2938c574b7d8c6f22463d346d27a Signed-off-by: smadhavan <smadhavan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1917939 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 17:28:58 -07:00
smadhavan	64341b544b	gpu: nvgpu: Fix MISRA 21.2 C identifier redefines There are two places where C standard identifiers are being redefined. This is forbidden by MISRA rule 21.2. This patch will fix two violations 'div' and 'exp' by renaming them 'divisor' and 'exponent' respectively. Renamed 'man' as 'mantissa' for uniformity. JIRA NVGPU-1035 Change-Id: I0bbd38e5021c0047f9ce646dd6f90a30a3e4f3a5 Signed-off-by: smadhavan <smadhavan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1852429 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 17:28:48 -07:00
Konsta Holtta	472f8d1352	gpu: nvgpu: unit: add runlist construction tests Test that the runlist construction formats tsg items as expected and in correct order, in particular when interleaving is enabled and tsgs of different interleave levels exist. Jira NVGPU-1174 Change-Id: I9c88a618ce8bf5b0555b39a876cda98f6d887a01 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1918227 GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 11:14:23 -07:00
Konsta Holtta	0fbc1a2652	gpu: nvgpu: avoid recursion in runlist construction MISRA rule 17.2 forbids recursion as a hazard on the stack space. To comply and additionally to make the code somewhat more straightforward to read, rewrite the runlist construction with three explicit functions that work as the three levels of the earlier recursion. These levels map to the three priority levels of TSGs and having more than that is unlikely. When "runlist interleaving" is enabled, TSGs with higher priorities get interleaved between the switch of each pair of lower-level priority TSGs, so that the latency for a job at priority level X is no more than all jobs' timeslices of priority X and higher, plus at most one job at a lower level. This can be illustrated as follows (low, medium, high TSGs 1 and 2): L1 L2 (only low-priority TSGs) H1 H2 (only high-priority TSGs) H1 H2 M1 H1 H2 M2 (no low-priority TSGs) M1 M2 L1 M1 M2 L2 (no high-priority TSGs) H1 H2 L1 H1 H2 L2 (no medium-priority TSGs) H1 H2 M1 H1 H2 M2 H1 H2 L1 H1 H2 M1 H1 H2 M2 H1 H2 L2 (no empty levels) Without interleaving, the items are simply grouped by priority. Jira NVGPU-1174 Change-Id: Ic3b5106945df7105633730ecd1d150af770a5e83 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1918226 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 11:14:19 -07:00
Konsta Holtta	998bf379df	gpu: nvgpu: add runlist_append_tsg Extract out the part to construct a runlist entry for a tsg and its channels. The higher-level logic of ordering the runlist entries is about to change. Jira NVGPU-1174 Change-Id: I7c0dd30a7313d1feb29945a6e4ca17f764e78877 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1918225 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 11:14:10 -07:00
Amurthyreddy	89660dbd62	gpu: nvgpu: MISRA 14.4 bitwise operation as boolean MISRA rule 14.4 doesn't allow the usage of integer types as booleans in the controlling expression of an if statement or an iteration statement. Fix violations where the result of a bitwise operation is used as a boolean in the controlling expression of if and loop statements. JIRA NVGPU-1020 Change-Id: If910150072c3dd67c31fe9819c3a9e738fd3c1c6 Signed-off-by: Amurthyreddy <amurthyreddy@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1932389 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 10:06:55 -07:00
Deepak Nibade	0d065df144	gpu: nvgpu: tu104: enable SLCG/BLCG Enable SLCG/BLCG for TU104 device by setting corresponding flags in platform data Jira NVGPUT-108 Bug 200456693 Change-Id: I47e097f96c9056dcd0747897614fc316073291ad Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1934326 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 02:13:21 -07:00
Debarshi Dutta	6fe9bb835b	gpu: nvgpu: access channel_sync via public API struct nvgpu_channel_sync is moved to a private header i.e. channel_sync_priv.h present in common/sync/. All accesses to callback functions inside the struct nvgpu_channel_sync in NVGPU driver is replaced by the public channel_sync specific APIs. Jira NVGPU-1093 Change-Id: I52d57b3d458993203a3ac6b160fb569effbe5a66 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1929783 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 02:12:23 -07:00

... 3 4 5 6 7 ...

5001 Commits