linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-23 01:50:07 +03:00

Author	SHA1	Message	Date
Konsta Hölttä	0b70fff5db	gpu: nvgpu: fix job count calculation for non-pow2 The CIRC_SPACE and CIRC_CNT macros work as expected when the buffer size is a power of two. The userspace-supplied number of inflight jobs is not necessarily so. Compare the get and put pointers manually. Jira NVGPU-4548 Change-Id: Ifa7bd6d78f82ec8efcac21fcca391053a2f6f311 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2328572 (cherry picked from commit 33dffa1cfb142eea0f28474566c31b632eee04f5) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2331340 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	47c3d4582c	gpu: nvgpu: hide priv cmdbuf gva and size Add an accessor function in the priv cmdbuf object for gva and size to be written in a gpfifo entry once the cmdbuf build is finished. This helps in eventually hiding the struct priv_cmd_entry as an implementation detail. Add a sanity check to verify that the buffer has been filled exactly to the requested size. The cmdbufs are used to hold wait and increment commands for syncpoints or gpu semaphores. A prefence buffer can hold a number of wait commands of equal size, and the postfence buffer holds exactly one increment. Jira NVGPU-4548 Change-Id: I83132bf6de52794ecc419e033e9f4599e488fd68 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2325102 (cherry picked from commit d1831463a487666017c4c80fab0292a0b85c7d83) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2331339 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Dinesh	1c1da3d6b4	gpu: nvgpu: Syncpoint invalid value to ~0. As qnx syncpoint's invalid value is ~0, change the code to handle this. Bug 200603716 Change-Id: I5ec79688cd9e60066725781f1effe57692ec0c27 Signed-off-by: Dinesh <dt@nvidia.com> (cherry picked from commit 705260565a75bc90683841c4c08e4c857bda39f0) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2331208 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Debarshi Dutta	c1521a7bba	gpu: nvgpu: change system suspend's implementation Currently, for platforms with canRailgate device characteristics disabled, suspend can block as deterministic channels hold busy references. This patch makes the change to first hold off any new jobs for deterministic channels and then reverts back the busy references taken by those channels. Following this, suspend also waits for the device to get idle by waiting (with timeout) for the nvgpu's internal usage counter to be come zero. This ensures there are no further jobs in progress and allows the system to go into a suspend state. Bug 200598228 Bug 2930266 Change-Id: Id02b4d41a9c2dd64303b2e2449dbed48c12aea4c Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2328489 (cherry picked from commit `9d1e07ca18`) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2330159 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	e9747d5477	gpu: nvgpu: remove wait_fence_fd from incr_user The wait_fence_fd parameter in nvgpu_channel_sync_incr_user() has not been used since commit `1a4647272f` ("gpu: nvgpu: remove fence dependency tracking") where it was used to save a dependency fd to sema-based post fences. The commit probably should have removed this param; it has no purpose in the current design. Jira NVGPU-4548 Change-Id: Id7e68b24f8e9ba0e43ff01b7af946434580b166e Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2326604 (cherry picked from commit f8031142270fb87ac41597ae70a80505078ae6d5) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2328423 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	aa1322f975	gpu: nvgpu: move syncpt priv cmd allocation channel_sync_syncpt_gen_wait_cmd() is rather simple now and is called from two places where one has the buf preallocated and the other doesn't. Remove the preallocated flag from the function, moving the allocation to the single place where it is needed. Jira NVGPU-4548 Change-Id: I48083f4f6f1093d64b67c63582291392a3481932 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2325101 (cherry picked from commit afb566721e2b4c15349ff79d51f5eddc49b66014) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2331338 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	4acf78dff3	gpu: nvgpu: guard sync cmd hals properly Make the syncpt and sema wait and incr command HAL ops consistent. Add CONFIG_NVGPU_SW_SEMAPHORE guards for the semaphore ops. The syncpoint ops already have CONFIG_TEGRA_GK20A_NVHOST around them. Delete the dummy syncpt ops. They are not used; the ops are only needed when the real versions exist. Jira NVGPU-4548 Change-Id: I30315a67169b31b1d63a0a1a0a4492688db4a2bc Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2325100 (cherry picked from commit ed13b286c5fbdbc008ec59172d98ac79e9f2e733) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2331337 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	39844fb27c	gpu: nvgpu: hide priv cmdbuf mem writes Add an API to append data to a priv cmdbuf entry. Hold the write pointer offset internally in the entry instead of having the user keep track of where those words are written to. This helps in eventually hiding struct priv_cmd_entry from users and provides a more consistent interface in general. The wait and incr commands are now slightly easier to read as well when they're just arrays of data. A syncfd-backed prefence may be composed of several individual fences. Some of those (or even a fence backed by just one) may be already expired, and currently the syncfd export design releases and nulls semaphores when expired (see gk20a_sync_pt_has_signaled()) so for those the wait cmdbuf is appended with zeros; the specific function is for this purpose. Jira NVGPU-4548 Change-Id: I1057f98c1b5b407460aa6e1dcba917da9c9aa9c9 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2325099 (cherry picked from commit 6a00a65a86d8249cfeb06a05682abb4771949f19) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2331336 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Tejal Kudav	0c9f589f3f	gpu: nvgpu: Remove TLC error regs from dev_reginit The TLC error registers will be programmed as part of interrupt and error initialization code. This will help move all common.nvlink_turing_intr unit related code together. JIRA NVGPU-4350 Change-Id: I1c291f346eee890ee973889473b44227306d0400 Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2327621 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Petlozu Pravareshwar <petlozup@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
tkudav	3856381b43	gpu: nvgpu: Clear nvlink error persistent state Error logging bits within the nvlink blocks like TLC and MIF are persistent through reset, to enable them to be polled following a reset event. That means that they are in an unknown state at cold reset, and may contain error state after a warm reset event. Software is expected to reset them, either by writing ones to the status bits or by writing to the DEBUG_RESET register at the IOCTRL top level, to clear the state out before enabling error reporting. JIRA NVGPU-4352 Change-Id: Iab4e96388fd827c0d694eada61b20f24bbddd1ff Signed-off-by: tkudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2317683 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Tejal Kudav	5af8cedf05	gpu: nvgpu: Nvlink interrupt handling Enable logging and error reporting for MIF, DLPL, and TLC blocks. Configure the NVLIPT and IOCTRL interrupt registers to rollup the MIF and TLC errors on the link-specific fatal line and the DLPL interrupts on link-specific intr_a(fatal) line. Both link_err_fatal and link_intr_a are rolled up to stall interrupt line. In the handling ISR, clear the interrupt status registers and print an error. Move the interrupt handling HAL code to /common/hal. JIRA NVGPU-4350 JIRA NVGPU-4351 JIRA NVGPU-5231 JIRA NVGPU-4354 JIRA NVGPU-4355 JIRA NVGPU-4356 Change-Id: I14812499caf506592f3ae84d6681d857730d31ff Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2313221 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	d58d6ff321	gpu: nvgpu: use job count for priv cmdbuf size Reduce the priv cmdbuf allocation size to match the actual space needed in the worst case when num_in_flight is not specified. Although synchronization may indeed take up to 2/3 of the gpfifo entries, the number of jobs is what matters and it will be the remaining 1/3. Each job uses up at most one wait and incr command from the pre and post fences, so half of the 2/3 will be only wait commands and the other half will be only incr commands. Jira NVGPU-4548 Change-Id: Ib3566a76b97d8f65538d961efb97408ef23ec281 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2325233 (cherry picked from commit 515deae4f58fedc7d004988f0f85470a7a894ddf) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2328413 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	116c385089	gpu: nvgpu: alloc priv cmdbuf based on chip The semaphore wait and incr sizes are not 8 and 10 for gv11b onwards. Use the specific HAL API to retrieve their sizes and compute the priv cmdbuf queue based on them instead of the up-to-gp10b values. We haven't run out of space likely for several reasons: 1) userspace may not need both pre and post fences for absolutely each submitted job 2) submitted jobs may consist of more than one gpfifo entry, reducing the relative required sync capacity 3) the queue size is rounded up to the next power of two which leaves some margin for error in this calculation 4) the gpfifo size based num-in-flight guess has been twice as big as it needs to be (fixed in a next patch) Jira NVGPU-4548 Change-Id: I172b5c0d8bb7d2231cc45cbed5e1e8b60ce7c707 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2323148 (cherry picked from commit 03fb194d105242c3eb20a9857a22743f5f64b9b9) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2328412 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	00203b42f2	gpu: nvgpu: split add_sema_cmd to wait and incr The internal add_sema_cmd() used when making cmd buf entries has so many branches it makes sense to split it at the bool acquire flag into two functions. The wait part doesn't even need the wfi flag, and the incr part doesn't need offset. Jira NVGPU-4548 Change-Id: Iab26b9bc14564e2958935ab7ffda03aa873dd9b1 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2323320 (cherry picked from commit 9fe2830aa9ee2b0b165edc959defa74dfb49c6ba) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2328410 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	6202ead057	gpu: nvgpu: split sema sync hal to wait and incr Instead of one HAL op with a boolean flag to decide whether to do one thing or another entirely different thing, use two separate HAL ops for filling priv cmd bufs with semaphore wait and semaphore increment commands. It's already two ops for syncpoints, and explicit commands are more readable than boolean flags. Change offset into cmdbuf in sem wait HAL to be relative to the cmdbuf, so the HAL adds the cmdbuf internal offset to it. While at it, modify the syncpoint cmdbuf HAL ops' prototypes to be consistent. Jira NVGPU-4548 Change-Id: Ibac1fc5fe2ef113e4e16b56358ecfa8904464c82 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2323319 (cherry picked from commit 08c1fa38c0fe4effe6ff7a992af55f46e03e77d0) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2328409 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vinod G	6a7bf6cdc0	gpu: nvgpu: update sm ecc_status_error handling Use gv11b_gr_intr_handle_tpc_sm_ecc_exception function for future chip to avoid code replication. Add sm_ecc_status_errors hal to read the ecc_status_errors Jira NVGPU-5033 Signed-off-by: Vinod G <vinodg@nvidia.com> Change-Id: I4a25837d9b833a48307b9353b82ff6597f985e41 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2325537 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Sagar Kamble	72d01afd0c	gpu: nvgpu: replace dma_buf_kmap with dma_buf_vmap dma_buf_kmap was introduced a decade ago to map a dma_buf partially by the input number of pages, when 32-bit was fairly common. It was added to not exhaust vmalloc space. Starting from kernel 5.6, it is deprecated as vmap calls should succeed with larger available vmalloc space. Use dma_buf_vmap/vunmap instead of dma_buf_kmap/kunmap for handling mapping of notifier memory in gk20a_channel_wait_semaphore. Also update the debug prints and add speculation barrier to the start of gk20a_channel_wait. Bug 2925664 Change-Id: I49078fa81f050a57a5b66a793e62006dd66e3ba3 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2326513 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Abdul Salam	b029f3b2b0	gpu: nvgpu: Reactor clk_fll unit As a part of refactor move struct nvgpu_avfsfllobjs from public header to private header. This will help to have arch consistency across all units. Use public functions to fetch the data across other units. NVGPU-4690 Change-Id: I73a750695c2ae7d3e46d1d692d10e40f13ec3cb3 Signed-off-by: Abdul Salam <absalam@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/#/c/linux-nvgpu/+/2326675/ (cherry picked from commit 41e374461da5dc9e2b4ac67a0855fd8dd20e1450) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2328538 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Ramesh Mylavarapu <rmylavarapu@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Sagar Kamble	81b14ef5b1	gpu: nvgpu: fix dbg log and comment in nvgpu_vm_find_mapping Following commit updated the debug message in the function nvgpu_vm_find_mapping w.r.t reuse of mapping. commit 2f00d9adfc4fc91a6b84b14cc513f9b855d39cad Author: Sagar Kamble <skamble@nvidia.com> gpu: nvgpu: fix null pointer access in nvgpu_vm_find_mapping That reuse log is about the mapping and not SGT. Fix the log and add details about different handling of SGT for dmabuf drvdata cases in the comment. Bug 2834141 Change-Id: I3630de1c45a2bf55ff18bdb426f0597efe83f72c Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2328427 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Konsta Hölttä	1dcd4957f0	gpu: nvgpu: extract job from channel.c Start moving job and job list related functionality out of the big channel.c file. The lowest level job list stuff is moved, as is resource preallocation which is tied to the job list. Adding and cleaning jobs still stays in channel.c for now. The joblist is still owned by the channel as a direct struct field. Jira NVGPU-4548 Change-Id: I2733484d8ce6bd7b1fe0c32a867139c682616dfd Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2323149 (cherry picked from commit cbd20803ee10058da9d258e9e8cb91b34d2278d5) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2328408 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	72151c579f	gpu: nvgpu: hide priv cmd queue type Move struct priv_cmd_queue to priv_cmdbuf.c so that its definition does not need to be visible to all users of channel.h. This also forces it to be separately allocated (during channel init time). While at it, rename the functions to allocate and free priv cmdbuf queues now that they're not in channel.c anymore. A private command buffer queue is a piece of dma memory from which entries for incr and wait command lists are suballocated. As the name implies, it's a queue; allocations and frees of the bufs must happen in certain order. Jira NVGPU-4548 Change-Id: I1b47029f3a478e1942f24292918b7b59a5d91528 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2323147 (cherry picked from commit 1fcf9b04275f44638059c0147dc16c1dc6956510) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2328407 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	b3d16b23d5	gpu: nvgpu: extract priv cmdbuf from channel.c Move private command buffer related functionality to priv_cmdbuf.c. This is used only for kernel mode submits, so it makes sense to group it out, and the priv cmdbuf stuff is used also by things that don't care about channels. Jira NVGPU-4548 Change-Id: Idbb42e3ed3984e16c654bb9aa2b7564b780048a4 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2323146 (cherry picked from commit bb67bfc7ab8e87236f31bc4f6c80dab042609f21) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2328406 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
ajesh	7fc3c3822d	gpu: nvgpu: reduce the ccm for thread unit Reduce the code complexity of function nvgpu_thread_create_priority in Thread unit. Jira NVGPU-4987 Change-Id: I85da527c3d8dbbe37c5428e5bded9ed19b299613 Signed-off-by: ajesh <akv@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2327865 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	52835c39ae	gpu: nvgpu: do not skip completed syncpt prefences A corner case has existed since ancient times for syncpoint-backed prefences to not cause a gpu wait if the fence is found to be completed in the submit path. This adds some unnecessary complexity, so don't check for completion in software. Let the gpu "wait" for these known-to-be-trivial waits too. Necessary priv cmdbuf space has been allocated anyway. Originally nvhost had 16-bit fences which would wrap around relatively quickly, so waiting for an old fence could have looked like waiting for a fence that will expire long in the future. With 32-bit thresholds, this hasn't been the case for several Tegra generations anymore, and nvhost doesn't ignore waits like this either. The wait priv cmdbuf in submit path can still be missing even with a prefence supplied because the Android sync framework supports sync fds that contain zero fences inside; this can happen at least when merging fences that have all been expired. In such conditions the wait cmdbuf wouldn't even get allocated. [this is squashed with commit 8b3b0cb12d118 (gpu: nvgpu: allow no wait cmd with valid input fence) from https://git-master.nvidia.com/r/c/linux-nvgpu/+/2325677] Jira NVGPU-4548 Change-Id: Ie81fd8735c2614d0fedb7242dc9869d0961610eb Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2321762 (cherry picked from commit 8f3dac44934eb727b1bf4fb853f019cf4c15a5cd) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2324254 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Nitin Kumbhar	85949d39e2	gpu: nvgpu: disable GC-OFF feature for all dGPUs Set the can_pci_gc_off platform flag of all dGPUs to false to disable powering on/off dGPU using GC-OFF feature. Bug 2917054 Change-Id: Iffacd134cf52a137bb9c121d69bd0fd0a096c6ff Signed-off-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2327968 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Sagar Kamble	23d545f250	gpu: nvgpu: dma_buf_ops map unavailable in upstream This function op is deprecated after kernel version 5.5. Compile the corresponding code under version check. Bug 2925664 Change-Id: Ice1b02a3d6c7942ec63a609c515436d2622c3575 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2326512 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	c6908922e5	gpu: nvgpu: move generic preempt hals to common - Move fifo.preempt_runlists_for_rc and fifo.preempt_tsg hals to common source file as nvgpu_fifo_preempt_runlists_for_rc and nvgpu_fifo_preempt_tsg. Jira NVGPU-4881 Change-Id: I31f7973276c075130d8a0ac684c6c99e35be6017 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2323866 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Antony Clince Alex	fa73b73ba7	nvgpu: userspace: update priv_ring testcase Init below new hals in test_priv_ring_setup: - priv_ring.isr_handle_0 - priv_ring.isr_handle_1 Jira NVGPU-4669 Change-Id: Ia93a4ffca8335d71a463415d018e0746fc24e07e Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2321415 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Antony Clince Alex	5555b6db87	gpu: nvgpu: add isr_handle_0/1 priv_ring gops Add below hals to priv_ring gops. These hals are used from gp10b onwards. - isr_handle_0 - isr_handle_1 Jira: NVGPU-4669 Change-Id: I95aaebfd4c9c292b7b0da98cd34ac2a8472a5e1d Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2318245 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Vinod Gopalakrishnakurup <vinodg@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Sagar Kamble	705dd2ad77	gpu: nvgpu: use timespec64 Due to y2038 problem, where timestamps will overflow if logged in 32bit values, upstream linux kernel has removed timespec and friends. Correpsonding jiffies conversion functions are also obsolete. Update the notifier timestamp setup code to comply with this. Bug 2925664 Change-Id: I5266e3d748e536175af8eff5111fcc54556332a8 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2326514 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Jonathan Hunter <jonathanh@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Konsta Hölttä	0119aedbe5	gpu: nvgpu: try syncpt fdget only if supported Guard nvgpu_os_fence_syncpt_fdget() with an nvgpu_has_syncpoints() check. Even when CONFIG_TEGRA_GK20A_NVHOST is set, the platform data bit can be disabled independently; on Linux we have a runtime flag to disable them, too. If nvgpu doesn't have syncpt support, don't try reading syncpt-based sync files. If a sema-only-backed channel sync is given a syncpoint-based prefence fd, we can't wait for it with the current design that couples waits and increments in one interface. This should eventually be fixed, but for now the extra check at least guards another interesting case. A sync file with a zero fence count can be trivially accepted as either a valid syncpoint fence or a sema fence. If only semas are supported, and the syncpt check that happens first would turn the empty fd into a syncpt-based sync fence, the sema wait layer would wrongly reject it. Jira NVGPU-4548 Change-Id: Ib40c2d9a6a25812c5e24eef52c1d1a4f81eeed83 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2325733 (cherry picked from commit 877f99d7c9977dfea14480a1b0488c990b813d1d) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2326044 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vinod G	4ffcc14e90	gpu: nvgpu: support gr_exception_mme_fe1 error Add GPU_PGRAPH_MME_FE1_EXCEPTION definition to report the gr_exception_mme_fe1 error. Jira NVGPU-5226 Signed-off-by: Vinod G <vinodg@nvidia.com> Change-Id: I963586f8335ea99159e379b260679f060c2cf3c9 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2326228 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Dinesh	8a94781aa9	gpu: nvgpu: Change pramin lock to mutex As spinlock contention will eat cpu cycle, the pramin lock can be changed to mutex. Vidmem allocation is fully protected and vidmem pending is an atomic variable. So the lock acquisition is removed. JIRA NVGPU-4550 Change-Id: I0cecb8f4ee7e840fd698311572aedebbc8f49177 Signed-off-by: Dinesh <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2321251 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Antony Clince Alex	ad503f60fd	gpu: nvgpu: gv11b: add missing hw macros Add following missing hw macros for gv11b: - pri_ringstation_gpc_gpc0_priv_error_info_priv_level_v - pri_ringstation_gpc_gpc0_priv_error_info_subid_v - pri_ringstation_sys_priv_error_info_priv_level_v - pri_ringstation_sys_priv_error_info_subid_v Bug 200604892 Change-Id: I37fa33580b689a496ed0a74855a58291d626e341 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2325347 Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Vinod Gopalakrishnakurup <vinodg@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Konsta Hölttä	32fae6acf4	gpu: nvgpu: linux: read current syncpt in set safe state nvhost_syncpt_read_minval() only reads the min value that nvhost has cached. It makes sense for host managed syncpoints, but the user syncpoint that needs set_safe_state is client managed; its min and max values are not tracked internally. Use nvhost_syncpt_read_ext_check() to read the actual syncpoint value from HW and set the "safe state" (65536 increments) based on that. The safe state is analogous to "set min equal to max" when max is expected to be no more than the current value plus a big number. Using the cached min value would make this safe state lose its meaning when there could have been more than the big number of increments since the syncpoint was allocated. Jira NVGPU-4548 Change-Id: I395be75f1696e48e344f5503420864efeb3621de Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2323060 (cherry picked from commit ae571178ca63c4fa3e6bf70a4da0221393e975ee) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2326380 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	2d9b839f21	gpu: nvgpu: remove user sync related apis Set safe state and get syncpt address in the kernel submission tracking syncs was implemented for userspace syncs. Now that it's clear that the user sync object provides them, there are no users left for these APIs. Remove them. Jira NVGPU-4548 Change-Id: I58e04162dee55bb8d8547c9252033f40ed908144 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2321950 (cherry picked from commit a95c8f7ace562a11ca235d71496d3a7ce150bc7d) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2324251 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	60e1cf334a	gpu: nvgpu: include linux/mm.h in utils Linux's mm.h defines at least PAGE_ALIGN which nvgpu uses and provides via utils.h. It's defined internally in posix/utils.h, but in Linux the OS version is relied upon. However, the macro definition has spilled in via the trace events that includes linux/ktime.h. If nvgpu/trace.h isn't included, we wouldn't get that. Jira NVGPU-4548 Change-Id: I9c27cc9db15231714d38d7c623957c8f5fe52789 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2321362 (cherry picked from commit 84762ce74847a3b0983901b6cf3f5d3110abd6cc) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2324246 GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User	2020-12-15 14:13:28 -06:00
Konsta Hölttä	4f80c6b8a9	gpu: nvgpu: add channel_user_syncpt Refactor user managed syncpoints out of the channel sync infrastructure that deals with jobs submitted via the kernel api. The user syncpt only needs to expose the id and gpu address of the reserved syncpoint. None of the rest (fences, priv cmdbufs) is needed for that, so it hasn't been ideal to couple with the user-allocated syncpts. With user syncpts now provided by channel_user_syncpt, remove the user_managed flag from the kernel sync api. This allows moving all the kernel submit sync code to be conditionally compiled in only when needed, and separates the user sync functionality in a more clear way from the rest with a minimal API. [this is squashed with commit 5111caea601a (gpu: nvgpu: guard user syncpt with nvhost config) from https://git-master.nvidia.com/r/c/linux-nvgpu/+/2325009] Jira NVGPU-4548 Change-Id: I99259fc9cbd30bbd478ed86acffcce12768502d3 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2321768 (cherry picked from commit 1095ad353f5f1cf7ca180d0701bc02a607404f5e) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2319629 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	b813adbf49	gpu: nvgpu: require os fence when only supported If the os fence is the only kind that's supported, fail a submit if the user wants fences but doesn't explicitly request sync fences, expecting syncpoints. Syncpoint support is advertised to userspace in the gpu characteristics, so userspace already has the knowledge to request the correct sync type. Do this check at the ioctl level. The in-kernel stuff that needs submits (cde, copyengine) can work without syncpoints and sync fences are used only in userspace. Fail a submit also if CONFIG_SYNC is not set and sync fences are requested. Lack of kernel support doesn't guarantee that userspace would still wrongly want that. Clarify the deferred cleanup requirements. The sync framework is needed only for post sync fences, but deferred cleanup is still always needed with semaphores because the internal tracking is done with dynamically allocated (although small) objects. Jira NVGPU-4548 Change-Id: I2e5a6554930cb413b2bb46ddfe388e41390bc7e4 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2321715 (cherry picked from commit d870956170906eae1088846ec05266c859669771) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2318157 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
vinodg	df896cd3c7	gpu: nvgpu: support feature_override for nvgpu-next Jira NVGPU-4667 Signed-off-by: vinodg <vinodg@nvidia.com> Change-Id: I5dc64b874acf691c6a77dcda6c66c119dbc0092c Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2324880 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
ajesh	e4285740a4	gpu: nvgpu: modify the prints for return values Modify the prints for certain OS API return values for non safe build. Jira NVGPU-4987 Change-Id: I7c3eba58b5c685fbf654b7093e4d2180e02c49a6 Signed-off-by: ajesh <akv@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2324992 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vinod G	0e0b966f0c	gpu: nvgpu: update gr exception hal Make generic gr exception static functions to public functions. Jira NVGPU-5033 Signed-off-by: Vinod G <vinodg@nvidia.com> Change-Id: I9ac4cbc728edda813a487f80af622559a798b319 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2324676 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: Vedashree Vidwans <vvidwans@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	62955ec7f1	gpu: nvgpu: reorganize gpfifo writes in submit Reduce the number of branches and make the code flow more straightforward by having two complete paths for the gpfifo entry writes: one when job tracking is done and another when not. Although this adds some very minor duplication (of the user gpfifo append call), this way it's easier to read what happens to the job metadata, and when do we even have one. Jira NVGPU-4548 Change-Id: I6be8bc5afaf139e7c49d5e44837e04f642dd5721 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2321761 (cherry picked from commit 9a3d3c8d556d563b9d67b370636791d6a1dd57ee) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2324253 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	550d45430f	gpu: nvgpu: extract submit prechecks to own function Reduce complexity of the big gpfifo submit function by adding another function to perform channel-global and driver-global sanity checks that don't depend on submit parameters. The nvgpu_channel_check_unserviceable() check was in the middle of the submit function because there used to be a blocking wait just before it when the hw gpfifo would be full. The blocking wait could exit with the channel recovered from a timeout. Now it's ok to check this only once in the beginning because the submit is non-blocking. Jira NVGPU-4548 Change-Id: Idf19a560ca58a4f7da776c420dc9c6299cd7f7e7 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2321760 (cherry picked from commit 5359a2180f13505f57c62b9f639344913716370a) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2324252 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	8b96f27c45	gpu: nvgpu: delete channel refs in job tracking Each submitted job has held a reference to the channel where the job runs. This is not necessary: all that the refs do is prevent the channel from getting freed before the jobs are done in case the channel file is closed early. However, that is already taken care of, so remove the per-job get/put pair. The channel closure path needs to unbind the channel from its tsg if that hasn't done by the channel's user. Unbind gets the channel off the runlist and forces all fences to expire, then enqueues the channel for final job cleanup. No jobs can outlive this. Delete also the extra get/put pair in job cleanup. The caller (either the channel worker thread or the submit path in case of deterministic channels) will always hold a reference. Jira NVGPU-4548 Change-Id: I3a01759e1b2caf66c46cff19f6557645489ca8f4 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2322541 (cherry picked from commit 8af6260b8fcfd7bf393f50addb681b5353cbae38) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2324255 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
vinodg	9d577b8d9a	gpu: nvgpu: unit: remove gv11b_channel_debug_dump test dump_channel hal is moved to common code. Remove the test_gv11b_channel_debug_dump Jira NVGPU-5109 Signed-off-by: vinodg <vinodg@nvidia.com> Change-Id: If04314c09d9f7f0c789752a8a003e012a629d9be Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2323553 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vinod G	340ea241cb	gpu: nvgpu: remove channel debug_dump hal Channel debug_dump hal function does not involve any register related code. Move gv11b_channel_debug_dump hal function to common code nvgpu_channel_info_debug_dump function. Check gpu hw version to limit instance variables dump that differs between socs. Add new hal pointer syncpt_debug_dump for pbdma. Jira NVGPU-5109 Signed-off-by: Vinod G <vinodg@nvidia.com> Change-Id: Icfca837ce8e4117387cffa6fadf6c094c7da5946 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2321016 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Seshendra Gadagottu	62c06723dd	gpu: nvgpu: sim: defer sim buffers allocation Allocate sim buffers only after chip specific memory properties are enabled. JIRA NVGPU-5281 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Change-Id: I7b64b3a51b8cd66dbefd22a09216b2caaeccacbf Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2324083 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Sagar Kamble	c37c1b5474	gpu: nvgpu: fix null pointer access in nvgpu_vm_find_mapping mapped_buffer sgt is deallocated by dma_buf_unmap_attachment in the function nvgpu_vm_find_mapping if it is previously mapped. Debug print referred to the sgt page after deallocating hence move it before deallocation. Fix below null pointer dereference issue: [ 34.692659] Unable to handle kernel NULL pointer dereference at virtual address 0000000000000000 ... [ 34.699820] Hardware name: Jetson-AGX (DT) [ 34.709136] pc : nvgpu_vm_find_mapping+0xcc/0x240 [nvgpu] [ 34.714377] lr : nvgpu_vm_find_mapping+0xa4/0x240 [nvgpu] ... [ 34.804523] Call trace: [ 34.807093] nvgpu_vm_find_mapping+0xcc/0x240 [nvgpu] [ 34.812195] nvgpu_vm_map+0xd4/0x480 [nvgpu] [ 34.816168] nvgpu_vm_map_linux+0x15c/0x1c8 [nvgpu] [ 34.821237] nvgpu_vm_map_buffer+0x14c/0x270 [nvgpu] [ 34.825962] gk20a_as_dev_ioctl+0x320/0xfe0 [nvgpu] [ 34.830955] do_vfs_ioctl+0xb8/0xa40 [ 34.834449] ksys_ioctl+0x80/0xb8 [ 34.837430] __arm64_sys_ioctl+0x1c/0x28 [ 34.841630] el0_svc_handler+0x80/0x1a8 [ 34.845129] el0_svc+0x8/0xc [ 34.848106] Code: f9400021 d360ff3a b9406a62 f940007c (f9400034) [ 34.854227] ---[ end trace 1d8c4efb211dc317 ]--- Bug 2834141 Change-Id: I281ac21071fcc2e3e6f3685798c8e77daba6d4a4 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2323361 Tested-by: Debarshi Dutta <ddutta@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Debarshi Dutta	f483304238	gpu: nvgpu: add prerequisite for syncpoint-shim support add check for nvgpu_has_syncpoints() before enabling syncpoint-shim and usermode_syncpoint support. Syncpoint shim cannot exist without syncpoint support in the first place. Bug 200551105 Change-Id: I2a9c6d23c72a25bcac4a2a8737ed0bad14cd4d8f Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2323208 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00

1 2 3 4 5 ...

7732 Commits