linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-23 01:50:07 +03:00

Author	SHA1	Message	Date
Konsta Hölttä	ff41d97ab5	gpu: nvgpu: always prealloc jobs and fences Unify the job metadata handling by deleting the parts that have handled dynamically allocated job structs and fences. Now a channel can be in one less mode than before which reduces branching in tricky places and makes the submit/cleanup sequence easier to understand. While preallocating all the resources upfront may increase average memory consumption by some kilobytes, users of channels have to supply the worst case numbers anyway and this preallocation has been already done on deterministic channels. Flip the channel_joblist_delete() call in nvgpu_channel_clean_up_jobs() to be done after nvgpu_channel_free_job(). Deleting from the list (which is a ringbuffer) makes it possible to reuse the job again, so the job must be freed before that. The comment about using post_fence is no longer valid; nvgpu_channel_abort() does not use fences. This inverse order has not posed problems before because it's been buggy only for deterministic channels, and such channels do not do the cleanup asynchronously so no races are possible. With preallocated job list for all channels, this would have become a problem. Jira NVGPU-5492 Change-Id: I085066b0c9c2475e38be885a275d7be629725d64 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2346064 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Konsta Hölttä	ca1f93bdd7	gpu: nvgpu: add user fence type Decouple the fence information needed for providing submit postfences to userspace by adding a separate type for that and using it to pass fence data to ioctls. The data in struct nvgpu_fence_type is used in various places: - job tracking needs to know when a post fence is expired - job submitters within the driver (vidmem clears) need to be able to wait for these fences - userspace needs the fence as an id, value pair or as a file descriptor created from an os fence To keep object lifetimes strict, start decoupling the os fence data out of struct nvgpu_fence_type: delete nvgpu_fence_install_fd() and add nvgpu_fence_extract_user() to return a struct nvgpu_user_fence that contains only the necessary information. Storing the os fence in job tracking metadata is legacy code and not useful. Passing the os fence from where it's created through the whole submit path inside this combined fence type has been convenient, though. The internally stored cde job fence in dmabuf compression metadata is still nvgpu_fence_type to keep this patch simple. Jira NVGPU-5248 Change-Id: I75b7da676fb6aa083828f888c55571bbf7645ef3 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2359064 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Alex Waterman	70ce67df2d	gpu: nvgpu: Add a generic profiler Add a generic profiler based on the channel kickoff profiler. This aims to provide a mechanism to allow engineers to (more) easily profile arbitrary software paths within nvgpu. Usage of this profiler is still primarily through debugfs. Next up is a generic debugfs interface for this profiler in the Linux code. The end goal for this is to profile the recovery code and generate interesting statistics. JIRA NVGPU-5606 Signed-off-by: Alex Waterman <alexw@nvidia.com> Change-Id: I99783ec7e5143855845bde4e98760ff43350456d Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2355319 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	6cbc174fc2	gpu: nvgpu: avoid channel wdt ifdefs Implement empty stubs of the channel watchdog functions for when watchdog is disabled from build. Add some forward declarations that were missing. Now most call sites don't need #idefs for the build flag. Add error checks for the wdt alloc failure. Jira NVGPU-5494 Jira NVGPU-5493 Change-Id: I2d42e8ab4c5e045cd280b2e1f254396127bd154b Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2352050 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	16fb7654a5	gpu: nvgpu: isolate channel watchdog unit Move the definition of struct nvgpu_channel_wdt to watchdog.c. Adjust users of it to access it via an unified interface instead of poking directly at the channel internals. Jira NVGPU-5494 Change-Id: Ie11826e6732a8b98e72c4f81dd06bd7e49848121 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2345935 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	d916e85171	gpu: nvgpu: incr sync once submit is ready to go Split out the max value increment and syncpt interrupt registration out of nvgpu_channel_sync_incr*(). This API is called in the submit path to prepare buffers and tracking resources, but later on in the submit path errors can still occur so that the increment wouldn't happen (unless artificially forced by sw). The increment and irq registration cannot easily be undone and it makes more sense to do these at the moment when the prepared job is finally ready, so add a new nvgpu_channel_sync_mark_progress() API to be called later in the submit path to signal that progress shall eventually happen on the sync. Without this, the max value would stay too large after an unsuccessful submit until the channel gets closed. The sync object (syncpt or semaphore) is always exclusively owned by the channel that allocated it, so nonatomically reading the max value first in sync_incr() and incrementing it later in mark_progress() is racefree; all submits per channel are serialized. Change the channel syncpoint to client managed from host managed so that nvhost-exported sync fences behave correctly with the temporary state where the fence threshold is over the max value. Ideally we'd always track nvgpu-owned syncpts' max values internally, but this is enough for now. Jira NVGPU-5491 Change-Id: Idf0bda7ac93d7f2f114cdeb497fe6b5369d21c95 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2340465 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	d44ed9d3a8	gpu: nvgpu: rollback gpfifo on error Submitting new work may fail in the middle of writing the gpfifo entries. Undo the increments on the gp_put shadow pointer in case of error to avoid submitting wrong data during the next submit. Jira NVGPU-5491 Change-Id: I064eaac8773b24da0a56db79ac6bfd07c008da03 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2340464 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	f388b1f596	gpu: nvgpu: simplify cmdbuf construction in submit Split out the wait cmd and incr cmd setup work in submit path to separate functions to minimize cyclomatic complexity and to increase readability. Jira NVGPU-5491 Change-Id: I7dfabd2de287ae10aaae9fb8d4d85d752db8631c Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2340463 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	05df07945a	gpu: nvgpu: avoid channel dependency in priv cmdbuf The priv cmdbuf queue needs only the vm_gk20a of the channel that owns it. Pass the vm to the queue constructor and have the channel code store the queue to itself instead of poking at the channel from the queue code. Adjust the cmdbuf queue api to take the queue, not the channel. Move the inflight job fallback calculation to the channel code. The size of the channel gpfifo isn't needed in the queue; just the job count is. [not part of the cherry-pick: a bunch of MISRA mitigations.] Jira NVGPU-4548 Change-Id: I4277dc67bb50380cb157f3aa3c5d57b162a8f0ba Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2329659 (cherry picked from commit 83b2276f7bea563602eee20ce24b70ce70c8475a) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2332508 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	9bee2fe660	gpu: nvgpu: prealloc priv cmdbuf metadata Move preallocation of priv cmdbuf metadata structs to the priv cmdbuf level and do it always, not only on deterministic channels. This makes job tracking simpler and loosens dependencies from jobs to cmdbuf internals. The underlying dma memory for the cmdbuf data has always been preallocated. Rename the priv cmdbuf functions to have a consistent prefix. Refactor the channel sync wait and incr ops to free any priv cmdbufs they allocate. They have been depending on the caller to free their resources even on error conditions, requiring the caller to know how they work. The error paths that could occur after a priv cmdbuf has been allocated have likely been wrong for a long time. Usually the cmdbuf queue allows allocating only from one end and freeing from only the other end, as that's natural with the hardware job queue. However, in error conditions the just recently allocated entries need to be put back. Improve the interface for this. [not part of the cherry-pick:] Delete the error prints about not enough priv cmd buffer space. That is not an error. When obeying the user-provided job sizes more strictly, momentarily running out of job tracking resources is possible when the job cleanup thread does not catch up quickly enough. In such a case the number of inflight jobs on the hardware could be less than the maximum, but the inflight job count that nvgpu sees via the consumed resources could reach the maximum. Also remove the wrong translation to -EINVAL from err from one call to nvgpu_priv_cmdbuf_alloc() - the -EAGAIN from the failed allocation is important. [not part of the cherry-pick: a bunch of MISRA mitigations.] Jira NVGPU-4548 Change-Id: I09d02bd44d50a5451500d09605f906d74009a8a4 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2329657 (cherry picked from commit 25412412f31436688c6b45684886f7552075da83) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2332506 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	6fc1e41150	gpu: nvgpu: split submit on deterministic Avoid repetitive branching on the c->deterministic flag and on build time flags by breaking the submit function on the runtime flag into two functions of which one gets called. In deterministic mode the job tracking conditions are simpler, there are a few extra prechecks to guarantee deterministic latency and the railgate corner case, and deferred cleanup is never done. In nondeterministic mode job tracking has more conditions, a power reference is taken for the job lifetime, and deferred cleanup is assumed. These two paths still share some common code. Split it to two more functions to act as easy building blocks so that the main logic is apparent. Jira NVGPU-4548 Change-Id: I64f91dcf09acb16f409dc04a12ad1e144d0cce56 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2333728 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	b077c6787d	gpu: nvgpu: split sync and gpfifo work in submit Make the big submit function somewhat shorter by splitting out the work to do job allocation, sync command buffer creation and gpfifo writing out to another function. To emphasize the difference between tracked and fast submits, add two separate functions for those two cases. Jira NVGPU-4548 Change-Id: I97432a3d70dd408dc5d7c520f2eb5aa9c76d5e41 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2333727 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Konsta Hölttä	dd2fb50a1a	gpu: nvgpu: require deferred cleanup for aggressive sync destroy Aggressive sync destroy is used on some platforms where the amount of syncpoints is limited. It can cause sync objects to get allocated and freed in the submit path and when jobs are cleaned up, so require deferred cleanup. Allocations do not belong to job tracking in a deterministic submit path. Although this has been technically allowed before, deterministic channels have likely not been a priority on those old platforms with aggressive sync destroy set. Update virtualized gp10b platform data to match on a gp10b-vgpu compat string instead of gk20a-vgpu. gk20a (Tegra T124) hasn't been supported for a long time. Delete the aggressive sync destroy field from this platform. It's got enough syncpoints to not dynamically allocate them; having this property set for gp10b-vgpu has likely been a mistake. This is not a completely pure cherry-pick: also extend the gpu characteristics to not advertise full deterministic submit support when aggressive sync destroy is off. This platform flag cannot be adjusted by the user unlike many other flags. Jira NVGPU-4548 Change-Id: I283f546d48b79ac94b943d88e5dce55710858330 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2322042 (cherry picked from commit b1ba2b997b2174e365bcb0782ef3e67260ff9e57) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2328411 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	47c3d4582c	gpu: nvgpu: hide priv cmdbuf gva and size Add an accessor function in the priv cmdbuf object for gva and size to be written in a gpfifo entry once the cmdbuf build is finished. This helps in eventually hiding the struct priv_cmd_entry as an implementation detail. Add a sanity check to verify that the buffer has been filled exactly to the requested size. The cmdbufs are used to hold wait and increment commands for syncpoints or gpu semaphores. A prefence buffer can hold a number of wait commands of equal size, and the postfence buffer holds exactly one increment. Jira NVGPU-4548 Change-Id: I83132bf6de52794ecc419e033e9f4599e488fd68 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2325102 (cherry picked from commit d1831463a487666017c4c80fab0292a0b85c7d83) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2331339 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	e9747d5477	gpu: nvgpu: remove wait_fence_fd from incr_user The wait_fence_fd parameter in nvgpu_channel_sync_incr_user() has not been used since commit `1a4647272f` ("gpu: nvgpu: remove fence dependency tracking") where it was used to save a dependency fd to sema-based post fences. The commit probably should have removed this param; it has no purpose in the current design. Jira NVGPU-4548 Change-Id: Id7e68b24f8e9ba0e43ff01b7af946434580b166e Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2326604 (cherry picked from commit f8031142270fb87ac41597ae70a80505078ae6d5) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2328423 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	1dcd4957f0	gpu: nvgpu: extract job from channel.c Start moving job and job list related functionality out of the big channel.c file. The lowest level job list stuff is moved, as is resource preallocation which is tied to the job list. Adding and cleaning jobs still stays in channel.c for now. The joblist is still owned by the channel as a direct struct field. Jira NVGPU-4548 Change-Id: I2733484d8ce6bd7b1fe0c32a867139c682616dfd Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2323149 (cherry picked from commit cbd20803ee10058da9d258e9e8cb91b34d2278d5) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2328408 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	b3d16b23d5	gpu: nvgpu: extract priv cmdbuf from channel.c Move private command buffer related functionality to priv_cmdbuf.c. This is used only for kernel mode submits, so it makes sense to group it out, and the priv cmdbuf stuff is used also by things that don't care about channels. Jira NVGPU-4548 Change-Id: Idbb42e3ed3984e16c654bb9aa2b7564b780048a4 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2323146 (cherry picked from commit bb67bfc7ab8e87236f31bc4f6c80dab042609f21) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2328406 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	52835c39ae	gpu: nvgpu: do not skip completed syncpt prefences A corner case has existed since ancient times for syncpoint-backed prefences to not cause a gpu wait if the fence is found to be completed in the submit path. This adds some unnecessary complexity, so don't check for completion in software. Let the gpu "wait" for these known-to-be-trivial waits too. Necessary priv cmdbuf space has been allocated anyway. Originally nvhost had 16-bit fences which would wrap around relatively quickly, so waiting for an old fence could have looked like waiting for a fence that will expire long in the future. With 32-bit thresholds, this hasn't been the case for several Tegra generations anymore, and nvhost doesn't ignore waits like this either. The wait priv cmdbuf in submit path can still be missing even with a prefence supplied because the Android sync framework supports sync fds that contain zero fences inside; this can happen at least when merging fences that have all been expired. In such conditions the wait cmdbuf wouldn't even get allocated. [this is squashed with commit 8b3b0cb12d118 (gpu: nvgpu: allow no wait cmd with valid input fence) from https://git-master.nvidia.com/r/c/linux-nvgpu/+/2325677] Jira NVGPU-4548 Change-Id: Ie81fd8735c2614d0fedb7242dc9869d0961610eb Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2321762 (cherry picked from commit 8f3dac44934eb727b1bf4fb853f019cf4c15a5cd) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2324254 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	4f80c6b8a9	gpu: nvgpu: add channel_user_syncpt Refactor user managed syncpoints out of the channel sync infrastructure that deals with jobs submitted via the kernel api. The user syncpt only needs to expose the id and gpu address of the reserved syncpoint. None of the rest (fences, priv cmdbufs) is needed for that, so it hasn't been ideal to couple with the user-allocated syncpts. With user syncpts now provided by channel_user_syncpt, remove the user_managed flag from the kernel sync api. This allows moving all the kernel submit sync code to be conditionally compiled in only when needed, and separates the user sync functionality in a more clear way from the rest with a minimal API. [this is squashed with commit 5111caea601a (gpu: nvgpu: guard user syncpt with nvhost config) from https://git-master.nvidia.com/r/c/linux-nvgpu/+/2325009] Jira NVGPU-4548 Change-Id: I99259fc9cbd30bbd478ed86acffcce12768502d3 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2321768 (cherry picked from commit 1095ad353f5f1cf7ca180d0701bc02a607404f5e) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2319629 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	b813adbf49	gpu: nvgpu: require os fence when only supported If the os fence is the only kind that's supported, fail a submit if the user wants fences but doesn't explicitly request sync fences, expecting syncpoints. Syncpoint support is advertised to userspace in the gpu characteristics, so userspace already has the knowledge to request the correct sync type. Do this check at the ioctl level. The in-kernel stuff that needs submits (cde, copyengine) can work without syncpoints and sync fences are used only in userspace. Fail a submit also if CONFIG_SYNC is not set and sync fences are requested. Lack of kernel support doesn't guarantee that userspace would still wrongly want that. Clarify the deferred cleanup requirements. The sync framework is needed only for post sync fences, but deferred cleanup is still always needed with semaphores because the internal tracking is done with dynamically allocated (although small) objects. Jira NVGPU-4548 Change-Id: I2e5a6554930cb413b2bb46ddfe388e41390bc7e4 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2321715 (cherry picked from commit d870956170906eae1088846ec05266c859669771) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2318157 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	62955ec7f1	gpu: nvgpu: reorganize gpfifo writes in submit Reduce the number of branches and make the code flow more straightforward by having two complete paths for the gpfifo entry writes: one when job tracking is done and another when not. Although this adds some very minor duplication (of the user gpfifo append call), this way it's easier to read what happens to the job metadata, and when do we even have one. Jira NVGPU-4548 Change-Id: I6be8bc5afaf139e7c49d5e44837e04f642dd5721 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2321761 (cherry picked from commit 9a3d3c8d556d563b9d67b370636791d6a1dd57ee) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2324253 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Konsta Hölttä	550d45430f	gpu: nvgpu: extract submit prechecks to own function Reduce complexity of the big gpfifo submit function by adding another function to perform channel-global and driver-global sanity checks that don't depend on submit parameters. The nvgpu_channel_check_unserviceable() check was in the middle of the submit function because there used to be a blocking wait just before it when the hw gpfifo would be full. The blocking wait could exit with the channel recovered from a timeout. Now it's ok to check this only once in the beginning because the submit is non-blocking. Jira NVGPU-4548 Change-Id: Idf19a560ca58a4f7da776c420dc9c6299cd7f7e7 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2321760 (cherry picked from commit 5359a2180f13505f57c62b9f639344913716370a) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2324252 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Thomas Fleury	8ec4395e82	gpu: nvgpu: build flag for deterministic channel Add CONFIG_NVGPU_DETERMINISTIC_CHANNELS and fix preprocessor #ifdefs to allow compiling kernel mode submit without deterministic feature enabled. Jira NVGPU-4661 Change-Id: I4aa678715824e8981d39bd8db0c5ae61ef3a675c Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2310325 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vaibhav Kachore	bbb63c0a8c	gpu: nvgpu: remove "trace/events/gk20a.h" from QNX build - "include/trace/events/gk20a.h" file was having GPL2 license (which should not used for QNX code). This file was used for compiling linux userspace driver("libnvgpu-drv.so") and was used for unit testing on QNX. - This patch removes stubs in "include/trace/events/gk20a.h" file. (which were used for linux userspace driver.) - For QNX driver, "nvgpu_rmos/trace/events/gk20a.h" was used. This patch moves that file to "include/nvgpu/posix/trace_gk20a.h" and does relevant license change. This same file will be used for linux userspace driver. - This patch also creates a new file "include/nvgpu/trace.h" which selects proper trace file depending on the config. Bug 2802414 Change-Id: Icdfb251e5698073f986753a969e804161af3ecc5 Signed-off-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2286388 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Scott Long	52a4dd74e2	gpu: nvgpu: fix misra 18.4 violations This change eliminates MISRA Advisory Rule 18.4 violations in the following cases: * nvgpu_submit_append_gpfifo_user_direct() * nvgpu_submit_append_gpfifo_common() - use array-indexing to access gpfifo entry lists * gv11b_gr_intr_record_sm_error_state() - use array-indexing to access sm_error_states table Advisory Rule 18.4 states that the +, -, +=, and -= operators should not be applied to an expression of pointer type. JIRA NVGPU-3798 Change-Id: I736930e4ba09a88888b0ef48f62496c4082ea5a1 Signed-off-by: Scott Long <scottl@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2210173 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:05:52 -06:00
Shashank Singh	6fd0d972ae	nvgpu: gpu: include qnx_init unit in doxygen documentation -Include qnx_init unit in doxygen documentation. -Add documentation for gk20a_busy/idle and similar functions. -Remove must_check return value as misra already reports violation for that. Jira NVGPU-2571 Change-Id: I9573cb61865677944809dcc494d92f63cc6e0f58 Signed-off-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2176755 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:05:52 -06:00
Vedashree Vidwans	5fd301c61b	gpu: nvgpu: fix race for channel sync read/write CTS test dEQP-VK.api.object_management.max_concurrent.device_group crashes with invalid userspace memory access. Currently, nvgpu_submit_prepare_syncs() races with nvgpu_channel_clean_up_jobs() and this race condition is exposed when aggressive_sync_destroy_thresh is set to non-zero value. nvgpu_submit_prepare_syncs() gets ref for c->sync to submit job and releases channel sync_lock immediately. Meanwhile, nvgpu_worker_poll_work() triggers nvgpu_channel_clean_up_jobs(), which destroys ref'd c->sync pointer. Channel sync is deleted by nvgpu_channel_clean_up_jobs() only if aggressive_sync_destroy_thresh is non-zero. So, nvgpu_channel_clean_up_jobs() and nvgpu_submit_prepare_syncs() will race only in this scenario. Hence, if aggressive_sync_destroy_thresh value is non-zero, this patch protects channel's sync pointer by holding channel sync_lock during complete execution of nvgpu_submit_prepare_syncs(). Bug 2613870 Change-Id: I030d8df7af10d4ed86f921b5cf60de2b1d60e5d3 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2181360 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-08-28 17:44:15 -07:00
Vedashree Vidwans	83fea157a3	Revert "gpu: nvgpu: fix race for channel sync read/write" This reverts commit `e22d743a20`. Change-Id: I4ea0a8158030d2fb9700ef5b84f8d77e579c1025 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2182350 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-08-28 17:44:00 -07:00
Vedashree Vidwans	7bc3cdcf95	gpu: nvgpu: use vpr resize enabled API This patch adds nvgpu API in linux and posix to query vpr resize. The new API nvgpu_is_vpr_resize_enabled() is used in nvgpu_submit_channel_gpfifo(). Previously, if non-deterministic channel has timeout disabled and GPU cannot railgate on some platform, then channel doesn't power ref count and results in video freeze. To resolve non-determinstic channel job tracking needs to be enabled if vpr resize is supported or if GPU can railgate. Bug 200532122 Change-Id: Icfbff6253762b195b2f5955749343974b1a7a269 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2171093 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-08-28 14:24:19 -07:00
Seema Khowala	e22d743a20	gpu: nvgpu: fix race for channel sync read/write CTS test dEQP-VK.api.object_management.max_concurrent.device_group crashes with invalid userspace memory access. Currently, nvgpu_submit_prepare_syncs() races with nvgpu_channel_clean_up_jobs() and this race condition is exposed when aggressive_sync_destroy_thresh is set to non-zero value. nvgpu_submit_prepare_syncs() gets ref for c->sync to submit job and releases channel sync_lock. Meanwhile, nvgpu_worker_poll_work() triggers nvgpu_channel_clean_up_jobs(), which destroys ref'd c->sync pointer. This patch protects channel's sync pointer by holding channel sync_lock during complete execution of nvgpu_submit_prepare_syncs(). Bug 2613870 Change-Id: I6f3d48aff361d1cb38c30d2ce5de276d0c55fb6f Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2176929 Reviewed-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-08-20 11:46:02 -07:00
Sagar Kamble	a16cc2dde3	gpu: nvgpu: compile out vidmem from safety build Safety build does not support vidmem. This patch compiles out vidmem related changes - vidmem, dma alloc, cbc/acr/pmu alloc based on vidmem and corresponding tests like pramin, page allocator & gmmu_map_unmap_vidmem.. As vidmem is applicable only in case of DGPUs the code is compiled out using CONFIG_NVGPU_DGPU. JIRA NVGPU-3524 Change-Id: Ic623801112484ffc071195e828ab9f290f945d4d Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2132773 GVS: Gerrit_Virtual_Submit Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-06-25 04:37:08 -07:00
Sagar Kadamati	d2444e85ed	gpu: nvgpu: compile-out debug unit debug unit is not need to for safety build, so compile out it JIRA NVGPU-3542 Change-Id: I60cc256a5659e72ae2e647ec4f1a810ba4aa959d Signed-off-by: Sagar Kadamati <skadamati@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2133419 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-06-21 04:35:25 -07:00
Sagar Kamble	3f08cf8a48	gpu: nvgpu: rename feature Make and C flags Name the Make and C flag variables consistently wih syntax: CONFIG_NVGPU_<feature name> s/NVGPU_DEBUGGER/CONFIG_NVGPU_DEBUGGER s/NVGPU_CYCLESTATS/CONFIG_NVGPU_CYCLESTATS s/NVGPU_USERD/CONFIG_NVGPU_USERD s/NVGPU_CHANNEL_WDT/CONFIG_NVGPU_CHANNEL_WDT s/NVGPU_FEATURE_CE/CONFIG_NVGPU_CE s/NVGPU_GRAPHICS/CONFIG_NVGPU_GRAPHICS s/NVGPU_ENGINE/CONFIG_NVGPU_FIFO_ENGINE_ACTIVITY s/NVGPU_FEATURE_CHANNEL_TSG_SCHED/CONFIG_NVGPU_CHANNEL_TSG_SCHED s/NVGPU_FEATURE_CHANNEL_TSG_CONTROL/CONFIG_NVGPU_CHANNEL_TSG_CONTROL s/NVGPU_FEATURE_ENGINE_QUEUE/CONFIG_NVGPU_ENGINE_QUEUE s/GK20A_CTXSW_TRACE/CONFIG_NVGPU_FECS_TRACE s/IGPU_VIRT_SUPPORT/CONFIG_NVGPU_IGPU_VIRT s/CONFIG_TEGRA_NVLINK/CONFIG_NVGPU_NVLINK s/NVGPU_DGPU_SUPPORT/CONFIG_NVGPU_DGPU s/NVGPU_VPR/CONFIG_NVGPU_VPR s/NVGPU_REPLAYABLE_FAULT/CONFIG_NVGPU_REPLAYABLE_FAULT s/NVGPU_FEATURE_LS_PMU/CONFIG_NVGPU_LS_PMU s/NVGPU_FEATURE_POWER_PG/CONFIG_NVGPU_POWER_PG JIRA NVGPU-3624 Change-Id: I8b2492b085095fc6ee95926d8f8c3929702a1773 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2130290 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-06-11 09:46:24 -07:00
Debarshi Dutta	0eb0242bdd	gpu: nvgpu: rename public channel unit APIs Rename the public channel unit APIs to follow the convention of nvgpu_channel_*. gk20a_channel_deterministic_idle -> nvgpu_channel_deterministic_idle gk20a_channel_deterministic_unidle -> nvgpu_channel_deterministic_unidle gk20a_wait_until_counter_is_N -> nvgpu_channel_wait_until_counter_is_N nvgpu_gk20a_alloc_job -> nvgpu_channel_alloc_job Jira NVGPU-3248 Change-Id: I358d63d4e891f6d92c70efe887c07674bc0f9914 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2123398 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-23 02:19:38 -07:00
Debarshi Dutta	f39a5c4ead	gpu: nvgpu: rename gk20a_channel_* APIs Renamed gk20a_channel_* APIs to nvgpu_channel_* APIs. Removed unused channel API int gk20a_wait_channel_idle Renamed nvgpu_channel_free_usermode_buffers in os/linux-channel.c to nvgpu_os_channel_free_usermode_buffers to avoid conflicts with the API with the same name in channel unit. Jira NVGPU-3248 Change-Id: I21379bd79e64da7e987ddaf5d19ff3804348acca Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2121902 Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-21 09:26:16 -07:00
Debarshi Dutta	1dea88c6c7	gpu: nvgpu: Add NVGPU_CHANNEL_WDT flag NVGPU_CHANNEL_WDT feature is embedded within the NVGPU_CHANNEL_WDT flag to allow it to be compiled out for safety builds. Jira NVGPU-3012 Change-Id: I0ca54af9d7b1b8e01f4090442341eaaadca8e339 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2114480 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-16 23:28:13 -07:00
Seema Khowala	334f855ac4	gpu: nvgpu: channel MISRA fix for Rule 17.7 Check return value of below function and add void to ignore the return value update_gp_get Rename nvgpu_get_gp_free_count -> nvgpu_channel_update_gpfifo_get_and_get_free_count nvgpu_gp_free_count -> nvgpu_channel_get_gpfifo_free_count JIRA NVGPU-3388 Change-Id: I6e2265882c1f34e3bb47eaeac7a2c5a9fbe9b4eb Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2115784 Reviewed-by: Thomas Fleury <tfleury@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-15 16:47:14 -07:00
Seema Khowala	c2b3da8e47	gpu: nvgpu: channel MISRA fix for Rule 17.7 Change gk20a_free_priv_cmdbuf from int to void type Rename gk20a_free_priv_cmdbuf -> nvgpu_channel_update_priv_cmd_q_and_free_entry channel_gk20a_free_priv_cmdbuf -> nvgpu_channel_free_priv_cmd_q free_priv_cmdbuf -> nvgpu_channel_free_priv_cmd_entry JIRA NVGPU-3388 JIRA NVGPU-3248 Change-Id: I32bc5686a280f72c7bba4ab2d37782e29117f596 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2114971 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Thomas Fleury <tfleury@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-15 16:46:27 -07:00
Thomas Fleury	44f6c10947	gpu: nvgpu: submit MISRA fixes for Rule 5.7 Renamed local variable - sync_fence -> flag_sync_fence As "sync_fence" is already used to represent a type. Also, renamed for consistency: - fence_wait -> flag_fence_wait - fence_get -> flag_fence_get Jira NVGPU-3384 Change-Id: Ib40d068f0ebda985303a85a385f4123955d3b4c9 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2111613 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-06 17:55:42 -07:00
Thomas Fleury	47b7820cb6	gpu: nvgpu: submit MISRA fixes for Rule 17.7 Check return value of gk20a_channel_add_job, and clean up in case of failure. Jira NVGPU-3384 Change-Id: Ic818d8bcf97fef6360aedd7a2a0a5a7f6f69150f Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2111612 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-06 17:55:28 -07:00
Thomas Fleury	9f5ca49a07	gpu: nvgpu: submit MISRA fixes for Rule 15.7 Refactored if / else statements in nvgpu_submit_channel_gpfifo to avoid "else if" with no terminating "else" statement. Jira NVGPU-3384 Change-Id: If553901f418455d77c372fd1d7113553a21096e1 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2111611 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-06 17:55:13 -07:00
Thomas Fleury	8b32821634	gpu: nvgpu: submit MISRA fixes for Rule 14.3 (!c->deterministic) \|\| (nvgpu_is_enabled(g, NVGPU_CAN_RAILGATE) && !c->deterministic) is equivalent to (!c->deterministic) Remove second condition in nvgpu_submit_channel_gpfifo Jira NVGPU-3384 Change-Id: Icf3e460d4fe9d310d94a21895832bbfae595df28 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2111610 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-06 17:54:59 -07:00
Debarshi Dutta	17486ec1f6	gpu: nvgpu: rename tsg_gk20a and channel_gk20a structs rename struct tsg_gk20a to struct nvgpu_tsg and rename struct channel_gk20a to struct nvgpu_channel Jira NVGPU-3248 Change-Id: I2a227347d249f9eea59223d82f09eae23dfc1306 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2112424 GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-06 02:56:53 -07:00
Seema Khowala	170d7464d6	gpu: nvgpu: move fifo_gk20a.[ch] to hal/fifo Move fifo_gk20a struct to fifo.h Move fifo_gk20a.[ch] to hal/fifo Add missing includes for fifo subunits. JIRA NVGPU-2012 Change-Id: I8bf5402bd5a9f8ff9f6a818cee553b57e117f3bc Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2109012 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-02 23:40:42 -07:00
Seema Khowala	034d44311e	gpu: nvgpu: move profile related struct and func Add include/nvgpu/profile.h Move from fifo_gk20a.h to include/nvgpu/profile.h and rename fifo_profile_gk20a -> nvgpu_profile gk20a_fifo_profile_acquire -> nvgpu_profile_acquire gk20a_fifo_profile_release -> nvgpu_profile_release gk20a_fifo_profile_snapshot -> nvgpu_profile_snapshot JIRA NVGPU-2012 Change-Id: I4f9fde9f0ccdeedec62d1f612046be14db334a89 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2109010 Reviewed-by: Thomas Fleury <tfleury@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-02 23:40:09 -07:00
Thomas Fleury	7fb397b0b3	gpu: nvgpu: add format_gpfifo_entry HAL for pbdma Removed dependency on pbdma hw headers in ce2, cde and submit. Added the following HAL to format gpfifo entries: - pbdma.format_gpfifo_entry Jira NVGPU-1992 Jira NVGPU-1990 Change-Id: I322d6bcd832b0ea5bbe2c2871b8f96b2793d8a65 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2093502 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-19 13:54:56 -07:00
Seema Khowala	312f91f991	gpu: nvgpu: move fence_gk20a to common/fence Move gk20a/fence_gk20a.c to common/fence/fence.c Renamed gk20a_fence_from_semaphore -> nvgpu_fence_from_semaphore gk20a_fence_from_syncpt -> nvgpu_fence_from_syncpt gk20a_alloc_fence_pool -> nvgpu_fence_pool_alloc gk20a_free_fence_pool -> nvgpu_fence_pool_free gk20a_alloc_fence -> nvgpu_fence_alloc gk20a_init_fence -> nvgpu_fence_init gk20a_fence_put -> nvgpu_fence_put gk20a_fence_get -> nvgpu_fence_get gk20a_fence_wait -> nvgpu_fence_wait gk20a_fence_is_expired -> nvgpu_fence_is_expired gk20a_fence_install_fd -> nvgpu_fence_install_fd gk20a_fence_ops struct -> nvgpu_fence_ops struct gk20a_fence struct -> nvgpu_fence_type struct JIRA NVGPU-1982 Change-Id: Ife77b2c3c386ff4368683c78ca02f00c99cddb4b Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2093002 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-10 17:24:52 -07:00
Thomas Fleury	6009662fa5	gpu: nvgpu: move resetup_ramfc to common channel On gp10b, ramfc contains information related to syncpoint protection, which restricts the syncpoint increment operation to a safe set of syncpoints. This information must be updated when a syncpoint is assigned to a channel. Added the following ramfc HALs - ramfc.get_syncpt - ramfc.set_syncpt And replaced - fifo.resetup_ramfc With - channel.set_syncpt Use new ramfc HALs, move resetup_ramfc implementation from fifo to common channel code: - nvgpu_channel_set_syncpt NVGPU-1750 Change-Id: I036a0b7b2d9fd6ccd9f30094ae33e6c38a96e0cc Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2075938 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Alex Waterman <alexw@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-27 20:35:23 -07:00
Seema Khowala	9393e2a90a	gpu: nvgpu: rename timeout of channel struct to wdt Rename channel_gk20a_timeout to nvgpu_channel_wdt. Rename timeout variable of channel_gk20a struct to wdt. Rename ch_wdt_timeout_ms to ch_wdt_init_limit_ms. Rename gk20a_channel_timeout_* to nvgpu_channel_wdt_* JIRA NVGPU-1312 Change-Id: Ida78426cc007b53f3d407cf85428d15f7fe7518a Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2077641 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-25 22:46:52 -07:00
Thomas Fleury	696d212718	gpu: nvgpu: move userd to separate unit Add userd unit under common/fifo Moved userd setup/cleanup from fifo: - nvgpu_userd_setup_sw - nvgpu_userd_cleanup_sw Moved common userd code from hals: - nvgpu_userd_init_slabs - nvgpu_userd_free_slabs - nvgpu_userd_init_channel Replaced the following hals - fifo.userd_gp_get - fifo.userd_gp_put - fifo.userd_pb_get - fifo.setup_userd - fifo.userd_entry_size With - userd.gp_get - userd.gp_put - userd.pb_get - userd.init_mem - userd.entry_size Also added the following hals - userd.setup_sw: init slabs and reserve userd gpu_va - userd.cleanup_sw: de-init slabs and free gpu_va - userd.setup_hw: setup writeback timeout Jira NVGPU-2713 Change-Id: Ide854a38531a3ce00e61045449ddd010c956bdeb Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2035116 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-22 06:25:55 -07:00

1 2

71 Commits