linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-23 09:57:08 +03:00

Author	SHA1	Message	Date
Philip Elcan	c02bccd6db	gpu: nvgpu: cond: use u32 for COND_WAIT timeout The type for the timeout parameter to the NVGPU_COND_WAIT and NVGPU_COND_WAIT_INTERRUPTIBLE macros was too weak. This updates these macros to require a u32 for the timeout. Users of the macros are updated to be compliant as necessary. This addresses MISRA 10.3 violations for implicit conversions of types of different size or essential type. JIRA NVGPU-1008 Change-Id: I12368dfa81b137c35bd056668c1867f03a73b7aa Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017503 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-21 10:24:24 -08:00
Seema Khowala	13f37f9c70	gpu: nvgpu: remove gk20a_is_channel_marked_as_tsg Use tsg_gk20a_from_ch to get tsg pointer for tsgid of a channel. For invalid tsgid, tsg pointer will be NULL Bug 2092051 Bug 2429295 Bug 2484211 Change-Id: I82cd6a2dc5fab4acb147202af667ca97a2842a73 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2006722 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-21 10:23:50 -08:00
Debarshi Dutta	9767366c60	gpu: nvgpu: add pbdma_status unit A new unit pbdma_status is added. The unit provides a HAL ops function pointer read_pbdma_status_info() to read and produce a struct of type nvgpu_pbdma_status_info. Additionally, the unit provides public APIs to retrieve data from the struct nvgpu_pbdma_status_info. Jira NVGPU-1311 Change-Id: Ic89c78703c3738b91be8d18ba970a591658d4022 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2019976 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-19 04:17:00 -08:00
Debarshi Dutta	061aa66adc	gpu: nvgpu: move engine specific functions to common/fifo The following changes are done in this patch. 1) gk20a_fifo_get_engine_info() is moved to common/fifo/engine.c and is renamed to gk20a_fifo_get_active_engine_info() to reflect accurately the purpose of the function. 2) move the definition of enum fifo_engine to <nvgpu/engines.h> and add the prefix NVGPU_ 3) move the following functions related to engines in fifo_gk20a.c to common/fifo/engines.c and replace their signature by adding the prefix nvgpu_engine and removing gk20a_fifo. gk20a_fifo_get_active_engine_info gk20a_fifo_engine_enum_from_type gk20a_fifo_get_engine_ids gk20a_fifo_is_valid_engine_id gk20a_fifo_get_gr_engine_id gk20a_fifo_act_eng_interrupt_mask gk20a_fifo_engine_interrupt_mask gk20a_fifo_get_all_ce_engine_reset_mask Jira NVGPU-1315 Change-Id: I63d9dcd905a0bebcc9a4c65776cf6ec7a0837acf Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2011298 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-15 09:44:19 -08:00
Konsta Holtta	93e15f9c43	gpu: nvgpu: rename redundant runlist names in HAL Drop the "runlist_" part in the runlist section of the HAL ops. For example: - old: g->ops.runlist.runlist_wait_pending - new: g->ops.runlist.wait_pending At the same time, drop the "fifo_" part from the function names. For example: - old: gk20a_fifo_runlist_wait_pending - new: gk20a_runlist_wait_pending Also rename eng_runlist_base_size to count_max. The size of the eng_runlist_base register array depicts the maximum possible number of runlists in the chip for which count_max is more descriptive. Jira NVGPU-1309 Change-Id: Ie9e94b9f65cd10d3e682d19954f240adb6e311be Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017403 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-14 18:52:29 -08:00
Debarshi Dutta	ddcdf364b7	gpu: nvgpu: use public APIs of engine_status_info unit nvgpu driver presently uses h/w functions to read and process the engine_status registers. H/w headers shouldn't be directly invoked by common code and should be called via HAL layer. This patch replaces the h/w headers with the APIs in the engine_status_info unit. Jira NVGPU-1315 Change-Id: I767a2b116b07cce4f4b587e6da8dd118afa27de5 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2005470 Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-13 14:34:03 -08:00
Debarshi Dutta	e60bae8ec4	gpu: nvgpu: add engine_status_info unit A new unit nvgpu_engine_status_info is added. The unit provides a HAL ops function pointer read_engine_status_info() to read and produce a struct of type nvgpu_engine_status_info. Additionally, the unit provides public APIs to retrieve data from the struct nvgpu_engine_status_info. Jira NVGPU-1315 Change-Id: I6c167c36081bee5c9a8db51d3467c8f5f02c2685 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2003886 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: Konsta Holtta <kholtta@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-13 14:34:00 -08:00
Konsta Holtta	38c548a39c	gpu: nvgpu: Add channel.reset_faulted HAL Add a HAL op for resetting the eng_faulted and pbdma_faulted states on a channel. This used to be a local feature in fifo_gv11b.c; the HAL is defined for all chips from gv11b onwards. Jira NVGPU-1307 Change-Id: I120a59c429851cc69e712ddd5b06a4b3d16c06c9 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017269 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:06:37 -08:00
Konsta Holtta	44e4d69734	gpu: nvgpu: add channel.force_ctx_reload HAL Isolate the write to ccsr_channel_force_ctx_reload behind a HAL op. Jira NVGPU-1307 Change-Id: Iaef7d740f4a89e4a45c7de28f001a7dea98ce066 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017268 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:06:28 -08:00
Konsta Holtta	9457a5ea91	gpu: nvgpu: add eng_faulted to channel HAL for gv11b+ The ccsr_channel_eng_faulted field exists from Volta onwards. Implement the read_state HAL op for those chips, and store that bit as a boolean in the channel state info. Jira NVGPU-1307 Change-Id: Ie997892f2d3db0725496661a4d3083e7396894cc Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017267 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:06:18 -08:00
Konsta Holtta	cd4b2f642c	gpu: nvgpu: add HAL for reading ccsr_channel Refactor read accesses to the ccsr_channel register for channel state to be done via a channel HAL op for all chips. A new op called read_state is added for this; information needed by other units is collected in a new struct nvgpu_channel_hw_state. Jira NVGPU-1307 Change-Id: Iff9385c08e17ac086d97f5771a54b56b2727e3c4 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017266 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:06:09 -08:00
Konsta Holtta	7189630e7c	gpu: nvgpu: drop fifo_ in channel HAL names Now that the moved HAL ops from fifo are in channel, rename the implementations to match. Jira NVGPU-1307 Change-Id: I7b9336f506c9e71bcd0af98886216958bd6695eb Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017264 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:05:56 -08:00
Konsta Holtta	5cde4c2140	gpu: nvgpu: move chip specific channel reg ops to common Extract out the HAL ops' implementation that now belongs to the channel unit. This unit is responsible for channel register accesses and the like (ccsr_*). Rename channel_gm20b_bind to gm20b_fifo_channel_bind to match with the rest of the naming. Same with channel_gv11b_unbind. Jira NVGPU-1307 Change-Id: I58b9d96dbdaf36bdb163a5729544a41faec828ab Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017262 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:05:43 -08:00
Konsta Holtta	c330d8fd98	gpu: nvgpu: add channel HAL section for ccsr_* Split out ops that belong to channel unit to a new section called channel. Channel is a broad concept; this includes just the code that accesses channel registers (ccsr_*). This is effectively just renaming; the implementation still stays put. The word "channel" is also dropped from certain HAL entries to avoid redundancy (e.g., channel.disable_channel -> channel.disable). fifo.get_num_fifos gets an entirely new name: channel.count. Jira NVGPU-1307 Change-Id: I9a08103e461bf3ddb743aa37ababee3e0c73c861 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017261 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:05:34 -08:00
Philip Elcan	fa81cf9000	gpu: nvgpu: fifo: cleanup MISRA 10.3 violations MISRA 10.3 prohibits assigning of objects of different size or essential type. This fixes a number of violations in the common/fifo code. JIRA NVGPU-1008 Change-Id: I138c27eb86f6e0f9481c39a94d6632e2b4360af8 Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2009940 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-11 12:55:27 -08:00
Konsta Holtta	49506f257e	gpu: nvgpu: split update_runlist HAL API in two A comment for gk20a_fifo_update_runlist() says: /* add/remove a channel from runlist special cases below: runlist->active_channels will NOT be changed. (ch == NULL && !add) means remove all active channels from runlist. (ch == NULL && add) means restore all active channels on runlist. */ Those special cases call for a new function, so add that. Delete the update_runlist HAL op and add update_for_channel (like update_runlist without the special cases) and reload (no channel to add or remove, just the special cases). While at it, rename gk20a_fifo_update_runlist_ids to nvgpu_runlist_reload_ids. It's common across chips and does what the reload HAL does but for a list of several IDs. Jira NVGPU-1922 Change-Id: I9a99ab03a636a1214c021faad359d2b304a9472f Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2013058 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-08 12:56:09 -08:00
Adeel Raza	d828e013db	gpu: nvgpu: common: MISRA rule 15.6 fixes MISRA rule 15.6 requires that all if/else/loop blocks should be enclosed by brackets. This patch adds brackets to single line if/else/loop blocks in the common directory. JIRA NVGPU-775 Change-Id: I0dfb38dbf256d49bc0391d889d9fbe5e21da5641 Signed-off-by: Adeel Raza <araza@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2011655 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-05 19:23:47 -08:00
Deepak Nibade	254253732c	gpu: nvgpu: add new unit for GR subcontext Add new unit common/gr/subctx.c to manage GR subcontext This unit provides interfaces to allocate/free/load GR subcontext Add new header file include/nvgpu/gr/subctx.h to declare all the interfaces. Right now channel_gk20a structure directly includes a nvgpu_mem for context header. Declare a new structure nvgpu_gr_subctx for subcontext and include this from channel_gk20a Make all necessary changes to refer ctx_header from subctx instead of directly referencing it from channel Jira NVGPU-1613 Change-Id: I9eb1ee8f26fa88d2881f9b294935b65e9cbcc9b4 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1990129 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-02 03:03:43 -08:00
Thomas Fleury	13afcc24c3	gpu: nvgpu: non abortable TSG for vidmem-clear When an engine faults due to unbound instance block, all active TSGs are currently aborted. This includes the TSG used by vidmem-clear task to clear vidmem buffers. From this point nvgpu_vidmem_clear cannot submit jobs anymore. Define TSG in MM CE context as non-abortable, and skip it when aborting active TSGs. Bug 2486146 Change-Id: I221259aec468e8ee3a24e80fab8d8fb7ee8607b0 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2008954 (cherry picked from commit 6f2444dc5e128aa2b870796bd1e9dee7853f90af) Reviewed-on: https://git-master.nvidia.com/r/2008942 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-01 14:55:23 -08:00
Konsta Holtta	8854cfafe1	gpu: nvgpu: simplify update_runlist logic gk20a_fifo_update_runlist_locked() and the vgpu counterpart do three things: 1. find out whether there's a channel to add or remove (if not, the whole runlist is just reconstructed or cleared), 2. reconstruct the runlist format for hardware (or the vgpu message), 3. actually update the runlist to hw and maybe wait for finish. Split out the two first operations to separate functions to make the code easier to understand. Now it's also clearer that the "add" parameter behaves completely differently depending on whether the channel pointer is NULL or not. Also ignore (with a warning) channels not bound to a tsg. We shouldn't get runlist updates on such channels. This simplifies the control flow a bit. Jira NVGPU-1309 Jira NVGPU-1922 Change-Id: I478c33eb2bd154c05a6c8c4148e4fea528a39a3e Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2007473 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-01 12:38:26 -08:00
Seema Khowala	013ca60edd	gpu: nvgpu: remove code for ch not bound to tsg - Remove handling for channels that are no more bound to tsg as channel could be referenceable but no more part of a tsg - Use tsg_gk20a_from_ch to get pointer to tsg for a given channel - Clear unhandled gr interrupts Bug 2429295 JIRA NVGPU-1580 Change-Id: I9da43a2bc9a0282c793b9f301eaf8e8604f91d70 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1972492 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-01 11:58:57 -08:00
Seema Khowala	aacc33bb47	gpu: nvgpu: do not use raw spinlock for ch->timeout.lock With PREEMPT_RT kernel, regular spinlocks are mapped onto sleeping spinlocks (rt_mutex locks), and raw spinlocks retain their behaviour. Schedule while atomic can occur in gk20a_channel_timeout_start, as it acquires ch->timeout.lock raw spinlock, and then calls functions that acquire ch->ch_timedout_lock regular spinlock. Bug 200484795 Change-Id: Iacc63195d8ee6a2d571c998da1b4b5d396f49439 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2004100 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-28 12:44:00 -08:00
Konsta Holtta	4e85ebc05f	gpu: nvgpu: use channel pointer for update_runlist A naked channel ID does not carry good information about the channel validity and is a very low level construct for an API of this level. Refactor the runlist updating fifo APIs to take a channel pointer. While at it, delete the channel and wait_for_finish parameters from gk20a_fifo_update_runlist_ids() - the only caller is suspend and resume and the parameters were always null for channel and true for wait. Jira NVGPU-1309 Jira NVGPU-1737 Change-Id: Ied350bc8e482d8e311cc708ab0c7afdf315c61cc Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1997744 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-25 11:44:47 -08:00
Konsta Holtta	7439449c5c	gpu: nvgpu: move runlist base and entry size hal ops Avoid including the HW headers directly in the HAL listings: add indirection functions for the two ops that were naked: - runlist.eng_runlist_base_size - runlist.runlist_entry_size GV100 gets a new fifo HAL file as base_size is the first one (and currently the only one) of GV100-specific ops. NVGPU-1309 Change-Id: Idf28b5e26c798457132ef595fa55c65bcddb1b31 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1997826 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-24 04:14:58 -08:00
Konsta Holtta	cdfa78e91d	gpu: nvgpu: move set_runlist_state declaration The function gk20a_fifo_set_runlist_state was moved to another place some time ago but the declaration didn't follow the implementation move. Move it from fifo_gk20a.h to runlist.h. Jira NVGPU-1309 Change-Id: Ib939a5243cee4be1c1092a553cb81b81adc6e5ce Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1997825 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-24 04:14:49 -08:00
Konsta Holtta	237cee5997	gpu: nvgpu: move chip specific runlist code to common Extract out the HAL ops' implementation that now belongs to the runlist unit. Jira NVGPU-1309 Change-Id: I66185de0ddace1728da5f55ae11daa0b752bebf1 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1997824 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-24 04:14:40 -08:00
Konsta Holtta	6fda25e958	gpu: nvgpu: move runlist HAL ops to separate section Split out ops that belong to runlist unit to a new section called runlist. This is effectively just renaming; the implementation still stays put. Jira NVGPU-1309 Change-Id: Ib928164f8008f680d9cb13c969e3304ef727abba Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1997823 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-24 04:14:31 -08:00
Nicolas Benech	6978943621	gpu: nvgpu: gk20a_disable_tsg to return void gk20a_disable_tsg was always returning 0. This patch changes it to return void, thus fixing a number of MISRA violations. JIRA NVGPU-677 Change-Id: I5be8d1d8eaeb36da44653a60e57259ccffc4fea0 Signed-off-by: Nicolas Benech <nbenech@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1995004 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-23 17:23:57 -08:00
Vinod G	c0a2f356c4	gpu: nvgpu: pmu code fix for VDK dgpu vdk does not have pmu support. pmu variables do not get initialized in fmodel. Add is_pmu_supported check before nvgpu_pmu_mutex_acquire call. JIRA NVGPU-1564 Change-Id: Ieb683d3092b5289a9959c8811c25782074d19804 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1992193 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-15 23:04:42 -08:00
Scott Long	4ba92354c0	gpu: nvgpu: container_of() changes to tsg/fence code The container_of() macro used in nvgpu produces the following set of MISRA required rule violations: * Rule 11.3 : A cast shall not be performed between a pointer to object type and a pointer to a different object type. * Rule 11.8 : A cast shall not remove any const or volatile qualification from the type pointed to be a pointer. * Rule 20.7 : Expressions resulting from the expansion of macro parameters shall be enclosed in parentheses Using the same modified implementation of container_of() as that used in the nvgpu_list_node/nvgpu_rbtree_node routines eliminates the Rule 11.8 and Rule 20.7 violations and exchanges the Rule 11.3 violation with an advisory Rule 11.4 violation. This patch uses that same equivalent implementation in two new (static) functions that are used to replace references to container_of() references in tsg and fence code: * tsg_gk20a_from_ref * gk20a_fence_from_ref It should be noted that replacement functions still contain potentially dangerous (and non-MISRA compliant code) and that it is expected that deviation requests will be filed for the new advisory rule violations accordingly. JIRA NVGPU-782 Change-Id: Ib5f3b8c7b18b92af8237e82ef5ee42d39c0381e5 Signed-off-by: Scott Long <scottl@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1993503 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-14 12:42:54 -08:00
Deepak Nibade	1c17ae310c	gpu: nvgpu: add new unit for GR context Add new unit common/gr/ctx.c to manage GR context This unit provides interfaces to allocate/free/map/unmap GR context, patch context, pm context, ctxsw {preempt/spill/betacb/pagepool/rtvcb} buffers. It also provides APIs to set size of above buffers Add new header file include/nvgpu/gr/ctx.h to declare all the interfaces. Move nvgpu_gr_ctx, patch_desc, pm_ctx_desc, zcull_ctx_desc structures to this unit Add new structure nvgpu_gr_ctx_desc to hold context description parameters. For now we add sizes of all the buffers here. Add this structure to gr_gk20a for global reference Remove gr_gp10b_alloc_buffer() since it is no longer used Rename g->ops.gr.alloc_gfxp_rtv_cb() to g->ops.gr.init_gfxp_rtv_cb() since this HAL now only sets the size of rtvcb ctxsw buffer Remove gr->ctx_vars.buffer_size and gr->ctx_vars.buffer_total_size since they were redundant. We already have gr->ctx_vars.golden_image_size to denote golden image size Jira NVGPU-1527 Change-Id: I8847b347f80235209dd5e28d979e79984ab85408 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1987702 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-09 10:46:29 -08:00
Konsta Holtta	d1d1f56c49	gpu: nvgpu: skip nvgpu syncpoint in usermode submits The nvgpu managed syncpoint is not needed for anything if a channel uses usermode submits; in that case the channel would allocate an user-managed syncpoint and use that. Create the channel sync in nvgpu_channel_setup_bind() only if usermode submit is not enabled. Bug 200466905 Change-Id: I976f4b4fd0c3131cb310c72b286329fb16f1f29a Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1990270 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-09 09:35:18 -08:00
Konsta Holtta	8979a97af3	gpu: nvgpu: abstract out timeout rewinding The channel timeout ends up in a strange state during timeout handling for a brief moment; it can become stopped and started again, and the timeout lock is released in the middle. Add a more explicit rewind function to reset the timeout to start if it's active. The active check allows to use this from gk20a_channel_timeout_restart_all_channels(), so that's also modified. Also replace the return statements with more readable control flow in gk20a_channel_timeout_handler(). Change-Id: Ia7d67242dfc149ace1f4f841a837e90b6c985308 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1989327 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-08 08:24:55 -08:00
Sai Nikhil	e824ea0963	gpu: nvgpu: common: MISRA Rule 10.1 fixes MISRA rule 10.1 mandates that the correct data types are used as operands of operators. For example, only unsigned integers can be used as operands of bitwise operators. This patch fixes rule 10.1 vioaltions for drivers/gpu/nvgpu/common. JIRA NVGPU-777 JIRA NVGPU-1006 Change-Id: I53fe750f1b41816a183c595e5beb7bd263c27725 Signed-off-by: Sai Nikhil <snikhil@nvidia.com> Signed-off-by: Adeel Raza <araza@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1971221 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-06 19:24:58 -08:00
Adeel Raza	c961b7ed1d	nvgpu: fifo: fix invalid ID macros MISRA rule 10.1 prohibits using signed values with bitwise operators. Make fifo invalid ID macros compliant with this MISRA rule. Also use these macros in source code instead of hardcoded numbers to make the code more readable. JIRA NVGPU-1006 Change-Id: I2f336d1decbc53b08f93587f2e00ea2cce47f72b Signed-off-by: Adeel Raza <araza@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1983700 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-06 19:24:13 -08:00
Konsta Holtta	11c0c1ad89	gpu: nvgpu: unify vgpu runlist init Split out native-specific engine info collection out of nvgpu_init_runlist() so that it only contains common code. Call this common function from vgpu code that ends up being identical. Jira NVGPU-1309 Change-Id: I9e83669c84eb6b145fcadb4fa6e06413b34e1c03 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1978060 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-04 11:15:52 -08:00
Konsta Holtta	2f51d7c5ed	gpu: nvgpu: reorder runlist enable/disable Move gk20a_fifo_set_runlist_state() to common and move gk20a_tsg_{enable,disable}_sched() to be part of tsg. Jira NVGPU-1309 Change-Id: I16ffe7f9f97249b5ac0885bba56510847bb6858b Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1978059 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-04 11:15:43 -08:00
Konsta Holtta	e05c0d13a0	gpu: nvgpu: add runlist unit to common Extract non-chip-specific code that manages the runlists (init, update, reschedule etc.) to a new file in the common directory. Move the declarations to a new matching runlist.h header. Jira NVGPU-1309 Change-Id: I3c7e0032899516487037f47ddc9a7e7aa4b0b33a Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1978058 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-04 11:15:34 -08:00
Seema Khowala	13aed4da44	gpu: nvgpu: remove log_fn prints in _gk20a_channel_from_id Remove nvgpu_log_fn for _gk20a_channel_from_id as enabing log_fn prints during debugging become very noisy due to these prints. Change-Id: I52ef193d13af87924dbde59a55c892e98e95bc85 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1982263 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-02 08:35:28 -08:00
Ranjanikar Nikhil Prabhakarrao	f0762ed483	gpu: nvgpu: add speculative barrier Data can be speculativerly stored and code flow can be hijacked. To mitigate this problem insert a speculation barrier. Bug 200447167 Change-Id: Ia865ff2add8b30de49aa970715625b13e8f71c08 Signed-off-by: Ranjanikar Nikhil Prabhakarrao <rprabhakarra@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1972221 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-30 22:26:01 -08:00
Philip Elcan	90024cb73a	gpu: nvgpu: misc MISRA 14.4 fixes This fixes a few lingering MISRA Rule 14.4 violations. Rule 14.4 requires that the condition of an if statement be a boolean. JIRA NVGPU-1022 Change-Id: Ib6293e00e0436fceee9f7bf0ada1b6ac01a82faa Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1975424 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-19 11:24:42 -08:00
Debarshi Dutta	0188b93e30	gpu: nvgpu: move gk20a_fifo_recover_tsg into tsg unit gk20a_fifo_recover_tsg does high-level software calls and invokes gk20a_fifo_recover. This function belongs to the tsg unit and is moved to tsg.c file. Also, the function is renamed to nvgpu_tsg_recover. Jira NVGPU-1237 Change-Id: Id1911fb182817b0cfc47b3219065cba6c4ca507a Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1970034 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 21:55:07 -08:00
Debarshi Dutta	fb114f8fda	gpu: nvgpu: move gk20a_fifo_recover_ch to channel unit gk20a_fifo_recover_ch does high-level calls and invokes gk20a_fifo_recover. This function belongs to the channel unit and is moved to the file channel.c. Also, the function is renamed to nvgpu_channel_recover. Jira NVGPU-1237 Change-Id: I31890f85fdb2c42648cc063dd9c4e7e35930dcef Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1970033 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 21:54:58 -08:00
Debarshi Dutta	7f58347ed9	gpu: nvgpu: move tsg functions to common Any tsg specific functions that does high-level software-centric operations below to the TSG unit and not the FIFO unit. Move the below public functions as well as their dependent static functions to common/fifo/tsg.c and also rename them to use the prefix nvgpu_tsg_* gk20a_fifo_set_ctx_mmu_error_tsg gk20a_fifo_abort_tsg gk20a_fifo_error_tsg gk20a_fifo_check_tsg_ctxsw_timeout Jira NVGPU-1237 Change-Id: I4e3da821a878d4b4a0a0b53fbb7f4c10f135f58d Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1934299 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 21:54:26 -08:00
Debarshi Dutta	57f03e3a20	gpu: nvgpu: move channel functions to common Any channel specific functions having high-level software-centric operations belong to the channel unit and not the FIFO unit. Move the below public functions as well as their dependent static functions to common/fifo/channel.c. Also, rename the functions to use the prefix nvgpu_channel_*. gk20a_fifo_set_ctx_mmu_error_ch gk20a_fifo_error_ch gk20a_fifo_check_ch_ctxsw_timeout Jira NVGPU-1237 Change-Id: Id6b6d69bbed193befbfc4c30ecda1b600d846199 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1932358 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 21:54:17 -08:00
Richard Zhao	2992990431	gpu: nvgpu: separate common tsg open/release functions The common functions are shared with RM server. When add new variables to struct tsg_gk20a, it won't have to add init code in RM server. Bug 200473570 Change-Id: Ic12337ac8834599e23056d4c8bdb7ece9664f68e Signed-off-by: Richard Zhao <rizhao@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1971838 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 16:05:57 -08:00
Thomas Fleury	7e68e5c83d	gpu: nvgpu: userd slab allocator We had to force allocation of physically contiguous memory for USERD in nvlink case, as a channel's USERD address is computed as an offset from fifo->userd address, and nvlink bypasses SMMU. With 4096 channels, it can become difficult to allocate 2MB of physically contiguous sysmem for USERD on a busy system. PBDMA does not require any sort of packing or contiguous USERD allocation, as each channel has a direct pointer to that channel's 512B USERD region. When BAR1 is supported we only need the GPU VAs to be contiguous, to setup the BAR1 inst block. - Add slab allocator for USERD. - Slabs are allocated in SYSMEM, using PAGE_SIZE for slab size. - Contiguous channels share the same page (16 channels per slab). - ch->userd_mem points to related nvgpu_mem descriptor - ch->userd_offset is the offset from the beginning of the slab - Pre-allocate GPU VAs for the whole BAR1 - Add g->ops.mm.bar1_map() method - gk20a_mm_bar1_map() uses fixed mapping in BAR1 region - vgpu_mm_bar1_map() passes the offset in TEGRA_VGPU_CMD_MAP_BAR1 - TEGRA_VGPU_CMD_MAP_BAR1 is called for each slab. Bug 2422486 Bug 200474793 Change-Id: I202699fe55a454c1fc6d969e7b6196a46256d704 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1959032 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-11 16:24:10 -08:00
Sai Nikhil	303fc7496c	gpu: nvgpu: common: fix MISRA Rule 10.4 Violations MISRA Rule 10.4 only allows the usage of arithmetic operations on operands of the same essential type category. Adding "U" at the end of the integer literals or casting operands to have same type of operands when an arithmetic operation is performed. This fixes violations where an arithmetic operation is performed on signed and unsigned int types. JIRA NVGPU-992 Change-Id: I27e3e59c3559c377b4bd3cbcfced90fdf90350f2 Signed-off-by: Sai Nikhil <snikhil@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1921459 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-11 10:26:16 -08:00
Debarshi Dutta	9abe9fe062	gpu: nvgpu: replace input param chid with pointer to channel preempt_channel needs to use the channel to pass it to other public functions, get access to a tsg etc. This qualifies it to take a pointer to a channel as an input parameter instead of a chid. Increment the channel ref counter using the function gk20a_channel_from_id in functions where we get the chid from the h/w registers directly. Once the prempt_channel function call is done, use a gk20a_channel_put on the referenced channel. Jira NVGPU-1461 Change-Id: I6c87c8104cfcb418d468c8c590087fd4aeabf4bd Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1963200 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-05 21:55:10 -08:00
Debarshi Dutta	e5bebd880f	gpu: nvgpu: replace tsgid input variable with pointer to a struct tsg_gk20a replace tsgid with a pointer to a struct tsg_gk20a in the function gk20a_fifo_tsg_abort(). gk20a_fifo_tsg_abort needs to enumerate through all the channels within the tsg as well as pass the tsg pointer to other functions, qualifying the need to use a pointer instead as an input parameter. Jira NVGPU-1461 Change-Id: I59cec05d5d778f733d0c3e9ffadf46e74e249080 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1956567 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-30 08:14:48 -08:00

... 7 8 9 10 11

541 Commits