linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-23 09:57:08 +03:00

Author	SHA1	Message	Date
Seema Khowala	5222d0ff4f	gpu: nvgpu: do not do timeout_debug_dump for non fifo_error_idle_timeout Any recovery that goes through gk20a_fifo_recover path e.g. gr error, mmu fault or any recovery that involves engine recovery as well, will still dump the full debug dump. This change will just avoid dumping debug dump for force reset channels and pbdma intr if they do not involve engine recovery. For FIFO_ERROR_IDLE_TIMEOUT error notifiers that involves tsg recovery only, debug_dump will happen only if timeout_debug_dump is set. timeout_debug_dump by default is set to true but can be changed using NVGPU_IOCTL_CHANNEL_SET_TIMEOUT_EX. Bug 2092051 Change-Id: Ibbf3cd2c44c586d9deb9e61ffbf37945b8d9e428 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2033068 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-07 15:14:24 -08:00
Thomas Fleury	b64ee64fa7	gpu: nvgpu: do not free individual runlists gk20a_fifo_delete_runlist was invoking nvgpu_kfree for each runlist, but active runlists are now stored in the g->active_runlist_info array. Remove nvgpu_kfree for individual runlists. Also clear active_runlist_info and num_runlists fields in gk20a_fifo_delete_runlist. Bug 2470115 Bug 2522374 Change-Id: Ie678c16af31ed8345ca0f015c17d61a3965b924d Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2030970 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-07 11:46:25 -08:00
Thomas Fleury	70453a8606	Revert "Revert "gpu: nvgpu: allocate only active runlists"" This reverts commit `f67bc51e51`. Currently a fifo_runlist_info_gk20a structure is allocated and initialized for each possible runlist. But only a few runlists are actually used. Skip allocation and initialization of inactive runlists. Active runlists info is stored in the active_runlist_info array.If a runlist is active, then runlist_info[runlist_id] points to one entry in active_runlist_info. Otherwise, runlist_info[runlist_id] is NULL. Operations that used to walk through all runlists are modified to walk though active runlists only. Bug 2470115 Bug 2522374 Change-Id: I98253ebebb4b1ba5957b57329820b94444b9d41b Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2030409 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-07 11:46:15 -08:00
Thomas Fleury	c23738969d	Revert "Revert "gpu: nvgpu: array of pointers to runlists"" This reverts commit `ade1d50cbe`. Currently a fifo_runlist_info_gk20a structure is allocated and initialized for each possible runlist. But only a few runlists are actually used. Use an array of pointers to runlists in fifo_gk20a. The array keeps existing indexing by runlist_id. In this patch a context is still allocated for each possible runlist, but follow up patch will allow to skip context allocation for inactive runlists. Bug 2470115 Bug 2522374 Change-Id: I0deb6981bc6f5152bdf121f0a44429748aa14687 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2030407 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-07 11:45:59 -08:00
Debarshi Dutta	675a2b6858	gpu: nvgpu: added non-functional changes to engines unit The following changes are made in this patch. 1) nvgpu driver is incorrectly using u32 to store enum values in some functions. Replaced them with correct type enum nvgpu_fifo_engine 2) change parameter type in nvgpu_engine_get_ids from engine_id[] to *engine_ids 3) rename some function names to remove redundant characters to make the name shorter. 4) Removed the initialization of enum nvgpu_fifo_engine in functions where we assign a value before direct access. Jira NVGPU-1315 Change-Id: Ic65b40c9cb1e90ad278cb36a00e1c9de51724f27 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2020230 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-06 04:45:20 -08:00
Nicolas Benech	ee6ef2a719	gpu: nvgpu: resolve MISRA 17.7 for WARN_ON MISRA Rule-17.7 requires the return value of all functions to be used. Fix is either to use the return value or change the function to return void. This patch ensures that WARN and WARN_ON always return void; and introduces a new nvgpu_do_assert construct to trigger the equivalent of WARN_ON(true) so that stack can be dumped (depends on OS support) JIRA NVGPU-677 Change-Id: Ie2312c5588ceb5b1db825d15a096149b63b69af4 Signed-off-by: Nicolas Benech <nbenech@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2018706 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-05 11:14:46 -08:00
Mahantesh Kumbar	4bb9b0b987	gpu: nvgpu: use support_ls_pmu flag to check LS PMU support Currently PMU support enable check is done with multiple methods which added complexity to know status of PMU support. Changed to replace multiple methods with support_pmu flag to know the PMU support, support_pmu will be updated at init stage based on platform/chip specific settings to know the PMU support status. Cleaned up support_pmu flag check with platform specific PMU members in multiple places & moved check to public functions JIRA NVGPU-173 Change-Id: Ief2c64250d1f78e3b054203be56499e4d1d9b046 Signed-off-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2024024 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-04 03:33:16 -08:00
Thomas Fleury	ade1d50cbe	Revert "gpu: nvgpu: array of pointers to runlists" This reverts commit `5fdda1b075`. Bug 2522374 Change-Id: Icb5e2181b056dc2247291c7f0e47d46c29095286 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2030293 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Hoang Pham <hopham@nvidia.com>	2019-02-28 17:51:37 -08:00
Thomas Fleury	f67bc51e51	Revert "gpu: nvgpu: allocate only active runlists" This reverts commit `45fa0441f7`. Bug 2522374 Change-Id: Icb80b7a31c7588a269850a3768ab0238dbec67b1 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2030292 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: Hoang Pham <hopham@nvidia.com>	2019-02-28 17:51:22 -08:00
Thomas Fleury	45fa0441f7	gpu: nvgpu: allocate only active runlists Currently a fifo_runlist_info_gk20a structure is allocated and initialized for each possible runlist. But only a few runlists are actually used. Skip allocation and initialization of inactive runlists. Active runlists info is stored in the active_runlist_info array. If a runlist is active, then runlist_info[runlist_id] points to one entry in active_runlist_info. Otherwise, runlist_info[runlist_id] is NULL. Operations that used to walk through all runlists are modified to walk though active runlists only. Bug 2470115 Change-Id: Icd10281dc904bdee581ebc9cfeb662018ecca121 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2025385 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-27 17:54:54 -08:00
Thomas Fleury	5fdda1b075	gpu: nvgpu: array of pointers to runlists Currently a fifo_runlist_info_gk20a structure is allocated and initialized for each possible runlist. But only a few runlists are actually used. Use an array of pointers to runlists in fifo_gk20a. The array keeps existing indexing by runlist_id. In this patch a context is still allocated for each possible runlist, but follow up patch will allow to skip context allocation for inactive runlists. Bug 2470115 Change-Id: I1615043cea84db35a270ade64695d51f85c1193a Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2025203 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-27 17:54:37 -08:00
Debarshi Dutta	8db1955d74	gpu: nvgpu: split semaphore.c file into multiple units The file semaphore.c is now split into 4 units namely semaphore, semaphore_hw, semaphore_pool and semaphore_sea. Each of the above units now have separate compilation units under common/semaphore/. The public APIs corresponding to each unit is present in include/nvgpu/semaphore.h. The dependency graph of the below units is as follows where '->' indicates left depends on right. semaphore -> semaphore_hw -> semaphore_pool -> semaphore_sea Some of the other major changes made in this patch are as follows i) Renamed some of the functions. ii) Some functions are changed from private to public. iii) Public header for semaphore contains only the declaration of the corresponding structs as an opaque structure. iv) Constructed a private header to contain internal functions common to all the units and struct definitions corresponding to each unit. v) Added new functions to provide access to internal members of the units. Jira NVGPU-2076 Change-Id: I6f111647ba9a9a9f8ef9c658f316cd5d6276c703 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2022782 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-27 12:54:15 -08:00
Seema Khowala	2c0933de05	gpu: nvgpu: rename ch_timedout to unserviceable ch_timedout is not a good variable name for broken and unusable state of the channel. Rename ch_timedout to unserviceable Bug 2092051 Bug 2429295 Change-Id: I633eaff61928d5ef9836dcdc162b07e7a5e03881 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1996865 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-22 20:21:37 -08:00
Thomas Fleury	8610ae5fdc	gpu: nvgpu: skip buffer allocation for unused runlists Currently, 2x 1MB buffers are allocated per possible runlist (which totals 26MB in GV100 case). But only a few runlists are actually used. Skip runlist buffer allocation for unused runlists. Bug 2470115 Change-Id: Ifc9a36c38d302ca758d1fe99d293a1bfbde85ac7 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2024279 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-22 15:24:11 -08:00
Philip Elcan	c02bccd6db	gpu: nvgpu: cond: use u32 for COND_WAIT timeout The type for the timeout parameter to the NVGPU_COND_WAIT and NVGPU_COND_WAIT_INTERRUPTIBLE macros was too weak. This updates these macros to require a u32 for the timeout. Users of the macros are updated to be compliant as necessary. This addresses MISRA 10.3 violations for implicit conversions of types of different size or essential type. JIRA NVGPU-1008 Change-Id: I12368dfa81b137c35bd056668c1867f03a73b7aa Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017503 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-21 10:24:24 -08:00
Seema Khowala	13f37f9c70	gpu: nvgpu: remove gk20a_is_channel_marked_as_tsg Use tsg_gk20a_from_ch to get tsg pointer for tsgid of a channel. For invalid tsgid, tsg pointer will be NULL Bug 2092051 Bug 2429295 Bug 2484211 Change-Id: I82cd6a2dc5fab4acb147202af667ca97a2842a73 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2006722 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-21 10:23:50 -08:00
Debarshi Dutta	9767366c60	gpu: nvgpu: add pbdma_status unit A new unit pbdma_status is added. The unit provides a HAL ops function pointer read_pbdma_status_info() to read and produce a struct of type nvgpu_pbdma_status_info. Additionally, the unit provides public APIs to retrieve data from the struct nvgpu_pbdma_status_info. Jira NVGPU-1311 Change-Id: Ic89c78703c3738b91be8d18ba970a591658d4022 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2019976 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-19 04:17:00 -08:00
Debarshi Dutta	061aa66adc	gpu: nvgpu: move engine specific functions to common/fifo The following changes are done in this patch. 1) gk20a_fifo_get_engine_info() is moved to common/fifo/engine.c and is renamed to gk20a_fifo_get_active_engine_info() to reflect accurately the purpose of the function. 2) move the definition of enum fifo_engine to <nvgpu/engines.h> and add the prefix NVGPU_ 3) move the following functions related to engines in fifo_gk20a.c to common/fifo/engines.c and replace their signature by adding the prefix nvgpu_engine and removing gk20a_fifo. gk20a_fifo_get_active_engine_info gk20a_fifo_engine_enum_from_type gk20a_fifo_get_engine_ids gk20a_fifo_is_valid_engine_id gk20a_fifo_get_gr_engine_id gk20a_fifo_act_eng_interrupt_mask gk20a_fifo_engine_interrupt_mask gk20a_fifo_get_all_ce_engine_reset_mask Jira NVGPU-1315 Change-Id: I63d9dcd905a0bebcc9a4c65776cf6ec7a0837acf Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2011298 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-15 09:44:19 -08:00
Konsta Holtta	93e15f9c43	gpu: nvgpu: rename redundant runlist names in HAL Drop the "runlist_" part in the runlist section of the HAL ops. For example: - old: g->ops.runlist.runlist_wait_pending - new: g->ops.runlist.wait_pending At the same time, drop the "fifo_" part from the function names. For example: - old: gk20a_fifo_runlist_wait_pending - new: gk20a_runlist_wait_pending Also rename eng_runlist_base_size to count_max. The size of the eng_runlist_base register array depicts the maximum possible number of runlists in the chip for which count_max is more descriptive. Jira NVGPU-1309 Change-Id: Ie9e94b9f65cd10d3e682d19954f240adb6e311be Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017403 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-14 18:52:29 -08:00
Debarshi Dutta	ddcdf364b7	gpu: nvgpu: use public APIs of engine_status_info unit nvgpu driver presently uses h/w functions to read and process the engine_status registers. H/w headers shouldn't be directly invoked by common code and should be called via HAL layer. This patch replaces the h/w headers with the APIs in the engine_status_info unit. Jira NVGPU-1315 Change-Id: I767a2b116b07cce4f4b587e6da8dd118afa27de5 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2005470 Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-13 14:34:03 -08:00
Debarshi Dutta	e60bae8ec4	gpu: nvgpu: add engine_status_info unit A new unit nvgpu_engine_status_info is added. The unit provides a HAL ops function pointer read_engine_status_info() to read and produce a struct of type nvgpu_engine_status_info. Additionally, the unit provides public APIs to retrieve data from the struct nvgpu_engine_status_info. Jira NVGPU-1315 Change-Id: I6c167c36081bee5c9a8db51d3467c8f5f02c2685 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2003886 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: Konsta Holtta <kholtta@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-13 14:34:00 -08:00
Konsta Holtta	38c548a39c	gpu: nvgpu: Add channel.reset_faulted HAL Add a HAL op for resetting the eng_faulted and pbdma_faulted states on a channel. This used to be a local feature in fifo_gv11b.c; the HAL is defined for all chips from gv11b onwards. Jira NVGPU-1307 Change-Id: I120a59c429851cc69e712ddd5b06a4b3d16c06c9 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017269 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:06:37 -08:00
Konsta Holtta	44e4d69734	gpu: nvgpu: add channel.force_ctx_reload HAL Isolate the write to ccsr_channel_force_ctx_reload behind a HAL op. Jira NVGPU-1307 Change-Id: Iaef7d740f4a89e4a45c7de28f001a7dea98ce066 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017268 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:06:28 -08:00
Konsta Holtta	9457a5ea91	gpu: nvgpu: add eng_faulted to channel HAL for gv11b+ The ccsr_channel_eng_faulted field exists from Volta onwards. Implement the read_state HAL op for those chips, and store that bit as a boolean in the channel state info. Jira NVGPU-1307 Change-Id: Ie997892f2d3db0725496661a4d3083e7396894cc Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017267 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:06:18 -08:00
Konsta Holtta	cd4b2f642c	gpu: nvgpu: add HAL for reading ccsr_channel Refactor read accesses to the ccsr_channel register for channel state to be done via a channel HAL op for all chips. A new op called read_state is added for this; information needed by other units is collected in a new struct nvgpu_channel_hw_state. Jira NVGPU-1307 Change-Id: Iff9385c08e17ac086d97f5771a54b56b2727e3c4 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017266 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:06:09 -08:00
Konsta Holtta	7189630e7c	gpu: nvgpu: drop fifo_ in channel HAL names Now that the moved HAL ops from fifo are in channel, rename the implementations to match. Jira NVGPU-1307 Change-Id: I7b9336f506c9e71bcd0af98886216958bd6695eb Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017264 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:05:56 -08:00
Konsta Holtta	5cde4c2140	gpu: nvgpu: move chip specific channel reg ops to common Extract out the HAL ops' implementation that now belongs to the channel unit. This unit is responsible for channel register accesses and the like (ccsr_*). Rename channel_gm20b_bind to gm20b_fifo_channel_bind to match with the rest of the naming. Same with channel_gv11b_unbind. Jira NVGPU-1307 Change-Id: I58b9d96dbdaf36bdb163a5729544a41faec828ab Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017262 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:05:43 -08:00
Konsta Holtta	c330d8fd98	gpu: nvgpu: add channel HAL section for ccsr_* Split out ops that belong to channel unit to a new section called channel. Channel is a broad concept; this includes just the code that accesses channel registers (ccsr_*). This is effectively just renaming; the implementation still stays put. The word "channel" is also dropped from certain HAL entries to avoid redundancy (e.g., channel.disable_channel -> channel.disable). fifo.get_num_fifos gets an entirely new name: channel.count. Jira NVGPU-1307 Change-Id: I9a08103e461bf3ddb743aa37ababee3e0c73c861 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017261 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:05:34 -08:00
Philip Elcan	fa81cf9000	gpu: nvgpu: fifo: cleanup MISRA 10.3 violations MISRA 10.3 prohibits assigning of objects of different size or essential type. This fixes a number of violations in the common/fifo code. JIRA NVGPU-1008 Change-Id: I138c27eb86f6e0f9481c39a94d6632e2b4360af8 Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2009940 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-11 12:55:27 -08:00
Konsta Holtta	49506f257e	gpu: nvgpu: split update_runlist HAL API in two A comment for gk20a_fifo_update_runlist() says: /* add/remove a channel from runlist special cases below: runlist->active_channels will NOT be changed. (ch == NULL && !add) means remove all active channels from runlist. (ch == NULL && add) means restore all active channels on runlist. */ Those special cases call for a new function, so add that. Delete the update_runlist HAL op and add update_for_channel (like update_runlist without the special cases) and reload (no channel to add or remove, just the special cases). While at it, rename gk20a_fifo_update_runlist_ids to nvgpu_runlist_reload_ids. It's common across chips and does what the reload HAL does but for a list of several IDs. Jira NVGPU-1922 Change-Id: I9a99ab03a636a1214c021faad359d2b304a9472f Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2013058 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-08 12:56:09 -08:00
Adeel Raza	d828e013db	gpu: nvgpu: common: MISRA rule 15.6 fixes MISRA rule 15.6 requires that all if/else/loop blocks should be enclosed by brackets. This patch adds brackets to single line if/else/loop blocks in the common directory. JIRA NVGPU-775 Change-Id: I0dfb38dbf256d49bc0391d889d9fbe5e21da5641 Signed-off-by: Adeel Raza <araza@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2011655 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-05 19:23:47 -08:00
Deepak Nibade	254253732c	gpu: nvgpu: add new unit for GR subcontext Add new unit common/gr/subctx.c to manage GR subcontext This unit provides interfaces to allocate/free/load GR subcontext Add new header file include/nvgpu/gr/subctx.h to declare all the interfaces. Right now channel_gk20a structure directly includes a nvgpu_mem for context header. Declare a new structure nvgpu_gr_subctx for subcontext and include this from channel_gk20a Make all necessary changes to refer ctx_header from subctx instead of directly referencing it from channel Jira NVGPU-1613 Change-Id: I9eb1ee8f26fa88d2881f9b294935b65e9cbcc9b4 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1990129 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-02 03:03:43 -08:00
Thomas Fleury	13afcc24c3	gpu: nvgpu: non abortable TSG for vidmem-clear When an engine faults due to unbound instance block, all active TSGs are currently aborted. This includes the TSG used by vidmem-clear task to clear vidmem buffers. From this point nvgpu_vidmem_clear cannot submit jobs anymore. Define TSG in MM CE context as non-abortable, and skip it when aborting active TSGs. Bug 2486146 Change-Id: I221259aec468e8ee3a24e80fab8d8fb7ee8607b0 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2008954 (cherry picked from commit 6f2444dc5e128aa2b870796bd1e9dee7853f90af) Reviewed-on: https://git-master.nvidia.com/r/2008942 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-01 14:55:23 -08:00
Konsta Holtta	8854cfafe1	gpu: nvgpu: simplify update_runlist logic gk20a_fifo_update_runlist_locked() and the vgpu counterpart do three things: 1. find out whether there's a channel to add or remove (if not, the whole runlist is just reconstructed or cleared), 2. reconstruct the runlist format for hardware (or the vgpu message), 3. actually update the runlist to hw and maybe wait for finish. Split out the two first operations to separate functions to make the code easier to understand. Now it's also clearer that the "add" parameter behaves completely differently depending on whether the channel pointer is NULL or not. Also ignore (with a warning) channels not bound to a tsg. We shouldn't get runlist updates on such channels. This simplifies the control flow a bit. Jira NVGPU-1309 Jira NVGPU-1922 Change-Id: I478c33eb2bd154c05a6c8c4148e4fea528a39a3e Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2007473 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-01 12:38:26 -08:00
Seema Khowala	013ca60edd	gpu: nvgpu: remove code for ch not bound to tsg - Remove handling for channels that are no more bound to tsg as channel could be referenceable but no more part of a tsg - Use tsg_gk20a_from_ch to get pointer to tsg for a given channel - Clear unhandled gr interrupts Bug 2429295 JIRA NVGPU-1580 Change-Id: I9da43a2bc9a0282c793b9f301eaf8e8604f91d70 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1972492 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-01 11:58:57 -08:00
Seema Khowala	aacc33bb47	gpu: nvgpu: do not use raw spinlock for ch->timeout.lock With PREEMPT_RT kernel, regular spinlocks are mapped onto sleeping spinlocks (rt_mutex locks), and raw spinlocks retain their behaviour. Schedule while atomic can occur in gk20a_channel_timeout_start, as it acquires ch->timeout.lock raw spinlock, and then calls functions that acquire ch->ch_timedout_lock regular spinlock. Bug 200484795 Change-Id: Iacc63195d8ee6a2d571c998da1b4b5d396f49439 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2004100 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-28 12:44:00 -08:00
Konsta Holtta	4e85ebc05f	gpu: nvgpu: use channel pointer for update_runlist A naked channel ID does not carry good information about the channel validity and is a very low level construct for an API of this level. Refactor the runlist updating fifo APIs to take a channel pointer. While at it, delete the channel and wait_for_finish parameters from gk20a_fifo_update_runlist_ids() - the only caller is suspend and resume and the parameters were always null for channel and true for wait. Jira NVGPU-1309 Jira NVGPU-1737 Change-Id: Ied350bc8e482d8e311cc708ab0c7afdf315c61cc Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1997744 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-25 11:44:47 -08:00
Konsta Holtta	7439449c5c	gpu: nvgpu: move runlist base and entry size hal ops Avoid including the HW headers directly in the HAL listings: add indirection functions for the two ops that were naked: - runlist.eng_runlist_base_size - runlist.runlist_entry_size GV100 gets a new fifo HAL file as base_size is the first one (and currently the only one) of GV100-specific ops. NVGPU-1309 Change-Id: Idf28b5e26c798457132ef595fa55c65bcddb1b31 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1997826 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-24 04:14:58 -08:00
Konsta Holtta	cdfa78e91d	gpu: nvgpu: move set_runlist_state declaration The function gk20a_fifo_set_runlist_state was moved to another place some time ago but the declaration didn't follow the implementation move. Move it from fifo_gk20a.h to runlist.h. Jira NVGPU-1309 Change-Id: Ib939a5243cee4be1c1092a553cb81b81adc6e5ce Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1997825 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-24 04:14:49 -08:00
Konsta Holtta	237cee5997	gpu: nvgpu: move chip specific runlist code to common Extract out the HAL ops' implementation that now belongs to the runlist unit. Jira NVGPU-1309 Change-Id: I66185de0ddace1728da5f55ae11daa0b752bebf1 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1997824 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-24 04:14:40 -08:00
Konsta Holtta	6fda25e958	gpu: nvgpu: move runlist HAL ops to separate section Split out ops that belong to runlist unit to a new section called runlist. This is effectively just renaming; the implementation still stays put. Jira NVGPU-1309 Change-Id: Ib928164f8008f680d9cb13c969e3304ef727abba Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1997823 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-24 04:14:31 -08:00
Nicolas Benech	6978943621	gpu: nvgpu: gk20a_disable_tsg to return void gk20a_disable_tsg was always returning 0. This patch changes it to return void, thus fixing a number of MISRA violations. JIRA NVGPU-677 Change-Id: I5be8d1d8eaeb36da44653a60e57259ccffc4fea0 Signed-off-by: Nicolas Benech <nbenech@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1995004 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-23 17:23:57 -08:00
Vinod G	c0a2f356c4	gpu: nvgpu: pmu code fix for VDK dgpu vdk does not have pmu support. pmu variables do not get initialized in fmodel. Add is_pmu_supported check before nvgpu_pmu_mutex_acquire call. JIRA NVGPU-1564 Change-Id: Ieb683d3092b5289a9959c8811c25782074d19804 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1992193 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-15 23:04:42 -08:00
Scott Long	4ba92354c0	gpu: nvgpu: container_of() changes to tsg/fence code The container_of() macro used in nvgpu produces the following set of MISRA required rule violations: * Rule 11.3 : A cast shall not be performed between a pointer to object type and a pointer to a different object type. * Rule 11.8 : A cast shall not remove any const or volatile qualification from the type pointed to be a pointer. * Rule 20.7 : Expressions resulting from the expansion of macro parameters shall be enclosed in parentheses Using the same modified implementation of container_of() as that used in the nvgpu_list_node/nvgpu_rbtree_node routines eliminates the Rule 11.8 and Rule 20.7 violations and exchanges the Rule 11.3 violation with an advisory Rule 11.4 violation. This patch uses that same equivalent implementation in two new (static) functions that are used to replace references to container_of() references in tsg and fence code: * tsg_gk20a_from_ref * gk20a_fence_from_ref It should be noted that replacement functions still contain potentially dangerous (and non-MISRA compliant code) and that it is expected that deviation requests will be filed for the new advisory rule violations accordingly. JIRA NVGPU-782 Change-Id: Ib5f3b8c7b18b92af8237e82ef5ee42d39c0381e5 Signed-off-by: Scott Long <scottl@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1993503 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-14 12:42:54 -08:00
Deepak Nibade	1c17ae310c	gpu: nvgpu: add new unit for GR context Add new unit common/gr/ctx.c to manage GR context This unit provides interfaces to allocate/free/map/unmap GR context, patch context, pm context, ctxsw {preempt/spill/betacb/pagepool/rtvcb} buffers. It also provides APIs to set size of above buffers Add new header file include/nvgpu/gr/ctx.h to declare all the interfaces. Move nvgpu_gr_ctx, patch_desc, pm_ctx_desc, zcull_ctx_desc structures to this unit Add new structure nvgpu_gr_ctx_desc to hold context description parameters. For now we add sizes of all the buffers here. Add this structure to gr_gk20a for global reference Remove gr_gp10b_alloc_buffer() since it is no longer used Rename g->ops.gr.alloc_gfxp_rtv_cb() to g->ops.gr.init_gfxp_rtv_cb() since this HAL now only sets the size of rtvcb ctxsw buffer Remove gr->ctx_vars.buffer_size and gr->ctx_vars.buffer_total_size since they were redundant. We already have gr->ctx_vars.golden_image_size to denote golden image size Jira NVGPU-1527 Change-Id: I8847b347f80235209dd5e28d979e79984ab85408 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1987702 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-09 10:46:29 -08:00
Konsta Holtta	d1d1f56c49	gpu: nvgpu: skip nvgpu syncpoint in usermode submits The nvgpu managed syncpoint is not needed for anything if a channel uses usermode submits; in that case the channel would allocate an user-managed syncpoint and use that. Create the channel sync in nvgpu_channel_setup_bind() only if usermode submit is not enabled. Bug 200466905 Change-Id: I976f4b4fd0c3131cb310c72b286329fb16f1f29a Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1990270 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-09 09:35:18 -08:00
Konsta Holtta	8979a97af3	gpu: nvgpu: abstract out timeout rewinding The channel timeout ends up in a strange state during timeout handling for a brief moment; it can become stopped and started again, and the timeout lock is released in the middle. Add a more explicit rewind function to reset the timeout to start if it's active. The active check allows to use this from gk20a_channel_timeout_restart_all_channels(), so that's also modified. Also replace the return statements with more readable control flow in gk20a_channel_timeout_handler(). Change-Id: Ia7d67242dfc149ace1f4f841a837e90b6c985308 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1989327 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-08 08:24:55 -08:00
Sai Nikhil	e824ea0963	gpu: nvgpu: common: MISRA Rule 10.1 fixes MISRA rule 10.1 mandates that the correct data types are used as operands of operators. For example, only unsigned integers can be used as operands of bitwise operators. This patch fixes rule 10.1 vioaltions for drivers/gpu/nvgpu/common. JIRA NVGPU-777 JIRA NVGPU-1006 Change-Id: I53fe750f1b41816a183c595e5beb7bd263c27725 Signed-off-by: Sai Nikhil <snikhil@nvidia.com> Signed-off-by: Adeel Raza <araza@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1971221 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-06 19:24:58 -08:00
Adeel Raza	c961b7ed1d	nvgpu: fifo: fix invalid ID macros MISRA rule 10.1 prohibits using signed values with bitwise operators. Make fifo invalid ID macros compliant with this MISRA rule. Also use these macros in source code instead of hardcoded numbers to make the code more readable. JIRA NVGPU-1006 Change-Id: I2f336d1decbc53b08f93587f2e00ea2cce47f72b Signed-off-by: Adeel Raza <araza@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1983700 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-06 19:24:13 -08:00
Konsta Holtta	11c0c1ad89	gpu: nvgpu: unify vgpu runlist init Split out native-specific engine info collection out of nvgpu_init_runlist() so that it only contains common code. Call this common function from vgpu code that ends up being identical. Jira NVGPU-1309 Change-Id: I9e83669c84eb6b145fcadb4fa6e06413b34e1c03 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1978060 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-04 11:15:52 -08:00

1 2 3

105 Commits