linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-23 09:57:08 +03:00

Author	SHA1	Message	Date
Thomas Fleury	70453a8606	Revert "Revert "gpu: nvgpu: allocate only active runlists"" This reverts commit `f67bc51e51`. Currently a fifo_runlist_info_gk20a structure is allocated and initialized for each possible runlist. But only a few runlists are actually used. Skip allocation and initialization of inactive runlists. Active runlists info is stored in the active_runlist_info array.If a runlist is active, then runlist_info[runlist_id] points to one entry in active_runlist_info. Otherwise, runlist_info[runlist_id] is NULL. Operations that used to walk through all runlists are modified to walk though active runlists only. Bug 2470115 Bug 2522374 Change-Id: I98253ebebb4b1ba5957b57329820b94444b9d41b Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2030409 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-07 11:46:15 -08:00
Thomas Fleury	c23738969d	Revert "Revert "gpu: nvgpu: array of pointers to runlists"" This reverts commit `ade1d50cbe`. Currently a fifo_runlist_info_gk20a structure is allocated and initialized for each possible runlist. But only a few runlists are actually used. Use an array of pointers to runlists in fifo_gk20a. The array keeps existing indexing by runlist_id. In this patch a context is still allocated for each possible runlist, but follow up patch will allow to skip context allocation for inactive runlists. Bug 2470115 Bug 2522374 Change-Id: I0deb6981bc6f5152bdf121f0a44429748aa14687 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2030407 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-07 11:45:59 -08:00
Thomas Fleury	ade1d50cbe	Revert "gpu: nvgpu: array of pointers to runlists" This reverts commit `5fdda1b075`. Bug 2522374 Change-Id: Icb5e2181b056dc2247291c7f0e47d46c29095286 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2030293 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Hoang Pham <hopham@nvidia.com>	2019-02-28 17:51:37 -08:00
Thomas Fleury	f67bc51e51	Revert "gpu: nvgpu: allocate only active runlists" This reverts commit `45fa0441f7`. Bug 2522374 Change-Id: Icb80b7a31c7588a269850a3768ab0238dbec67b1 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2030292 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: Hoang Pham <hopham@nvidia.com>	2019-02-28 17:51:22 -08:00
Thomas Fleury	45fa0441f7	gpu: nvgpu: allocate only active runlists Currently a fifo_runlist_info_gk20a structure is allocated and initialized for each possible runlist. But only a few runlists are actually used. Skip allocation and initialization of inactive runlists. Active runlists info is stored in the active_runlist_info array. If a runlist is active, then runlist_info[runlist_id] points to one entry in active_runlist_info. Otherwise, runlist_info[runlist_id] is NULL. Operations that used to walk through all runlists are modified to walk though active runlists only. Bug 2470115 Change-Id: Icd10281dc904bdee581ebc9cfeb662018ecca121 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2025385 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-27 17:54:54 -08:00
Thomas Fleury	5fdda1b075	gpu: nvgpu: array of pointers to runlists Currently a fifo_runlist_info_gk20a structure is allocated and initialized for each possible runlist. But only a few runlists are actually used. Use an array of pointers to runlists in fifo_gk20a. The array keeps existing indexing by runlist_id. In this patch a context is still allocated for each possible runlist, but follow up patch will allow to skip context allocation for inactive runlists. Bug 2470115 Change-Id: I1615043cea84db35a270ade64695d51f85c1193a Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2025203 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-27 17:54:37 -08:00
Seema Khowala	68c13e2f04	gpu: nvgpu: add hal to mask/unmask intr during teardown ctxsw timeout error prevents recovery as it can get triggered periodically. Disable ctxsw timeout interrupt to allow recovery. Bug 2092051 Bug 2429295 Bug 2484211 Bug 1890287 Change-Id: I47470e13968d8b26cdaf519b62fd510bc7ea05d9 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2019645 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-27 12:54:06 -08:00
Debarshi Dutta	061aa66adc	gpu: nvgpu: move engine specific functions to common/fifo The following changes are done in this patch. 1) gk20a_fifo_get_engine_info() is moved to common/fifo/engine.c and is renamed to gk20a_fifo_get_active_engine_info() to reflect accurately the purpose of the function. 2) move the definition of enum fifo_engine to <nvgpu/engines.h> and add the prefix NVGPU_ 3) move the following functions related to engines in fifo_gk20a.c to common/fifo/engines.c and replace their signature by adding the prefix nvgpu_engine and removing gk20a_fifo. gk20a_fifo_get_active_engine_info gk20a_fifo_engine_enum_from_type gk20a_fifo_get_engine_ids gk20a_fifo_is_valid_engine_id gk20a_fifo_get_gr_engine_id gk20a_fifo_act_eng_interrupt_mask gk20a_fifo_engine_interrupt_mask gk20a_fifo_get_all_ce_engine_reset_mask Jira NVGPU-1315 Change-Id: I63d9dcd905a0bebcc9a4c65776cf6ec7a0837acf Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2011298 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-15 09:44:19 -08:00
Konsta Holtta	93e15f9c43	gpu: nvgpu: rename redundant runlist names in HAL Drop the "runlist_" part in the runlist section of the HAL ops. For example: - old: g->ops.runlist.runlist_wait_pending - new: g->ops.runlist.wait_pending At the same time, drop the "fifo_" part from the function names. For example: - old: gk20a_fifo_runlist_wait_pending - new: gk20a_runlist_wait_pending Also rename eng_runlist_base_size to count_max. The size of the eng_runlist_base register array depicts the maximum possible number of runlists in the chip for which count_max is more descriptive. Jira NVGPU-1309 Change-Id: Ie9e94b9f65cd10d3e682d19954f240adb6e311be Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017403 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-14 18:52:29 -08:00
Konsta Holtta	cd4b2f642c	gpu: nvgpu: add HAL for reading ccsr_channel Refactor read accesses to the ccsr_channel register for channel state to be done via a channel HAL op for all chips. A new op called read_state is added for this; information needed by other units is collected in a new struct nvgpu_channel_hw_state. Jira NVGPU-1307 Change-Id: Iff9385c08e17ac086d97f5771a54b56b2727e3c4 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017266 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:06:09 -08:00
Konsta Holtta	5cde4c2140	gpu: nvgpu: move chip specific channel reg ops to common Extract out the HAL ops' implementation that now belongs to the channel unit. This unit is responsible for channel register accesses and the like (ccsr_*). Rename channel_gm20b_bind to gm20b_fifo_channel_bind to match with the rest of the naming. Same with channel_gv11b_unbind. Jira NVGPU-1307 Change-Id: I58b9d96dbdaf36bdb163a5729544a41faec828ab Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017262 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:05:43 -08:00
Konsta Holtta	c330d8fd98	gpu: nvgpu: add channel HAL section for ccsr_* Split out ops that belong to channel unit to a new section called channel. Channel is a broad concept; this includes just the code that accesses channel registers (ccsr_*). This is effectively just renaming; the implementation still stays put. The word "channel" is also dropped from certain HAL entries to avoid redundancy (e.g., channel.disable_channel -> channel.disable). fifo.get_num_fifos gets an entirely new name: channel.count. Jira NVGPU-1307 Change-Id: I9a08103e461bf3ddb743aa37ababee3e0c73c861 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017261 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:05:34 -08:00
Philip Elcan	ab5684ce1b	gpu: nvgpu: channel: use u32 for syncpt id Make the APIs nvgpu_channel_sync_get_syncpt_id() and channel_sync_syncpt_get_id() return u32s rather than converting to ints and back. Also define FIFO_INVAL_SYNCPT_ID to use for invalid syncpt IDs rather than using magic numbers. JIRA NVGPU-1008 Change-Id: I4dde6b15fd3708fb0126b46c6fea8ac1b447c7ce Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2014821 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-11 12:55:36 -08:00
Debarshi Dutta	20b15e6f40	gpu: nvgpu: move sema specific cmdbuf methods to common/sync/ sema cmdbuf specific functions are only for the sync functionality of nvgpu and do not belong to fifo. construct files sema_cmdbuf_gk20a.h and sema_cmdbuf_gk20a.c under common/sync to contain the syncpt specific cmdbuf functions for arch gk20a. Jira NVGPU-1308 Change-Id: Iebeebe7a3de627f2de08d4ced74bb1aabf1eb53c Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1975922 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-25 02:46:06 -08:00
Debarshi Dutta	ebe6fa7fac	gpu: nvgpu: move syncpt specific cmdbuf methods to common/sync/ syncpt cmdbuf specific functions are only for the sync functionality of nvgpu and donot belong to fifo. construct files syncpt_cmdbuf_gk20a.h and syncpt_cmdbuf_gk20a.c under common/sync to contain the syncpt specific cmdbuf functions for arch gk20a. The word 'fifo' is also removed from the name of these functions. Jira NVGPU-1308 Change-Id: I1a1fd1d31f7decd1398f8e2ff625f95cf1f55033 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1975920 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-25 02:45:40 -08:00
Konsta Holtta	cdfa78e91d	gpu: nvgpu: move set_runlist_state declaration The function gk20a_fifo_set_runlist_state was moved to another place some time ago but the declaration didn't follow the implementation move. Move it from fifo_gk20a.h to runlist.h. Jira NVGPU-1309 Change-Id: Ib939a5243cee4be1c1092a553cb81b81adc6e5ce Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1997825 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-24 04:14:49 -08:00
Konsta Holtta	237cee5997	gpu: nvgpu: move chip specific runlist code to common Extract out the HAL ops' implementation that now belongs to the runlist unit. Jira NVGPU-1309 Change-Id: I66185de0ddace1728da5f55ae11daa0b752bebf1 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1997824 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-24 04:14:40 -08:00
Vinod G	1ff12f065e	gpu: nvgpu: Update pbdma data and header reset functions Two new fifo hals are added. read_pbdma_data and reset_pbdma_header. In turing the instruction that caused the interrupt will be stored in NV_PPBDMA_PB_DATA0 register or NV_PPBDMA_HDR_SHADOW register, which is decided based on NV_PPBDMA_PB_COUNT value and PB_HEADER type JIRA NVGPU-1240 Change-Id: I54a92e317a6054335439d2d61bced28aff3eecb7 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1990699 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-17 22:35:06 -08:00
Adeel Raza	c961b7ed1d	nvgpu: fifo: fix invalid ID macros MISRA rule 10.1 prohibits using signed values with bitwise operators. Make fifo invalid ID macros compliant with this MISRA rule. Also use these macros in source code instead of hardcoded numbers to make the code more readable. JIRA NVGPU-1006 Change-Id: I2f336d1decbc53b08f93587f2e00ea2cce47f72b Signed-off-by: Adeel Raza <araza@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1983700 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-06 19:24:13 -08:00
Konsta Holtta	2f51d7c5ed	gpu: nvgpu: reorder runlist enable/disable Move gk20a_fifo_set_runlist_state() to common and move gk20a_tsg_{enable,disable}_sched() to be part of tsg. Jira NVGPU-1309 Change-Id: I16ffe7f9f97249b5ac0885bba56510847bb6858b Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1978059 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-04 11:15:43 -08:00
Konsta Holtta	e05c0d13a0	gpu: nvgpu: add runlist unit to common Extract non-chip-specific code that manages the runlists (init, update, reschedule etc.) to a new file in the common directory. Move the declarations to a new matching runlist.h header. Jira NVGPU-1309 Change-Id: I3c7e0032899516487037f47ddc9a7e7aa4b0b33a Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1978058 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-04 11:15:34 -08:00
Konsta Holtta	5504d368ec	gpu: nvgpu: add HAL for preempt next The reschedule_preempt_next functionality requires direct access to registers. Move it to be called via a HAL op for chips that have rescheduling support in HAL. Jira NVGPU-1309 Change-Id: I72d87d8e7ebd3fc05f094b83398cc1ab4b4027a5 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1978057 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-04 11:15:25 -08:00
tkudav	3267530f22	gpu: nvgpu: Use device_info parsing HAL for Fifo Update the fifo code to use the HALs exposed by "Top" unit to read data from device_info table. The information for GRAPHICS engine in device_info table is now parsed using the get_device_info HAL from "Top" unit. Copy engine(CE) has multiple entries in the device_info table corresponding to each instance of the engine. Prior to Pascal, each instance of an engine was denoted by different engine type. For example in GM20B, there are engine types like COPY_ENGINE0, COPY_ENGINE1 and so on. In Pascal and chips beyond, a new field called "inst_id" is added and the engine_type is kept the same for different instances of an engine. For example in GP10B, all copy engine entries have same engine type i.e ENGINE_LCE, but different inst_ids. So for Pascal and chips beyond, we use a different HAL to get CE information from device_info table. JIRA NVGPU-1053 Change-Id: Ib40a616d903a5dbef5730678c2ebc3454b8e900d Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1969400 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-20 09:26:01 -08:00
Debarshi Dutta	0188b93e30	gpu: nvgpu: move gk20a_fifo_recover_tsg into tsg unit gk20a_fifo_recover_tsg does high-level software calls and invokes gk20a_fifo_recover. This function belongs to the tsg unit and is moved to tsg.c file. Also, the function is renamed to nvgpu_tsg_recover. Jira NVGPU-1237 Change-Id: Id1911fb182817b0cfc47b3219065cba6c4ca507a Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1970034 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 21:55:07 -08:00
Debarshi Dutta	fb114f8fda	gpu: nvgpu: move gk20a_fifo_recover_ch to channel unit gk20a_fifo_recover_ch does high-level calls and invokes gk20a_fifo_recover. This function belongs to the channel unit and is moved to the file channel.c. Also, the function is renamed to nvgpu_channel_recover. Jira NVGPU-1237 Change-Id: I31890f85fdb2c42648cc063dd9c4e7e35930dcef Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1970033 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 21:54:58 -08:00
Debarshi Dutta	fcd216e170	gpu: nvgpu: move gk20a_fifo_engines_on_id to ops struct gk20a_fifo_engines_on_id uses H/W headers to return a valid active engine mask. This qualifies the function to be invoked via a struct gpu_ops function pointer instead. Jira NVGPU-1237 Change-Id: Ice30610ef51cf4471b3750f21d38e6648953e9e2 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1970032 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 21:54:48 -08:00
Debarshi Dutta	ac4c2d4ae0	gpu: nvgpu: move fifo RC_TYPE_* definitions to common header The RC_TYPE_* definitions in fifo_gk20a.h are generic and are moved to a newly constructed common header <nvgpu/fifo.h> Jira NVGPU-1237 Change-Id: Ia1bb80b9b0047675c7abfb6ce6ccd42a2e99f41f Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1970031 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 21:54:39 -08:00
Debarshi Dutta	7f58347ed9	gpu: nvgpu: move tsg functions to common Any tsg specific functions that does high-level software-centric operations below to the TSG unit and not the FIFO unit. Move the below public functions as well as their dependent static functions to common/fifo/tsg.c and also rename them to use the prefix nvgpu_tsg_* gk20a_fifo_set_ctx_mmu_error_tsg gk20a_fifo_abort_tsg gk20a_fifo_error_tsg gk20a_fifo_check_tsg_ctxsw_timeout Jira NVGPU-1237 Change-Id: I4e3da821a878d4b4a0a0b53fbb7f4c10f135f58d Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1934299 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 21:54:26 -08:00
Debarshi Dutta	57f03e3a20	gpu: nvgpu: move channel functions to common Any channel specific functions having high-level software-centric operations belong to the channel unit and not the FIFO unit. Move the below public functions as well as their dependent static functions to common/fifo/channel.c. Also, rename the functions to use the prefix nvgpu_channel_*. gk20a_fifo_set_ctx_mmu_error_ch gk20a_fifo_error_ch gk20a_fifo_check_ch_ctxsw_timeout Jira NVGPU-1237 Change-Id: Id6b6d69bbed193befbfc4c30ecda1b600d846199 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1932358 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 21:54:17 -08:00
Konsta Holtta	07993bbbd8	gpu: nvgpu: add runlist_write_state HAL The function gk20a_fifo_sched_disable_rw accesses HW directly. Rename it and add a HAL indirection so that it can be called from chip-independent code. Also fix some trivial MISRA violations in the function. Jira NVGPU-1309 Change-Id: Icf320738d3d1d4baa40257a9da3ca2c6b7fefc0b Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1971274 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 12:06:08 -08:00
Thomas Fleury	7e68e5c83d	gpu: nvgpu: userd slab allocator We had to force allocation of physically contiguous memory for USERD in nvlink case, as a channel's USERD address is computed as an offset from fifo->userd address, and nvlink bypasses SMMU. With 4096 channels, it can become difficult to allocate 2MB of physically contiguous sysmem for USERD on a busy system. PBDMA does not require any sort of packing or contiguous USERD allocation, as each channel has a direct pointer to that channel's 512B USERD region. When BAR1 is supported we only need the GPU VAs to be contiguous, to setup the BAR1 inst block. - Add slab allocator for USERD. - Slabs are allocated in SYSMEM, using PAGE_SIZE for slab size. - Contiguous channels share the same page (16 channels per slab). - ch->userd_mem points to related nvgpu_mem descriptor - ch->userd_offset is the offset from the beginning of the slab - Pre-allocate GPU VAs for the whole BAR1 - Add g->ops.mm.bar1_map() method - gk20a_mm_bar1_map() uses fixed mapping in BAR1 region - vgpu_mm_bar1_map() passes the offset in TEGRA_VGPU_CMD_MAP_BAR1 - TEGRA_VGPU_CMD_MAP_BAR1 is called for each slab. Bug 2422486 Bug 200474793 Change-Id: I202699fe55a454c1fc6d969e7b6196a46256d704 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1959032 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-11 16:24:10 -08:00
Debarshi Dutta	9abe9fe062	gpu: nvgpu: replace input param chid with pointer to channel preempt_channel needs to use the channel to pass it to other public functions, get access to a tsg etc. This qualifies it to take a pointer to a channel as an input parameter instead of a chid. Increment the channel ref counter using the function gk20a_channel_from_id in functions where we get the chid from the h/w registers directly. Once the prempt_channel function call is done, use a gk20a_channel_put on the referenced channel. Jira NVGPU-1461 Change-Id: I6c87c8104cfcb418d468c8c590087fd4aeabf4bd Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1963200 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-05 21:55:10 -08:00
Debarshi Dutta	99acb8011a	gpu: nvgpu: replace input param chid with pointer to channel gk20a_fifo_recover_channel takes a reference to the channel via its chid before passing the channel pointer to other public functions such as gk20a_channel_abort and gk20a_fifo_error_ch. This qualifies the gk20a_fifo_recover_channel to take a pointer to a channel instead of only chid. Jira NVGPU-1461 Change-Id: I338a12a05e5ccee785a202fea7848db5201a3a39 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1963199 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-05 21:55:00 -08:00
Konsta Holtta	94d4a42d10	gpu: nvgpu: add runlist_busy_engines HAL Split out the code to check which engines on a particular runlist are busy from gk20a_fifo_runlist_reset_engines() and make it a HAL op. Resetting engines is common across chips but status is read from registers. Jira NVGPU-1309 Change-Id: I7a63a2942a9e210481822eaf85795fc17dad0dc5 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1961822 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-30 11:54:27 -08:00
Debarshi Dutta	e19cea7ab3	gpu: nvgpu: replace input parameter tsgid with pointer to struct tsg_gk20a The function gk20a_fifo_recover_tsg has to pass a valid struct tsg to other functions from within. This qualifies it to have a pointer to struct tsg_gk20a as an input parameter. Tsg specific parts of the gk20a_fifo_preempt_timeout_rc are now moved into another function gk20a_fifo_preempt_timeout_rc_tsg that takes a tsg as an input and passes it to gk20a_fifo_recover_tsg. The pointer to a tsg is also used to enumerate channels from within. The function gk20a_fifo_preempt_timeout_rc now contains only channel specific code. Jira NVGPU-1461 Change-Id: Ice0a9921567841fb5586a7e4e010c442ca6cf172 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1961675 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-30 08:16:09 -08:00
Debarshi Dutta	1e78d47f15	gpu: nvgpu: replace input parameter tsgid with pointer to struct tsg_gk20a gv11b_fifo_preempt_tsg needs to access the runlist_id of the tsg as well as pass the tsg pointer to other public functions such as gk20a_fifo_disable_tsg_sched. This qualifies the preempt_tsg to use a pointer to a struct tsg_gk20a instead of just using the tsgid. Jira NVGPU-1461 Change-Id: I01fbd2370b5746c2a597a0351e0301b0f7d25175 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1959068 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-30 08:15:06 -08:00
Debarshi Dutta	e5bebd880f	gpu: nvgpu: replace tsgid input variable with pointer to a struct tsg_gk20a replace tsgid with a pointer to a struct tsg_gk20a in the function gk20a_fifo_tsg_abort(). gk20a_fifo_tsg_abort needs to enumerate through all the channels within the tsg as well as pass the tsg pointer to other functions, qualifying the need to use a pointer instead as an input parameter. Jira NVGPU-1461 Change-Id: I59cec05d5d778f733d0c3e9ffadf46e74e249080 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1956567 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-30 08:14:48 -08:00
Konsta Holtta	4a53854a92	gpu: nvgpu: delete raw chid lookup This (dangerous) array lookup with no channel references is now unused. Jira NVGPU-1460 Change-Id: Ic6bdbcf19fc8996bc6ff02a40afe3224bdd5bc27 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1955402 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-27 12:24:56 -08:00
smadhavan	f1747cbcd1	gpu: nvgpu: Fix MISRA rule 8.3 violations MISRA rule 8.3 requires that all declarations of a function shall use the same parameter names and type qualifiers. There are cases where the parameter names do not match between function prototype and declaration. This patch will fix some of these violations by renaming the parameter as required. JIRA NVGPU-847 Change-Id: I3f7280b0e4c21b1c2d70fd7f899cf920075f87a3 Signed-off-by: smadhavan <smadhavan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1927103 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-12 22:33:18 -08:00
Sai Nikhil	c365698e18	gpu: nvgpu: gk20a: fix MISRA 10.4 Violations [2/2] MISRA Rule 10.4 only allows the usage of arithmetic operations on operands of the same essential type category. Adding "U" at the end of the integer literals to have same type of operands when an arithmetic operation is performed. This fixes violation where an arithmetic operation is performed on signed and unsigned int types. JIRA NVGPU-992 Change-Id: I4c04e2720a3b068909cc4af6847d4718568c13ea Signed-off-by: Sai Nikhil <snikhil@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1822740 GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-09 13:27:12 -08:00
Konsta Holtta	9adc7a6542	gpu: nvgpu: fix MISRA errors in runlist Fix some mistakes from commit `0fbc1a2652` (gpu: nvgpu: avoid recursion in runlist construction) and commit `998bf379df` (gpu: nvgpu: add runlist_append_tsg) for MISRA rules 10.3 and 10.4. - cast a sizeof to u32 in a calculation to match in size, - make the NVGPU_FIFO_RUNLIST_INTERLEAVE_LEVEL_* constants unsigned to make comparisons match in signedness. Jira NVGPU-1174 Change-Id: I00aa9758ca4352d8eb53a0e8ded42a1ba3b14561 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1938069 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 15:36:49 -07:00
Konsta Holtta	f8188089df	gpu: nvgpu: save only used part of channel ram for dump Reduce the size of memory allocations in the channel debug dump by capturing only the necessary values from the instance block. This also simplifies the allocation path slightly with the downside of having to add a capture_channel_ram_dump HAL for reading the interesting parts explicitly beforehand to the now smaller staging buffer. Also rename struct ch_state to struct nvgpu_channel_dump_info. Jira NVGPU-886 Change-Id: I5d7518d9d474b0b728b183383bc83d89ecf91b98 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1928207 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 15:35:26 -07:00
Konsta Holtta	439d3eb74f	gpu: nvgpu: use a pointer for ch_state inst mem MISRA rule 18.7 doesn't allow flexible array members. To work around that, modify the instance block member in struct ch_state to be an explicit pointer and allocate it separately for simplicity. Jira NVGPU-886 Change-Id: I34299bec79bf7706f9cdfa42dee7fba765c9f312 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1928205 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 15:35:02 -07:00
smadhavan	b597a721af	gpu: nvgpu: Fix MISRA 8.2 violations MISRA rule 8.2 makes it mandatory for all function prototypes to have named parameters. There were few instances where parameter name(s) for function prototypes were omitted. This patch will fix the same. JIRA NVGPU-861 Change-Id: I6cb28482becc2938c574b7d8c6f22463d346d27a Signed-off-by: smadhavan <smadhavan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1917939 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 17:28:58 -07:00
Konsta Holtta	0fbc1a2652	gpu: nvgpu: avoid recursion in runlist construction MISRA rule 17.2 forbids recursion as a hazard on the stack space. To comply and additionally to make the code somewhat more straightforward to read, rewrite the runlist construction with three explicit functions that work as the three levels of the earlier recursion. These levels map to the three priority levels of TSGs and having more than that is unlikely. When "runlist interleaving" is enabled, TSGs with higher priorities get interleaved between the switch of each pair of lower-level priority TSGs, so that the latency for a job at priority level X is no more than all jobs' timeslices of priority X and higher, plus at most one job at a lower level. This can be illustrated as follows (low, medium, high TSGs 1 and 2): L1 L2 (only low-priority TSGs) H1 H2 (only high-priority TSGs) H1 H2 M1 H1 H2 M2 (no low-priority TSGs) M1 M2 L1 M1 M2 L2 (no high-priority TSGs) H1 H2 L1 H1 H2 L2 (no medium-priority TSGs) H1 H2 M1 H1 H2 M2 H1 H2 L1 H1 H2 M1 H1 H2 M2 H1 H2 L2 (no empty levels) Without interleaving, the items are simply grouped by priority. Jira NVGPU-1174 Change-Id: Ic3b5106945df7105633730ecd1d150af770a5e83 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1918226 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-26 11:14:19 -07:00
Philip Elcan	a84e69d693	gpu: nvgpu: fifo_gk20: make pbdma_id type the same The use of the pbdma_id value was not consistent. This caused MISRA 10.3 violations due to the assignment between different essential types. JIRA NVGPU-647 Change-Id: I1d25748ee64bacf659bb5c3b65f26e5721c4670c Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1917634 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-16 16:46:42 -07:00
Philip Elcan	901cf5ffcb	gpu: nvgpu: fifo_gk20a: fix some declaration types This fixes some declarations in fifo_gk20a that resulted in MISRA 10.3 violations. MISRA 10.3 prohibits implicit assignment between types. JIRA NVGPU-647 Change-Id: I28df83a73c5530c37275cdd36c6c56d03a1ccadd Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1917633 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-16 16:46:33 -07:00
Philip Elcan	1040a3a534	gpu: nvgpu: fix return for engine_enum_from_type() Use an enum instead of an int as a return type for this function. This resolves violations of MISRA 10.3 that prohibits implicit assignment between types. JIRA NVGPU-647 Change-Id: I2a3725b28c6db9c1540da25228df3da184dd2e6d Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1917632 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-16 16:46:24 -07:00
Deepak Nibade	84a37954fb	gpu: nvgpu: keep runlist submit lock only for submit registers We right now acquire rulist_submit_mutex to submit runlist and also to wait for submit completion But locking is only needed to atomically configure the runlist submit registers, hence move the locking to inside of gk20a_fifo_runlist_hw_submit() where we program the registers Also convert the mutex to spinlock at the same time Note that similar locking is not required for tu104_fifo_runlist_hw_submit() since the runlist submit registers are per-runlist beginning Turing Bug 200452543 Change-Id: I53d6179b80cb066466b64c6efa9393e55e381bfc Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1919058 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:10 +05:30
Deepak Nibade	e8001064ec	gpu: nvgpu: add mutex for runlist submit We right now submit new runlist and wait for submit to complete in gk20a_fifo_update_runlist_locked() It is possible that multiple runlists are being updated in parallel by multiple threads since the lock taken by parent of gk20a_fifo_update_runlist_locked() is per-runlist Note that the concurrent threads would still construct their runlists into per-runlist buffer But we still have a race condition while submitting these runlists to hardware. With an application that creates and destroys multiple contexts in parallel this race condition gets realized and we see h/w reporting an error interrupt NV_PFIFO_INTR_SCHED_ERROR_CODE_BAD_TSG which means a bad TSG was submitted Fix this by adding a global lock for runlist submit and wait sequence This ensures that concurrent threads do not try to submit runlists to the hardware at the same time Bug 200452543 Bug 2405416 Change-Id: I2660a2e5d9af1da400e7f865361722dc0914f96f Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1851114 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:08 +05:30

1 2 3 4

157 Commits