linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-23 09:57:08 +03:00

Author	SHA1	Message	Date
Konsta Holtta	44e4d69734	gpu: nvgpu: add channel.force_ctx_reload HAL Isolate the write to ccsr_channel_force_ctx_reload behind a HAL op. Jira NVGPU-1307 Change-Id: Iaef7d740f4a89e4a45c7de28f001a7dea98ce066 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017268 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:06:28 -08:00
Konsta Holtta	cd4b2f642c	gpu: nvgpu: add HAL for reading ccsr_channel Refactor read accesses to the ccsr_channel register for channel state to be done via a channel HAL op for all chips. A new op called read_state is added for this; information needed by other units is collected in a new struct nvgpu_channel_hw_state. Jira NVGPU-1307 Change-Id: Iff9385c08e17ac086d97f5771a54b56b2727e3c4 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017266 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:06:09 -08:00
Konsta Holtta	7189630e7c	gpu: nvgpu: drop fifo_ in channel HAL names Now that the moved HAL ops from fifo are in channel, rename the implementations to match. Jira NVGPU-1307 Change-Id: I7b9336f506c9e71bcd0af98886216958bd6695eb Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017264 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:05:56 -08:00
Konsta Holtta	5cde4c2140	gpu: nvgpu: move chip specific channel reg ops to common Extract out the HAL ops' implementation that now belongs to the channel unit. This unit is responsible for channel register accesses and the like (ccsr_*). Rename channel_gm20b_bind to gm20b_fifo_channel_bind to match with the rest of the naming. Same with channel_gv11b_unbind. Jira NVGPU-1307 Change-Id: I58b9d96dbdaf36bdb163a5729544a41faec828ab Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017262 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:05:43 -08:00
Konsta Holtta	c330d8fd98	gpu: nvgpu: add channel HAL section for ccsr_* Split out ops that belong to channel unit to a new section called channel. Channel is a broad concept; this includes just the code that accesses channel registers (ccsr_*). This is effectively just renaming; the implementation still stays put. The word "channel" is also dropped from certain HAL entries to avoid redundancy (e.g., channel.disable_channel -> channel.disable). fifo.get_num_fifos gets an entirely new name: channel.count. Jira NVGPU-1307 Change-Id: I9a08103e461bf3ddb743aa37ababee3e0c73c861 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017261 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 17:05:34 -08:00
Abdul Salam	6945209418	gpu: nvgpu: Add unified check for clk_arb support Currently clk_arb needs PSTATE to be true for dGPU. Setting PSTATE only FALSE, causes issue as clk_arb fails. There is no such dependency of PSTATE on iGPU. Making it unified with a call to check_clk_arb_support(). This call is implemented based on its dependency in iGPU, dGPU. check_clk_arb_support returns true if supported, else false. Jira NVGPU-1948 Change-Id: I108dc12bd6ad8d0e074352080c978b7dda9bee05 Signed-off-by: Abdul Salam <absalam@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2014775 Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-12 08:54:49 -08:00
Philip Elcan	ff80b0e6c1	gpu: nvgpu: gp10b: misc MISRA 10.3 fixes This fixes some miscellaneous MISRA 10.3 violations in gp10b for assignment of objects of different size or essential type. JIRA NVGPU-1008 Change-Id: I40e83cd5682c9407ce4301663d07578a40ce1814 Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2006586 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-11 12:55:15 -08:00
Deepak Nibade	fe27a7f934	gpu: nvgpu: add gr/ctx and gr/subctx APIs to set hwpm ctxsw mode gr_gk20a_update_hwpm_ctxsw_mode() right now validates the incoming hwpm mode, checks if it is already set, and if not, it will go ahead and set the new hwpm mode by calling g->ops.gr.ctxsw_prog HALs Instead of programming hwpm mode in gr_gk20a.c, move the programming to gr/ctx and gr/subctx units by adding below APIs nvgpu_gr_ctx_prepare_hwpm_mode() - validate the incoming mode and check if it is already set nvgpu_gr_ctx_set_hwpm_mode() - set pm mode in graphics context nvgpu_gr_subctx_set_hwpm_mode() - set pm mode in subcontext Add gpu_va field to struct pm_ctx_desc to store the gpu_va to be programmed into context Rename NVGPU_DBG_HWPM_CTXSW_MODE_* to NVGPU_GR_CTX_HWPM_CTXSW_MODE_* and move them to gr/ctx.h Remove below HALs since they are no longer used g->ops.gr.ctxsw_prog.set_pm_mode_no_ctxsw() g->ops.gr.ctxsw_prog.set_pm_mode_ctxsw() g->ops.gr.ctxsw_prog.set_pm_mode_stream_out_ctxsw() Jira NVGPU-1527 Jira NVGPU-1613 Change-Id: Id2a4d498182ec0e3586dc7265f73a25870ca2ef7 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2011093 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-11 10:25:34 -08:00
Mahantesh Kumbar	de3ff22726	gpu: nvgpu: ACR LSF loader config changes LSF loader cleanup, on gm20b/gp10b PMU falcon & other GR falcons uses different struct to store loader config which needs different functions to fill LSF loader config data, but on gv11b/gv10x/tu10a uses common falcon struct to store loader config, so made single function to fill LSF loader config data using ACR LSF struct & removed duplicate code. Removed ACR LSF loader ops which were part of PMU ops to cleanup dependency JIRA NVGPU-1148 Change-Id: I681829e05463d2517a4049433d8b0de3adeb06d9 Signed-off-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2012853 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-11 03:28:49 -08:00
Mahantesh Kumbar	7b933d58e0	gpu: nvgpu: ACR refactor to manage LSF ucodes Added data struct under ACR struct to manage LS falcons ucode as LS falcon ucode holds multiple properties & can be set at acr init stage to bootstrap LS falcons as required, at present LS falcons code is part ACR & partially part of PMU code to setup LSF bootstrap, so, needed to clean up the dependency. JIRA NVGPU-1148 Change-Id: Ie206e129e3db838041db44d5227ab76a1de991c8 Signed-off-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2012763 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-11 03:28:41 -08:00
Mahantesh Kumbar	f1bdef62b6	gpu: nvgpu: ucode blob prepare using ACR ops Moved ACR ucode blob prepare ops to struct nvgpu_acr from PMU ops as ACR needs to be independent from PMU. JIRA NVGPU-1147 Change-Id: I2ad1805fcbd0837c24f6f09b6bc292ad2c346fb6 Signed-off-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2007291 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-11 03:27:22 -08:00
Konsta Holtta	49506f257e	gpu: nvgpu: split update_runlist HAL API in two A comment for gk20a_fifo_update_runlist() says: /* add/remove a channel from runlist special cases below: runlist->active_channels will NOT be changed. (ch == NULL && !add) means remove all active channels from runlist. (ch == NULL && add) means restore all active channels on runlist. */ Those special cases call for a new function, so add that. Delete the update_runlist HAL op and add update_for_channel (like update_runlist without the special cases) and reload (no channel to add or remove, just the special cases). While at it, rename gk20a_fifo_update_runlist_ids to nvgpu_runlist_reload_ids. It's common across chips and does what the reload HAL does but for a list of several IDs. Jira NVGPU-1922 Change-Id: I9a99ab03a636a1214c021faad359d2b304a9472f Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2013058 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-08 12:56:09 -08:00
Deepak Nibade	a5eb150635	gpu: nvgpu: add new gr/config unit to initialize GR configuration Add new unit gr/config to initialize GR configuration like GPC/TPC count, MAX count and mask Create new structure nvgpu_gr_config that stores all the configuration and that is owned by the new unit Move below fields from struct gr_gk20a to nvgpu_gr_config in gr/config.h Struct gr_gk20a now only holds the pointer to struct nvgpu_gr_config u32 max_gpc_count; u32 max_tpc_per_gpc_count; u32 max_zcull_per_gpc_count; u32 max_tpc_count; u32 gpc_count; u32 tpc_count; u32 ppc_count; u32 zcb_count; u32 pe_count_per_gpc; u32 gpc_tpc_count; u32 gpc_ppc_count; u32 gpc_zcb_count; u32 pes_tpc_count[GK20A_GR_MAX_PES_PER_GPC]; u32 gpc_tpc_mask; u32 pes_tpc_mask[GK20A_GR_MAX_PES_PER_GPC]; u32 gpc_skip_mask; u8 map_tiles; u32 map_tile_count; u32 map_row_offset; Remove gr->sys_count since it was already no longer used common/gr/config/gr_config.c unit now exposes the APIs to initialize the configuration and also to query the configuration values nvgpu_gr_config_init() is called to initialize GR configuration from gr_gk20a_init_gr_config() and gr_gk20a_init_map_tiles() is simply renamed as nvgpu_gr_config_init_map_tiles() Expose new API nvgpu_gr_config_deinit() to deinit the configuration Expose nvgpu_gr_config_get_*() APIs to query above configuration fields stored in nvgpu_gr_config structure Update vgpu_gr_init_gr_config() to initialize the configuration from gr->config structure Chip specific HALs that access GR register for initialization are implemented in common/gr/config/gr_config_gm20b.c Set these HALs for all GPUs Jira NVGPU-1879 Change-Id: Ided658b43124ea61b9f273b82b73fdde4ed3c8f0 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2012167 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-08 12:55:53 -08:00
Alex Waterman	f2979bcdac	gpu: nvgpu: Remove support_sparse() HAL in MM The support sparse HAL severs only one purpose: return true or false depending on whether the given chip supports sparse mappings. This HAL is used to, in turn, program (or not) the NVGPU_SUPPORT_SPARSE_ALLOCS enabled flag. So instead of having all this rigmarole to program this flag just program it for all native GPUs. Then, in the vGPU specific characteristics function disable it explicitly. This seems to have precedent already. JIRA NVGPU-1737 JIRA NVGPU-1934 Change-Id: I630928ad656aaffc09fdc6b7fec9fc423aa94c38 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2006796 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-07 15:44:41 -08:00
Terje Bergstrom	a9f404cb99	gpu: nvgpu: Introduce NVGPU_DEBUGGER build flag Introduce build flag for NVGPU_DEBUGGER. Also introduces Makefile flag NVGPU_REDUCED and disables NVGPU_DEBUGGER when doing a reduced build. Make user space build enable the reduced build. Change-Id: I84d6142811f674f2a7652e093b63ea5e93d9143e Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2002190 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-01 09:46:07 -08:00
Vinod G	1b1ebb0a8d	gpu: nvgpu: log mme esr register information Add new hal to log the mme exception register information. Support added for Turing only. On mme exception interrupt, read the mme_hww_esr register and log the error based on esr register bits. JIRA NVGPU-1241 Change-Id: Ied3db0cc8fe6e2a82ecafc9964875e2686ca0d72 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2005807 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-30 04:42:58 -08:00
Debarshi Dutta	20b15e6f40	gpu: nvgpu: move sema specific cmdbuf methods to common/sync/ sema cmdbuf specific functions are only for the sync functionality of nvgpu and do not belong to fifo. construct files sema_cmdbuf_gk20a.h and sema_cmdbuf_gk20a.c under common/sync to contain the syncpt specific cmdbuf functions for arch gk20a. Jira NVGPU-1308 Change-Id: Iebeebe7a3de627f2de08d4ced74bb1aabf1eb53c Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1975922 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-25 02:46:06 -08:00
Debarshi Dutta	ebe6fa7fac	gpu: nvgpu: move syncpt specific cmdbuf methods to common/sync/ syncpt cmdbuf specific functions are only for the sync functionality of nvgpu and donot belong to fifo. construct files syncpt_cmdbuf_gk20a.h and syncpt_cmdbuf_gk20a.c under common/sync to contain the syncpt specific cmdbuf functions for arch gk20a. The word 'fifo' is also removed from the name of these functions. Jira NVGPU-1308 Change-Id: I1a1fd1d31f7decd1398f8e2ff625f95cf1f55033 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1975920 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-25 02:45:40 -08:00
Debarshi Dutta	8b57b3b938	gpu: nvgpu: restructure sync cmdbufs specific gpu_ops sync cmbbuf specific ops pointers are moved into a new struct sync_ops under the parent struct gpu_ops. The HAL assignments to the gk20a and gv11b versions are updated to match the new struct type. Jira NVGPU-1308 Change-Id: I1d9832ed5e938cb65747f0f6d34088552f75e2bc Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1975919 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-25 02:45:11 -08:00
Konsta Holtta	7439449c5c	gpu: nvgpu: move runlist base and entry size hal ops Avoid including the HW headers directly in the HAL listings: add indirection functions for the two ops that were naked: - runlist.eng_runlist_base_size - runlist.runlist_entry_size GV100 gets a new fifo HAL file as base_size is the first one (and currently the only one) of GV100-specific ops. NVGPU-1309 Change-Id: Idf28b5e26c798457132ef595fa55c65bcddb1b31 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1997826 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-24 04:14:58 -08:00
Konsta Holtta	237cee5997	gpu: nvgpu: move chip specific runlist code to common Extract out the HAL ops' implementation that now belongs to the runlist unit. Jira NVGPU-1309 Change-Id: I66185de0ddace1728da5f55ae11daa0b752bebf1 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1997824 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-24 04:14:40 -08:00
Konsta Holtta	6fda25e958	gpu: nvgpu: move runlist HAL ops to separate section Split out ops that belong to runlist unit to a new section called runlist. This is effectively just renaming; the implementation still stays put. Jira NVGPU-1309 Change-Id: Ib928164f8008f680d9cb13c969e3304ef727abba Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1997823 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-24 04:14:31 -08:00
Deepak Nibade	b40c655e12	gpu: nvgpu: move regops to separate unit Move regops (gk20a/regops_gk20a.c) to separate unit common/regops/regops.c Move corresponding header (gk20a/regops_gk20a.h) to include/nvgpu/regops.h Move rest of the platform HAL files to common/regops/ as well Fix all the header includes to include new public header Remove *_apply_smpc_war() declarations from headers. Corresponding functions were cleaned up already, and declarations were left somehow Jira NVGPU-620 Change-Id: I8b8065b9c91f69809bdeb1b4caecdc7582c8a992 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1998723 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-21 23:04:28 -08:00
Vinod G	1ff12f065e	gpu: nvgpu: Update pbdma data and header reset functions Two new fifo hals are added. read_pbdma_data and reset_pbdma_header. In turing the instruction that caused the interrupt will be stored in NV_PPBDMA_PB_DATA0 register or NV_PPBDMA_HDR_SHADOW register, which is decided based on NV_PPBDMA_PB_COUNT value and PB_HEADER type JIRA NVGPU-1240 Change-Id: I54a92e317a6054335439d2d61bced28aff3eecb7 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1990699 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-17 22:35:06 -08:00
Alex Waterman	489236d181	gpu: nvgpu: MISRA 21.2 fixes: __nvgpu_set_enabled() Rename __nvgpu_set_enabled() to nvgpu_set_enabled(). The original double underscore was present to indicate that this function is a function with potentially unintended side effects (enabling a feature has wide ranging impact). To not lose this documentation a comment was added to convey that this function must be used with care. JIRA NVGPU-1029 Change-Id: I8bfc6fa4c17743f9f8056cb6a7a0f66229ca2583 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1989434 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-15 12:54:19 -08:00
Deepak Nibade	4883f14fbb	gpu: nvgpu: map global_ctx buffers from gr/ctx unit Currently all the global contex buffers are mapped into each graphics context. Move all the mapping/unmapping support to gr/ctx unit since all the mappings are owned by context itself Add nvgpu_gr_ctx_map_global_ctx_buffers() that maps all the global context buffers into given gr_ctx Add nvgpu_gr_ctx_get_global_ctx_va() that returns VA of the mapping for requested index Remove g->ops.gr.map_global_ctx_buffers() since it is no longer required. Also remove below APIs gr_gk20a_map_global_ctx_buffers() gr_gk20a_unmap_global_ctx_buffers() gr_tu104_map_global_ctx_buffers() Remove global_ctx_buffer_size from nvgpu_gr_ctx since it is no longer used Jira NVGPU-1527 Change-Id: Ic185c03757706171db0f5a925e13a118ebbdeb48 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1987739 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-09 10:46:48 -08:00
Deepak Nibade	1c17ae310c	gpu: nvgpu: add new unit for GR context Add new unit common/gr/ctx.c to manage GR context This unit provides interfaces to allocate/free/map/unmap GR context, patch context, pm context, ctxsw {preempt/spill/betacb/pagepool/rtvcb} buffers. It also provides APIs to set size of above buffers Add new header file include/nvgpu/gr/ctx.h to declare all the interfaces. Move nvgpu_gr_ctx, patch_desc, pm_ctx_desc, zcull_ctx_desc structures to this unit Add new structure nvgpu_gr_ctx_desc to hold context description parameters. For now we add sizes of all the buffers here. Add this structure to gr_gk20a for global reference Remove gr_gp10b_alloc_buffer() since it is no longer used Rename g->ops.gr.alloc_gfxp_rtv_cb() to g->ops.gr.init_gfxp_rtv_cb() since this HAL now only sets the size of rtvcb ctxsw buffer Remove gr->ctx_vars.buffer_size and gr->ctx_vars.buffer_total_size since they were redundant. We already have gr->ctx_vars.golden_image_size to denote golden image size Jira NVGPU-1527 Change-Id: I8847b347f80235209dd5e28d979e79984ab85408 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1987702 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-09 10:46:29 -08:00
Konsta Holtta	e05c0d13a0	gpu: nvgpu: add runlist unit to common Extract non-chip-specific code that manages the runlists (init, update, reschedule etc.) to a new file in the common directory. Move the declarations to a new matching runlist.h header. Jira NVGPU-1309 Change-Id: I3c7e0032899516487037f47ddc9a7e7aa4b0b33a Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1978058 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-04 11:15:34 -08:00
Konsta Holtta	5504d368ec	gpu: nvgpu: add HAL for preempt next The reschedule_preempt_next functionality requires direct access to registers. Move it to be called via a HAL op for chips that have rescheduling support in HAL. Jira NVGPU-1309 Change-Id: I72d87d8e7ebd3fc05f094b83398cc1ab4b4027a5 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1978057 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-04 11:15:25 -08:00
Deepak Nibade	93a05937f0	gpu: nvgpu: remove g->ops.gr.dump_ctxsw_stats g->ops.gr.dump_ctxsw_stats is redundant since we can directly call g->ops.gr.ctxsw_prog.dump_ctxsw_stats Also clean up gr_gp10b_dump_ctxsw_stats since it too becomes redundant Jira NVGPU-1527 Change-Id: I0ac5bcf6cf3dca30954d302766431496971708f4 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1986814 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-03 23:05:42 -08:00
Sagar Kamble	5efc446a06	gpu: nvgpu: make all falcons struct nvgpu_falcon* With intention to make falcon header free of private data we are making all falcon struct members (pmu.flcn, sec2.flcn, fecs_flcn, gpccs_flcn, nvdec_flcn, minion_flcn, gsp_flcn) in the gk20a, pointers to struct nvgpu_falcon. Falcon structures are allocated/deallocated by falcon_sw_init & _free respectively. While at it, remove duplicate gk20a.pmu_flcn and gk20a.sec2_flcn, refactor flcn_id assignment and introduce falcon_hal_sw_free. JIRA NVGPU-1594 Change-Id: I222086cf28215ea8ecf9a6166284d5cc506bb0c5 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1968242 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-03 02:58:38 -08:00
Deepak Nibade	bb677160e5	gpu: nvgpu: check tu104 specific timestamp buffer full error code In gk20a_gr_handle_fecs_error(), we right now check the error code in mailbox to identify if we hit timestamp buffer full error interrupt This error code right now is hard coded to 0x26 But on Turing ucode this error code is set to 0x32 Add new HAL g->ops.fecs_trace.get_buffer_full_mailbox_val() to get correct error code per platform and use this in gk20a_gr_handle_fecs_error() Bug 200471541 Bug 2469604 Change-Id: I7325354b39d35b1c8b218e554814316d22950469 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1978144 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-31 09:43:39 -08:00
tkudav	3267530f22	gpu: nvgpu: Use device_info parsing HAL for Fifo Update the fifo code to use the HALs exposed by "Top" unit to read data from device_info table. The information for GRAPHICS engine in device_info table is now parsed using the get_device_info HAL from "Top" unit. Copy engine(CE) has multiple entries in the device_info table corresponding to each instance of the engine. Prior to Pascal, each instance of an engine was denoted by different engine type. For example in GM20B, there are engine types like COPY_ENGINE0, COPY_ENGINE1 and so on. In Pascal and chips beyond, a new field called "inst_id" is added and the engine_type is kept the same for different instances of an engine. For example in GP10B, all copy engine entries have same engine type i.e ENGINE_LCE, but different inst_ids. So for Pascal and chips beyond, we use a different HAL to get CE information from device_info table. JIRA NVGPU-1053 Change-Id: Ib40a616d903a5dbef5730678c2ebc3454b8e900d Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1969400 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-20 09:26:01 -08:00
tkudav	38f8b3fb00	gpu: nvgpu: Add HALs for device_info table parsing The device_info table is an array of registers which contain engine specific data for engines like CE, graphics, nvdec, ioctrl etc. These registers contain data like intr_enum, reset_enum, pri_base and so on. The Top unit would include HAL to parse this table and get data for a particular engine. Some engines like CE have multiple entries in the device_info table corresponding to each instance of the engine. Prior to Pascal, each instance of an engine was denoted by different engine type. For example in GM20B, there are engine types like COPY_ENGINE0, COPY_ENGINE1 and so on. In Pascal and chips beyond, a new field called "inst_id" is added and the engine_type is kept the same. For example in GP10B, all copy engine entries have same engine type i.e ENGINE_LCE, but different inst_ids. So for Pascal and chips beyond, add HAL to get number of entries corresponding to an engine type.The "get_device_info" HAL will parse a specific instance of the engine using inst_id argument JIRA NVGPU-1053 Change-Id: Ie3058b1c1bfdd87bfa47e5f037d049d9d50cfc0b Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1969399 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-20 09:25:57 -08:00
Thomas Fleury	3943f87d69	gpu: nvgpu: userd slab cleanup Follow-up change to rename g->ops.mm.bar1_map (and implementations) to more specific g->ops.mm.bar1_map_userd. Also use nvgpu_big_zalloc() to allocate userd slabs memory descriptors. Bug 2422486 Bug 200474793 Change-Id: Iceff3bd1d34d56d3bb9496c179fff1b876b224ce Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1970891 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-17 12:33:43 -08:00
Debarshi Dutta	fcd216e170	gpu: nvgpu: move gk20a_fifo_engines_on_id to ops struct gk20a_fifo_engines_on_id uses H/W headers to return a valid active engine mask. This qualifies the function to be invoked via a struct gpu_ops function pointer instead. Jira NVGPU-1237 Change-Id: Ice30610ef51cf4471b3750f21d38e6648953e9e2 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1970032 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 21:54:48 -08:00
Debarshi Dutta	7f58347ed9	gpu: nvgpu: move tsg functions to common Any tsg specific functions that does high-level software-centric operations below to the TSG unit and not the FIFO unit. Move the below public functions as well as their dependent static functions to common/fifo/tsg.c and also rename them to use the prefix nvgpu_tsg_* gk20a_fifo_set_ctx_mmu_error_tsg gk20a_fifo_abort_tsg gk20a_fifo_error_tsg gk20a_fifo_check_tsg_ctxsw_timeout Jira NVGPU-1237 Change-Id: I4e3da821a878d4b4a0a0b53fbb7f4c10f135f58d Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1934299 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 21:54:26 -08:00
Debarshi Dutta	57f03e3a20	gpu: nvgpu: move channel functions to common Any channel specific functions having high-level software-centric operations belong to the channel unit and not the FIFO unit. Move the below public functions as well as their dependent static functions to common/fifo/channel.c. Also, rename the functions to use the prefix nvgpu_channel_*. gk20a_fifo_set_ctx_mmu_error_ch gk20a_fifo_error_ch gk20a_fifo_check_ch_ctxsw_timeout Jira NVGPU-1237 Change-Id: Id6b6d69bbed193befbfc4c30ecda1b600d846199 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1932358 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 21:54:17 -08:00
Konsta Holtta	07993bbbd8	gpu: nvgpu: add runlist_write_state HAL The function gk20a_fifo_sched_disable_rw accesses HW directly. Rename it and add a HAL indirection so that it can be called from chip-independent code. Also fix some trivial MISRA violations in the function. Jira NVGPU-1309 Change-Id: Icf320738d3d1d4baa40257a9da3ca2c6b7fefc0b Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1971274 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 12:06:08 -08:00
Deepak Nibade	fdc15553bc	gpu: nvgpu: add new HAL to initialize preemption mode g->ops.gr.alloc_gr_ctx HAL right now allocates graphics context and also initializes preemption mode for various platforms Separate out a new HAL g->ops.gr.init_ctxsw_preemption_mode that initializes preemption mode and call it from gk20a_alloc_obj_ctx() after context is created g->ops.gr.alloc_gr_ctx now only allocates the context as the name suggests Jira NVGPU-1527 Change-Id: I8a44672d5ab2ebfe315e6334115265e4ee4f24f0 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1972254 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 00:35:39 -08:00
Deepak Nibade	6bbcdb51c6	gpu: nvgpu: remove redundant GR ops g->ops.gr.enable_cde_in_fecs and g->ops.gr.update_boosted_ctx are no longer required since we can directly call g->ops.gr.ctxsw_prog.set_cde_enabled and g->ops.gr.ctxsw_prog.set_pmu_options_boost_clock_frequencies respectively remove those functions and the ops Jira NVGPU-1526 Change-Id: Idb0ad5f634e78aac44ec325ba2b7f59c612b29e8 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1972184 GVS: Gerrit_Virtual_Submit Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 00:35:29 -08:00
Sagar Kamble	147d5d9402	gpu: nvgpu: update GPCCS falcon base addr init GPCCS falcon base address was being set without invoking hal api. Remove FALCON_GPCCS_BASE. This patch defines gpu_ops.gr.gpccs_falcon_base_addr hal api to get this base address. JIRA NVGPU-1587 Change-Id: Icfa7a26d1bb2d67c81f05a43f6ce906f59706b3d Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1969431 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-12 15:14:20 -08:00
Sagar Kamble	c6fc301a9b	gpu: nvgpu: update FECS falcon base addr init FECS falcon base address was being set without invoking hal api. Remove FALCON_FECS_BASE. This patch defines gpu_ops.gr.fecs_falcon_base_addr hal api to get this base address. JIRA NVGPU-1587 Change-Id: I9c8e60be4ee81a154020c982893725a12ebb72ef Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1969430 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-12 15:14:16 -08:00
Sagar Kamble	e6668a163f	gpu: nvgpu: update PMU falcon base addr init PMU falcon base address was being set without invoking hal api. Remove FALCON_PWR_BASE. This patch defines gpu_ops.pmu.falcon_base_addr hal api to get this base address. JIRA NVGPU-1587 Change-Id: I5c3f27e89bdcc775025bc8d4fa9cf0af11ceb002 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1969428 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-12 15:14:09 -08:00
Peng Liu	34df003519	gpu: nvgpu: using pmu counters for load estimate PMU counters #0 and #4 are used to count total cycles and busy cycles. These counts are used by podgov to estimate GPU load. PMU idle intr status register is used to monitor overflow. Overflow rarely occurs because frequency governor reads and resets the counters at a high cadence. When overflow occurs, 100% work load is reported to frequency governor. Bug 1963732 Change-Id: I046480ebde162e6eda24577932b96cfd91b77c69 Signed-off-by: Peng Liu <pengliu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1939547 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-11 18:22:54 -08:00
Thomas Fleury	7e68e5c83d	gpu: nvgpu: userd slab allocator We had to force allocation of physically contiguous memory for USERD in nvlink case, as a channel's USERD address is computed as an offset from fifo->userd address, and nvlink bypasses SMMU. With 4096 channels, it can become difficult to allocate 2MB of physically contiguous sysmem for USERD on a busy system. PBDMA does not require any sort of packing or contiguous USERD allocation, as each channel has a direct pointer to that channel's 512B USERD region. When BAR1 is supported we only need the GPU VAs to be contiguous, to setup the BAR1 inst block. - Add slab allocator for USERD. - Slabs are allocated in SYSMEM, using PAGE_SIZE for slab size. - Contiguous channels share the same page (16 channels per slab). - ch->userd_mem points to related nvgpu_mem descriptor - ch->userd_offset is the offset from the beginning of the slab - Pre-allocate GPU VAs for the whole BAR1 - Add g->ops.mm.bar1_map() method - gk20a_mm_bar1_map() uses fixed mapping in BAR1 region - vgpu_mm_bar1_map() passes the offset in TEGRA_VGPU_CMD_MAP_BAR1 - TEGRA_VGPU_CMD_MAP_BAR1 is called for each slab. Bug 2422486 Bug 200474793 Change-Id: I202699fe55a454c1fc6d969e7b6196a46256d704 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1959032 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-11 16:24:10 -08:00
Deepak Nibade	6777bd5ed2	gpu: nvgpu: add separate unit for gr/ctxsw_prog Add separate new unit gr/ctxsw_prog that provides interface to access h/w header files hw_ctxsw_prog_.h Add below chip specific files that access above h/w unit and provide interface through g->ops.gr.ctxsw_prog.() HAL for rest of the units common/gr/ctxsw_prog/ctxsw_prog_gm20b.c common/gr/ctxsw_prog/ctxsw_prog_gp10b.c common/gr/ctxsw_prog/ctxsw_prog_gv11b.c Remove all the h/w header includes from rest of the units and code. Remove direct calls to h/w headers ctxsw_prog_() and use HALs g->ops.gr.ctxsw_prog.() instead In gr_gk20a_find_priv_offset_in_ext_buffer(), h/w header ctxsw_prog_extended_num_smpc_quadrants_v() is only defined on gk20a And since we don't support gk20a remove corresponding code Add missing h/w header ctxsw_prog_main_image_pm_mode_ctxsw_f() for some chips Add new h/w header ctxsw_prog_gpccs_header_stride_v() Jira NVGPU-1526 Change-Id: I170f5c0da26ada833f94f5479ff299c0db56a732 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1966111 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-11 14:41:04 -08:00
Vinod G	a747e3a3ba	gpu: nvgpu: RTV cb support for gfxp Add new buffer support for graphics preemption in Turing. Add new hal for allocate and commit rtv circular buffer for gfxp. Add new hal for free gr_ctx for TU104. JIRA NVGPUT-98 Change-Id: I4396fd50288db55da5f924fefa96a2e3d170094b Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1944975 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-05 17:03:53 -08:00
Konsta Holtta	94d4a42d10	gpu: nvgpu: add runlist_busy_engines HAL Split out the code to check which engines on a particular runlist are busy from gk20a_fifo_runlist_reset_engines() and make it a HAL op. Resetting engines is common across chips but status is read from registers. Jira NVGPU-1309 Change-Id: I7a63a2942a9e210481822eaf85795fc17dad0dc5 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1961822 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-30 11:54:27 -08:00
Sagar Kamble	1da7c720c0	gpu: nvgpu: reorganize falcon HAL code Move falcon HAL files under common/falcon unit and rename the files to falcon_*.c\|h for consistency. JIRA NVGPU-1459 Change-Id: I9f39097f35fd6228e80945251c7b7ef9cc901398 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1953757 Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-21 23:04:33 -08:00

1 2 3 4 5 ...

270 Commits