linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-24 02:22:34 +03:00

Author	SHA1	Message	Date
Nitin Kumbhar	a2314ee780	gpu: nvgpu: move no_of_sm to common.gr.config 1. Move no_of_sm from gr to common.gr.config 2. Add nvgpu_gr_config_get_no_of_sm() API in gr.config to fetch no_of_sm. JIRA NVGPU-1884 Change-Id: I3c6c20a12cd7f9939a349a409932195f17392943 Signed-off-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2073583 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-25 11:55:20 -07:00
Nitin Kumbhar	03e137b552	gpu: nvgpu: move init_sm_id_table hal to hal.gr.config Move init_sm_id_table hal to common.hal.gr.config. Two separate hals for gm20b and gv100 are added. JIRA NVGPU-1884 Change-Id: Id307542db67b103ec25b02b41fd3b9d9bd8f30f0 Signed-off-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2073582 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-25 11:55:05 -07:00
Seema Khowala	f66f3e1341	gpu: nvgpu: move fifo intr to hal/fifo Removed intr_0_error_mask ops Added below ops for fifo intr intr_0_enable intr_1_enable intr_0_isr intr_1_isr JIRA NVGPU-1310 Change-Id: I19bd1a380a89cffd582d6c4a0b7796a46fec5afb Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2072144 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-25 11:03:39 -07:00
Antony Clince Alex	217be5e492	gpu: nvgpu: report sm machine check errors Introduce callbacks for reporting various SM machine check errors along with relevant information which can be used to root cause the reason for the error. Jira NVGPU-1839 Change-Id: I1396f5ecb1ae1a7a7bb5d7534f5f54920c68e366 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2071467 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-25 06:35:45 -07:00
Nitin Kumbhar	30eea4ff2b	gpu: nvgpu: create common.gr.zcull 1. Separate out zcull unit from gr 2. Move zcull HALs from gr to common.hal.gr.zcull 3. Move common zcull functions to common.gr.zcull JIRA NVGPU-1883 Change-Id: Icfc297cf3511f957aead01044afc6fd025a04ebb Signed-off-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2076547 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-25 01:55:14 -07:00
Vinod G	863ab23445	gpu: nvgpu: Add interrupt hal unit for gr Create interrupt hal unit under hal.gr.intr. This holds the interrupt and exception related hals. Move enable_exceptions and enable_gpc_exceptions hal functions to hal.gr.init location. Modify enable_exceptions hal to pass gr->config and enable or disable parameters. Modify enable_gpc_exceptions to pass gr->config parameter. Add new hal function enable_interrupts with enable or disable parameter This hal helps to enable and disable the gr interrupts as needed. gr init calls that use these hals are modified to g->ops.gr.intr.enable_exceptions g->ops.gr.intr.enable_gpc_exceptions JIRA NVGPU-3016 Change-Id: Ib62f8bf0b5289b815c8eff4d32a47387f24af51b Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2077857 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-23 12:53:53 -07:00
Deepak Nibade	e64e02aaef	gpu: nvgpu: move global circular buffer commit hal to hal.gr.init Move g->ops.gr.commit_global_bundle_cb() hal to hal.gr.init unit as g->ops.gr.init.commit_global_bundle_cb() Remove register header accessor from gr_gk20a_commit_global_ctx_buffers() and move it to hal functions Move hal definitions to gm20b/gp10b hal files appropriately Jira NVGPU-2961 Change-Id: I6358dce963857402aa1d4d5606bf75398b9be83d Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2077216 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-23 00:54:20 -07:00
Deepak Nibade	e7047d0151	gpu: nvgpu: move circular/pagepool buffer size hals to hal.gr.init unit Move g->ops.gr.get_global_ctx_cb_buffer_size() and g->ops.gr.get_global_ctx_pagepool_buffer_size() hals to hal.gr.init unit Move corresponding hal definitions to hal.gr.init unit Jira NVGPU-2961 Change-Id: Ifff3e2073f6d9bca5b37244f7e107bad885e7ca7 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2077215 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-23 00:54:06 -07:00
Seshendra Gadagottu	60073d2156	gpu: nvgpu: move ltc related data to nvgpu_ltc Moved following ltc related data to struct nvgpu_ltc and has a reference to it from struct gk20a: struct nvgpu_spinlock ltc_enabled_lock; u32 max_ltc_count; u32 ltc_count; u32 slices_per_ltc; u32 cacheline_size; Added function remove_support for ltc and it is called during nvgpu remove sequence. Added following helper functions in ltc.h: u32 nvgpu_ltc_get_ltc_count(struct gk20a g); u32 nvgpu_ltc_get_slices_per_ltc(struct gk20a g); u32 nvgpu_ltc_get_cacheline_size(struct gk20a *g); Removed redudnant ltc.init_fs_state call from vgpu init sequence since it is getting called from nvgpu_init_ltc_support. NVGPU-2044 Change-Id: I3c256dc3866f894c38715aa2609e85bd2e5cfe5a Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2073417 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-22 15:13:46 -07:00
Deepak Nibade	4c8aadf83c	gpu: nvgpu: add hal.gr.init hals to get global cb sizes Remove below variables from struct gr_gk20a u32 bundle_cb_default_size; u32 min_gpm_fifo_depth; u32 bundle_cb_token_limit; u32 attrib_cb_default_size; u32 alpha_cb_default_size; u32 attrib_cb_gfxp_default_size; u32 attrib_cb_gfxp_size; u32 attrib_cb_size; u32 alpha_cb_size; Instead add below hals in hal.gr.init unit to get all of above sizes u32 (get_bundle_cb_default_size)(struct gk20a g); u32 (get_min_gpm_fifo_depth)(struct gk20a g); u32 (get_bundle_cb_token_limit)(struct gk20a g); u32 (get_attrib_cb_default_size)(struct gk20a g); u32 (get_alpha_cb_default_size)(struct gk20a g); u32 (get_attrib_cb_gfxp_default_size)(struct gk20a g); u32 (get_attrib_cb_gfxp_size)(struct gk20a g); u32 (get_attrib_cb_size)(struct gk20a g, u32 tpc_count); u32 (get_alpha_cb_size)(struct gk20a g, u32 tpc_count); u32 (get_global_attr_cb_size)(struct gk20a g, u32 max_tpc); Define these hals for all gm20b/gp10b/gv11b/gv100/tu104 chips Also add hal.gr.init support for gv100 chip Remove all accesses to variables from struct gr_gk20a and start using newly defined hals Remove below hals to initialize sizes since they are no more required g->ops.gr.bundle_cb_defaults(g); g->ops.gr.cb_size_default(g); g->ops.gr.calc_global_ctx_buffer_size(g); Also remove definitions of above hals from all the chip files Jira NVGPU-2961 Change-Id: I130b578ababf22328d68fe19df581e46aebeccc9 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2077214 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-22 12:48:03 -07:00
Thomas Fleury	7abb0324b4	gpu: nvgpu: move engine reset to common Move gk20a_fifo_reset_engine to common engine code. Added the following hals: - gr.halt_pipe - gr.reset Jira NVGPU-1306 Jira NVGPU-1315 Change-Id: I97b195720bcf8bf9dbc6882f0830e33942441b7b Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2035216 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-22 12:47:03 -07:00
Divya Singhatwaria	43d6dc82c9	gpu: nvgpu: Re-structure nvgpu_pmu struct - Make a new structure: nvgpu_pmu_pg for PG unit - This new struct combines all PG unit variables like elpg_stat, elpg_refcnt, pg_init etc. into one structure as a part of PG unit refactoring. - Use pmu_pg struct to access all PG variables. - Eg: &pmu->pmu_pg.elpg_mutex, &pmu->pmu_pg.pg_mutex and so on. NVGPU-1973 Change-Id: I9973072826f4089f6315827bce49fa30dbcbcdda Signed-off-by: Divya Singhatwaria <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2071306 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-22 07:38:43 -07:00
Thomas Fleury	696d212718	gpu: nvgpu: move userd to separate unit Add userd unit under common/fifo Moved userd setup/cleanup from fifo: - nvgpu_userd_setup_sw - nvgpu_userd_cleanup_sw Moved common userd code from hals: - nvgpu_userd_init_slabs - nvgpu_userd_free_slabs - nvgpu_userd_init_channel Replaced the following hals - fifo.userd_gp_get - fifo.userd_gp_put - fifo.userd_pb_get - fifo.setup_userd - fifo.userd_entry_size With - userd.gp_get - userd.gp_put - userd.pb_get - userd.init_mem - userd.entry_size Also added the following hals - userd.setup_sw: init slabs and reserve userd gpu_va - userd.cleanup_sw: de-init slabs and free gpu_va - userd.setup_hw: setup writeback timeout Jira NVGPU-2713 Change-Id: Ide854a38531a3ce00e61045449ddd010c956bdeb Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2035116 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-22 06:25:55 -07:00
Seema Khowala	27e3546175	gpu: nvgpu: add new tsg functions for ctxsw timeout re-org Add nvgpu_tsg_set_error_notifier function for setting error_notifier for all channels of a tsg. Add nvgpu_tsg_timeout_debug_dump_state function for finding if timeout_debug_dump is set for any of the channels of a tsg. Add nvgpu_tsg_set_timeout_accumulated_ms to set timeout_accumulated_ms for all the channels of a tsg. JIRA NVGPU-1312 Change-Id: Ib2daf2d462c2cf767f5a6e6fd3436abf6860091d Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2077626 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-22 05:20:01 -07:00
Vinod G	540241e47c	gpu: nvgpu: Add fifo_access hal to hal.gr.init Add new hal fifo_access to control the gr_gpfifo_ctl_r register to enable or disable the access bit and semaphore access bit. g->ops.gr.init.fifo_access function call with true or false parameter to enable or disable the fifo_access. JIRA NVGPU-2951 Change-Id: I67ad7ce9f176d7ce347e8acb425f7a4bb9e088ca Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2077705 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-21 17:26:10 -07:00
Vinod G	7e0063b871	gpu: nvgpu: move get_access_map hal to hal.gr.init Move the get_access_map hal code to the corresponding hal files under hal.gr.init Update g->ops.gr.get_access_map to g->ops.gr.init_get_access_map JIRA NVGPU-2951 Change-Id: Ib3a45198c936f9b0ae8694a35b6dc6968810e136 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2076920 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-21 17:25:56 -07:00
Debarshi Dutta	64269c2c55	gpu: nvgpu: modify init_pbdma_intr_descs into separate HALs init_pbdma_intr_descs HAL ops is used to update the internal values of the struct intr within struct fifo_gk20a. Three kinds of intr_descriptors are filled i.e. device_fatal_0, channel_fatal_0 and restartable_0. Breaking them into separate HALs has the advantage of reusing the h/w headers corresponding to the device_fatal_0 as they are same across all the architectures while those of channel_fatal_0 varies. Another advantage is to now decouple pbdma from filling in values within the fifo_gk20a struct. A new method gk20a_fifo_init_pbdma_descs is constructed that initializes the above intr struct by calling the separate HAL ops for these. Jira NVGPU-2950 Change-Id: I78ddc61a5d9b2088d34259af90f8b85817bf19d9 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2072741 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-21 04:37:27 -07:00
Debarshi Dutta	f9ca472d5f	gpu: nvgpu: move pbdma HAL functions to hal/fifo/pbdma The following HAL pointers are moved to a separate HAL unit named pbdma. pbdma_acquire_val get_pbdma_signature dump_pbdma_status handle_pbdma_intr_0 handle_pbdma_intr_1 read_pbdma_data reset_pbdma_header The functions corresponding to these HAL units are also moved to pbdma_{arch} files under hal/fifo correspondinging to arch gm20b, gp10b, gv11b and tu104. Any calls to gk20a_readl and gk20a_writel are replaced by nvgpu_readl and nvgpu_writel respectively. Jira NVGPU-2950 Change-Id: I9723f30ddf6582df02c03fceb1fba26a206e1230 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2071782 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-21 04:37:03 -07:00
Vinod G	f8b7a4f6d2	gpu: nvgpu: move wait_empty hal to hal.gr.init Move wait_empty hal function to hal.gr.init. Remove gv11b_gr_wait_empty hal function as it use the same implementation in gp10b_gr_wait_empty and has no register difference. JIRA NVGPU-2951 Change-Id: I4035e7cc5bf1510db9a250747467a873777526cf Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2075950 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-21 02:15:19 -07:00
Vinod G	d466ab8007	gpu: nvgpu: move load_tpc_mask and setup_rop_mapping to hal.gr.init Move load_tpc_mask and setup_rop_mapping hal functions to hal.gr.init. Existing load_tpc_mask hal code is split to two parts, one as a common code in gr_load_tpc_mask and register write to init.tpc_mask hal functions. Modify pd_tpc_per_gpc and pd_skip_table_gpc hals in the hal.gr.init to pass struct nvgpu_gr_config as a parameter. JIRA NVGPU-2951 Change-Id: I52e26d0f023afa511a8cf8c3e4c54f45350be4ae Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2074892 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-20 01:04:04 -07:00
Deepak Nibade	ac655611fd	gpu: nvgpu: disable pm_mode ctxsw by default in common.gr.ctx nvgpu_gr_ctx_load_golden_ctx_image() in common.gr.ctx unit programs initial pm_mode in context. gk20a_alloc_obj_ctx() then disables pm_mode ctxsw by default. Fix this by disabling pm_mode ctxsw by default in nvgpu_gr_ctx_load_golden_ctx_image() itself. Remove corresponding code from gk20a_alloc_obj_ctx() Jira NVGPU-1887 Change-Id: I6e1f83cefcb6229394da353e4cd87f1f5a0b10d4 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2076273 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-19 13:45:17 -07:00
Deepak Nibade	43c4083bd5	gpu: nvgpu: remove tu104 hal to commit global ctx buffers Add new hals in unit hal.gr.init to commit RTV circular buffer g->ops.gr.init.commit_rtv_cb() g->ops.gr.init.commit_gfxp_rtv_cb() Remove tu104 hal to commit global ctx buffers gr_tu104_commit_global_ctx_buffers() since we have specific hals to commit RTB circular buffer Update gr_gk20a_commit_global_ctx_buffers() to directly call hal.gr.init hals to commit RTV buffers Jira NVGPU-2961 Change-Id: I12a53386654ebfeb98bf187385bb8b839070d569 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2075230 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-19 11:25:19 -07:00
Deepak Nibade	c34c240409	gpu: nvgpu: remove tu104 hal to allocate global ctx buffers Add a new hal.gr.init unit hal g->ops.gr.init.get_rtv_cb_size() to retrieve RTV buffer size Update gr_gk20a_alloc_global_ctx_buffers() to initialize RTV buffer size if g->ops.gr.init.get_rtv_cb_size hal is present Remove gr_tu104_alloc_global_ctx_buffers() since it is no longer required Jira NVGPU-2961 Change-Id: I44be8dfdda5c813eac445192635a3a6c2b867b3a Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2075229 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-19 11:25:04 -07:00
Deepak Nibade	a3a508c21d	gpu: nvgpu: move gr.commit_global_timeslice hal to hal.gr.init unit Move g->ops.gr.commit_global_timeslice() hal operation to hal.gr.init unit as g->ops.gr.init.commit_global_timeslice() Drop channel pointer in parameter list since it was unused Also change return type to void since it never returns error Move corresponding gm20b and gv11b hal operations to hal.gr.init unit Jira NVGPU-2961 Change-Id: I68deef45af1d52149eb354a1478cc2b5f2e4ec2a Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2075228 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-19 11:24:50 -07:00
Deepak Nibade	f69050632d	gpu: nvgpu: add hal.gr.init hal to load method init bundle Add a new hal operation g->ops.gr.init.load_method_init() in hal.gr.init unit that reads method init netlist bundle and writes those values to h/w appropriately Use new hal in gr_gk20a_init_golden_ctx_image() instead of direct register accesses Jira NVGPU-2961 Change-Id: If1edd09445e55b5ad9cb1ec7b0f32cab9bfd6f05 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2075227 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-19 11:24:35 -07:00
Antony Clince Alex	09d5059369	gpu: nvgpu: report fecs ctxsw errors Introduce hooks for reporting the following ctxsw errors. CTXSW_WATCHDOG CTXSW_CRC_MISMATCH FAULT_DURING_CTXSW Add missing accessors for CTXSW interrupt registers and CRC error mailbox enumeration type. Jira NVGPU-1860 Jira NVGPU-1865 Jira NVGPU-1862 Change-Id: I1a4953b874bdb212497f12ec1493bed30d9a0f67 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017998 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-19 01:08:03 -07:00
Vinod G	30fd2a5dcc	gpu: nvgpu: move gr.init_fs_state HAL to hal.gr.init unit Move GR HAL operation g->ops.gr.init_fs_state to hal.gr.init unit as g->ops.gr.init.fs_state. Copy the corresponding hal function for init fs_state to the hal.gr.init files. JIRA NVGPU-2951 Change-Id: Icaf47e8872cc74a5a7430026633c52b47cfc879b Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2073381 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-18 16:56:48 -07:00
Thomas Fleury	6729d67f59	gpu: nvgpu: vpgu: use fifo common init/deinit code Native and vgpu were using different paths for fifo init/deinit code. Use same nvgpu_fifo_init_support for init code: nvgpu_fifo_init_support g->ops.fifo.setup_sw vgpu_fifo_setup_sw (NEW) Use same nvgpu_fifo_remove_support for deinit code: nvgpu_fifo_remove_support (NEW) g->ops.fifo.cleanup_sw (NEW) vgpu_fifo_cleanup_sw (NEW) Also implemented gk20a_fifo_cleanup_sw for native case. Jira NVGPU-1306 Jira NVGPU-2855 Change-Id: Iefe303cc224f804a206422e2efffda9da1616d89 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2029649 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-18 16:56:04 -07:00
Deepak Nibade	dbcce79b55	gpu: nvgpu: remove g->ops.gr.alloc_gr_ctx() hal Common code now directly calls gr_gk20a_alloc_gr_ctx() and vgpu code directly calls vgpu_gr_alloc_gr_ctx() Remove g->ops.gr.alloc_gr_ctx() hal since it is no longer required Jira NVGPU-1887 Change-Id: I65d19f4a8ae62967ff67d6f69b5af1b46abf9c1a Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2075233 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-18 15:45:54 -07:00
Deepak Nibade	7ea3a9833b	gpu: nvgpu: add common.gr.ctx apis to init/validate/set preemption modes Add below new apis in common.gr.ctx unit : Initialize preemption modes nvgpu_gr_ctx_init_compute_preemption_mode() nvgpu_gr_ctx_init_graphics_preemption_mode() Validate preemption modes nvgpu_gr_ctx_check_valid_preemption_mode() Set preemption modes nvgpu_gr_ctx_set_preemption_modes() Use new APIs instead of directly accessing preemption modes from struct nvgpu_gr_ctx Also move preemption mode #define values to include/nvgpu/gr/ctx.h Jira NVGPU-1887 Change-Id: I72cfc98bdc62df1b1a55f93c96514d4ea3c9dbf3 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2075240 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vinod Gopalakrishnakurup <vinodg@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-18 12:55:16 -07:00
Seshendra Gadagottu	e6f9033048	gpu: nvgpu: cbc: move cbc de-init sequence Move cbc_remove_support from gr remove to generic nvgpu remove sequence. JIRA NVGPU-2896 JIRA NVGPU-2897 Change-Id: Ia9c1a81e849bfe0dc123a86473ae2b0d77792335 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2074251 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-17 05:15:59 -07:00
Seshendra Gadagottu	a2bc7d5923	gpu: nvgpu: cbc: move cbc related code from gr Moved cbc related code and data from gr to cbc unit. Ltc and cbc related data is moved from gr header: 1. Ltc related data moved from gr_gk20a -> gk20a and it will be moved eventually to ltc unit: u32 slices_per_ltc; u32 cacheline_size; 2. cbc data moved from gr_gk20a -> nvgpu_cbc u32 compbit_backing_size; u32 comptags_per_cacheline; u32 gobs_per_comptagline_per_slice; u32 max_comptag_lines; struct gk20a_comptag_allocator comp_tags; struct compbit_store_desc compbit_store; 3. Following config data moved gr_gk20a -> gk20a u32 comptag_mem_deduct; u32 max_comptag_mem; These are part of initial config which should be available during nvgpu_probe. So it can't be moved to nvgpu_cbc. Modified code to use above updated data structures. Removed cbc init sequence from gr and added in common cbc unit. This sequence is getting called from common nvgpu init code. JIRA NVGPU-2896 JIRA NVGPU-2897 Change-Id: I1a1b1e73b75396d61de684f413ebc551a1202a57 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2033286 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-17 05:15:35 -07:00
Vinod G	43672dd237	gpu: nvgpu: gr/init update move gr_gk20a_init_fs_state function to common/gr/init as nvgpu_gr_init_fs_state. JIRA NVGPU-1885 Change-Id: I37aad483be268e2b722883719376beb142c0b7ea Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2072413 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-16 10:05:46 -07:00
Deepak Nibade	04786d1a2e	gpu: nvgpu: add hal.gr.init hal to enable/disable fe_go_idle timeout Add new hal operation g->ops.gr.init.fe_go_idle_timeout() in hal.gr.init unit to enable/disable fe_go_idle timeout Use this hal in gr_gk20a_init_golden_ctx_image() instead of direct register access Remove timeout disable/enable code in gk20a_init_sw_bundle() since parent API gr_gk20a_init_golden_ctx_image() is already taking care of that Jira NVGPU-2961 Change-Id: Ice72699059f031ca0b1994fa57661716a6c66cd2 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2072550 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-16 05:06:42 -07:00
Deepak Nibade	15d8941341	gpu: nvgpu: move gr.init_preemption_state HAL to hal.gr.init unit Move GR HAL operation g->ops.gr.init_preemption_state() to hal.gr.init unit as g->ops.gr.init.preemption_state() Create hal.gr.init unit files for gp10b and gv11b and copy over corresponding functions to new files This API now takes gfxp_wfi_timeout_unit and gfxp_wfi_timeout_count as parameter Define gfxp_wfi_timeout_unit in struct gr_gk20a as a boolean flag named gfxp_wfi_timeout_unit_usec Remove GFXP_WFI_TIMEOUT_UNIT_SYSCLK/USEC macros Jira NVGPU-2961 Change-Id: I4347b1e30c86c231e44cf274adccd8c70addcdab Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2072549 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-16 05:06:28 -07:00
Deepak Nibade	09e2e8c838	gpu: nvgpu: remove write to gr_scc_init_r() register Register gr_scc_init_r() is deprecated and non-functional since maxwell Remove write to this register and also remove its accessors Jira NVGPU-2961 Change-Id: I7ef0c55290003234f795a66435c1f7093827662e Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2072548 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-16 05:06:13 -07:00
Deepak Nibade	7fa2189fb3	gpu: nvgpu: move fecs_trace operations under gr Move g->ops.fecs_trace.() HAL operations under gr operations as g->ops.gr.fecs_trace.() Also rename gk20a_ctxsw_() functions used in common code to the format nvgpu_gr_fecs_trace_() Jira NVGPU-1880 Change-Id: Idf2f8fb3d7ba2832bf1837fd97b70b3cee412123 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2070767 GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-16 05:05:41 -07:00
Deepak Nibade	1208ad7cef	gpu: nvgpu: rearrange linux specific fecs trace support We have 3 header files for FECS tracing support include/nvgpu/gr/fecs_trace.h : common header include/nvgpu/ctxsw_trace.h : header that includes both common and os-specific functions os/linux/ctxsw_trace.h : linux specific header Remove the second header since it is not needed. Move all structures that are needed in common code to include/nvgpu/gr/fecs_trace.h Move all function declarations that are needed in common code to include/nvgpu/gr/fecs_trace.h Move all linux specific declarations in os/linux/ctxsw_trace.h and rename this file as os/linux/fecs_trace_linux.h Also rename os/linux/ctxsw_trace.c to os/linux/fecs_trace_linux.c Jira NVGPU-1880 Change-Id: I05cc4489c4b6a64880b7d59c02b22cd2244d5e22 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2070766 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-16 05:05:32 -07:00
Sagar Kamble	f4174ef048	gpu: nvgpu: move nvgpu_falcon struct to nvgpu/falcon.h This struct was earlier moved to falcon_priv.h to give exclusive access to only falcon unit. However with HAL unit needing access to this we need to move it public header nvgpu/falcon.h. JIRA NVGPU-1993 Change-Id: Ia3b211798009107f64828c9765040d628448812a Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2069688 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-15 02:24:49 -07:00
Thomas Fleury	ffed5095db	gpu: nvgpu: move fifo init/deinit code to common Add fifo sub-unit to common.fifo to handle init/deinit code and global support functions. Split init into: - nvgpu_channel_setup_sw - nvgpu_tsg_setup_sw - nvgpu_fifo_setup_sw - nvgpu_runlist_setup_sw - nvgpu_engine_setup_sw - nvgpu_userd_setup_sw - nvgpu_pbdma_setup_sw Split de-init into - nvgpu_channel_cleanup_sw - nvgpu_tsg_cleanup_sw - nvgpu_fifo_cleanup_sw - nvgpu_runlist_cleanup_sw - nvgpu_engine_cleanup_sw - nvgpu_userd_cleanup_sw - nvgpu_pbdma_cleanup_sw Added the following HALs - runlist.length_max - fifo.init_pbdma_info - fifo.userd_entry_size Last 2 HALs should be moved resp. to pbdma and userd sub-units, when available. Added vgpu implementation of above hals - vgpu_runlist_length_max - vgpu_userd_entry_size - vgpu_channel_count Use hals in vgpu_fifo_setup_sw. Jira NVGPU-1306 Change-Id: I954f56be724eee280d7b5f171b1790d33c810470 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2029620 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-14 20:35:22 -07:00
Vinod G	56219f7c10	gpu: nvgpu: add more gr/init hal functions Register write from gr_gk20a_init_fs_state function are moved to hal. New hal added for setting the pd_tpc_per_gpc, pd_skip_table_gpc and cwd_gpcs_tpcs_num. pd_tpc_per_gpc helps to describe the number of tpcs in each logical gpc. pd_skip_table helps to skip certain TPCs during distribution. cwd_gpcs_tpcs_num helps to set number of tpcs and gpcs in CWD. remove write for depreciated NV_PBE_PRI_ZROP_SETTING_NUM_ACTIVE_FBPS and NV_PBE_PRI_CROP_SETTINS_NUM_ACTIVE_FBPS fields from BES_ZROP_SETTINGS and BES_CROP_SETTINGS registers. Both these fields changed to NUM_ACTIVE_LTCS from gm20b onwards and those are being set in existing hal functions. JIRA NVGPU-2951 Change-Id: I905b98356e8eadaf7e2481850de841c050ea50c5 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2072249 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-14 15:34:53 -07:00
Vinod G	caac47c4fa	gpu: nvgpu: add new gr.init hals create new hals for wait_idle and wait_fe_idle under gr.init. modify functions to following hals and use same hals for all chips. gr_gk20a_wait_idle -> gm20b_gr_init_wait_idle gr_gk20a_wait_fe_idle -> gm20b_gr_init_wait_fe_idle JIRA NVGPU-2951 Change-Id: Ie60675a08cba12e31557711b6f05f06879de8965 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2072051 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-14 15:34:24 -07:00
Deepak Nibade	95f47ac13c	gpu: nvgpu: add new hal.gr.init HAL to reset sys/gpc/be units gr_gk20a_init_golden_ctx_image() right now resets sys/gpc/be units by directly accessing gr_fecs_ctxsw_reset_ctl_r() register Move this register write/read sequence to common.hal.gr.init unit through HAL operation g->ops.gr.init.override_context_reset() Use new HAL in gr_gk20a_init_golden_ctx_image() Also fix the delay() operations. delay() should be added before we read back gr_fecs_ctxsw_reset_ctl_r() register and not after Jira NVGPU-2961 Change-Id: I70d3a61b5aa60846815dee52ecac544066542695 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2070608 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-13 11:17:55 -07:00
Deepak Nibade	c4534b5ee3	gpu: nvgpu: add common.hal.gr.init unit Add new HAL unit common.hal.gr.init with below source files hal/gr/init/gr_init_gm20b.c hal/gr/init/gr_init_gm20b.h In gr_gk20a_init_golden_ctx_image() we force FE power mode on and also disable it. Extract out this sequence into new unit and expose new HAL operation that takes a boolean flag to enable/disable power mode g->ops.gr.init.fe_pwr_mode_force_on() Use new HAL operation in gr_gk20a_init_golden_ctx_image() Set this HAL for all the chips Jira NVGPU-2961 Change-Id: I1dd35d94fda5e5296af67c0abc944e200fb752ea Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2070607 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-13 11:17:40 -07:00
Vinod G	e8b6580953	gpu: nvgpu: remove pd_max_batches support remove unused pd_max_batches implementation. remove pd_max_batches support from gr_gk20a struct and sysfs Bug 200492671 Change-Id: Ibfd81a6aec88610175495018759c27341b637e52 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2070058 Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-13 08:54:21 -07:00
Vinod G	3856aa54d8	gpu: nvgpu: remove czf_bypass support remove unused czf_bypass support clean up the czf_bypass from sysfs implementation, gr_gk20a struct, hal support in gp10b for init_czf_bypass and set_czf_bypass. Bug 200492671 Change-Id: I2412410838581341c777d07cf4b2fad2d4163956 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2070057 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-13 08:54:12 -07:00
Seema Khowala	cb91bf1e13	gpu: nvgpu: protect recovery with engines_reset_mutex Rename gr_reset_mutex to engines_reset_mutex and acquire it before initiating recovery. Recovery running in parallel with engine reset is not recommended. On hitting engine reset, h/w drops the ctxsw_status to INVALID in fifo_engine_status register. Also while the engine is held in reset h/w passes busy/idle straight through. fifo_engine_status registers are correct in that there is no context switch outstanding as the CTXSW is aborted when reset is asserted. Use deferred_reset_mutex to protect deferred_reset_pending variable If deferred_reset_pending is true then acquire engines_reset_mutex and call gk20a_fifo_deferred_reset. gk20a_fifo_deferred_reset would also check the value of deferred_reset_pending before initiating reset process Bug 2092051 Bug 2429295 Bug 2484211 Bug 1890287 Change-Id: I47de669a6203e0b2e9a8237ec4e4747339b9837c Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2022373 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-13 06:34:31 -07:00
Seema Khowala	7e2f124fd1	gpu: nvgpu: wait for gr.initialized before changing cg/pg set gr.initialized to false in the beginning of gk20a_gr_reset() and set it to true at the end of successful execution of gk20a_gr_reset. Use gk20a_gr_wait_initialized() to enable/disable cg/pg functions to make sure engine is out of reset and initialized. Bug 2092051 Bug 2429295 Bug 2484211 Bug 1890287 Change-Id: Ic7b0b71382c6d852a625c603dad8609c43b7f20f Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2030827 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-13 06:34:17 -07:00
Seema Khowala	672e6bc31e	gpu: nvgpu: disable elpg before ctxsw_disable if fecs is sent stop_ctxsw method, elpg entry/exit cannot happen and may timeout. It could manifest as different error signatures depending on when stop_ctxsw fecs method gets sent with respect to pmu elpg sequence. It could come as pmu halt or abort or maybe ext error too. If ctxsw failed to disable, do not read engine info and just abort tsg. Bug 2092051 Bug 2429295 Bug 2484211 Bug 1890287 Change-Id: I5f3ba07663bcafd3f0083d44c603420b0ccf6945 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2014914 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-13 06:34:02 -07:00
Seema Khowala	d27f875d2c	gpu: nvgpu: change err to info print if failing eng id is -1 For handle_sched_error, change err to info print for failing eng id returned as -1 i.e. FIFO_INVAL_ENGINE_ID as no engine is found busy doing ctxsw. May be ctxsw already finished for the context for which ctxsw timeout intr was triggered. Possible Causes: a) On hitting engine reset, h/w drops the ctxsw_status to INVALID in fifo_engine_status register. Also while the engine is held in reset h/w passes busy/idle straight through. fifo_engine_status registers are correct in that there is no context switch outstanding as the CTXSW is aborted when reset is asserted. This is just a side effect of how gv100 and earlier versions of ctxsw_timeout behave. With gv10b and later, h/w snaps the context at the point of error so that s/w can see the tsg_id which caused the HW timeout. b) If engines are not busy and ctxsw state is valid then intr occurred in the past and if the ctxsw state has moved on to VALID from LOAD or SAVE, it means that whatever timed out eventually finished anyways. The problem with this is that s/w cannot conclude which context caused the problem as maybe more switches occurred before intr is handled. Bug 2092051 Bug 2429295 Bug 2484211 Bug 1890287 Change-Id: Ia79bee6e860fb179ee39024c963671d4f8245227 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2030866 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-13 04:14:10 -07:00

1 2 3 4 5 ...

2720 Commits