linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-22 17:36:20 +03:00

Author	SHA1	Message	Date
Deepak Nibade	bb43f11a61	gpu: nvgpu: update common.gr doxygen Add below updates to common.gr doxygen: - Add doxygen comments for APIs that are mentioned in RM SWAD and in RM-common.gr traceability document. - Comment about valid ranges for input parameters of bunch of functions. - Add nvgpu_assert() to ensure correct value is passed as input parameter to number of functions. - Add references to relevant functions with @see. - Update Targets field for unit tests to cover newly doxygenated functions. - Update unit test test_gr_init_hal_pd_skip_table_gpc to take care of new asserts added into some APIs. Jira NVGPU-6180 Change-Id: Ie889bed96b6428b1fd86dcf30b322944464e9d12 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2469397 (cherry picked from commit 5d7d7e9ce1c4efe836ab842d7962a3aee4e8972f) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2469394 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-02-02 23:34:27 -08:00
Lakshmanan M	883c12529a	gpu: nvgpu: Add multi GR reset support for MIG * Added multi GR reset/recovery support for MIG. * Added a api to get the gr engine id using gr instance id. JIRA NVGPU-5650 JIRA NVGPU-5653 Change-Id: I12ece75a4c33f0944f404121b54879e814dda6df Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2443644 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
Vedashree Vidwans	c0b9ae2f17	gpu: nvgpu: enable gr_reset in recovery on sim platform HALT_PIPELINE method is supported on nvgpu-next simulation platform. Send HALT_PIPELINE followed by gr reset during recovery for all types of platforms including simulation platform. Bug 3109773 Change-Id: Ib830075bb9414fa1765c762a652e63cddbe6a141 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2406719 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Antony Clince Alex	c36752fe3d	gpu: nvgpu: sim: make ring buffer independent of PAGE_SIZE The simulator ring buffer DMA interface supports buffers of the following sizes: 4, 8, 12 and 16K. At present, it is configured to 4K and it happens to match with the kernel PAGE_SIZE, which is used to wrap back the GET/PUT pointers once 4K is reached. However, this is not always true; for instance, take 64K pages. Hence, replace PAGE_SIZE with SIM_BFR_SIZE. Introduce macro NVGPU_CPU_PAGE_SIZE which aliases to PAGE_SIZE and replace latter with former. Bug 200658101 Jira NVGPU-6018 Change-Id: I83cc62b87291734015c51f3e5a98173549e065de Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2420728 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Lakshmanan M	995731171b	gpu: nvgpu: Do not reset PERFMON and BLG when MIG is enabled Do not reset PERFMON and BLG when MIG is enabled as PERFMON is a global engine which is shared by all syspipes. Individual PERF counters can be reset during gr syspipe reset. JIRA NVGPU-5650 Change-Id: I4a7fc9b6c62e94ee65779068ca257cb8e01c8cee Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2424604 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Lakshmanan M	2ecb5feaad	gpu: nvgpu: Skip graphics CB programming for MIG Added logic to skip the following graphics CB allocation, map and programming sequence when MIG is enabled. Global CB: 1) NVGPU_GR_GLOBAL_CTX_CIRCULAR 2) NVGPU_GR_GLOBAL_CTX_PAGEPOOL 3) NVGPU_GR_GLOBAL_CTX_ATTRIBUTE 4) NVGPU_GR_GLOBAL_CTX_CIRCULAR_VPR 5) NVGPU_GR_GLOBAL_CTX_PAGEPOOL_VPR 6) NVGPU_GR_GLOBAL_CTX_ATTRIBUTE_VPR 7) NVGPU_GR_GLOBAL_CTX_RTV_CIRCULAR_BUFFER CTX CB: 1) NVGPU_GR_CTX_CIRCULAR_VA 2) NVGPU_GR_CTX_PAGEPOOL_VA 3) NVGPU_GR_CTX_ATTRIBUTE_VA 4) NVGPU_GR_CTX_RTV_CIRCULAR_BUFFER_VA JIRA NVGPU-5650 Change-Id: I38c2859ce57ad76c58a772fdf9f589f2106149af Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2423450 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	94bc3a8135	gpu: nvgpu: rearch zbc code and update hals Update nvgpu_gr_zbc as: struct nvgpu_gr_zbc { struct nvgpu_mutex zbc_lock; /* Lock to access zbc table / struct zbc_color_table zbc_col_tbl; /* SW zbc color table pointer / struct zbc_depth_table zbc_dep_tbl; /* SW zbc depth table pointer / struct zbc_stencil_table zbc_s_tbl; /* SW zbc stencil table pointer / u32 min_color_index; / Minimum valid color table index / u32 min_depth_index; / Minimum valid depth table index / u32 min_stencil_index; / Minimum valid stencil table index / u32 max_color_index; / Maximum valid color table index / u32 max_depth_index; / Maximum valid depth table index / u32 max_stencil_index; / Maximum valid stencil table index / u32 max_used_color_index; / Max used color table index / u32 max_used_depth_index; / Max used depth table index / u32 max_used_stencil_index; / Max used stencil table index / }; Add global struct nvgpu_gr_zbc_table_indices struct nvgpu_gr_zbc_table_indices { u32 min_color_index; u32 min_depth_index; u32 min_stencil_index; u32 max_color_index; u32 max_depth_index; u32 max_stencil_index; }; Currently, hw zbc table registers are written during both gr_init_setup_sw() and gr_init_setup_hw(). - Modify nvgpu_gr_zbc_load_default_table() to nvgpu_gr_zbc_load_default_sw_table() to only update sw copy of zbc table during gr_init_setup_sw(). - Modify nvgpu_gr_zbc_load_table() to write zbc values stored in sw zbc table to hw registers. Re-structure zbc function as per zbc type i.e. color, depth and stencil. Add gr.zbc.init_table_indices() hal to initialize zbc indices. Valid ZBC table indices start from 1. HW indices start from 0 for color, depth and stencil tables. Note that the corresponding format registers follow ZBC index range starting at 1. - void (init_table_indices)(struct gk20a g, struct nvgpu_gr_zbc_table_indices zbc_indices); - Add corresponding functions for legacy chips - Add zbc color, depth and stencil table size hw defines - Remove ltc.zbc_table_size() hal - Update ltc.set_zbc_s_entry(), ltc.set_zbc_color_entry and ltc.set_zbc_depth_entry() accordingly. Bug 3122410 Bug 3122649 Change-Id: Ib799991ad35c6613534c0a6eb07f3bf24e600dc5 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2417620 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	d2bb5df3c7	gpu: nvgpu: remove NVGPU_GR_NUM_INSTANCES common.gr defined a temporary macro NVGPU_GR_NUM_INSTANCES to enable or disable multiple GR instances from common.gr unit. Multiple GR instance boot is now verified, so we can remove this temporary solution. Note that nvgpu_grmgr_get_num_gr_instances() will return more than 1 instance only if NVGPU_SUPPORT_MIG is enabled. Update unit tests to set number of syspipes to 1 to allow enumeration of GR instance by grmgr. Jira NVGPU-5648 Change-Id: I795901ae516843ae7b6c1794dae0f023a213ab1d Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2418377 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	ebb66b5d50	gpu: nvgpu: add macros to get current GR instance Add macros to get current GR instance id and the pointer nvgpu_gr_get_cur_instance_ptr() nvgpu_gr_get_cur_instance_id() This approach makes sure that the caller is getting GR instance pointer under mutex g->mig.gr_syspipe_lock in MIG mode. Trying to access current GR instance outside of this lock in MIG mode dumps a warning. Return 0th instance in case MIG mode is disabled. Use these macros in nvgpu instead of direct reference to g->mig.cur_gr_instance. Store instance id in struct nvgpu_gr. This is to retrieve GR instance id in functions where struct nvgpu_gr pointer is already available. Jira NVGPU-5648 Change-Id: Ibfef6a22371bfdccfdc2a7d636b0a3e8d0eff6d9 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2413140 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	6a69ea235e	gpu: nvgpu: disable graphics specific init functions in MIG mode MIG mode does not support graphics, ELPG, and use cases like TPC floorsweeping. Skip all such initialization functions in common.gr unit if MIG mode is enabled. Set can_elpg to false if MIG mode is enabled. Jira NVGPU-5648 Change-Id: I03656dc6289e49a21ec7783430db9c8564c6bf1f Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2411741 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	7a937a6190	gpu: nvgpu: add debug logs for common.gr debugging Add separate flag gpu_dbg_gr to enable common.gr specific debugging. Add this flag to all the existing debug logs that use gpu_dbg_fn or gpu_dbg_info for debugging. Also add many other debugging logs that might be helpful in debugging. Removing debug log in gv11b_gr_init_get_nonpes_aware_tpc() as it dumps too much data that does not seem useful. Batch all interrupt enable functions in gr_init_setup_hw() together for readability. Jira NVGPU-5648 Change-Id: I0b857650122cdb1f974b452d28c26e7f142baf61 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2411740 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	e0dd79cd43	gpu: nvgpu: rearch mc reset and enable hals Remove current mc hals - mc.reset() - mc.enable() - mc.disable() - mc.reset_mask() - mc.reset_engine() - mc.reset_engine_enable() Add new mc hals - mc.enable_units(g, units, enable) > enable/disable given unit(s) - mc.enable_dev(g, dev, enable) > enable/disable engine represented by given device pointer - mc.enable_devtype(g, devtype) > enable/disable all engines of given devtype Move common mc intr functions to common/mc/mc_intr.c. Add below common mc functions - nvgpu_mc_reset_units(g, units) > reset given logical OR of nvgpu unit bitmap - nvgpu_mc_reset_dev(g, dev) > reset given single engine via dev > if engine is graphics, reset gpcs for nvgpu_next - nvgpu_mc_reset_devtype(g, devtype) > reset all engines of given devtype > if devtype is graphics, reset gpcs for nvgpu_next Bug 200648985 Bug 3109773 Change-Id: Idc67a14a0a7cde83de44fbfbec13007fead3ed5c Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2408523 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	e6e7561084	gpu: nvgpu: execute nvgpu_gr_init_support for each GR instance nvgpu_gr_init_support() right now executes each of its function for each GR instance separately. Instead of looping for each function, move the GR engine initialization sequence to a separate gr_init_support_impl() and execute this function for each instance. Update below functions to take nvgpu_gr pointer as parameter. These functions need not worry about GR instance, instead they'll just operate on provided instance pointer. gr_init_setup_hw gr_init_config gr_init_setup_sw gr_init_sm_id_config_early gr_init_ctxsw_falcon_support Add new static function gr_init_support_finalize() to set the ready status and invoke waiters. Execute this per GR instance. gr_init_ecc_init() and nvgpu_cg_elcg_enable_no_wait() are not needed to be run per instance. gr_init_ecc_init() will be later updated to allocate meta data for all instances Jira NVGPU-5648 Change-Id: Ia6860f2bdfe0080aebf8930266d3f51bfd805e36 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2410703 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	bafeea3530	gpu: nvgpu: setup HW for each GR instance Get number of SMs from GR instance specific nvgpu_gr_config pointer instead of global SM count in below functions : nvgpu_gr_fs_state_init() gv11b_gr_init_sm_id_config() Update nvgpu_gr_config_get_gpc_skip_mask() to return 0 in case gpc_index is greater than available gpc_count. This is not MIG specific, but based on code review possible even today for existing chips. See gm20b_gr_init_pd_skip_table_gpc() Update nvgpu_gr_get_override_ecc_val() to return GR instance specific value. Execute gr_init_setup_hw() for each GR instance. Disable below failing unit tests: nvgpu_gr_fs_state.test_gr_fs_state_error_injection nvgpu_gr_init.test_gr_init_hal_config_error_injection Jira NVGPU-5648 Change-Id: Ie8f1c0c304c634756786d85facf336a5c9ae8195 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2410702 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	3df2ed4f82	gpu: nvgpu: setup SW for each GR instance Execute gr_init_setup_sw() for each GR instance. Update all of the functions called from this function to receive nvgpu_gr pointer explicitly. Separate out nvgpu_gr_zbc_init() call to gr_init_setup_sw() and rename gr_init_ctx_and_map_zbc() to gr_init_ctx_bufs() for more clarity. Call gr_init_ecc_init() from nvgpu_gr_init_support() since this does not need to be executed per GR instance. Initialize mutex etc in nvgpu_gr_alloc() for consistency. Jira NVGPU-5648 Change-Id: I8e990e11458c05c1b53a4d6710cc2ec3545762a8 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2410701 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	83691e088f	gpu: nvgpu: initialize ctx state for each GR instance Execute nvgpu_gr_init_ctx_state() for each GR instance. Move it under gr_init_ctxsw_falcon_support() which is already executed for each instance. Update the API to accept struct nvgpu_gr pointer for convenience. API does not need to know about other instances. For reset path, continue using g->gr instead of specific instance. This will be revisited when entire reset path is refactored. Jira NVGPU-5648 Change-Id: I8879bf3b44bb01f6b8053f1aecbd550f49837520 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2409535 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	b6c72410bb	gpu: nvgpu: execute CTXSW ucode initialization per GR instance Move CTXSW ucode initialization to separate static API gr_init_ctxsw_falcon_support() and execute this per GR instance with nvgpu_gr_exec_with_ret_for_each_instance() Jira NVGPU-5648 Change-Id: I6e0fa72bd568eaac027bb12edcdf90255336f0a1 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2409532 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Seshendra Gadagottu	43242fa878	gpu: nvgpu: init ctxsw state after gr reset Ctxsw state will be lost after gr reset. After gr reset in recovery sequence, re-initialize ctxsw state to send below fecs methods: gr_fecs_method_push_adr_discover_image_size_v() gr_fecs_method_push_adr_discover_pm_image_size_v() gr_fecs_method_push_adr_discover_zcull_image_size_v() gr_fecs_method_push_adr_discover_preemption_image_size_v() Without these methods sent to ctxsw, fecs will generate host error interrupts indicating mismatches in ctxsw image. Above fecs methods needs to be sent even if they are already sent during golden context creation. Bug 3109773 Change-Id: I2aeb92da8fa1961903ab95ef90f47906a1bb32c4 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2406685 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	da43acf639	gpu: nvgpu: execute early SM id config for each instance Execute gops.gr.init.sm_id_config_early() for each GR instance with nvgpu_gr_exec_with_ret_for_each_instance() Jira NVGPU-5648 Change-Id: I7023ed5c7d65d43eb7bb8384617464a39c846f56 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2408419 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	fc12a284bf	gpu: nvgpu: initialize per GR instance config Expose below two new APIs from common.grmgr unit nvgpu_grmgr_get_gr_num_gpcs() - get per instance number of GPCs nvgpu_grmgr_get_gr_gpc_phys_id() - get physical GPC id for MIG engine local id in corresponding instance Execute gr_init_config() for each GR instance. Add gr_config_init_mig_gpcs() to initialize GPC data in case MIG is enabled. Separate out gr_config_init_gpcs() for legacy GPC data initialization. These functions will inititialize below data in struct nvgpu_gr_config: max_gpc_count gpc_count gpc_mask gpc_tpc_mask[gpc_count] max_tpc_per_gpc_count Rest of the values in struct nvgpu_gr_config are either based on above values, or read from HW after setting GPC PRI window. In gr_config_alloc_struct_mem(), rename total_gpc_cnt to total_tpc_cnt since it represents total TPC count and not GPC. Remove use of temp3 variable since it does not give any idea on usage. Jira NVGPU-5648 Change-Id: I646cac2ddc312e72b241b1b2a0e51a5cce141535 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2406390 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	002edb782a	gpu: nvgpu: move cur_gr_instance tracking to MIG infra Move cur_gr_instance from struct gk20a to struct nvgpu_mig since this tracking is really MIG specific. Jira NVGPU-5648 Change-Id: I27b124925c2291e352ef9456c7189da0bc447a42 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2406389 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	717921a274	gpu: nvgpu: return intr mask of all GR engine instances nvgpu_gr_engine_interrupt_mask() earlier returned mask of all GR engine instance interrupts. During device refactor series, this got changed to return interrupt of only first instance. Change this again to return interrupt mask of all the GR engine instances since common.mc unit does not yet support APIs to enable interrupt of individual GR instance. Update nvgpu_gr_get_syspipe_id() API to take gr_instance_id as parameter instead of struct nvgpu_gr pointer. Definition of struct nvgpu_gr is not available outside of common.gr unit. Jira NVGPU-5648 Change-Id: I5320d1515eea6054150dc14706a16475bd650da7 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2405409 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	35fabed1e8	gpu: nvgpu: execute gr_init_prepare_hw() for each instance Rename gr_init_reset_enable_hw() to gr_init_prepare_hw() since this function does not actually do reset, but just prepares the HW after reset for other SW/HW initialization. Add a new function gr_init_prepare_hw_impl() that executes per-instance sequence to prepare GR hardware. Execute this inside nvgpu_gr_exec_with_ret_for_each_instance(). Note that enabling GR engine interrupts in MC is still expected to be done in one shot hence keep that code outside of gr_init_prepare_hw_impl() Remove redundant calls to gops.gr.init.fifo_access() and enable_gr_interrupts() from gr_init_setup_hw(). gr_init_prepare_hw() does this already and executes before gr_init_setup_hw() Jira NVGPU-5648 Change-Id: If0b7207f80c2fb00d894afebce04b06b7b61d432 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2405408 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	6745b0685e	gpu: nvgpu: support resetting each GR instance Add a new header file <nvgpu/gr/gr_instances.h> that supports below macros to execute various functions for GR instances 1) nvgpu_gr_exec_for_each_instance Execute a function for each GR instance by configuring GR remap window for that instance. Function being executed returns void. 2) nvgpu_gr_exec_with_ret_for_each_instance Execute a function for each GR instance by configuring GR remap window for that instance. Function being executed returns an error. 3) nvgpu_gr_exec_for_all_instances Execute a function for all GR instances at once. For this GR remap window needs to be disabled temporarily. If CONFIG_NVGPU_MIG is disabled, all above macros will turn into simple funciton calls. If CONFIG_NVGPU_MIG is disabled or if runtime flag NVGPU_SUPPORT_MIG is disabled, all above macros will turn into simple function calls that configure single GR instance. Separate out GR engine reset code into new API gr_reset_engine() and execute it with nvgpu_gr_exec_with_ret_for_each_instance(). PROD values need to be loaded in legacy mode, hence call nvgpu_cg_init_gr_load_gating_prod() inside nvgpu_gr_exec_for_all_instances(). Rename gr_init_prepare_hw() to more appropriate gr_reset_hw_and_load_prod() Moe gops.gr.init.fifo_access() call to gr_init_reset_enable_hw(). Add new API nvgpu_grmgr_get_gr_syspipe_id() to query GR instance syspipe id from common.grmgr unit. Add nvgpu_gr_get_syspipe_id() that returns same value stored in nvgpu_gr struct. Add cur_gr_instance field to struct nvgpu_gr to track current GR instance being programmed under remap window. Jira NVGPU-5648 Change-Id: I86920303427a6e6547ebf195daa37438365bb38e Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2403550 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	34c24873a7	gpu: nvgpu: trigger gpc reset from common.gr GPC reset is right now triggered from common.mc unit for NVGPU_NEXT. Move the triggers to common code in common.gr unit. This way it is much more cleaner to handle multiple GR instances (added in subsequent patch) Hardcode GR engine instance to 0 for now since by default there is only one GR engine instance. Jira NVGPU-5648 Change-Id: I3fd4d0a50db5a8c4b3decf1df881af323cea50c1 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2403549 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Debarshi Dutta	38ce6fa717	gpu: nvgpu: change unnamed structs to named structs Following changes are made in this patch. 1) Change unnamed structs within gpu_ops to named structs with the prefix gops_. 2) Each named struct gops_ are moved into a separate gops specific file under include/nvgpu/gops/ 3) struct gpu_ops is moved into a separate file include/nvgpu/gpu_ops.h and all other dependent struct gops_ are included in this header. 4) Direct references to include/nvgpu/gops are removed from files as its enough to include gk20a.h. Change-Id: Ieb22cb853be567e3bef14f5f8a04674eebd902ea Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2398776 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	2427d45102	gpu: nvgpu: initialize gr ecc counters for each instance Add new API nvgpu_ecc_counter_init_per_gr() to initialize ECC counters per GR instance. Switch NVGPU_ECC_COUNTER_INIT_GR macro to use nvgpu_ecc_counter_init_per_gr() instead of nvgpu_ecc_counter_init(). Fix error handling path in nvgpu_gr_alloc(). Jira NVGPU-5648 Change-Id: I18f1bf8b245956bdb5a3e4bb6b03114282366ce6 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2402025 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	8cccb49bd2	gpu: nvgpu: collapse nvgpu_gr_prepare_sw into nvgpu_gr_alloc common.gr unit exports a separate API nvgpu_gr_prepare_sw to initialize some SW pieces required for nvgpu_gr_enable_hw(). A separate API is really unnecessary since same initialization can be performed in nvgpu_gr_alloc(). Remove nvgpu_gr_prepare_sw() and HAL gops.gr.gr_prepare_sw(). Initialize falcon and interrupt structures in loop from nvgpu_gr_alloc(). Move nvgpu_netlist_init_ctx_vars() from nvgpu_gr_prepare_sw() to common init path since netlist parsing need not be done from common.gr unit. It just needs to happen before nvgpu_gr_enable_hw(). Also, trigger nvgpu_gr_free() from gr_remove_support() instead of OS specific paths. Also remove nvgpu_gr_free() calls from probe error paths since nvgpu_gr_alloc is no longer called in probe path. Move interrupt and falcon data structure free calls to nvgpu_gr_free(). Also remove corresponding unit testing code that tests nvgpu_gr_prepare_sw() specifically. Update some unit tests to initialize ecc counters and netlist. Disable some unit tests that fail for reasons unknown. Jira NVGPU-5648 Change-Id: I82ec8160f76530bc40e0c11a9f26ba1c8f9cf643 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2400166 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	cfa360f5b8	gpu: nvgpu: allocate struct nvgpu_gr based on enumerated gr count Add new API nvgpu_grmgr_get_num_gr_instances() that returns number of GR instance enumerated by GR manager. This just returns number of sys pipes enabled since it is same as number of GR instances. For consistency until common.gr supports multiple GR instances completely, add a temporary macro NVGPU_GR_NUM_INSTANCES and set it to 1. If this macro is changed to 0 (for local MIG testing), fall back to use nvgpu_grmgr_get_num_gr_instances() to get enumerated number of GR instances. Use a for loop to initialize other variables of struct nvgpu_gr. Remove unnecessary NULL check in nvgpu_gr_alloc() since struct gk20a pointer can never be NULL in this path. Also remove corresponding unit test code. Jira NVGPU-5648 Change-Id: Id151d634a23235381229044f2a9af89e390886f2 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2400151 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Shashank Singh	cdc96f900f	gpu: nvgpu: do sm id programming early Move sm id programming before loading ctxsw and gpccs firmwares. This is the actual sequence expected by ctxsw ucode. Legacy chips will use the same old sequence. Bug 200631350 Change-Id: I3cc1384982b238475af47da6a25e2acd6616fd84 Signed-off-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2398300 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Alex Waterman	fba96fdc09	gpu: nvgpu: Replace nvgpu_engine_info with nvgpu_device Delete the struct nvgpu_engine_info as it's essentially identical to struct nvgpu_device. Duplicating data structures is not ideal as it's terribly confusing what does what. Update all uses of nvgpu_engine_info to use struct nvgpu_device. This is often a fairly straight forward replacement. Couple of places though where things got interesting: - The enum_type that engine_info uses is defined in engines.h and has a bit of SW abstraction - in particular the GRCE type. The only place this seemed to be actually relevant (the IOCTL providing device info to userspace) the GRCE engines can be worked out by comparing runlist ID. - Addition of masks based on intr_id and reset_id; those can be computed easily enough using BIT32() but this is an area that could be improved on. This reaches into a lot of extraneous code that traverses the fifo active engines list and dramtically simplifies this. Now, instead of having to go through a table of engine IDs that point to the list of all host engines, the active engine list is just a list of pointers to valid engines. It's now trivial to do a for-all-active-engines type loop. This could even be turned into a generic macro or otherwise abstracted in the future. JIRA NVGPU-5421 Change-Id: I3a810deb55a7dd8c09836fd2dae85d3e28eb23cf Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2319895 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	010f818596	gpu: nvgpu: initialize gr struct in poweron path struct nvgpu_gr is right now initialized during probe and from OS specific code. To support multiple instances of graphics engine, nvgpu needs to initialize nvgpu_gr after number of engine instances have been enumerated in poweron path. Hence move nvgpu_gr_alloc() to poweron path and after gr manager has been initialized. Some of the members of nvgpu_gr are initialized in probe path and they too are in OS specific code. Move them to common code in nvgpu_gr_alloc() Add field fecs_feature_override_ecc_val to struct gk20a to store the override flag read from device tree. This flag is later copied to nvgpu_gr in poweron path. Update tpc_pg_mask_store() to check for g->gr being NULL before accessing golden image pointer. Update tpc_fs_mask_store() to return error if g->gr is not initialized. This path needs nvgpu_gr struct initialized. Also fix the incorrect NULL pointer check in tpc_fs_mask_store() which breaks the write path to this sysfs. Jira NVGPU-5648 Change-Id: Ifa2f66f3663dc2f7c8891cb03b25e997e148ab06 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2397259 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Antony Clince Alex	8a9acf8a7e	gpu: nvgpu: move set_hww_esr_report_mask to golden context init The driver configures the sm hww global, warp ESR report masks during poweron as part of gops_gr.gr_init_support. However, during golden context init, these are overwritten with default entries from sw_ctx_load list; this leaves the report masks in a state inconsistent with the driver expectation. The driver should configure the sm hww warp, global ESR report masks during golden context init and not before it; Hence, move set_hww_esr_report_mask from power-on path to golden context init. In addition, update set_hww_esr_report_mask to do RMW, so as to retain the values loaded from sw_ctx_load list. Update global ESR report mask to enable all exceptions. Bug 3029888 Bug 2997718 Change-Id: Id7ad4cff5409982143f49695c95c5e1d1c9fdec9 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2367466 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Seema Khowala	a5ecf0da7c	gpu: nvgpu: add info prints for sw_ctx_load and sw_non_ctx_load This will help debug issues where registers are incorrectly updated by ctxsw ucode or are overwritten after nvgpu init sequence sets the value. Bug 3029888 Change-Id: I510763a767145500715fb260799b0dd98e59778f Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2365212 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Antony Clince Alex	077a07ff9f	gpu: nvgpu: add gr gops to enable/handle zrop/crop/rrh hww Add the following gr gops functions: - enable_gpc_crop_hww - enable_gpc_zrop_hww - handle_gpc_crop_hww - handle_gpc_zrop_hww - handle_gpc_rrh_hww These gr gops will be used in nvgpu-next. Add function: nvgpu_gr_rop_offset to compute rop pri offsets. Jira: NVGPU-5237 Change-Id: I9e2437c1d2893238b16ec7a134543e20c81b49f7 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2335687 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	2d94863cae	gpu: nvgpu: move is_tpc_addr and get_tpc_num to common gr.is_tpc_addr() and gr.get_tpc_num() are chip agnostic hals. Move these hals to common code. Jira NVGPU-5504 Change-Id: I50fa7ac876c8667de42df1830bd412b412538508 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2349272 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Seshendra Gadagottu	3bd0430aa8	gpu: nvgpu: for nvgpu-next do not reset grce engines twice NVGPU_ENGINE_GRCE engines are getting reset twice, once in nvgpu_init_prepare_hw() and other time in nvgpu_ce_init_support(). To avoid this, remove NVGPU_ENGINE_GRCE engines reset from nvgpu_init_prepare_hw. JIRA NVGPU-5288 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Change-Id: Ic03dbff0a74e973ba423abfa004e49bdd8e451f7 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2336450 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	44f12288ad	gpu: nvgpu: add mc.reset_engine hal for nvgpu-next Engine reset process has changed for nvgpu-next. Add mc.reset_engine gops for nvgpu-next. Modify engine reset functions to use mc.reset_engine hal. Jira NVGPU-5145 Change-Id: I176800212042eaef71c8cbd4bc499805c5af0e60 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2312485 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	2e4fb38870	gpu: nvgpu: add eng_config hal for nvgpu_next Add gr.eng_config hal for nvgpu_next. Jira NVGPU-5049 Change-Id: Ieb342cb0416f965a3f80e3a6e3f0f43a853485ff Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2300534 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	5fa0d7f994	gpu: nvgpu: add bundle programming for nvgpu_next Update bundle programming for nvgpu_next. JIRA NVGPU-5004 Change-Id: I1c452a9e78cd018de86fb57de10291c4411e7d89 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2299128 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: Vinod Gopalakrishnakurup <vinodg@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Rajesh Devaraj	50d71f7c56	gpu: nvgpu: report fecs ctxsw init error This patch adds callback to report fecs ctxsw init error to 3LSS. It also moves the related wrapper function to nvgpu_err header file and adds doxygen documentation. JIRA NVGPU-5042 Change-Id: I2a051cf19c2940859169799a4dd51adf8870eff4 Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2300003 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Seema Khowala	57d6721ce3	gpu: nvgpu: add NULL check for su/lg_coalesce su_coalesce and lg_coalesce hals are chip specific and not all the chips need to set su/lg. Add NULL check for these hals. Also add hooks for nvgpu-next fuse. JIRA NVGPU-4868 Change-Id: Ic89d3fb7669f86dcdd6e36c7f832e64958cb9576 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2288652 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vinod Gopalakrishnakurup <vinodg@nvidia.com> Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
vinodg	977cc73230	gpu: nvgpu: move wait_initialized to non-fusa section nvgpu_gr_wait_initialized function is being called from cg and pg subunit and only be used as part of non-fusa code. Add CONFIG_NVGPU_HAL_NON_FUSA checking for that function call. Jira NVGPU-4676 Change-Id: Ibfdbe336a5e56bc5a2974576cffb9fb5cb5d2cc9 Signed-off-by: vinodg <vinodg@nvidia.com> Reviewed-on: http://git-master.nvidia.com/r/c/linux-nvgpu/+/2276907 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:10:29 -06:00
vinodg	a126e00e28	gpu: nvgpu: compile out unused code in gr init unit Add CONFIG_NVGPU_GRAPHICS check before calling g->gops>gr.init.preemption_state function. Add NULL checking of pointer before deferecing those pointers in de_init functions Jira NVGPU-4676 Change-Id: Id9be0aebdcab4a8fb2b03e92e67c1c207b5b8eab Signed-off-by: vinodg <vinodg@nvidia.com> Reviewed-on: http://git-master.nvidia.com/r/c/linux-nvgpu/+/2276898 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:10:29 -06:00
Scott Long	7378e16778	gpu: nvgpu: gr: misra 12.1 fixes MISRA Advisory Rule states that the precedence of operators within expressions should be made explicit. This change removes the Advisory Rule 12.1 violations from gr code. Jira NVGPU-3178 Change-Id: I99a60f60f6edcc2acb7343c66d1c4c79752d4acb Signed-off-by: Scott Long <scottl@nvidia.com> Reviewed-on: http://git-master.nvidia.com/r/c/linux-nvgpu/+/2276774 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:10:29 -06:00
dnibade	ab76dc1ad5	gpu: nvgpu: unit: add coverage tests for gops.gr.init.ecc_scrub_reg Add new unit test to cover gops.gr.init.ecc_scrub_reg HAL function gops.gr.init.ecc_scrub_reg HAL can generate TIMEOUT errors which are not returned to caller currently. Update this HAL to return int value for error propagation. Jira NVGPU-4458 Change-Id: I98f4d5af2ef17cc4301951fec4d660638c8ef72c Signed-off-by: dnibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2265456 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:10:29 -06:00
vinodg	6286876706	gpu: nvgpu: fix code complexity issue in common.gr init unit ecc init code is moved to a sub function from gr_init_setup_sw to reduce the code complexity below 10. Jira NVGPU-4699 Change-Id: I32d31895c18554993f56918da71179000ca86122 Signed-off-by: vinodg <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2268270 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:10:29 -06:00
Sagar Kamble	a73ca0b70e	gpu: nvgpu: split GR ECC initialization Split GR ECC initialization into GPC/TPC and FECS ECC init as FECS ECC errors during acr_construct_execute need to be reported and handled hence FECS ECC counters are required to be initialized before acr_construct_execute. GPC/TPC ECC counters are dependent on the GR config that will be initialized only after acr_construct_execute. nvgpu_gr_intr_init_support is moved to nvgpu_gr_prepare_sw. FECS ECC interrupt is enabled by default hence interrupt is not enabled through gr_fecs_host_int_enable_r in nvgpu_gr_prepare_sw. JIRA NVGPU-4439 Change-Id: Ifc9912f0578015a6ba1e9d38765c42633632b15f Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2261987 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Vinod Gopalakrishnakurup <vinodg@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:10:29 -06:00
Sagar Kadamati	42ccc21c62	gpu: nvgpu: fix static violations in common * Updated types and added error checks * Modified GR condition for ctxsw disable count CERT-C error check was added to detect error on integer overflow But below logic couldn't detect first overflow, so updated condition INT_MAX < gr->ctxsw_disable_count --> it became true after overflow So, we didn't detected in first overflow and lead to assert on enable JIRA NVGPU-3400 Change-Id: I6b0265a464f8f19efa7b0761612c6e9ffb3bd2bd Signed-off-by: Sagar Kadamati <skadamati@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2206282 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:10:29 -06:00
Deepak Nibade	d7971e7444	gpu: nvgpu: add DGPU config for RTV circular buffer RTV circular context buffer is only supported on TU104 dGPU as of now. Hence compile out corresponding #define and code from safety build. Jira NVGPU-4373 Change-Id: I46a3efc92fb247fa08efb925447c248b2a4b9a57 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2255768 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vinod Gopalakrishnakurup <vinodg@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:10:29 -06:00

1 2 3

124 Commits