linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-23 09:57:08 +03:00

Author	SHA1	Message	Date
Nitin Kumbhar	c69c5a7a60	gpu: nvgpu: use safe ops in ALIGN and ALIGN_MASK Shortcomings of ALIGN macros: - ALIGN_MASK down aligns when there is an wrapping/overflow instead of throwing an error. This can affect the size assumptions. - Alignment a's check will be bypassed when ALIGN_MASK is directly used. Fix these issues by 1) adding compile time error for non-unsigned type arguments 2) using unsigned type safe ops for addition and subtraction. Also, change users of ALIGN to pass unsigned types only. JIRA NVGPU-3515 Jira NVGPU-3411 Change-Id: I5b94a262e09aad473c420af750ead6b0f9d36a9b Signed-off-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2128382 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-06-28 08:56:27 -07:00
Deepak Nibade	67350e2c9c	gpu: nvgpu: add flags to debugger specific headers Add debugger/cyclestats/fecs_trace compile time flags to debugger specific unit headers Jira NVGPU-3506 Change-Id: Iedea5f274243a389dce91edecbc80c58753d4805 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2137253 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-06-18 01:38:54 -07:00
Deepak Nibade	bc6feecb91	gpu: nvgpu: support active_unit_mask for subunit entries in hwpm_map In case of FBPA we need to consider mask of active FBPAs on dGPUs. For that we have GR unit HAL g->ops.gr.add_ctxsw_reg_pm_fbpa() Generic support to consider active mask of unit need not be in a HAL, move it to common code in add_ctxsw_buffer_map_entries_subunits() itself This API now supports providing active_unit_mask as its parameter In case we don't need to consider unit mask caller will simply pass ~U32(0U) to indicate all units are active In case of FBPA, add a new HAL g->ops.gr.hwpm_pm.get_active_fbpa_mask() which gets mask of active FBPAs, and pass this value to common API add_ctxsw_buffer_map_entries_subunits() Jira NVGPU-2895 Change-Id: I0d208ce53abcd36929c25a4d248868d6eaa5c70d Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2069472 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-12 11:47:16 -07:00
Deepak Nibade	ad8a3ca53e	gpu: nvgpu: create hal.gr.hwpm_map unit Create a new HAL unit hal.gr.hwpm_map that provides chip specific support to common.gr.hwpm_map unit We currently have common.gr HAL g->ops.gr.add_ctxsw_reg_perf_pma() to handle chip specific alignment of perf_pma list We only adjust the offset of list and remaining code is same Hence delete above HAL, and add new HAL under hal.gr.hwpm_map g->ops.gr.hwpm_map.align_regs_perf_pma() which returns correct alignment if HAL is defined Remove gr_gv100_add_ctxsw_reg_perf_pma() and gr_gk20a_add_ctxsw_reg_perf_pma() APIs since they are no longer used Simplify perf_pma parsing by fixing alignment with new HAL and then directly calling add_ctxsw_buffer_map_entries() Jira NVGPU-2895 Change-Id: I1852db846e1f5441e482028c79a3f39c5142b0c2 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2069471 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-12 11:47:01 -07:00

4 Commits