linux-nvgpu

nv-tegra-mirror/linux-nvgpu

Fork 0

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-22 17:36:20 +03:00

Commit Graph

Author	SHA1	Message	Date
Deepak Nibade	bc6feecb91	gpu: nvgpu: support active_unit_mask for subunit entries in hwpm_map In case of FBPA we need to consider mask of active FBPAs on dGPUs. For that we have GR unit HAL g->ops.gr.add_ctxsw_reg_pm_fbpa() Generic support to consider active mask of unit need not be in a HAL, move it to common code in add_ctxsw_buffer_map_entries_subunits() itself This API now supports providing active_unit_mask as its parameter In case we don't need to consider unit mask caller will simply pass ~U32(0U) to indicate all units are active In case of FBPA, add a new HAL g->ops.gr.hwpm_pm.get_active_fbpa_mask() which gets mask of active FBPAs, and pass this value to common API add_ctxsw_buffer_map_entries_subunits() Jira NVGPU-2895 Change-Id: I0d208ce53abcd36929c25a4d248868d6eaa5c70d Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2069472 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-12 11:47:16 -07:00
Deepak Nibade	ad8a3ca53e	gpu: nvgpu: create hal.gr.hwpm_map unit Create a new HAL unit hal.gr.hwpm_map that provides chip specific support to common.gr.hwpm_map unit We currently have common.gr HAL g->ops.gr.add_ctxsw_reg_perf_pma() to handle chip specific alignment of perf_pma list We only adjust the offset of list and remaining code is same Hence delete above HAL, and add new HAL under hal.gr.hwpm_map g->ops.gr.hwpm_map.align_regs_perf_pma() which returns correct alignment if HAL is defined Remove gr_gv100_add_ctxsw_reg_perf_pma() and gr_gk20a_add_ctxsw_reg_perf_pma() APIs since they are no longer used Simplify perf_pma parsing by fixing alignment with new HAL and then directly calling add_ctxsw_buffer_map_entries() Jira NVGPU-2895 Change-Id: I1852db846e1f5441e482028c79a3f39c5142b0c2 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2069471 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-12 11:47:01 -07:00
Deepak Nibade	8ce2a97d8e	gpu: nvgpu: create common.gr.hwpm_map unit Create new unit common.gr.hwpm_map with source file common/gr/hwpm_map.c and public header include/nvgpu/gr/hwpm_map.h Move all APIs in gr_gk20a.c that handle hwpm_map functionality to this new unit. This unit now exposes below struct that is included in struct gr_gk20a struct nvgpu_gr_hwpm_map { u32 pm_ctxsw_image_size; u32 count; struct ctxsw_buf_offset_map_entry *map; bool init; } Expose below APIs nvgpu_gr_hwpm_map_init() - initialize HWPM map meta-data with given size nvgpu_gr_hwpm_map_deinit() - deinitialize HWPM map nvgpu_gr_hwmp_map_find_priv_offset() - find a given offset in the map The sequence to create the map by reading various netlist segments is moved to a static API nvgpu_gr_hwpm_map_create() Jira NVGPU-2894 Change-Id: I07d31169d2ff18a496eb79a726027b847d5f0e06 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2032777 GVS: Gerrit_Virtual_Submit Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-08 05:55:29 -08:00

Author

SHA1

Message

Date

Deepak Nibade

bc6feecb91

gpu: nvgpu: support active_unit_mask for subunit entries in hwpm_map

In case of FBPA we need to consider mask of active FBPAs on dGPUs.
For that we have GR unit HAL g->ops.gr.add_ctxsw_reg_pm_fbpa()

Generic support to consider active mask of unit need not be in a HAL,
move it to common code in add_ctxsw_buffer_map_entries_subunits() itself
This API now supports providing active_unit_mask as its parameter

In case we don't need to consider unit mask caller will simply pass
~U32(0U) to indicate all units are active

In case of FBPA, add a new HAL g->ops.gr.hwpm_pm.get_active_fbpa_mask()
which gets mask of active FBPAs, and pass this value to common API
add_ctxsw_buffer_map_entries_subunits()

Jira NVGPU-2895

Change-Id: I0d208ce53abcd36929c25a4d248868d6eaa5c70d
Signed-off-by: Deepak Nibade <dnibade@nvidia.com>
Reviewed-on: https://git-master.nvidia.com/r/2069472
Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>
Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>

2019-03-12 11:47:16 -07:00

Deepak Nibade

ad8a3ca53e

gpu: nvgpu: create hal.gr.hwpm_map unit

Create a new HAL unit hal.gr.hwpm_map that provides chip specific
support to common.gr.hwpm_map unit

We currently have common.gr HAL g->ops.gr.add_ctxsw_reg_perf_pma()
to handle chip specific alignment of perf_pma list
We only adjust the offset of list and remaining code is same

Hence delete above HAL, and add new HAL under hal.gr.hwpm_map
g->ops.gr.hwpm_map.align_regs_perf_pma() which returns correct
alignment if HAL is defined

Remove gr_gv100_add_ctxsw_reg_perf_pma() and
gr_gk20a_add_ctxsw_reg_perf_pma() APIs since they are no longer used

Simplify perf_pma parsing by fixing alignment with new HAL and then
directly calling add_ctxsw_buffer_map_entries()

Jira NVGPU-2895

Change-Id: I1852db846e1f5441e482028c79a3f39c5142b0c2
Signed-off-by: Deepak Nibade <dnibade@nvidia.com>
Reviewed-on: https://git-master.nvidia.com/r/2069471
Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>
Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>

2019-03-12 11:47:01 -07:00

Deepak Nibade

8ce2a97d8e

gpu: nvgpu: create common.gr.hwpm_map unit

Create new unit common.gr.hwpm_map with source file common/gr/hwpm_map.c
and public header include/nvgpu/gr/hwpm_map.h

Move all APIs in gr_gk20a.c that handle hwpm_map functionality to this
new unit. This unit now exposes below struct that is included in struct
gr_gk20a

struct nvgpu_gr_hwpm_map {
	u32 pm_ctxsw_image_size;

	u32 count;
	struct ctxsw_buf_offset_map_entry *map;

	bool init;
}

Expose below APIs
nvgpu_gr_hwpm_map_init() - initialize HWPM map meta-data with given size
nvgpu_gr_hwpm_map_deinit() - deinitialize HWPM map
nvgpu_gr_hwmp_map_find_priv_offset() - find a given offset in the map

The sequence to create the map by reading various netlist segments is
moved to a static API nvgpu_gr_hwpm_map_create()

Jira NVGPU-2894

Change-Id: I07d31169d2ff18a496eb79a726027b847d5f0e06
Signed-off-by: Deepak Nibade <dnibade@nvidia.com>
Reviewed-on: https://git-master.nvidia.com/r/2032777
GVS: Gerrit_Virtual_Submit
Reviewed-by: Debarshi Dutta <ddutta@nvidia.com>
Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>
Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>

2019-03-08 05:55:29 -08:00

3 Commits