linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-22 17:36:20 +03:00

Author	SHA1	Message	Date
Vinod G	e22c4cbbec	gpu: nvgpu: add warpstate header for gr Move nvgpu_warpstate struct from gr_gk20a.h to warpstate.h This helps to avoid gr_gk20a.h include from some files. Jira NVGPU-3217 Change-Id: I53593a06a5203332cd3b517de835ad779718af11 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2107699 GVS: Gerrit_Virtual_Submit Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-29 22:08:21 -07:00
Vinod G	20cd4ce54f	gpu: nvgpu: create hal.gr.gr unit Move remaining chip specific gr hal files to hal.gr.gr unit. Remove unused headers include from hal files in hal.gr.gr unit Update gr hal headers include location in the files currently using these headers. Jira NVGPU-3219 Change-Id: Ic632020a90ac4b8ac1e0359e979864b42f0ef2c0 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2105489 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-26 16:14:55 -07:00
Deepak Nibade	0271ffd77b	gpu: nvgpu: remove max_ctxsw_ring_buffer_size from nvgpu_gr max_ctxsw_ring_buffer_size variable in struct nvgpu_gr is used to store max ring buffer size which is then referred into linux specific code We only use macro GK20A_CTXSW_TRACE_MAX_VM_RING_SIZE to initialize the variable. And max_ctxsw_ring_buffer_size does not belong to nvgpu_gr struct anyways Considering above remove max_ctxsw_ring_buffer_size from nvgpu_gr and use macro directly in linux specific code Jira NVGPU-3125 Change-Id: Ibed9901d2bde35633d9ad0df8bd08b414e075bf4 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2105653 GVS: Gerrit_Virtual_Submit Reviewed-by: Vinod Gopalakrishnakurup <vinodg@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-26 09:37:21 -07:00
Deepak Nibade	c474f7c288	gpu: nvgpu: add CSS hal to get max buffer size Currently max_css_buffer_size is incorrectly stored in struct nvgpu_gr Add a new hal g->ops.css.get_max_buffer_size() to get the size and remove the variable from struct nvgpu_gr Jira NVGPU-3125 Change-Id: If78fd86559526b84031051e281a98327a46fc11d Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2105652 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-26 09:37:12 -07:00
Vinod G	344b164eea	gpu: nvgpu: remove gr_gk20a.h from gk20a.h Remove gr_gk20a.h from gk20a.h Add gr_gk20a.h in all gr hal files Removed ununsed gr_priv.h from two files Jira NVGPU-3217 Jira NVGPU-3218 Change-Id: Ic74c068782432e99ddba168f65a5cf42e1405305 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2104569 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-25 16:27:11 -07:00
Alex Waterman	efbe371fd5	gpu: nvgpu: Create hal/mm/gmmu and move gk20a GMMU code Make a hal/mm/gmmu sub-unit for the GMMU HAL code. Also move the gk20a specific HAL code there. gp10b will happen in the next patch. This change also updates all the GMMU related HAL usage, of which there is quite a bit. Generally the only change is a .gmmu needs to be inserted into the HAL path. Each HAL init was also updated. JIRA NVGPU-2042 Change-Id: I6c46bdfddb8e021f56103d9457fb3e2a226f8947 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2099693 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-23 12:45:54 -07:00
Deepak Nibade	fed6ee1afc	gpu: nvgpu: remove nvgpu_preemption_modes_rec struct g->ops.gr.get_preemption_mode_flags() hal is used to fetch information on supported preemption modes and default preemption mode Temporary struct nvgpu_preemption_modes_rec is used for this purpose and is defined in gk20a/gr_gk20a.h right now. Split above hal into two separate hals and move them to hal.gr.init unit g->ops.gr.init.get_supported__preemption_modes() g->ops.gr.init.get_default_preemption_modes() These hals now return respective flags in pointers passed in function parameter list, so there is no need to use temporary structure anymore Hence delete struct nvgpu_preemption_modes_rec Implement gm20b/gp10b chip specific hals in hal.gr.init unit. Delete g->ops.gr.get_preemption_mode_flags() hal Jira NVGPU-3126 Change-Id: I84f507fcd8d122bb7f0ecf697e8b4f16c9339ce1 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2102455 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-23 08:20:13 -07:00
Nitin Kumbhar	24af0d3330	gpu: nvgpu: fix num of sm used for mem alloc Instead of using GPC and TPC counts to allocate memory to hold sm info, use nvgpu_gr_config_get_no_of_sm() to get the actual number. This fixes memory issues (corruption and segmentation fault) seem when nvgpu_gpu_ioctl_wait_for_pause is used. Bug 2559631 Change-Id: Idcf9983fbbec7ec7f53835c59164e04bc45cd041 Signed-off-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2102557 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-23 04:15:36 -07:00
Vinod G	dc82262b99	gpu: nvgpu: Add gr_priv header file Move nvgpu_gr structure to private file gr_priv.h Include the private file where gr variables are used. JIRA NVGPU-3132 JIRA NVGPU-3079 Change-Id: Ib26ca5c5cb25fd8dd013a7c643278efc34aa55d4 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2098021 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-22 03:15:09 -07:00
Vinod G	556e139077	gpu: nvgpu: Cleanup for gr_gk20a header Removed unused struct from gr_gk20a.h Change static allocation for struct gr_gk20a to dynamic type. Change all the files that being affected by that change. Call gr allocation from corresponding init_support functions, which are part of the probe functions. nvgpu_pci_init_support in pci.c vgpu_init_support in vgpu_linux.c gk20a_init_support in module.c Call gr free before the gk20a free call in nvgpu_free_gk20a. Rename struct gr_gk20a to struct nvgpu_gr JIRA NVGPU-3132 Change-Id: Ief5e664521f141c7378c4044ed0df5f03ba06fca Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2095798 Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-19 00:04:00 -07:00
Alex Waterman	3a764030b1	gpu: nvgpu: Add new mm HAL and move cache code to that HAL Add a new MM HAL directory to contain all MM related HAL units. As part of this change add cache unit to the MM HAL. This contains several related fixes: 1. Move the cache code in gk20a/mm_gk20a.c and gv11b/mm_gv11b.c to the new cache HAL. Update makefiles and header includes to take this into account. Also rename gk20a_{read,write}l() to their nvgpu_ variants. 2. Update the MM gops: move the cache related functions to the new cache HAL and update all calls to this HAL to reflect the new name. 3. Update some direct calls to gk20a MM cache ops to pass through the HAL instead. 4. Update the unit tests for various MM related things to use the new MM HAL locations. This change accomplishes two architecture design goals. Firstly it removes a multiple HW include from mm_gk20a.c (the flush HW header). Secondly it moves code from the gk20a/ and gv11b/ directories into more proper locations under hal/. JIRA NVGPU-2042 Change-Id: I91e4bdca4341be4dbb46fabd72622b917769f4a6 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2095749 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-16 17:06:42 -07:00
Thomas Fleury	3c4d6c95df	gpu: nvgpu: move usermode to hal/fifo Moved the following HALs from fifo to usermode - fifo.ring_channel_doorbell -> usermode.ring_doorbell - fifo.doorbell_token -> usermode.doorbell_token - fifo.usermode_base -> usermode.base Created the following HAL - usermode.setup_hw Jira NVGPU-2978 Change-Id: I856ea24c126fa22d2f3fe860d4b14087c6d7330b Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2094813 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-16 13:04:27 -07:00
Vinod G	815c102e5d	gpu: nvgpu: move get_nonpes_aware_tpc hal to hal.gr.init Move get_nonpes_aware_tpc hal to hal.gr.init . This hal is implemented for gv11b. Update sm_id_numbering hal to pass the gr_config struct pointer as parameter to avoid dereferencing from gr inside hal. JIRA NVGPU-2951 Change-Id: I1e06b634cc36741e116e41e581a18c7f5b373945 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2093835 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-13 10:24:12 -07:00
Seema Khowala	312f91f991	gpu: nvgpu: move fence_gk20a to common/fence Move gk20a/fence_gk20a.c to common/fence/fence.c Renamed gk20a_fence_from_semaphore -> nvgpu_fence_from_semaphore gk20a_fence_from_syncpt -> nvgpu_fence_from_syncpt gk20a_alloc_fence_pool -> nvgpu_fence_pool_alloc gk20a_free_fence_pool -> nvgpu_fence_pool_free gk20a_alloc_fence -> nvgpu_fence_alloc gk20a_init_fence -> nvgpu_fence_init gk20a_fence_put -> nvgpu_fence_put gk20a_fence_get -> nvgpu_fence_get gk20a_fence_wait -> nvgpu_fence_wait gk20a_fence_is_expired -> nvgpu_fence_is_expired gk20a_fence_install_fd -> nvgpu_fence_install_fd gk20a_fence_ops struct -> nvgpu_fence_ops struct gk20a_fence struct -> nvgpu_fence_type struct JIRA NVGPU-1982 Change-Id: Ife77b2c3c386ff4368683c78ca02f00c99cddb4b Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2093002 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-10 17:24:52 -07:00
Nitin Kumbhar	c649ca9fd6	gpu: nvgpu: move gr config structs to priv header Move sm_info and nvgpu_gr_config struts to a private header and add APIs to access member fields. JIRA NVGPU-3060 Change-Id: I90f44333f19cb8cb939c0a0f90d9a03f6c036080 Signed-off-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2091563 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-10 15:04:21 -07:00
Nitin Kumbhar	1a843ba051	gpu: nvpgu: move zbc structs to priv header Move nvgpu_gr_zbc_entry and nvgpu_gr_zbc to a priv header and add APIs to access members of those structs. JIRA NVGPU-3060 Change-Id: I1255f3ebda03f599aed3706136c0909491023067 Signed-off-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2091214 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-08 01:55:33 -07:00
Vinod G	22cb47c077	gpu: nvgpu: move fbp_en_mask hal to hal.gr.init Move fbp_en_mask hal to hal.gr.init. Calls to g->ops.gr.fbp_en_mask is modified to g->ops.gr.init.fbp_en_mask JIRA NVGPU-2951 Change-Id: I555ec3691226a9dd8555fa91f5ec90010d83ddd3 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2081370 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-27 22:55:09 -07:00
Nitin Kumbhar	b5cd0c7956	gpu: nvpgu: move sm_to_cluster to common.gr.config 1. Move sm_to_cluster from gr to common.gr.config 2. Add nvgpu_gr_config_get_sm_info() API in gr.config to get sm_info for a given sm_id. JIRA NVGPU-1884 Change-Id: I71aa3bf010eeb594f4e08168c17e49f100521b83 Signed-off-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2073584 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-25 11:55:35 -07:00
Nitin Kumbhar	a2314ee780	gpu: nvgpu: move no_of_sm to common.gr.config 1. Move no_of_sm from gr to common.gr.config 2. Add nvgpu_gr_config_get_no_of_sm() API in gr.config to fetch no_of_sm. JIRA NVGPU-1884 Change-Id: I3c6c20a12cd7f9939a349a409932195f17392943 Signed-off-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2073583 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-25 11:55:20 -07:00
Nitin Kumbhar	30eea4ff2b	gpu: nvgpu: create common.gr.zcull 1. Separate out zcull unit from gr 2. Move zcull HALs from gr to common.hal.gr.zcull 3. Move common zcull functions to common.gr.zcull JIRA NVGPU-1883 Change-Id: Icfc297cf3511f957aead01044afc6fd025a04ebb Signed-off-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2076547 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-25 01:55:14 -07:00
Seshendra Gadagottu	60073d2156	gpu: nvgpu: move ltc related data to nvgpu_ltc Moved following ltc related data to struct nvgpu_ltc and has a reference to it from struct gk20a: struct nvgpu_spinlock ltc_enabled_lock; u32 max_ltc_count; u32 ltc_count; u32 slices_per_ltc; u32 cacheline_size; Added function remove_support for ltc and it is called during nvgpu remove sequence. Added following helper functions in ltc.h: u32 nvgpu_ltc_get_ltc_count(struct gk20a g); u32 nvgpu_ltc_get_slices_per_ltc(struct gk20a g); u32 nvgpu_ltc_get_cacheline_size(struct gk20a *g); Removed redudnant ltc.init_fs_state call from vgpu init sequence since it is getting called from nvgpu_init_ltc_support. NVGPU-2044 Change-Id: I3c256dc3866f894c38715aa2609e85bd2e5cfe5a Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2073417 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-22 15:13:46 -07:00
Abdul Salam	e943e6278a	gpu: nvgpu: Restructure nvgpu.common.volt This patch does the following. 1. Remove unused functions from volt_pmu.c. 2. Append public functions with nvgpu. 3. Remove GP106 functions and rename TU104 to generic functions. 4. Rename volt struct from gpu_ops. 5. Remove the unused volt.h header file. 6. Make local functions as static and put in order. 7. Remove unused inclusion on header files. 8. After 4, generic functions can be called directly instead of g->ops. Jira NVGPU-1956 Change-Id: Icaea0ca817d37cccfc09241baa2f047ec2688169 Signed-off-by: Abdul Salam <absalam@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2076535 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-21 10:55:29 -07:00
Seshendra Gadagottu	a2bc7d5923	gpu: nvgpu: cbc: move cbc related code from gr Moved cbc related code and data from gr to cbc unit. Ltc and cbc related data is moved from gr header: 1. Ltc related data moved from gr_gk20a -> gk20a and it will be moved eventually to ltc unit: u32 slices_per_ltc; u32 cacheline_size; 2. cbc data moved from gr_gk20a -> nvgpu_cbc u32 compbit_backing_size; u32 comptags_per_cacheline; u32 gobs_per_comptagline_per_slice; u32 max_comptag_lines; struct gk20a_comptag_allocator comp_tags; struct compbit_store_desc compbit_store; 3. Following config data moved gr_gk20a -> gk20a u32 comptag_mem_deduct; u32 max_comptag_mem; These are part of initial config which should be available during nvgpu_probe. So it can't be moved to nvgpu_cbc. Modified code to use above updated data structures. Removed cbc init sequence from gr and added in common cbc unit. This sequence is getting called from common nvgpu init code. JIRA NVGPU-2896 JIRA NVGPU-2897 Change-Id: I1a1b1e73b75396d61de684f413ebc551a1202a57 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2033286 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-17 05:15:35 -07:00
Seema Khowala	ffb1869144	gpu: nvgpu: add nvgpu_pg_elpg_protected_call macro gr_gk20a_elpg_protected_call is renamed as nvgpu_pg_elpg_protected_call and resides in common/ power_features/pg.c JIRA NVGPU-2014 Change-Id: Id027d9a81ca93e0d47bbeeeb537d5fcd882f68d3 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2034274 GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-08 16:25:36 -08:00
Deepak Nibade	fca82e45fb	gpu: nvgpu: move get_max_fbps/ltc/lts GR hals to TOP unit Below HALs to get max FBPs, max LTC per FBP, max LTS pet LTC values are right now defined by GR unit. g->ops.gr.get_max_fbps_count() g->ops.gr.get_max_ltc_per_fbp() g->ops.gr.get_max_lts_per_ltc() These HALs only read registers from hw_top_.h h/w unit, and as such belong to TOP unit. Move them appropriately as below g->ops.top.get_max_fbps_count() g->ops.top.get_max_ltc_per_fbp() g->ops.top.get_max_lts_per_ltc() Remove hw_top_.h h/w header include from gr_gk20a.c and gr_gm20b.c Jira NVGPU-2894 Change-Id: I995d9f56edb65c9de98d2d15d34ecb72920a65c6 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2030672 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-05 03:47:53 -08:00
Vinod G	acf3c2df9b	gpu: nvgpu: create zbc subunit under gr Moved zbc related files to common/gr/zbc location. struct nvgpu_gr_zbc created for zbc variables. common zbc functions are moved to gr_zbc.c file. All zbc hal functions are moved with corresponding chip specific filename. JIRA NVGPU-1882 Change-Id: I1bdaa2d9416e6e77ab305f117647dc070438ee86 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2019760 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-22 03:47:16 -08:00
Vinod G	10d6603f39	gpu: nvgpu: rearrange zbc hal functions As part of creating zbc as gr subunit, zbc hal functions in gr are moved under struct zbc. Removed unused function - _gk20a_gr_zbc_set_table Removed unused hal function - add_zbc JIRA NVGPU-1882 Change-Id: I7560135210c45abb734d4041b3f7330a988b6978 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2017812 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-16 00:33:50 -08:00
Debarshi Dutta	061aa66adc	gpu: nvgpu: move engine specific functions to common/fifo The following changes are done in this patch. 1) gk20a_fifo_get_engine_info() is moved to common/fifo/engine.c and is renamed to gk20a_fifo_get_active_engine_info() to reflect accurately the purpose of the function. 2) move the definition of enum fifo_engine to <nvgpu/engines.h> and add the prefix NVGPU_ 3) move the following functions related to engines in fifo_gk20a.c to common/fifo/engines.c and replace their signature by adding the prefix nvgpu_engine and removing gk20a_fifo. gk20a_fifo_get_active_engine_info gk20a_fifo_engine_enum_from_type gk20a_fifo_get_engine_ids gk20a_fifo_is_valid_engine_id gk20a_fifo_get_gr_engine_id gk20a_fifo_act_eng_interrupt_mask gk20a_fifo_engine_interrupt_mask gk20a_fifo_get_all_ce_engine_reset_mask Jira NVGPU-1315 Change-Id: I63d9dcd905a0bebcc9a4c65776cf6ec7a0837acf Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2011298 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-15 09:44:19 -08:00
Deepak Nibade	00aeab6cca	gpu: nvgpu: add gpc_mask to gr/config unit We get gpc_mask by calling GR HAL g->ops.gr.get_gpc_mask() But gpc_mask should be logically owned by gr/config unit Hence add new gpc_mask field to nvgpu_gr_config Initialize it in nvgpu_gr_config_init() by calling a new HAL g->ops.gr.config.get_gpc_mask() if available If HAL is not defined we just initialize it based on gpc_count Expose new API nvgpu_gr_config_get_gpc_mask() to get gpc_mask and use this API now Remove gr_gm20b_get_gpc_mask() and HAL g->ops.gr.get_gpc_mask() Update GV100 and TU104 chip HALs to remove old and add new HAL Add gpc_mask to struct tegra_vgpu_constants_params to support this on vGPU. Also get gpc_mask from vGPU private data in vgpu_gr_init_gr_config() Jira NVGPU-1879 Change-Id: Ibdc89ea51df944dc7085920509e3536a5721efc0 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2016084 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-14 02:28:58 -08:00
Philip Elcan	ab5684ce1b	gpu: nvgpu: channel: use u32 for syncpt id Make the APIs nvgpu_channel_sync_get_syncpt_id() and channel_sync_syncpt_get_id() return u32s rather than converting to ints and back. Also define FIFO_INVAL_SYNCPT_ID to use for invalid syncpt IDs rather than using magic numbers. JIRA NVGPU-1008 Change-Id: I4dde6b15fd3708fb0126b46c6fea8ac1b447c7ce Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2014821 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-11 12:55:36 -08:00
Deepak Nibade	a5eb150635	gpu: nvgpu: add new gr/config unit to initialize GR configuration Add new unit gr/config to initialize GR configuration like GPC/TPC count, MAX count and mask Create new structure nvgpu_gr_config that stores all the configuration and that is owned by the new unit Move below fields from struct gr_gk20a to nvgpu_gr_config in gr/config.h Struct gr_gk20a now only holds the pointer to struct nvgpu_gr_config u32 max_gpc_count; u32 max_tpc_per_gpc_count; u32 max_zcull_per_gpc_count; u32 max_tpc_count; u32 gpc_count; u32 tpc_count; u32 ppc_count; u32 zcb_count; u32 pe_count_per_gpc; u32 gpc_tpc_count; u32 gpc_ppc_count; u32 gpc_zcb_count; u32 pes_tpc_count[GK20A_GR_MAX_PES_PER_GPC]; u32 gpc_tpc_mask; u32 pes_tpc_mask[GK20A_GR_MAX_PES_PER_GPC]; u32 gpc_skip_mask; u8 map_tiles; u32 map_tile_count; u32 map_row_offset; Remove gr->sys_count since it was already no longer used common/gr/config/gr_config.c unit now exposes the APIs to initialize the configuration and also to query the configuration values nvgpu_gr_config_init() is called to initialize GR configuration from gr_gk20a_init_gr_config() and gr_gk20a_init_map_tiles() is simply renamed as nvgpu_gr_config_init_map_tiles() Expose new API nvgpu_gr_config_deinit() to deinit the configuration Expose nvgpu_gr_config_get_*() APIs to query above configuration fields stored in nvgpu_gr_config structure Update vgpu_gr_init_gr_config() to initialize the configuration from gr->config structure Chip specific HALs that access GR register for initialization are implemented in common/gr/config/gr_config_gm20b.c Set these HALs for all GPUs Jira NVGPU-1879 Change-Id: Ided658b43124ea61b9f273b82b73fdde4ed3c8f0 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2012167 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-08 12:55:53 -08:00
Nicolas Benech	e9c00c0da9	gpu: nvgpu: add error codes to mm_l2_flush gv11b_mm_l2_flush was not checking error codes from the various functions it was calling. MISRA Rule-17.7 requires the return value of all functions to be used. This patch now checks return values and propagates the error upstream. JIRA NVGPU-677 Change-Id: I9005c6d3a406f9665d318014d21a1da34f87ca30 Signed-off-by: Nicolas Benech <nbenech@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1998809 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-30 16:44:35 -08:00
Terje Bergstrom	e04db419d7	gpu: nvgpu: Split pmgr.h into private and public pmgr/pmgr.h is used both by pmgr itself, and other units calling pmgr. Move all public dependencies to include/nvgpu/pmu/pmgr.h JIRA NVGPU-961 Change-Id: I753fd64d4bfd4667239cf0dcb2aea00a7e010e75 Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1986071 GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-10 20:09:38 -08:00
Ranjanikar Nikhil Prabhakarrao	f0762ed483	gpu: nvgpu: add speculative barrier Data can be speculativerly stored and code flow can be hijacked. To mitigate this problem insert a speculation barrier. Bug 200447167 Change-Id: Ia865ff2add8b30de49aa970715625b13e8f71c08 Signed-off-by: Ranjanikar Nikhil Prabhakarrao <rprabhakarra@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1972221 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-30 22:26:01 -08:00
Anup Mahindre	49f2692bc0	gpu: nvgpu: Remove redundant warnings from gk20a_ctrl_dev_ioctl Remove redundant warnings that are being generated when nvpgu is returning proper error codes. Add nvgpu_warns instead. Bug 200457091 Change-Id: Ida44cd6bd784ad4ce55b44a8cf974bb89a5f3301 Signed-off-by: Anup Mahindre <amahindre@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1980734 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-27 15:25:07 -08:00
Konsta Holtta	7aac00ee58	gpu: nvgpu: verify usermode mapping is at most 64K Commit `ca611e4d0e` (gpu: nvgpu: verify usermode mapping is at least PAGE_SIZE) was not quite the right thing to do; do_mmap() rounds the length up to a page boundary anyway, but the length must not be longer than the size of the usermode region which is 64 KB to avoid leaking access to other registers. Bug 2441531 Change-Id: Ib1c88a6725db62c8276b6e8b880631227a4fc8cd Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1971339 Reviewed-by: Alex Waterman <alexw@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: Allen Martin <amartin@nvidia.com> Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 00:35:22 -08:00
Anup Mahindre	75ff0feeff	gpu: nvgpu: Add characterstics field to expose max ctxsw ring buffer size NVGPU_CTXSW_IOCTL_RING_SETUP can be used to setup a custom ring buffer and it accepts size via arguments. nvgpu driver will return an error if size requested is greater than 128 * 4096 but this value is hardcoded and not exposed anywhere. Add characteristics field in nvgpu.h to expose this size so that corresponding nvrm_gpu API can use it. Bug 2169674 Change-Id: Icf9465d4eec6ba3a307ea9490bd5da563944e4f6 Signed-off-by: Anup Mahindre <amahindre@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1967596 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-11 16:24:27 -08:00
Allen Martin	ca611e4d0e	gpu: nvgpu: verify usermode mapping is at least PAGE_SIZE This is part of a move to 64KiB for usermode mapping to fix failures when the system page size is 64KiB. When remapping or zapping the vma, use the existing size, not hardcoded size. Also change the verification of the size when creating the mapping to verify it is at least as big as PAGE_SIZE. This allows 4KiB mappings to continue to work until nvrm_gpu is changed to use 64KiB mappings. Bug 2441531 Change-Id: I447ef8e9f84e6d70bbe96b527e267ec41c5630b8 Signed-off-by: Allen Martin <amartin@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1964687 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-10 15:24:49 -08:00
Alex Waterman	fc939e5fb6	gpu: nvgpu: Add IOCTL flag + plumbing for unified VAs Add a flag that let's userspace enable the unified VM functionality on a selective bassis. This feature is working for all cases except a single MODS trace. This will allow test coverage to be selectively added in certain userspace tests as well to help prevent this feature from bit rotting (as it has historically done). Also update the unit test for the page table management in the GMMU to reflect this new flag. It's been set to false since the target platform for safety is currently not using unified address spaces. Bug 200438879 Change-Id: Ibe005472910d1668e8372754be8dd792773f9d8c Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1951864 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-07 12:15:11 -08:00
Debarshi Dutta	c965ef8dc2	gpu: nvgpu: error handling for invalid ioctl call NVGPU_GPU_IOCTL_GET_EVENT_FD should return -EINVAL when invoked in any chips which donot have NVGPU_SUPPORT_DEVICE_EVENTS enabled. This is resulting in an use-after-free error in UBSAN from syzkaller fuzzing in the nvgpu driver. Also, as an addon remove the flag clk_arb_events_supported as the device events check can be made using the flag NVGPU_SUPPORT_DEVICE_EVENTS. Bug 200463292 Change-Id: I0ed0217704daa9e401b57a268a30b9f798928e4a Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1956070 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: Konsta Holtta <kholtta@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-30 11:54:17 -08:00
Terje Bergstrom	1cf6e4fc5e	gpu: nvgpu: Remove pmgr.h dependency from gk20a.h gk20a.h depends on definition of struct pmgr_pmupstate. Change that to a pointer and use forward declaration, and allocation and free functions. Also set pointer to NULL when freed. Fix a few build breaks by adding explicit includes where previously a header file had gotten included implicitly. JIRA NVGPU-596 Change-Id: I21ff1ae93ac7b92a71502f97785252c04964e72f Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1954003 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-26 21:22:57 -08:00
Konsta Holtta	d49d64e720	gpu: nvgpu: store usermode regs bus addr directly Instead of just the base address of the main register range, store (also) the base address of usermode area. All regs may not be always available; on vgpu guests we have only the usermode regs. Store the usermode addr we get from a platform resource directly in gv11b_vgpu_probe() for vgpu. In that case the main reg addr is unset. The base address is computed in gk20a_pm_finalize_poweron() for native environments; when the reg addr is read from a resource, the chip is still unknown and as such the HAL op for reading the usermode base offset is unavailable. Bug 200145225 Bug 200467197 Change-Id: I8855bb54a6456eb63b69559c84398f7eeaec3513 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1951524 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-22 20:14:04 -08:00
Konsta Holtta	0567904ac0	Revert "gpu: nvgpu: Remove pmgr.h dependency from gk20a.h" This reverts commit `2dc48ceba1`. Bug 2443630 JIRA NVGPU-596 Change-Id: Id728c908cd89142245f1708fb423c0fff38ba96d Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1952266 Reviewed-by: Bo Yan <byan@nvidia.com> Tested-by: Bo Yan <byan@nvidia.com>	2018-11-16 11:26:03 -08:00
Anup Mahindre	a6138b7810	gpu: nvgpu: Add a characteristics flag to denote FECS tracing support Add a flag to nvgpu_gpu_characteristics to expose FECS tracing capability to userspace. This is required for adding nvrm_gpu APIs for CTXSW set of IOCTLs which were requested in several bugs. nvrm_gpu APIs would query this flag to check the availability of IOCTLs. Bug 2169678 Bug 2169677 Bug 2169675 Bug 2169674 Bug 2169673 Bug 2168342 Change-Id: Ie6ba80a4144637546b97fa93baae67b8d0c4d425 Signed-off-by: Anup Mahindre <amahindre@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1950559 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-15 02:53:39 -08:00
Terje Bergstrom	2dc48ceba1	gpu: nvgpu: Remove pmgr.h dependency from gk20a.h gk20a.h depends on definition of struct pmgr_pmupstate. Change that to a pointer and use forward declaration, and allocation and free functions. Fix a few build breaks by adding explicit includes where previously a header file had gotten included implicitly. JIRA NVGPU-596 Change-Id: I7ced14d6629e033b0ccef3a93a3dbf099e43ba4c Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1946662 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-14 13:34:06 -08:00
Sai Nikhil	94e00ab6ad	gpu: nvgpu: gk20a: fix MISRA 10.4 Violations [1/2] MISRA Rule 10.4 only allows the usage of arithmetic operations on operands of the same essential type category. Adding "U" at the end of the integer literals to have same type of operands when an arithmetic operation is performed. This fixes violation where an arithmetic operation is performed on signed and unsigned int types. JIRA NVGPU-992 Change-Id: Ifb8cb992a5cb9b04440f162918a8ed2ae17ec928 Signed-off-by: Sai Nikhil <snikhil@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1822587 GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-09 13:27:08 -08:00
Srirangan Madhavan	ef5fdac7a6	gpu: nvgpu: Fix MISRA rule 15.6 violations MISRA Rule-15.6 requires that all if-else blocks and loop blocks be enclosed in braces, including single statement blocks. Fix errors due to single statement if-else and loop blocks without braces by introducing the braces. JIRA NVGPU-775 Change-Id: Ib70621d39735abae3fd2eb7ccf77f36125e2d7b7 Signed-off-by: Srirangan Madhavan <smadhavan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1928745 GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-05 22:13:16 -08:00
Nicolas Benech	b9e7ea65e1	gpu: nvgpu: Fix LibC MISRA 17.7 in os/linux MISRA Rule-17.7 requires the return value of all functions to be used. Fix is either to use the return value or change the function to return void. This patch contains fix for all 17.7 violations instandard C functions in OS/Linux interface. JIRA NVGPU-1036 Change-Id: I39b20f1d0e1a1da56d452f2c3d5ee049666cefe8 Signed-off-by: Nicolas Benech <nbenech@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1929900 Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-31 15:25:23 -07:00
Konsta Holtta	37659f5c8e	gpu: nvgpu: mark usermode submit supported for gv11b Mark usermode submit supported in gv11b and add the characteristics flag to expose the capability to userspace. Bug 200145225 Change-Id: Id9dcb0c71c020bd509fbdbffb94a756c69377f20 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1795822 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 08:04:47 -07:00
Konsta Holtta	f33935f426	gpu: nvgpu: provide usermode region via mmap Add a mmap callback on the control device node for mapping the usermode register region to userspace. Each such mapping is removed when the GPU railgates, and brought back again on unrailgate. The mapping offset must be 0 and its size must be 4 KB. Bug 200145225 Change-Id: Ie8d3758da745b958376292691d7d1d02a24e7815 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1795819 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 08:04:25 -07:00

1 2

62 Commits