linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-24 02:22:34 +03:00

Author	SHA1	Message	Date
Sagar Kamble	ff8fbf1004	Revert "gpu: nvgpu: disable DGPU_THERMAL_ALERT for k5.9 temporarily" Bug 200669739 This reverts commit `d3f5905a0c`. Change-Id: I76a4ca4ec5be316c24410860a722a13383a3cea3 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2501427 Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-03-19 14:37:00 -07:00
shashank singh	46cdc4d5ca	gpu: nvgpu: add field in characteristics struct for max gpfifo entries Expose max gpfifo entries supported by nvgpu-rm. This limit will then be propagated to application by nvrm_gpu. Jira NVGPU-5846 Change-Id: Ibbbed9e1929c3bcc4eaaec9636d76e9e115e0c0c Signed-off-by: shashank singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2482936 (cherry picked from commit b099a700aa055a5864ddb65cb546c9294c02b2b9) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2497486 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-03-19 10:09:44 -07:00
Divya Singhatwaria	cc34df76f9	gpu: nvgpu: Add support for ELPG_MS feature - To enable ELPG_MS feature, add identifier for MS_LTC engine. - The identifier is then passed as pg_engine_id to enable the MS_LTC engine. - Add enable flag NVGPU_ELPG_MS_ENABLED for enabling/disabling ELPG_MS feature at init. JIRA NVGPU-6430 Change-Id: Ie1f477918332d85ec98b3bd4d05b8e773d74eab8 Signed-off-by: Divya Singhatwaria <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2480750 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-03-18 15:29:06 -07:00
scottl	75d98f55d7	gpu: nvgpu: add SUPPORT_MAPPING_MODIFY flags Add new NVGPU_SUPPORT_MAPPING_MODIFY enable flag that is used to control the value of the exported NVGPU_GPU_FLAGS_SUPPORT_MAPPING_MODIFY flag. These flags are currently only enabled on linux in non-virtualized environments. Jira NVGPU-6374 Change-Id: Ia85c353b767b4f7d0aebc04838f44996bc38c61f Signed-off-by: scottl <scottl@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2490986 Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-03-16 14:07:21 -07:00
Antony Clince Alex	f41e5975d8	gpu: nvgpu: add ioctl to configure l2 max_ways_evict_last Add ioctl support to configure and read the max number of lines/ways in a L2 cache set that can be marked as EVICT_LAST. This is accomplished through two new ltc hals: set_l2_max_ways_evict_last, get_l2_max_ways_evict_last. These hals will only be set for nvgpu-next chips. Incase of legacy chips, the IOCTLs will return error -ENOSYS. Generate following litter constants to get the number of sets in a l2 slice and the number of ways in each set: - GPU_LIT_NUM_LTC_LTS_SETS - GPU_LIT_NUM_LTC_LTS_WAYS Add gpu characteritics flag: NVGPU_L2_MAX_WAYS_EVICT_LAST_ENABLED to allow userspace driver to determine if L2_MAX_WAYS_EVICT_LAST ioctl is supported. Bug 200605474 Change-Id: Id3180f891399f5e128500f3835d762aee59953e0 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2445884 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-03-12 04:36:22 -08:00
Thomas Steinle	1b5a9b28ea	gpu: nvgpu: Add gr.ops NULL-ptr check This fix add NULL-ptr checks for some of the user-accessible ioctl. Bug 3240771 Bug 200696704 Change-Id: Ibe7f75b31b2521a530883253a93ba832f010dc80 Signed-off-by: Thomas Steinle <tsteinle@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2483635 (cherry picked from commit `cc717e3145`) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2490126 Tested-by: Dinesh T <dt@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-03-04 00:36:14 -08:00
Deepak Nibade	4ab8c87974	gpu: nvgpu: expose V2 device node hierarchy Nvgpu will move to new V2 device node hierarchy by default to be consistent with MIG mode hierarchy. As part of this, expose V2 device node hierarchy with this patch. Both legacy and V2 hierarchies will co-exist for some duration until V2 hierarchy is well tested out, and then legacy hierarchy will be removed. With V2 hierarchy, below is the directory structure for dev nodes : igpu: /dev/nvgpu/igpu0/<dev_node> dgpu: /dev/nvgpu/dgpu-0001\:01\:00.0/<dev_node> Jira NVGPU-5648 Change-Id: I1c7a1c41e6d59fb248bc98a145c5ecebd06e8bad Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2443712 GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-03-02 19:30:59 -08:00
shashank singh	019641e88c	gpu: nvgpu: limit number of gpfifo entries Limit number of gpfifo entries so that the size of gpfifo i.e. num_entries * size of each entry fits in u32 data type. Jira NVGPU-5846 Change-Id: I4d3560a6ed90044c88ee3a7acd2e6cb0591b7c5e Signed-off-by: shashank singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2474118 (cherry picked from commit 02ab9e163f5b413b6eb9817ab8ac5581ce7ef427) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2483947 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-02-18 17:29:17 -08:00
prsethi	09fb445878	gpu: nvgpu: remove ipa to pa conversion WAR WAR assume 1:1 IPA to PA mapping when hyp_read_ipa_pa_info fails for the syncpt address which falls into syncpt shim aperture. API nvgpu_mem_phys_sgl_ipa_to_pa() is taking care of IPA to PA mapping for the syncpts which makes this WAR invalid. Patch removes the WAR. Bug 200673604 Change-Id: I966711e11c2ff1b5b5dd3f5e09674bea66c5d04b Signed-off-by: prsethi <prsethi@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2478068 Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-02-03 13:54:45 -08:00
Sagar Kamble	5993b74351	gpu: nvgpu: gm20b: increase WDT timeout to 7s Intermittently observing WDT during ap_cudnn on porg-b01-sku0. Increasing the WDT timeout from 5s to 7s helps. Bug 3223062 Change-Id: Ia94d931d301f3ec229e0e4fbd06876d326a4077e Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2475066 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-01-28 17:30:53 -08:00
scottl	456a814db5	gpu: nvgpu: add linux MAPPING_MODIFY ioctl Add new MAPPING_MODIFY ioctl to the linux nvgpu driver. This ioctl is used (for example) by the NvRmGpuMappingModify API to change the kind of an existing mapping. For compressed mappings the ioctl can be used to do the following: * switch between two different compressed kinds * switch between compressed and incompressed kinds For incompressed mappings the ioctl can be used to do the following: * switch between two different incompressed kinds In order to properly update an existing mapping the nvgpu_mapped_buf structure has been extended to cache the following state when the mapping is first created: * the compression tag offset (if applicable) * the GMMU read/write flags * the memory aperture The unused ctag_lines field in the nvgpu_ctag_buffer_info structure has been replaced with a new ctag_offset field. Jira NVGPU-6374 Change-Id: I647ab9c2c272e3f9b52f1ccefc5e0de4577c14f1 Signed-off-by: scottl <scottl@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2468100 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-01-28 17:27:31 -08:00
Alex Waterman	11d3785faf	gpu: nvgpu: Rename struct nvgpu_runlist_info, fields in fifo Rename struct nvgpu_runlist_info to struct nvgpu_runlist; the info is not necessary. struct nvgpu_runlist is soon to be a first class object among the nvgpu object model. Also rename the fields runlist_info and active_runlist_info to simply runlists and active_runlists respectively. Again the info text is just not necessary and somewhat misleading. These structs _are_ the runlist representations in SW; they are not merely informational. Also add an rl_dbg() macro to print debug info specific to runlist management and some debug prints specifying the runlist topology for the running chip. Change-Id: Id9fcbdd1a7227cb5f8c75cca4abbff94fe048e49 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2470303 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-01-20 21:56:33 -08:00
Sagar Kamble	cf287a4ef5	gpu: nvgpu: retry tsg unbind if NEXT is set The NEXT bit can remain set for the channel if timeslice expires before scheduler clears it. Due to this nvgpu fails TSG unbind and in turn nvrm_gpu fails channel close. In this case, checking the channel hw state after some time can help see NEXT bit cleared by scheduler. Reenable the tsg and return -EAGAIN to nvrm_gpu for it to retry again. Bug 3144960 Change-Id: I35f417f02270e371a4e632986b73a00f8a4f921a Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2468391 Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-01-18 23:11:57 -08:00
Seshendra Gadagottu	2cc8fdfa81	gpu: nvgpu: skip clock queries for un-supported platforms Skip clock queries in acquire_platform_clocks for un-supported platforms. Only silicon and fpga has clocks support. Bug 3198706 Change-Id: Ie012525802ef6b66709527cac2d4186f5287818a Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2470284 Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-01-14 16:13:54 -08:00
Jon Hunter	ddf8f12197	gpu: nvgpu: Add support for Linux v5.11 For Linux v5.11, commit 6619ccf1bb1d ("dma-buf: Use struct dma_buf_map in dma_buf_vmap() interfaces") changes to the dma_buf_vmap() and dma_buf_vunmap() APIs to pass a new parameter of type 'struct dma_buf_map'. Update the NVGPU to support these updated APIs for Linux v5.11+. Finally, the legacy dma_buf_vmap() API returns NULL on error and not an error code and so correct the test of the return value in the function gk20a_cde_convert(). Bug 200687525 Change-Id: Ie20f101e965fa0f2c650d9b30ff4558ce1256c12 Signed-off-by: Jon Hunter <jonathanh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2469555 Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-01-13 22:36:14 -08:00
dt	9b81c28dd3	gpu: nvgpu: Add PG199 support This is adding the device id in pci id table to support PG199. JIRA NVGPU-6375 Change-Id: Ib87bf903a55f6256ffc61582b1b42fbce5ea8033 Signed-off-by: dt <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2468622 Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-01-12 12:36:36 -08:00
Deepak Nibade	a0fb91846d	gpu: nvgpu: set regop type based on per-resource ctxsw flag New profiler APIs set regop type based on whether context is bound or not in nvgpu_prof_get_regops_staging_data(). But it is possible that ctxsw is not enabled for some particular HWPM resource even if context is bound to profiler object. Fix this by extracting regop type based on per-resource ctxsw flag instead of bound context. Add reg_op_type[] array in profiler object to track regop type for each HWPM resource. Initialize the array based on resource ctxsw flag in nvgpu_profiler_pm_resource_reserve(). Update profiler_obj_validate_reg_op_offset() to get regop type from nvgpu_profiler_validate_regops_allowlist() and use this type and prof->reg_op_type[] to get actual type that should be used for that regop. Update validate_reg_ops() to validate the offset first since regop type is now determined in offset validation. Set ops[i].status to 0 for each validation iteration, and if op is valid set it to REGOP(STATUS_SUCCESS) at the end of iteration. Bug 2510974 Jira NVGPU-5360 Change-Id: Ib1f75d840d04d288789473adabda02cdc807eea0 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2460003 Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-01-05 12:38:17 -08:00
Deepak Nibade	869735cda4	gpu: nvgpu: add dynamic allowlist support Add gv11b and tu104 HALs to get allowed HWPM resource register ranges, offsets, and stride meta data. Add new enum nvgpu_pm_resource_hwpm_register_type for HWPM register type. Add new struct nvgpu_pm_resource_register_range_map to store all the register ranges for HWPM resources. Add pointer of map in struct nvgpu_profiler_object along with map entry count. Add new API nvgpu_profiler_build_regops_allowlist() to build the regops allowlist dynamically while binding the resources. Map entry count is received with get_pm_resource_register_range_map_entry_count() and only those resource ranges are added for which resource is reserved by profiler object. Add nvgpu_profiler_destroy_regops_allowlist() to destroy the allowlist while unbinding the resources. Add static functions allowlist_range_search() to search a register offset in HWPM resource ranges. Add another static function allowlist_offset_search() to search the offset in per-resource offset list. Add nvgpu_profiler_validate_regops_allowlist() that accepts an offset value, checks if it is in allowed ranges using allowlist_range_search() and then checks if offset is in allowlist using allowlist_offset_search(). Update gops.regops.exec_regops() to receive profiler object pointer as a parameter. Invoke nvgpu_profiler_validate_regops_allowlist() from validate_reg_ops() if prof pointer is not-null. This will be true only for new profiler stack and not legacy profilers. In gr_exec_ctx_ops(), skip regops execution if offset is invalid. Bug 2510974 Jira NVGPU-5360 Change-Id: I40acb91cc37508629c83106ea15b062250bba473 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2460001 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-01-05 12:38:06 -08:00
Sagar Kamble	17d1ecc43c	gpu: nvgpu: remove bpmp powergate calls for t186 and t194 and update is_railgated With Generic Power Domains (genpd), bpmp driver will manage the GPU powergating. With the nvgpu idle/unidle flows updated for VPR with genpd/RPM, the usage of the below tegra bpmp calls can be removed from nvgpu from railgate APIs for t186 and t194. Note that genpd is available in k4.14 onwards, so this will work on current downstream kernel. tegra_bpmp_running tegra_powergate_is_powered tegra_powergate_partition tegra_unpowergate_partition Runtime suspended state indicates that the device is railgated. Update the t186 and t194 is_railgated handlers with this. t210 railgate/unrailgate will be still managed by nvgpu as bpmp support is not present. Bug 200602747 JIRA NVGPU-5356 Change-Id: Iadfd794cb51bc41ca927b84fc212ac766d60094d Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2376642 GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Sagar Kamble	bd7bda4f98	gpu: nvgpu: do_idle/unidle handling with runtime PM after probe Extend the runtime suspend/resume based idle/unidle logic in the probe case to handling done in gk20a_do_idle/unidle for nvgpu after the probe completion. If the railgating is disabled, setting autosuspend_delay to 0 will enable the suspend. If railgating is enabled, autosuspend delay will be > 0. Setting it to 0 will enable the immediate suspend. With this approach based on RPM, forced_reset logic is removed. force_reset_in_do_idle is also removed as railgating is supported. Bug 200602747 JIRA NVGPU-5356 Change-Id: Iaf6d5ab651b8200f0547b45d90f812110cf63c0e Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2375941 GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Sagar Kamble	4012a97640	gpu: nvgpu: enable runtime PM before secure_alloc init during probe With genpd based runtime PM, the device railgating is managed by the PM core and the nvgpu manages the clocks. To suspend/resume the device for idling/unidling while initializing secure alloc, runtime PM is to be enabled during probe. nvgpu platform railgate handlers will be only managing the clocks. During probe, the nvgpu driver poweroff/poweron are not to be invoked as part of driver runtime suspend/resume hence probe state is added. After platform probe initializes the clock, explicit runtime resume of the device is required to sanely suspend it during gk20a_do_idle. Runtime PM configuration differs based on the NVGPU_CAN_RAILGATE capability, hence the runtime PM is enabled ("truly") only for the duration of nvgpu_probe and then the state is reverted at the beginning of gk20a_pm_late_init. Bug 200602747 JIRA NVGPU-5356 Change-Id: I1fbd03d3f49da07ccbee9714387e00ffc688864e Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2375939 GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Sagar Kamble	5a5f082d24	gpu: nvgpu: move init_secure_alloc to platform late_probe With genpd based runtime PM, the device railgating is managed by the PM core and the nvgpu manages the clocks. To suspend/resume the device for idling/unidling while initializing secure alloc, runtime PM will be enabled before init_secure_alloc. nvgpu platform railgate handlers will be only managing the clocks. The clocks and secure alloc initialization was done in platform probe (applicable to tegra). To suspend (railgate and clks disable) and resume cleanly during secure alloc init, the platform probe should happen first that initializes the clocks. Post that device runtime PM will handle the device idle/unidle properly. Hence, move gk20a_tegra_init_secure_alloc to platform late_probe. Runtime PM changes are introduced in the later patches. Bug 200602747 JIRA NVGPU-5356 Change-Id: I5130ff43f7b75ddc51cb7096ba6532b3f5397258 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2375938 GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Lili Sang	3f0ea98b73	gpu: nvgpu: Add get_gr_context support for Linux. Implement the feature of retrieving gr context contents for all chips. Two IOCTLs, NVGPU_DBG_GPU_IOCTL_GET_GR_CONTEXT_SIZE and _GET_GR_CONTEXT, are added. Bug 3102903 Change-Id: If11006f4e294f190785a2c3159ca491b9f3b5187 Signed-off-by: Lili Sang <lilis@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2449183 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Chris Johnson <cwj@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Jon Hunter	8c94013c4d	gpu: nvgpu: Add host1x support Add support for the upstream host1x driver with the 'Host1x/Tegra UAPI' series [0] applied. The host1x support is only enabled if the kernel configuration variable CONFIG_TEGRA_HOST1X_NEXT is set. Please note that the initial implementation only supports Tegra194. [0] https://patchwork.ozlabs.org/project/linux-tegra/list/?series=206532 Bug 3156385 Change-Id: If531a8b866b48ba5a2af021756a4b5d158b8d59a Signed-off-by: Jon Hunter <jonathanh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2429981 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
Jon Hunter	18d1b1b536	gpu: nvgpu: Add abstraction layer for NVHost fence In preparation for adding support for the upstream Host1x driver, add a layer of abstraction for calling the NVHost fence APIs. This provides a clean interface for calling the appropriate APIs for either the downstream NVHost fence APIs or the upstream DMA fence and Host1x APIs, depending on which is being used. Change-Id: I055aee3fe5d3af83ebb11126c0a862e17b21f94c Signed-off-by: Jon Hunter <jonathanh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2450244 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Jon Hunter	d16259ccc9	gpu: nvgpu: Prepare for adding host1x support To prepare for supporting the upstream host1x driver with NVGPU, move functions that are common using either the upstream host1x driver or the downstream nvhost driver into a common file. Bug 3156385 Change-Id: I33a3b55cfa3eb4c03b7047b62f26a75f407b2dd4 Signed-off-by: Jon Hunter <jonathanh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2429980 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Vedashree Vidwans	2566977072	gpu: nvgpu: add coherent_dma_mask for tegra_vpr_dev Knext requires tegra_vpr_dev to contain non-zero coherent_dma_mask. Bug 200677584 Change-Id: I983236e3a7b4e807e26e3421bd3a78027e596692 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2451139 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Deepak Nibade	d584294545	gpu: nvgpu: set preemption mode for specific GR instance Pass gr_instance_id to function nvgpu_gr_setup_set_preemption_mode() which picks up correct nvgpu_gr struct pointer based on instance id. nvgpu_gr_get_cur_instance_ptr() is not needed in this special case since there is no PGRAPH register programming required to set preemption mode. All writes/updates are done on context image. Also fix unit tests accordingly to always select 0th GR instance. Jira NVGPU-5648 Change-Id: I46eff816d5a4afe784bf75b64ee9d698c77eb64a Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2435468 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Deepak Nibade	4478cb6b2f	gpu: nvgpu: support multiple instances for ctrl node IOCTLs Execute below IOCTL APIs for specific gr_instance_id with nvgpu_gr_exec_with_err_for_instance() NVGPU_GPU_IOCTL_SET_SM_DEBUG_MODE NVGPU_GPU_IOCTL_TRIGGER_SUSPEND NVGPU_GPU_IOCTL_WAIT_FOR_PAUSE NVGPU_GPU_IOCTL_RESUME_FOR_PAUSE NVGPU_GPU_IOCTL_CLEAR_SM_ERRORS Jira NVGPU-5648 Change-Id: I4a3fed67833bc1f9fd4085b18ab1fb522da167da Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2443805 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Vedashree Vidwans	0ffcd4ec75	gpu: nvgpu: enable vpr secure alloc on non-simulation platforms VPR is supported on non-simulation platforms. VPR allocations are successful with updates to secure_buffer_size for nvgpu-next. JIRA NVGPU-5302 Change-Id: I4d5561d5260e0feb7eb490c19df0bc53bf33a892 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2439218 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Deepak Nibade	af4f38fb6e	gpu: nvgpu: use correct gr_config pointer to return tpc masks In gk20a_ctrl_get_tpc_masks() receive gr_config pointer from caller instead of extracting with nvgpu_gr_get_config_ptr(). GR engine current instance value is not set here so this was always returning 0th instance gr_config pointer. Jira NVGPU-5648 Change-Id: I7f4df0c464cd357d05ac7d5c3b9e4abe0ac9a8af Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2441667 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: Lakshmanan M <lm@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
Nicolin Chen	bd114e8d0f	gpu: nvgpu: Do not call tegra_pcm_writel/readl In K5.9 those two functions are changed to reject calls directly using global pmc pointer. So it's a bit complicated to revert to the point where GPU can feel free to call them. For an easier future mantainance, a new set of APIs are added to all existing kernels to let GPU driver control clamp without any direct access to PMC registers. Bug 200663781 Change-Id: Ifce4765525eee6d61083896fc9a126892cbb86ba Signed-off-by: Nicolin Chen <nicolinc@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2441010 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Lakshmanan M	f8429c8e7c	gpu: nvgpu: MIG attributes enhancement This CL covers the following code changes, 1) Added some more documentation for gpu_instance_id and gr_instance_id. 2) Used the gr_sys_pipe_id for gr_instance_id. 2) Removed gr_syspipe_id attribute. 4) Removed NVGPU_GPU_FLAGS_SUPPORT_MIG flag. 3) Changed the device node name to use gpu instance id + syspipe id combination insted of gpu instance id + gr engine instance id. Bug 2802347 Change-Id: Id6ca7db5765ab31b5d83472be35dde432c2281ed Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2440532 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
Deepak Nibade	67a68771d4	gpu: nvgpu: skip physical instance dev nodes On Linux, nvrm_gpu can open channel/tsg/address space only using ctrl node. This tricks nvrm_gpu into considering physical instance as actual available fGPU if ctrl node is exposed for physical instance. There is no current requirement to expose physical instance ctrl node. It might be needed later for profiling use cases. For now, skip dev node creation for physical instance. Jira NVGPU-5648 Change-Id: I23398ba993f97e2d2f344876c0c6b0c82b336402 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2439880 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
Deepak Nibade	dbad874d9e	gpu: nvgpu: use instance specific max subctx count Store nvgpu_cdev pointer in struct tsg_private and assign it in nvgpu_ioctl_tsg_open. In gk20a_tsg_ioctl_bind_channel_ex(), extract gpu_instance_id from cdev pointer and then extract instance specific max VEID count from gpu_instance_id. Use this max veid count to validate subcontext id coming from user. Jira NVGPU-5648 Change-Id: I71cea5180e1ced1a72818d160f1a951c1c6ec770 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2438925 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
Deepak Nibade	dacdaf0778	gpu: nvgpu: return instance specific max subctx count nvgpu_channel_get_max_subctx_count() right now always returns max subctx count for 0th instance. Update this function to return max subctx count for GPU instance for which channel is allocated. For CE channels that are allocated and managed by nvgpu, cdev pointer is not set in channel private data (since it is assigned in OS specific code). For those channels continue returning max subctx count for 0th instance. CE channels should not need subcontexts anyways. Add nvgpu_cdev pointer in struct nvgpu_channel_linux. Assign it in __gk20a_channel_open() and clear it in gk20a_channel_release() Move code to get runlist and gpu_instance_id after nvgpu_get() call. Accesses to gk20a pointer should always come after nvgpu_get(). Also add a debug print to dump runlist_id and gpu_instance_id being used for channel. Jira NVGPU-5648 Change-Id: Idf58ccefdb7dc9fec78100f79c647e5a00b8fb29 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2438924 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
Sagar Kamble	d3f5905a0c	gpu: nvgpu: disable DGPU_THERMAL_ALERT for k5.9 temporarily GPIOs are not working currently in k5.9. Bug 200669739 Change-Id: Ia7848d55d95f7986cf0fa73a34b15b7546d62e0f Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2437640 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	043d793e57	gpu: nvgpu: populate instance specific engine information Separate out nvgpu_gpu_fetch_engine_info_item() that populates engine_id/engine_instance/runlist_id for given nvgpu_device. Update Existing API nvgpu_gpu_get_engine_info() to use above function. Add new API nvgpu_gpu_get_gpu_instance_engine_info() that populates instance specific engine information. Update NVGPU_GPU_IOCTL_GET_ENGINE_INFO sequence to trigger nvgpu_gpu_get_gpu_instance_engine_info() for fGPU instances in MIG mode. Continue using nvgpu_gpu_get_engine_info() in non-MIG mode and for physical instance in MIG mode. Jira NVGPU-5648 Change-Id: Ia946748fa2b0c27efa7704847cdf9bb44a0749da Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2436753 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	6e1495f45f	gpu: nvgpu: set instance specific characteristics Update gk20a_ctrl_dev_ioctl() to fetch gpu_instance_id with nvgpu_get_gpu_instance_id_from_cdev() and gr_instance_id with nvgpu_grmgr_get_gr_instance_id(). Get instance specific GR engine configuration pointer with nvgpu_gr_get_gpu_instance_config_ptr() Update gk20a_ctrl_ioctl_gpu_characteristics() to return instance specific characteristics with below changes : - 0th GPU instance is a physical instance. Set a limited and relevant characteristics flags for 0th instance. For rest of the instances and non-MIG mode, continue fetching flags with nvgpu_ctrl_ioctl_gpu_characteristics_flags. - nvgpu_set_preemption_mode_flags() should be set only for non-MIG mode and non-zero instance in MIG mode. - In MIG mode, 0th instance does not support any classes. Rest of the instances support only compute, copy and gpfifo classes. Non-MIG mode supports all the classes including graphics ones. - Fetch gpu_instance_id/gr_sys_pipe_id/gr_instance_id from gpu_instance pointer. - Fetch max_veid_count_per_tsg from gpu_instance pointer. Also update nvgpu_gr_get_zcull_ptr() and nvgpu_gr_get_zbc_ptr() to return instance specific pointers. zcull/zbc are not supported in MIG mode, this is just for consistency of the code. Jira NVGPU-5648 Change-Id: I764526061542c48ed87659844e16dd0e0253c588 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2436752 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	7cdfcbafc0	gpu: nvgpu: use instance specific config pointer In gk20a_ctrl_get_num_vsms() and gk20a_ctrl_vsm_mapping() use GR instance specific config pointer to get number of SMs. Jira NVGPU-5648 Change-Id: I22b1aa2daf5dfd1524d9fc5c1c1a278a808b59fb Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2436751 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	7b4bff6ebf	gpu: nvgpu: remove unify_address_space enforcement Let nvrm_gpu decide if unified_address_space is required when requesting new address space. JIRA NVGPU_5302 Change-Id: Ib77be5e7c913802a01f7e7861e8bce3d47aed55f Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2427724 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Sami Kiminki <skiminki@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: Sami Kiminki <skiminki@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	a252cc244a	gpu: nvgpu: modify alloc_as ioctl to accept mem size - Modify NVGPU_GPU_IOCTL_ALLOC_AS and struct nvgpu_alloc_as_args to accept start address and size of user memory. This allows configurable address space allocation. - Modify gk20a_as_alloc_share() and gk20a_vm_alloc_share() to receive va_range_start and va_range_end values. - gk20a_vm_alloc_share() initializes vm with low_hole = va_range_start, and user vma size = (va_range_end - va_range_start). - Modify nvgpu_as_alloc_space_args and nvgpu_as_free_space_args to accept 64 bit number of pages. Bug 2043269 JIRA NVGPU-5302 Change-Id: I243995adf5b7e0e84d6b36abe3b35a5ccabd7a37 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2385496 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Sami Kiminki <skiminki@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: Sami Kiminki <skiminki@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	179e04b442	gpu: nvgpu: use instance specific runlist id In __gk20a_channel_open(), if runlist_id is provided as -1, pick up correct GPU instance sprcific default runlist id using nvgpu_grmgr_get_gpu_instance_runlist_id(). Also, get GPU instance is using nvgpu_get_gpu_instance_id_from_cdev() If runlist_id is received as input, check if it is valid for given GPU instance with nvgpu_grmgr_is_valid_runlist_id() Jira NVGPU-5648 Change-Id: I69303a3dd81f28f474b40564da51254bcaa1ed15 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2435467 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	be9271d721	gpu: nvgpu: add API to extract gk20a pointer from cdev Add new API nvgpu_get_gk20a_from_cdev() that extracts gk20a pointer from cdev pointer. This helps in keeping cdev related implementation details in ioctl.c and away from other device ioctl files. Also move struct nvgpu_cdev, nvgpu_class, and nvgpu_cdev_class_priv_data from os_linux.h to ioctl.h since all of these structures are more IOCTL related and better to keep them in ioctl specific header. Jira NVGPU-5648 Change-Id: Ifad8454fd727ae2389ccf3d1ba492551ef1613ac Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2435466 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	d0a1f30e66	gpu: nvgpu: allocate object context for specific GR instance Add new API nvgpu_get_gpu_instance_id_from_cdev() that returns GPU instance id from nvgpu_cdev pointer. Store cdev pointer in channel private data channel_priv and ctrl node private data gk20a_ctrl_priv. Update below functions to pass cdev pointer : __gk20a_channel_open() gk20a_channel_open_ioctl() In gk20a_channel_ioctl(), extract gpu instance id using cdev pointer stored in channel_priv and new API nvgpu_get_gpu_instance_id_from_cdev(). Extract GR instance id using nvgpu_grmgr_get_gr_instance_id() Invoke context creation API inside nvgpu_gr_exec_with_err_for_instance() so that context is created with correct GR instance id. Jira NVGPU-5648 Change-Id: I5a4e79165e021b56181d08105b2185306a19703b Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2435465 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	a1bbcff476	gpu: nvgpu: enumerate dev nodes per GPU instance in MIG mode In MIG mode, each of the dev nodes should be enumerated for each fGPU. And for physical instance only the "ctrl" node should be enumerated. Support this with below set of changes : - Add struct nvgpu_mig_static_info that describes static GPU instance configuration. GPCs are enumerated only during poweron and grmgr unit will populate instance information based on number of GPCs. For linux, GPU poweron happens only with first gk20a_busy() call and instance information is not available during probe() time. Hence this static table is a temporary solution until proper solution is identified. - Add nvgpu_default_mig_static_info for iGPU and nvgpu_default_pci_mig_static_info for dGPU that describes GPU instance partition. - Add new function nvgpu_prepare_mig_dev_node_class_list() that parses the static table and creates one class per instance in MIG mode. Non-MIG mode classes are now enumerated in nvgpu_prepare_default_dev_node_class_list(). - Add new structure nvgpu_cdev_class_priv_data to store private data for each cdev. This will hold instance specific information and pointer to private data will be maintained in struct class and also passed as private data while creating device node with device_create() - Add nvgpu_mig_phys_devnode() to set dev node path/names for fGPUs and add nvgpu_mig_fgpu_devnode() to set dev node path/names for physical instance in MIG mode. - Add new field mig_physical_node to struct nvgpu_dev_node. This field is set if corresponding dev node should be created for physical instance in MIG mode. For now set it only for "ctrl" node. Jira NVGPU-5648 Change-Id: Ic97874eece1fbe0083b3ac4c48e36e06004f1bc2 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2434586 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	b5369b8d35	gpu: nvgpu: track classes using dynamic linked list Remove devnode_class pointer from struct nvgpu_os_linux and replace it by a list head. Add new structure nvgpu_class to store class related meta-data and create it dynamically in nvgpu_create_class(). Add new function nvgpu_prepare_dev_node_class_list() to prepare list of all classes that are required for each GPU. For now there is only one class per GPU, but in MIG mode multiple classes will be created with one class per instance. Update gk20a_user_init() to loop through list of classes and create dev nodes for each class. gk20a_user_deinit() frees up the linked list. Jira NVGPU-5648 Change-Id: I891a55c0ce1c2ff9db094564529b3f569df9735c Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2428501 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	a3e39c685d	gpu: nvgpu: track dev nodes using dynamic linked list Remove static dev node meta data from struct nvgpu_os_linux and replace it by a dynamic list. Struct nvgpu_os_linux will only keep track of list head and number of entries. Add new structure nvgpu_cdev to store meta data of each dev node and create/setup it dynamically in gk20a_user_init(). Once done, add the new node under list head maintained in nvgpu_os_linux. Add a static list dev_node_list[] that contains list of dev node names and file operations. This static list is used to create nvgpu_cdev data structures and to register new device nodes. Update all dev node open file operations (e.g. gk20a_as_dev_open()) to extract struct gk20a pointer from device pointer of dev node. gk20a device is the parent of dev node device. Jira NVGPU-5648 Change-Id: If070c3428afd6215e45b4919335d9f43e04c36f9 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2428500 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	9082bcf3bd	gpu: nvgpu: move ctrl priv tracking to struct nvgpu_os_linux Move ctrl node priv tracking variables from struct nvgpu_os_linux.ctrl to struct nvgpu_os_linux. This will unblock dev node creation without using the static data structures in struct nvgpu_os_linux. Jira NVGPU-5648 Change-Id: I57db0c601282534e6e2ea535d3ca27934f86fc2a Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2428499 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	2a6c473fe6	gpu: nvgpu: remove interface names and static classes to create dev nodes Remove static class definition and registration for iGPU and dGPU. Create the class dynamically in gk20a_user_init() and setup the callback function to create devnode name based on GPU type. For now add nvgpu_pci_devnode() callback for dGPU that sets correct dev node path for dGPUs. For iGPU, Android apparently does not honor dev node path set in callback and hence override the device name for iGPU with function nvgpu_devnode(). Destroy the class in gk20a_user_deinit(). This will overall be helpful in adding multiple classes and dev nodes for each GPU instance in MIG mode. Set GPU device pointer as the parent of new devices created with device_create(). This is helpful in getting GPU device name in callback function nvgpu_pci_devnode(). Update functions to not pass class structure and interface names : nvgpu_probe() gk20a_user_init() gk20a_user_deinit() nvgpu_remove() Remove static interface name format like INTERFACE_NAME since it is no longer needed. Update GK20A_NUM_CDEVS to 10 since there are 10 dev nodes per GPU right now. Jira NVGPU-5648 Change-Id: I5d41db5a0f87fa4a558297fb4135a9fbfcd51080 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2423492 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00

1 2 3 4 5 ...

822 Commits