linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-23 18:16:01 +03:00

Author	SHA1	Message	Date
Sagar Kamble	bd7bda4f98	gpu: nvgpu: do_idle/unidle handling with runtime PM after probe Extend the runtime suspend/resume based idle/unidle logic in the probe case to handling done in gk20a_do_idle/unidle for nvgpu after the probe completion. If the railgating is disabled, setting autosuspend_delay to 0 will enable the suspend. If railgating is enabled, autosuspend delay will be > 0. Setting it to 0 will enable the immediate suspend. With this approach based on RPM, forced_reset logic is removed. force_reset_in_do_idle is also removed as railgating is supported. Bug 200602747 JIRA NVGPU-5356 Change-Id: Iaf6d5ab651b8200f0547b45d90f812110cf63c0e Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2375941 GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Sagar Kamble	4012a97640	gpu: nvgpu: enable runtime PM before secure_alloc init during probe With genpd based runtime PM, the device railgating is managed by the PM core and the nvgpu manages the clocks. To suspend/resume the device for idling/unidling while initializing secure alloc, runtime PM is to be enabled during probe. nvgpu platform railgate handlers will be only managing the clocks. During probe, the nvgpu driver poweroff/poweron are not to be invoked as part of driver runtime suspend/resume hence probe state is added. After platform probe initializes the clock, explicit runtime resume of the device is required to sanely suspend it during gk20a_do_idle. Runtime PM configuration differs based on the NVGPU_CAN_RAILGATE capability, hence the runtime PM is enabled ("truly") only for the duration of nvgpu_probe and then the state is reverted at the beginning of gk20a_pm_late_init. Bug 200602747 JIRA NVGPU-5356 Change-Id: I1fbd03d3f49da07ccbee9714387e00ffc688864e Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2375939 GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Sagar Kamble	5a5f082d24	gpu: nvgpu: move init_secure_alloc to platform late_probe With genpd based runtime PM, the device railgating is managed by the PM core and the nvgpu manages the clocks. To suspend/resume the device for idling/unidling while initializing secure alloc, runtime PM will be enabled before init_secure_alloc. nvgpu platform railgate handlers will be only managing the clocks. The clocks and secure alloc initialization was done in platform probe (applicable to tegra). To suspend (railgate and clks disable) and resume cleanly during secure alloc init, the platform probe should happen first that initializes the clocks. Post that device runtime PM will handle the device idle/unidle properly. Hence, move gk20a_tegra_init_secure_alloc to platform late_probe. Runtime PM changes are introduced in the later patches. Bug 200602747 JIRA NVGPU-5356 Change-Id: I5130ff43f7b75ddc51cb7096ba6532b3f5397258 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2375938 GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Lili Sang	3f0ea98b73	gpu: nvgpu: Add get_gr_context support for Linux. Implement the feature of retrieving gr context contents for all chips. Two IOCTLs, NVGPU_DBG_GPU_IOCTL_GET_GR_CONTEXT_SIZE and _GET_GR_CONTEXT, are added. Bug 3102903 Change-Id: If11006f4e294f190785a2c3159ca491b9f3b5187 Signed-off-by: Lili Sang <lilis@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2449183 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Chris Johnson <cwj@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Jon Hunter	8c94013c4d	gpu: nvgpu: Add host1x support Add support for the upstream host1x driver with the 'Host1x/Tegra UAPI' series [0] applied. The host1x support is only enabled if the kernel configuration variable CONFIG_TEGRA_HOST1X_NEXT is set. Please note that the initial implementation only supports Tegra194. [0] https://patchwork.ozlabs.org/project/linux-tegra/list/?series=206532 Bug 3156385 Change-Id: If531a8b866b48ba5a2af021756a4b5d158b8d59a Signed-off-by: Jon Hunter <jonathanh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2429981 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
Jon Hunter	18d1b1b536	gpu: nvgpu: Add abstraction layer for NVHost fence In preparation for adding support for the upstream Host1x driver, add a layer of abstraction for calling the NVHost fence APIs. This provides a clean interface for calling the appropriate APIs for either the downstream NVHost fence APIs or the upstream DMA fence and Host1x APIs, depending on which is being used. Change-Id: I055aee3fe5d3af83ebb11126c0a862e17b21f94c Signed-off-by: Jon Hunter <jonathanh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2450244 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Jon Hunter	d16259ccc9	gpu: nvgpu: Prepare for adding host1x support To prepare for supporting the upstream host1x driver with NVGPU, move functions that are common using either the upstream host1x driver or the downstream nvhost driver into a common file. Bug 3156385 Change-Id: I33a3b55cfa3eb4c03b7047b62f26a75f407b2dd4 Signed-off-by: Jon Hunter <jonathanh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2429980 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Vedashree Vidwans	2566977072	gpu: nvgpu: add coherent_dma_mask for tegra_vpr_dev Knext requires tegra_vpr_dev to contain non-zero coherent_dma_mask. Bug 200677584 Change-Id: I983236e3a7b4e807e26e3421bd3a78027e596692 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2451139 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Deepak Nibade	d584294545	gpu: nvgpu: set preemption mode for specific GR instance Pass gr_instance_id to function nvgpu_gr_setup_set_preemption_mode() which picks up correct nvgpu_gr struct pointer based on instance id. nvgpu_gr_get_cur_instance_ptr() is not needed in this special case since there is no PGRAPH register programming required to set preemption mode. All writes/updates are done on context image. Also fix unit tests accordingly to always select 0th GR instance. Jira NVGPU-5648 Change-Id: I46eff816d5a4afe784bf75b64ee9d698c77eb64a Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2435468 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Deepak Nibade	4478cb6b2f	gpu: nvgpu: support multiple instances for ctrl node IOCTLs Execute below IOCTL APIs for specific gr_instance_id with nvgpu_gr_exec_with_err_for_instance() NVGPU_GPU_IOCTL_SET_SM_DEBUG_MODE NVGPU_GPU_IOCTL_TRIGGER_SUSPEND NVGPU_GPU_IOCTL_WAIT_FOR_PAUSE NVGPU_GPU_IOCTL_RESUME_FOR_PAUSE NVGPU_GPU_IOCTL_CLEAR_SM_ERRORS Jira NVGPU-5648 Change-Id: I4a3fed67833bc1f9fd4085b18ab1fb522da167da Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2443805 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Vedashree Vidwans	0ffcd4ec75	gpu: nvgpu: enable vpr secure alloc on non-simulation platforms VPR is supported on non-simulation platforms. VPR allocations are successful with updates to secure_buffer_size for nvgpu-next. JIRA NVGPU-5302 Change-Id: I4d5561d5260e0feb7eb490c19df0bc53bf33a892 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2439218 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Deepak Nibade	af4f38fb6e	gpu: nvgpu: use correct gr_config pointer to return tpc masks In gk20a_ctrl_get_tpc_masks() receive gr_config pointer from caller instead of extracting with nvgpu_gr_get_config_ptr(). GR engine current instance value is not set here so this was always returning 0th instance gr_config pointer. Jira NVGPU-5648 Change-Id: I7f4df0c464cd357d05ac7d5c3b9e4abe0ac9a8af Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2441667 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: Lakshmanan M <lm@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
Nicolin Chen	bd114e8d0f	gpu: nvgpu: Do not call tegra_pcm_writel/readl In K5.9 those two functions are changed to reject calls directly using global pmc pointer. So it's a bit complicated to revert to the point where GPU can feel free to call them. For an easier future mantainance, a new set of APIs are added to all existing kernels to let GPU driver control clamp without any direct access to PMC registers. Bug 200663781 Change-Id: Ifce4765525eee6d61083896fc9a126892cbb86ba Signed-off-by: Nicolin Chen <nicolinc@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2441010 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:48 -06:00
Lakshmanan M	f8429c8e7c	gpu: nvgpu: MIG attributes enhancement This CL covers the following code changes, 1) Added some more documentation for gpu_instance_id and gr_instance_id. 2) Used the gr_sys_pipe_id for gr_instance_id. 2) Removed gr_syspipe_id attribute. 4) Removed NVGPU_GPU_FLAGS_SUPPORT_MIG flag. 3) Changed the device node name to use gpu instance id + syspipe id combination insted of gpu instance id + gr engine instance id. Bug 2802347 Change-Id: Id6ca7db5765ab31b5d83472be35dde432c2281ed Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2440532 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
Deepak Nibade	67a68771d4	gpu: nvgpu: skip physical instance dev nodes On Linux, nvrm_gpu can open channel/tsg/address space only using ctrl node. This tricks nvrm_gpu into considering physical instance as actual available fGPU if ctrl node is exposed for physical instance. There is no current requirement to expose physical instance ctrl node. It might be needed later for profiling use cases. For now, skip dev node creation for physical instance. Jira NVGPU-5648 Change-Id: I23398ba993f97e2d2f344876c0c6b0c82b336402 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2439880 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
Deepak Nibade	dbad874d9e	gpu: nvgpu: use instance specific max subctx count Store nvgpu_cdev pointer in struct tsg_private and assign it in nvgpu_ioctl_tsg_open. In gk20a_tsg_ioctl_bind_channel_ex(), extract gpu_instance_id from cdev pointer and then extract instance specific max VEID count from gpu_instance_id. Use this max veid count to validate subcontext id coming from user. Jira NVGPU-5648 Change-Id: I71cea5180e1ced1a72818d160f1a951c1c6ec770 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2438925 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
Deepak Nibade	dacdaf0778	gpu: nvgpu: return instance specific max subctx count nvgpu_channel_get_max_subctx_count() right now always returns max subctx count for 0th instance. Update this function to return max subctx count for GPU instance for which channel is allocated. For CE channels that are allocated and managed by nvgpu, cdev pointer is not set in channel private data (since it is assigned in OS specific code). For those channels continue returning max subctx count for 0th instance. CE channels should not need subcontexts anyways. Add nvgpu_cdev pointer in struct nvgpu_channel_linux. Assign it in __gk20a_channel_open() and clear it in gk20a_channel_release() Move code to get runlist and gpu_instance_id after nvgpu_get() call. Accesses to gk20a pointer should always come after nvgpu_get(). Also add a debug print to dump runlist_id and gpu_instance_id being used for channel. Jira NVGPU-5648 Change-Id: Idf58ccefdb7dc9fec78100f79c647e5a00b8fb29 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2438924 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:48 -06:00
Sagar Kamble	d3f5905a0c	gpu: nvgpu: disable DGPU_THERMAL_ALERT for k5.9 temporarily GPIOs are not working currently in k5.9. Bug 200669739 Change-Id: Ia7848d55d95f7986cf0fa73a34b15b7546d62e0f Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2437640 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	043d793e57	gpu: nvgpu: populate instance specific engine information Separate out nvgpu_gpu_fetch_engine_info_item() that populates engine_id/engine_instance/runlist_id for given nvgpu_device. Update Existing API nvgpu_gpu_get_engine_info() to use above function. Add new API nvgpu_gpu_get_gpu_instance_engine_info() that populates instance specific engine information. Update NVGPU_GPU_IOCTL_GET_ENGINE_INFO sequence to trigger nvgpu_gpu_get_gpu_instance_engine_info() for fGPU instances in MIG mode. Continue using nvgpu_gpu_get_engine_info() in non-MIG mode and for physical instance in MIG mode. Jira NVGPU-5648 Change-Id: Ia946748fa2b0c27efa7704847cdf9bb44a0749da Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2436753 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	6e1495f45f	gpu: nvgpu: set instance specific characteristics Update gk20a_ctrl_dev_ioctl() to fetch gpu_instance_id with nvgpu_get_gpu_instance_id_from_cdev() and gr_instance_id with nvgpu_grmgr_get_gr_instance_id(). Get instance specific GR engine configuration pointer with nvgpu_gr_get_gpu_instance_config_ptr() Update gk20a_ctrl_ioctl_gpu_characteristics() to return instance specific characteristics with below changes : - 0th GPU instance is a physical instance. Set a limited and relevant characteristics flags for 0th instance. For rest of the instances and non-MIG mode, continue fetching flags with nvgpu_ctrl_ioctl_gpu_characteristics_flags. - nvgpu_set_preemption_mode_flags() should be set only for non-MIG mode and non-zero instance in MIG mode. - In MIG mode, 0th instance does not support any classes. Rest of the instances support only compute, copy and gpfifo classes. Non-MIG mode supports all the classes including graphics ones. - Fetch gpu_instance_id/gr_sys_pipe_id/gr_instance_id from gpu_instance pointer. - Fetch max_veid_count_per_tsg from gpu_instance pointer. Also update nvgpu_gr_get_zcull_ptr() and nvgpu_gr_get_zbc_ptr() to return instance specific pointers. zcull/zbc are not supported in MIG mode, this is just for consistency of the code. Jira NVGPU-5648 Change-Id: I764526061542c48ed87659844e16dd0e0253c588 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2436752 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	7cdfcbafc0	gpu: nvgpu: use instance specific config pointer In gk20a_ctrl_get_num_vsms() and gk20a_ctrl_vsm_mapping() use GR instance specific config pointer to get number of SMs. Jira NVGPU-5648 Change-Id: I22b1aa2daf5dfd1524d9fc5c1c1a278a808b59fb Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2436751 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	7b4bff6ebf	gpu: nvgpu: remove unify_address_space enforcement Let nvrm_gpu decide if unified_address_space is required when requesting new address space. JIRA NVGPU_5302 Change-Id: Ib77be5e7c913802a01f7e7861e8bce3d47aed55f Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2427724 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Sami Kiminki <skiminki@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: Sami Kiminki <skiminki@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	a252cc244a	gpu: nvgpu: modify alloc_as ioctl to accept mem size - Modify NVGPU_GPU_IOCTL_ALLOC_AS and struct nvgpu_alloc_as_args to accept start address and size of user memory. This allows configurable address space allocation. - Modify gk20a_as_alloc_share() and gk20a_vm_alloc_share() to receive va_range_start and va_range_end values. - gk20a_vm_alloc_share() initializes vm with low_hole = va_range_start, and user vma size = (va_range_end - va_range_start). - Modify nvgpu_as_alloc_space_args and nvgpu_as_free_space_args to accept 64 bit number of pages. Bug 2043269 JIRA NVGPU-5302 Change-Id: I243995adf5b7e0e84d6b36abe3b35a5ccabd7a37 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2385496 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Sami Kiminki <skiminki@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: Sami Kiminki <skiminki@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	179e04b442	gpu: nvgpu: use instance specific runlist id In __gk20a_channel_open(), if runlist_id is provided as -1, pick up correct GPU instance sprcific default runlist id using nvgpu_grmgr_get_gpu_instance_runlist_id(). Also, get GPU instance is using nvgpu_get_gpu_instance_id_from_cdev() If runlist_id is received as input, check if it is valid for given GPU instance with nvgpu_grmgr_is_valid_runlist_id() Jira NVGPU-5648 Change-Id: I69303a3dd81f28f474b40564da51254bcaa1ed15 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2435467 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	be9271d721	gpu: nvgpu: add API to extract gk20a pointer from cdev Add new API nvgpu_get_gk20a_from_cdev() that extracts gk20a pointer from cdev pointer. This helps in keeping cdev related implementation details in ioctl.c and away from other device ioctl files. Also move struct nvgpu_cdev, nvgpu_class, and nvgpu_cdev_class_priv_data from os_linux.h to ioctl.h since all of these structures are more IOCTL related and better to keep them in ioctl specific header. Jira NVGPU-5648 Change-Id: Ifad8454fd727ae2389ccf3d1ba492551ef1613ac Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2435466 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	d0a1f30e66	gpu: nvgpu: allocate object context for specific GR instance Add new API nvgpu_get_gpu_instance_id_from_cdev() that returns GPU instance id from nvgpu_cdev pointer. Store cdev pointer in channel private data channel_priv and ctrl node private data gk20a_ctrl_priv. Update below functions to pass cdev pointer : __gk20a_channel_open() gk20a_channel_open_ioctl() In gk20a_channel_ioctl(), extract gpu instance id using cdev pointer stored in channel_priv and new API nvgpu_get_gpu_instance_id_from_cdev(). Extract GR instance id using nvgpu_grmgr_get_gr_instance_id() Invoke context creation API inside nvgpu_gr_exec_with_err_for_instance() so that context is created with correct GR instance id. Jira NVGPU-5648 Change-Id: I5a4e79165e021b56181d08105b2185306a19703b Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2435465 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	a1bbcff476	gpu: nvgpu: enumerate dev nodes per GPU instance in MIG mode In MIG mode, each of the dev nodes should be enumerated for each fGPU. And for physical instance only the "ctrl" node should be enumerated. Support this with below set of changes : - Add struct nvgpu_mig_static_info that describes static GPU instance configuration. GPCs are enumerated only during poweron and grmgr unit will populate instance information based on number of GPCs. For linux, GPU poweron happens only with first gk20a_busy() call and instance information is not available during probe() time. Hence this static table is a temporary solution until proper solution is identified. - Add nvgpu_default_mig_static_info for iGPU and nvgpu_default_pci_mig_static_info for dGPU that describes GPU instance partition. - Add new function nvgpu_prepare_mig_dev_node_class_list() that parses the static table and creates one class per instance in MIG mode. Non-MIG mode classes are now enumerated in nvgpu_prepare_default_dev_node_class_list(). - Add new structure nvgpu_cdev_class_priv_data to store private data for each cdev. This will hold instance specific information and pointer to private data will be maintained in struct class and also passed as private data while creating device node with device_create() - Add nvgpu_mig_phys_devnode() to set dev node path/names for fGPUs and add nvgpu_mig_fgpu_devnode() to set dev node path/names for physical instance in MIG mode. - Add new field mig_physical_node to struct nvgpu_dev_node. This field is set if corresponding dev node should be created for physical instance in MIG mode. For now set it only for "ctrl" node. Jira NVGPU-5648 Change-Id: Ic97874eece1fbe0083b3ac4c48e36e06004f1bc2 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2434586 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	b5369b8d35	gpu: nvgpu: track classes using dynamic linked list Remove devnode_class pointer from struct nvgpu_os_linux and replace it by a list head. Add new structure nvgpu_class to store class related meta-data and create it dynamically in nvgpu_create_class(). Add new function nvgpu_prepare_dev_node_class_list() to prepare list of all classes that are required for each GPU. For now there is only one class per GPU, but in MIG mode multiple classes will be created with one class per instance. Update gk20a_user_init() to loop through list of classes and create dev nodes for each class. gk20a_user_deinit() frees up the linked list. Jira NVGPU-5648 Change-Id: I891a55c0ce1c2ff9db094564529b3f569df9735c Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2428501 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	a3e39c685d	gpu: nvgpu: track dev nodes using dynamic linked list Remove static dev node meta data from struct nvgpu_os_linux and replace it by a dynamic list. Struct nvgpu_os_linux will only keep track of list head and number of entries. Add new structure nvgpu_cdev to store meta data of each dev node and create/setup it dynamically in gk20a_user_init(). Once done, add the new node under list head maintained in nvgpu_os_linux. Add a static list dev_node_list[] that contains list of dev node names and file operations. This static list is used to create nvgpu_cdev data structures and to register new device nodes. Update all dev node open file operations (e.g. gk20a_as_dev_open()) to extract struct gk20a pointer from device pointer of dev node. gk20a device is the parent of dev node device. Jira NVGPU-5648 Change-Id: If070c3428afd6215e45b4919335d9f43e04c36f9 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2428500 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	9082bcf3bd	gpu: nvgpu: move ctrl priv tracking to struct nvgpu_os_linux Move ctrl node priv tracking variables from struct nvgpu_os_linux.ctrl to struct nvgpu_os_linux. This will unblock dev node creation without using the static data structures in struct nvgpu_os_linux. Jira NVGPU-5648 Change-Id: I57db0c601282534e6e2ea535d3ca27934f86fc2a Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2428499 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	2a6c473fe6	gpu: nvgpu: remove interface names and static classes to create dev nodes Remove static class definition and registration for iGPU and dGPU. Create the class dynamically in gk20a_user_init() and setup the callback function to create devnode name based on GPU type. For now add nvgpu_pci_devnode() callback for dGPU that sets correct dev node path for dGPUs. For iGPU, Android apparently does not honor dev node path set in callback and hence override the device name for iGPU with function nvgpu_devnode(). Destroy the class in gk20a_user_deinit(). This will overall be helpful in adding multiple classes and dev nodes for each GPU instance in MIG mode. Set GPU device pointer as the parent of new devices created with device_create(). This is helpful in getting GPU device name in callback function nvgpu_pci_devnode(). Update functions to not pass class structure and interface names : nvgpu_probe() gk20a_user_init() gk20a_user_deinit() nvgpu_remove() Remove static interface name format like INTERFACE_NAME since it is no longer needed. Update GK20A_NUM_CDEVS to 10 since there are 10 dev nodes per GPU right now. Jira NVGPU-5648 Change-Id: I5d41db5a0f87fa4a558297fb4135a9fbfcd51080 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2423492 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Sagar Kamble	5e5ac92aee	gpu: nvgpu: create symbolic link for gpu device under /sys/devices/ In linux kernel v4.14 and below gpu sysfs node is created under /sys/devices. In linux kernel v5.x it is created under /sys/devices/platform. Create symbolic link for the gpu device ("17000000.gv11b") under /sys/devices/ as various tests and scripts expect it to be there. Bug 200659872 Change-Id: I071177176d45c6c45d60cf9935dd33a6577dd11d Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2428623 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Seema Khowala	04de14215b	gpu: nvgpu: add NVGPU_SUPPORT_VPR check for vpr_resize VPR resize requires GPU to be reset (idle/unidle). Allow GPU idle/unidle only when NVGPU_SUPPORT_VPR is true. Bug 3122410 Bug 3144940 Change-Id: I08fb26a0d901922ee78c379982446616a880b9b3 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2427470 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: Vedashree Vidwans <vvidwans@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	c6aae8c049	gpu: nvgpu: use fixed address mapping for pma byte buffer Use fixed address mapping for pma byte buffer so that the address of this buffer always fits in 32 bits. This also requires to move unmap sequence to OS specific function since different unmap API is now needed for linux and QNX. Also call nvgpu_prof_free_pma_stream_priv_data() before nvgpu_profiler_free_pma_stream() since former uses mm->perfbuf which is released in later. Bug 2510974 Jira NVGPU-5360 Change-Id: I398b0ca4f96527d6e09c9aacacb4b43c90f5bfc9 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2424691 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Sagar Kamble	8fba942b6f	gpu: nvgpu: handle ioctl l2_fb_ops better Background: There is a race that occurs when l2_fb_ops ioctl is invoked. The race occurs as part of the flush() call while a gk20_idle() is in progress. This patch handles the race by making changes in the l2_fb_ops ioctl itself. For cases where pm_runtime is disabled or railgate is disabled, we allow this ioctl call to always go ahead as power is assumed to be always on. For the other case, we first check the status of g->power_on. In the driver, g->power_on is set to true, once unrailgate is completed and is set to false just before calling railgate. For linux, the driver invokes gk20a_idle() but there is a delay after which the call to the rpm_suspend()'s callback gets triggered. This leads to a scenario where we cannot efficiently rely on the runtime_pm's APIs to allow us to block an imminent suspend or exit if the suspend is currently in progress. Previous attempts at solving this has lead to ineffective solutions and make it much complicated to maintain the code. With regards to the above, this patch attempts to simplify the way this can be solved. The patch calls gk20a_busy() when g->power_on = true. This prevents the race with gk20a_idle(). Based on the rpm_resume and rpm_suspend's upstream code, resume is prioritized over a suspend unless a suspend is already in progress i.e. the delay period has been served and the suspend invokes the callback. There is a very small window for this to happen and the ioctl can then power_up the device as evident from the gk20a_busy's calls. nvgpu power state is queried using nvgpu_is_powered_off to determine whether to skip the resume. power state is protected under spinlock. Bug 200507468 Change-Id: I5c02dfa8ea855732e59b759d167152cf45a1131f Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2299545 (cherry picked from commit `06942bd268`) Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2425682 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Sagar Kamble	8cf5391330	gpu: nvgpu: create symbolic link gpu.0 under /sys/devices/ In linux kernel v4.14 and below gpu sysfs node is created under /sys/devices. In linux kernel v5.x it is created under /sys/devices/platform. Create symbolic link gpu.0 under /sys/devices/ as various tests and scripts expect it to be there. Bug 200665782 Change-Id: I807ce72fad94438f927df25e829082e771b72543 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2426544 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
shashank singh	d003fa57df	gpu: nvgpu: read fuse reg using physical gpc-id Fuse registers should be queried with physical gpc-id and not the logical ones. For tu104 and before chips physical gpc-ids are same as logical for non-floorswept config but for newer chips it may differ. Also, logical to physical mapping is not present for a floorswept gpc so query gpc_tpc mask only upto actual gpcs that are present. Jira NVGPU-6080 Change-Id: I84c4a3c1f256fdd1927f4365af26e9892fe91beb Signed-off-by: shashank singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2417721 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Antony Clince Alex	c36752fe3d	gpu: nvgpu: sim: make ring buffer independent of PAGE_SIZE The simulator ring buffer DMA interface supports buffers of the following sizes: 4, 8, 12 and 16K. At present, it is configured to 4K and it happens to match with the kernel PAGE_SIZE, which is used to wrap back the GET/PUT pointers once 4K is reached. However, this is not always true; for instance, take 64K pages. Hence, replace PAGE_SIZE with SIM_BFR_SIZE. Introduce macro NVGPU_CPU_PAGE_SIZE which aliases to PAGE_SIZE and replace latter with former. Bug 200658101 Jira NVGPU-6018 Change-Id: I83cc62b87291734015c51f3e5a98173549e065de Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2420728 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Sagar Kamble	c9a964aefd	gpu: nvgpu: move emc frequency floor setup under CONFIG_TEGRA_BWMGR API to set/remove emc frequency floor is defined under the config CONFIG_TEGRA_BWMGR. Add that config guard in railgate/unrailgate. Bug 200602747 Change-Id: I375f266a9926dc525102178761ffeade47cc6a66 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2423517 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Peter Daifuku	a331fd4b3a	gpu: nvgpu: pd_cache enablement for >4k allocations in qnx Mapping of large buffers to GMMU end up needing many pages for the PTE tables. Allocating these one by one can end up being a performance bottleneck, particularly in the virtualized case. This is adding the following changes: - As the TLB invalidation doesn't have access to mem_off, allow top-level allocation by alloc_cache_direct(). - Define NVGPU_PD_CACHE_SIZE, the allocation size for a new slab for the PD cache, effectively set to 64K bytes - Use the PD cache for any allocation < NVGPU_PD_CACHE_SIZE When freeing up cached entries, avoid prefetch errors by invalidating the entry (memset to 0). - Try to fall back to direct allocation of smaller chunk for contiguous allocation failures. - Unit test changes. Bug 200649243 Change-Id: I0a667af0ba01d9147c703e64fc970880e52a8fbc Signed-off-by: dt <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2404371 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Richard Zhao	6be1d64a59	gpu: nvgpu: vgpu: fixed build errors for k5.9 - enable pm qos only on CONFIG_GK20A_PM_QOS enabled - removed setting of devfreq->min_freq/max_freq. It's not support on k5.9. For K4.14, we won't need it since the scale core code has setup the freq table. Jira GVSCI-7221 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I78fef76dfb82296250065065510cd1f044308bdf Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2421647 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Aparna Das <aparnad@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	9e94e118fe	gpu: nvgpu: ensure pma byte buffer address fits in 32 bits Right now PMA byte buffer address is allocated in the range of 0x1ffc010000. The register that stores this address is only 32-bit and there is no corresponding _hi() register, so the address must fit in 32 bits. Update nvgpu_vm_init() parameters in nvgpu_perfbuf_init_vm() so that a low_hole of only 4K is used. This allows the address to be allocated in the range of 0x4000000. Also map byte buffer before PMA stream buffer so that byte buffer always gets lower address. There is only one PMA stream buffer allowed to be mapped right now so this works for now. But in future multiple buffers can be mapped and this solution needs to be reworked. Bug 2510974 Jira NVGPU-5360 Change-Id: Ief1a9ee54d554e3bc13c7a9567934dcbeaefbcc6 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2418520 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Sagar Kamble	807fbd3654	gpu: nvgpu: s/CONFIG_GK20A_PCI/CONFIG_NVGPU_DGPU Replace the redundant config GK20A_PCI with NVGPU_DGPU. Bug 200658918 Change-Id: Ie1af1c97edf96ff4d16e51170a1200623da88e2e Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2419564 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Lakshmanan M	c0e2dc5b74	gpu: nvgpu: Add subctx programming for MIG This CL covers the following code changes, 1) Added api to init inst_block for more than one subctxs. 2) Added logic to limit the subctx bind based on max. VEID count allocated to a gr instance. 3) Renamed nvgpu_grmgr_get_gr_runlist_id. JIRA NVGPU-5647 Change-Id: Ifec8164a9e5f46fbd0538c3dd50e19ee63667a54 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2418463 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Deepak Nibade	f9f82561cf	gpu: nvgpu: check PMA stream reservation in get put API Check if PMA stream resource is reserved in nvgpu_prof_ioctl_pma_stream_update_get_put() before accessing PMA stream data. Bug 2510974 Jira NVGPU-5360 Change-Id: Id57cc74e1cd37a2eb12a36671011a18693af1219 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2418521 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Sagar Kamble	df4d5f3c57	gpu: nvgpu: replace CONFIG_NVGPU_SUPPORT_TURING usage with CONFIG_NVGPU_DGPU Also update the config check in pci_power.c for definitions of stubs for pcie_attach\|detach_controller callbacks. Bug 200658918 Bug 200609273 Change-Id: Ie3f3b4de4cbcd520e54a3eb0590699c1a433e82d Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2414959 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	221475f753	gpu: nvgpu: add profiler apis to manage PMA stream Support new IOCTL to manage PMA stream meta data by adding below API nvgpu_prof_ioctl_pma_stream_update_get_put() Add nvgpu_perfbuf_update_get_put() to handle all the updates coming from userspace and to pass all required information. Add gops.perf.update_get_put() to handle all HW accesses required in perf HW unit. Add gops.perf.bind_mem_bytes_buffer_addr() to bind the available bytes buffer while binding HWPM streamout. Bug 2510974 Jira NVGPU-5360 Change-Id: Ibacc2299b845e47776babc081759dfc4afde34fe Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2406484 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	5844151a93	gpu: nvgpu: add profiler apis to alloc/free pma stream Add two new IOCTL APIs to allocate/free pma stream. Add two new functions to handle this : nvgpu_prof_ioctl_alloc_pma_stream() nvgpu_prof_ioctl_free_pma_stream() Allocation of pma stream includes below steps : - Initializing perfbuf VM - Mapping PMA buffer into perfbuf VM - Mapping PMA byte buffer into perfbuf VM - Mapping PMA byte buffer to CPU virtual address space Store all of above data in struct nvgpu_profiler_object for reference. OS specific data is stored in struct nvgpu_profiler_object_priv Update HWPM streamout bind/unbind sequence to enable/disable perfbuf respectively. Also take care of releasing the pma stream resources in profiler object close path if they are not explicitly released by user space by IOCTL call. Bug 2510974 Jira NVGPU-5360 Change-Id: I126633746cabc4e293c7ad7c49806866a897949d Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2406483 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Krishna Reddy	39b354da6e	gpu: nvgpu: update the config option check for T186 SID Compile T186 SID setup under CONFIG_NV_TEGRA_MC. dt-bindings header tegra-swgroup.h is needed for nvidia MC sources which are built under this config, hence using it. Bug 200653145 Change-Id: Iaa56265db62f5115864aae879e0b59362fa33edc Signed-off-by: Krishna Reddy <vdumpa@nvidia.com> Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2412455 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Bitan Biswas	f090e6aa23	drivers: gpu: remove archdata.iommu Fix k5.9 build error for archdata.iommu Replace use of dev->archdata.iommu with iommu_get_domain_for_dev() Change-Id: Ic1efb864046a08a7ea9b1810114bdadef20f6adf Signed-off-by: Bitan Biswas <bbiswas@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2402360 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Pritesh Raithatha <praithatha@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: Sagar Kamble <skamble@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00

1 2 3 4 5 ...

803 Commits