linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-22 09:12:24 +03:00

Author	SHA1	Message	Date
Martin Radev	9487599da1	gpu: nvgpu: update SETUP_BIND uapi This patch updates the SETUP_BIND uapi for conditionally exposing the GPFIFO, UserD and Usermode MMIO GPU VAs to userspace. Bug 3938139 Change-Id: Ifa42d592d5224de075c8c9640dd67cab398a9786 Signed-off-by: Martin Radev <mradev@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2908614 Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-25 20:20:51 -07:00
Santosh BS	a062676e71	gpu: nvgpu: log mask for multimedia engines Introducing gpu_dbg_mme for multimedia debug prints. Jira NVGPU-9429 Bug 3962979 Change-Id: I9c84c9336a10af864f61d314dc811d038d1d2d87 Signed-off-by: Santosh BS <santoshb@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2908237 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-25 09:22:05 -07:00
srajum	3207a29fd7	gpu: nvgpu: enable run time log mask for safety debug and standard debug builds - Enable /dev/nvgpu/igpu0/log_mask node on safey debug and standard debug builds which will help to enable log mask in run time. - Enabled "CONFIG_NVGPU_SYSFS" flag for release and safety debug builds which will build sysfs files to enable /dev/nvgpu/igpu0/log_mask node for these builds - Created "standard_debug" profile which is used to enable /dev/nvgpu/igpu0/log_mask node only for standard debug builds Bug 3958585 Change-Id: Id35cd617550974ee805c83e026e2bb5f3be768c6 Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2903881 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Prateek Sethi <prsethi@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-24 08:53:56 -07:00
atanand	4b2e1b4a38	gpu: nvgpu: Print PC and ESR for GR errors when fault occurs Added print for PC, WARP and GLOBAL ESR when the fault occurs. Bug 3973433 Change-Id: I8ec1d2aa16125d1628ca7321629b9f771322268a Signed-off-by: atanand <atanand@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2900112 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-22 19:15:08 -07:00
Rajesh Devaraj	36fcd45ca4	userspace: disable unit test for gr_config err injection This patch disables test_gr_config_error_injection JIRA NVGPU-9897 Change-Id: I771598687f4a1efc9a9a4229dbfd5524c5b6732f Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2900702 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: V M S Seeta Rama Raju Mudundi <srajum@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-22 13:16:51 -07:00
Rajesh Devaraj	ce22f1efb1	gpu: nvgpu: add functions to query vgpc config This patch adds the following functions which can be used to set/query skyline configuration: nvgpu_gr_config_set_singleton_mask nvgpu_gr_config_get_singleton_mask nvgpu_gr_config_set_num_singletons nvgpu_gr_config_get_num_singletons nvgpu_gr_config_set_num_tpc_in_skyline nvgpu_gr_config_get_num_tpc_in_skyline nvgpu_gr_config_set_gpc_skyline nvgpu_gr_config_get_gpc_skyline nvgpu_gr_config_set_virtual_gpc_id nvgpu_gr_config_get_virtual_gpc_id nvgpu_gr_config_set_sm_info_virtual_gpc_index nvgpu_gr_config_get_sm_info_virtual_gpc_index JIRA NVGPU-9897 Change-Id: If80e19f709472d74b42cb9c4b47dc4ce4f9a54dc Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2888783 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-05-22 13:16:41 -07:00
Sagar Kamble	278f87e0ce	gpu: nvgpu: unregister reboot notifier for pci driver Below oops is encountered as reboot notifier was not unregistered on nvgpu unload. On registering new reboot notifier on reload of the nvgpu module, kernel is not able to access prior freed notifier buffer. Reboot notifier unregistration was not present in nvgpu_pci_remove. Add it. Unable to handle kernel paging request at virtual address Internal error: Oops: 96000004 [#1] PREEMPT SMP^M CPU: 0 PID: 997 Comm: modprobe Tainted: G W 5.10.152-tegra #2^M pstate: a2400009 (NzCv daif +PAN -UAO +TCO BTYPE=--)^M pc : blocking_notifier_chain_register+0x50/0xf0^M lr : blocking_notifier_chain_register+0x3c/0xf0^M blocking_notifier_chain_register+0x50/0xf0^M register_reboot_notifier+0x2c/0x40^M nvgpu_pci_shutdown+0xaac/0x1b00 [nvgpu]^M pci_device_probe+0xf8/0x1f0^M really_probe+0x104/0x510^M driver_probe_device+0x108/0x190^M device_driver_attach+0x80/0x90^M __driver_attach+0xfc/0x1e0^M bus_for_each_dev+0x84/0x100^M driver_attach+0x34/0x50^M bus_add_driver+0x15c/0x260^M driver_register+0x68/0x130^M __pci_register_driver+0x78/0x90^M nvgpu_pci_init+0x3c/0x74 [nvgpu]^M init_module+0x18/0xe9c [nvgpu]^M do_one_initcall+0x50/0x310^M do_init_module+0x50/0x200^M load_module+0x22dc/0x2800^M Bug 3495440 Change-Id: I4367352c57d4858ee0b8dda7d1181c9d01632791 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2907442 Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Ramalingam C <ramalingamc@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-22 02:03:41 -07:00
prsethi	6f4b7d5cc2	gpu: nvgpu: fix the memory corruption issue Memory for tpc_index in gpc_tpc_physical_id_map array is allocated only for number of tpcs while it should be number of tpcs*size of index. Change fixes the memory allocation to avoid memory corruption. Bug 3994374 Change-Id: Ibc593b1d0baba980787ae50f02ea20072525888c Signed-off-by: prsethi <prsethi@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2906890 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Kishan Palankar <kpalankar@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-20 07:32:57 -07:00
Santosh BS	54b01e881b	gpu: nvgpu: multimedia engine enumeration changes - Changes to fetch and expose supported multimedia engines to umd - Unit and litter defines for multimedia engines - Add functions to get runlist id Jira NVGPU-9429 Bug 3962979 Signed-off-by: Santosh BS <santoshb@nvidia.com> Change-Id: I072b4aac803c4a70d3659857cb0d804755c5dbd7 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2900765 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: V M S Seeta Rama Raju Mudundi <srajum@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-18 23:40:19 -07:00
Divya	ef1fb41e54	gpu: nvgpu: add pmu hals to resolve mismatch Add the following HALs to avoid the duplication of code for future chips: - set_mailbox1 - get_ecc_address - get_ecc_status - set_ecc_status JIRA NVGPU-9758 Change-Id: I54ce3dfaae2873dbcd88edabbd877eca9f3d1fdb Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2898016 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-05-16 06:27:08 -07:00
prsethi	24a533c9dc	nvgpu: print the caller name with quiesce Currently quiesce method does not print the caller name which makes it difficult to find the reason behind the issue. Change prints caller name and invocation line number. Bug 4098984 Change-Id: I34a0f557c411f997022668e187060c1c1247b15f Signed-off-by: prsethi <prsethi@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2900585 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-05-15 15:54:43 -07:00
Martin Radev	d70c9a708b	gpu: nvgpu: expose local-to-logical/physical GPC mappings Expose the local-to-logical/physical GPC mappings for devtools needs. Bug 3944943 Change-Id: I2aa69ccef19627d41f3e2b8dcc9235401ae1f782 Signed-off-by: Martin Radev <mradev@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2900289 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-05-15 06:08:33 -07:00
Austin Tajiri	7522c3ee31	gpu: nvgpu: skip security check in ctxsw init Force ctxsw ucode to skip the security level check by clearing mailbox 1 before starting FECS/GPCCS. Jira NVGPU-9217 Change-Id: Id4286d0882a29a849128e62c5421c8ae6071e3b1 Signed-off-by: Austin Tajiri <atajiri@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2891013 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-15 06:07:23 -07:00
Austin Tajiri	7e0351f291	gpu: nvgpu: retry MMU fault buffer read When handling MMU faults, retry the first MMU fault buffer read multiple times until it contains valid data. There may be a delay between the MMU fault interrupt triggering and the fault buffer containing valid data. Jira NVGPU-9217 Change-Id: I06b442acaf54a4e036795de65345b423f9b424bf Signed-off-by: Austin Tajiri <atajiri@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2881909 Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-15 06:07:18 -07:00
Rajesh Devaraj	a321679a5d	gpu: nvgpu: add is_gsp_supported flag This patch adds is_gsp_supported flag and initializes it for GA10B, TU104. Further, this flag is checked before initializaing GSP LITE falcon. JIRA NVGPU-9983 Change-Id: If0a4a3095c15cac113895f3d114e731f35211c5d Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2902651 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-15 01:19:26 -07:00
Martin Radev	789ebda23d	nvgpu: Expose HES PM resource This patch adds the HES profiling resource. Bug 3944963 Change-Id: Ie7ea4d060cfdc6803262166c1c89d0c2d155c9e3 Signed-off-by: Martin Radev <mradev@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2901996 GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com> Reviewed-by: Kishan Palankar <kpalankar@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com>	2023-05-12 20:58:19 -07:00
Martin Radev	cad414d310	gpu: nvgpu: Expose NUMA domain id This patch exposes the NUMA domain id for the Linux device to userspace. This information is necessary for userspace to make better utilization of physical memory allocation and CPU scheduling. Bug 3972227 Change-Id: I9ad0369076d22531e154074d616e5a23e374a7e9 Signed-off-by: Martin Radev <mradev@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2897294 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-05-10 06:29:32 -07:00
srajum	80a21343a0	userspace: Enable unit tests on l4t for GA10B JIRA NVGPU-9909 Change-Id: I4917384b855ebfe6e3c428f0a268ad09a6bfb573 Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2892996 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-09 22:27:09 -07:00
mpoojary	a9b995bc3f	gpu: nvgpu: Copy correct struct from RPC payload for pmu acr Size of nv_pmu_rpc_struct_acr_bootstrap_gr_falcons is copied from the RPC payload for pmu acr instead of nv_pmu_rpc_header in pmu rpc handler. This causes KSAN slab out-of-bounds error. Bug 3727012 Change-Id: I633dac9167f9ed896dba956dc56e4081aaab6465 Signed-off-by: mpoojary <mpoojary@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2891392 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-09 16:08:42 -07:00
vivekku	bd5ab81ccc	gpu: nvgpu: nvs: queue direction update Changes: - update nvgpu_nvs_ctrl_queue to have queue direction as it is required by gsp scheduler to erase queue individually - queue direction is updated during ioctl call to create queue and is used only by gsp scheduler. So no other moduler should be affected by it. - need to pass the size of struct which is u32 so downgrading it from u64 to u32 is intentional, misra C violation 10.3 can be ignored here Bug 4027512 Change-Id: I6ef6e4b06124e25da3d004a2d8822516c3ac2105 Signed-off-by: vivekku <vivekku@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2881804 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-08 23:37:56 -07:00
Divya	a4175b1265	gpu: nvgpu: resolve pmu mismatches Add the following pmu HALs for PMU registers to avoid duplication of code for future chips: - get_bar0_addr - get_bar0_data - get_bar0_timeout - get_bar0_ctl - get_bar0_error_status - set_bar0_error_status - get_bar0_fecs_error - set_bar0_fecs_error - get_mailbox - get_pmu_debug JIRA NVGPU-9758 Change-Id: If8b9c91ecd51d526babf12e3cee09048d736f0f4 Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2897156 Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-05 19:48:04 -07:00
Deepak Goyal	811f8546cf	nvgpu: sysfs: remove 'allow_all' from prod build REG_OPS whitelist can be bypassed using sysfs /sys/devices/gpu.0/allow_all Sysfs are privileged, need Root access to set this sysfs. This node should not be exposed by NVGPU-RM in Linux prod builds. Bug 4083557 Change-Id: I984212df7a9f2dfeb6759cc502ae485daa1d82d4 Signed-off-by: Deepak Goyal <dgoyal@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2893829 Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-05 11:49:28 -07:00
Rajesh Devaraj	01d3ed09b0	gpu: nvgpu: update get_access_map This patch re-names the variables used in gr_init_get_access_map API: whitelist - gr_access_map num_entries - gr_access_map_num_entries wl_addr_[] - gr_access_map_[] JIRA NVGPU-9849 Change-Id: I3a0a59410af8983867af5bc2f9ff200e56e190c4 Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2891567 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-05-05 02:31:10 -07:00
Shashank Singh	9512b9f1de	gpu: nvgpu: remove user managed addr space capability flag Remove NVGPU_GPU_IOCTL_ALLOC_AS_FLAGS_USERSPACE_MANAGED and NVGPU_AS_ALLOC_USERSPACE_MANAGED flags which are used for supporting userspace managed address-space. This functionality is not implemented fully in kernel neither going to be implemented in near future. Jira NVGPU-9832 Bug 4034184 Change-Id: I3787d92c44682b02d440e52c7a0c8c0553742dcc Signed-off-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2882168 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-05-04 11:39:30 -07:00
Martin Radev	84bb919909	gpu: nvgpu: Setup GFX-capable TPCs This patch dispatches to the appropriate HAL to select the GFX-capable TPCs. Bug 3944931 Change-Id: Ifb7338bea2cd59581133b7a2ba723f5d8bfa507c Signed-off-by: Martin Radev <mradev@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2891725 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-05-04 11:38:54 -07:00
Divya	c728f09c18	gpu: nvgpu: add sysfs node for golden img status - Add a sysfs node "golden_img_status" to show if golden_image size and ptr are already initialized or not. - This node helps to know golden image status before attempting to modify gpc/tpc/fbp masks. Bug 3960290 Change-Id: I3c3de69b369bcaf2f0127e897d06e21cb8e2d68e Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2868729 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-05-04 11:36:37 -07:00
prsethi	c49ac865de	gpu: nvgpu: init golden ctx image during nvgpu poweron Safety build temporal requirement is that on FECS power up it should go through entire initialization methods. init_golden_image callback is being called from devctl/ioctl path and triggers FECS method 10 and 11. As these methods are part of APP init, not being called during resume and causing quiesce on safety build. To fix this issue, calling the callback from poweron API. Bug 4082813 Bug 4037712 Change-Id: I2d27203d3cb4326ae7d8bd6025693fd61d5237df Signed-off-by: prsethi <prsethi@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2893218 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-05-04 03:14:19 -07:00
srajum	1de86155d8	userspace: skip "test_branches.branches" unit test - "test_branches.branches" unit test is failing for vectorcast build on ga10b, so skipping this. JIRA NVGPU-9909 Change-Id: Ie713f1b92e29af63c23ba597cf2c16945337b780 Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2897716 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Prateek Sethi <prsethi@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-03 21:04:46 -07:00
Richard Zhao	be7b4aac61	gpu: nvgpu: linux: pci calls vgpu power on for virt dev VF driver shares same poweron function with legacy vgpu. Jira GVSCI-15779 Change-Id: I44381f7ebe1cfd6c72ff886aa9f54d54c189a8b6 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2884178 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-05-01 00:24:56 -07:00
Richard Zhao	f9242c032e	gpu: nvgpu: linux: add virtual function support vf probe is called by pci general probe when platform data indicates it's a virtual device. vf_linux covers PCIE specific initialization, then call common vgpu probe. Jira GVSCI-15779 Change-Id: I47ce1c4807b23363a9062ff0cbc8e08b9c6cdc97 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2884177 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-05-01 00:24:45 -07:00
Richard Zhao	8a411096c9	gpu: nvgpu: linux: vgpu to use dt_node for tegra_gr_comm For PCIE gpu device, it uses PCIE controller dt node to store nvgpu/vgpu dt properties, since PCIE endpoint device does not have any corresponding dt node. So we pass dt_node directly to tegra_gr_comm, together with EP device since tegra_gr_comm relies on dev_err. Jira GVSCI-15779 Change-Id: I73c5210e2d8b6a728c74823c1e62fed765776365 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2884176 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Austin Tajiri <atajiri@nvidia.com> Reviewed-by: Santosh BS <santoshb@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-01 00:24:34 -07:00
Richard Zhao	03da61c9ce	gpu: nvgpu: vgpu: create vgpu_common file vgpu_common file contains common functions that could be shared by legacy vgpu and vf. Jira GVSCI-15779 Change-Id: Ie301eb29dfceed95bcd96a1024663f31eb7558fd Signed-off-by: Richard Zhao <rizhao@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2884175 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-05-01 00:24:28 -07:00
Richard Zhao	678df244b9	gpu: nvgpu: vgpu: remove bar1 support All vgpu supported chips don't support bar1 anymore. Jira GVSCI-15779 Change-Id: Ia2dc4a69f3cb96df6650e663d2a40302fc826dd2 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2884174 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Santosh BS <santoshb@nvidia.com> Reviewed-by: Austin Tajiri <atajiri@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-05-01 00:24:07 -07:00
Johnny Liu	4e6be49eee	gpu: nvgpu: Correct the name of devfreq governor Remove the duplicated nvhost_pod_scaling_governor_v2 and switch to the nvhost_pod_scaling_governor. Bug 4074863 Signed-off-by: Johnny Liu <johnliu@nvidia.com> Change-Id: Ia6016cab46e97f04366d6cf14355b2e7e0989df8 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2895568 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Laxman Dewangan <ldewangan@nvidia.com> Tested-by: Laxman Dewangan <ldewangan@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-04-28 08:10:31 -07:00
Divya	3e5424bee3	gpu: nvgpu: gv11b: ap_compute fix - During nvgpu_poweron, PERFMON_INIT RPC and ACR_INIT_WPR_REGION command is sent to PMU in two different threads. - For perfmon RPC method is used and for ACR, CMD-MSG queue is used. - Since the pmu thread and poweron thread run in parallel, the pmu sequence acquired by both can have the same seq_id. - For Perfmon RPC, nvgpu_pmu_seq_free_release() is called followed by nvgpu_pmu_seq_release(). - This causes clearing of sequence for the next command. - To resolve this, instead of nvgpu_pmu_seq_free_release(), just free the rpc-payload after getting ack for perfmon and then do sequence release. - This ensures that the ACR cmd sent just after perfmon RPC does not get the same seq_id and the sequence is not cleared. Bug 4074021 Change-Id: Id9972cb719458062d8c7d9e226a25599026c052b Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2889840 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-04-28 03:32:42 -07:00
srajum	11c07fdfa8	gpu: nvgpu: Disable unit tests failed on ga10b - Disable unit tests that failed on ga10b and enabled back after fixed those and tracking at Bug 4084656. JIRA NVGPU-9909 Change-Id: I2795b198d084b2f03b7f71f8add14ffb2954d2b0 Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2886239 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-04-28 02:08:25 -07:00
srajum	53941baa93	gpu: nvgpu: fixing unit tests for ga10b - Add support for unit tests to run on orin platform. JIRA NVGPU-9909 Change-Id: I60a059840fd0d2733b0a1f2b3c1f722f8616868e Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2892228 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-04-28 02:08:14 -07:00
srajum	b2345cd01a	gpu: nvgpu: fixing unit tests for ga10b - Add support for unit tests to run on orin platform. JIRA NVGPU-9909 Change-Id: If4ca69b77d0d8483c0e9f6a6a5a64c3c3e050d65 Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2737876 Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-04-28 02:08:09 -07:00
srajum	63057907ee	gpu: nvgpu: Add mock support for all ga10b registers - Add support for unit tests to run on orin platform. JIRA NVGPU-9909 Change-Id: I532e667c4b30c36ca19776cd4ac8ef8fb1147d03 Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2886066 Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-04-28 02:08:04 -07:00
Sagar Kamble	4ee71f9852	gpu: nvgpu: guard ecc sysfs remove with NVGPU_DISABLE_ECC_STATS Following error is seen on unloading nvgpu on platforms with NVGPU_DISABLE_ECC_STATS set to true. [ 3712.384639] Internal error: Oops: 96000004 [#1] PREEMPT SMP [ 3712.388479] pc : sysfs_remove_file_ns+0x28/0x50 [ 3712.389119] lr : sysfs_remove_file_ns+0x28/0x50 ... [ 3712.400640] sysfs_remove_file_ns+0x28/0x50 [ 3712.401280] device_remove_file+0x34/0x50 [ 3712.414720] nvgpu_ecc_sysfs_remove+0x74/0xc0 [nvgpu] [ 3712.428800] nvgpu_ecc_remove_support+0x38/0x80 [nvgpu] [ 3712.442240] nvgpu_put+0xb8/0x160 [nvgpu] [ 3712.456319] nvgpu_pci_remove+0x168/0x250 [nvgpu] [ 3712.456959] pci_device_remove+0x4c/0x100 This is happening as ecc sysfs files are not created, however their removal is attempted. Add NVGPU_DISABLE_ECC_STATS check for ecc sysfs removal. Bug 3495440 Change-Id: I9726cf43a740c2b591ca39bdc572e8f4ff5684d3 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2891876 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Divya Singhatwaria <dsinghatwari@nvidia.com> Reviewed-by: Martin Radev <mradev@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-04-26 20:08:45 -07:00
Divya	b54cb9fd97	gpu: nvgpu: add pmu hals Add the following HALs for following PMU registers: - get_irqstat - set_irqsclr - set_irqsset - get_exterrstat - set_exterrstat - get_exterraddr JIRA NVGPU-9758 Change-Id: Ib153d3189ff493fdb726ec2d1e81b863476fc667 Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2886108 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-04-21 11:33:56 -07:00
Martin Radev	924dd58da0	gpu: nvgpu: remove IO_COHERENT flag This patch removes the IO_COHERENT flag as IO coherence is the default setting. Bug 3959027 Change-Id: I9800c2b8b161f7bdc2d6856639dd03488881882d Signed-off-by: Martin Radev <mradev@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2887630 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-04-21 11:32:05 -07:00
Martin Radev	81d95456b9	gpu: nvgpu: Rename PLATFORM_ATOMIC to SYSTEM_COHERENT To support current and future usecases, it would be beneficial to select the SYSTEM_COHERENT aperture explicitly. The benefits are: - platform atomic code is cleaned-up. - userspace can select the SYSTEM_COHERENT aperture for any specific usecases. Bug 3959027 Change-Id: I6489ebe87fa75cc760930277bad5e0cacca80eb6 Signed-off-by: Martin Radev <mradev@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2864177 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-04-21 11:31:53 -07:00
srajum	0babc089da	gpu: nvgpu: add unit tests in required_tests.ini - Every unit test should add in required_tests.ini otherwise it will throw a warning "Test not in required tests" while running tests. JIRA NVGPU-9870 Change-Id: Ic1268273fb42386cdd9204705b431ea4d3904afd Signed-off-by: srajum <srajum@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2888269 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Prateek Sethi <prsethi@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-04-19 10:51:11 -07:00
Ramalingam C	24475ad46b	gpu: nvgpu: pci power management for iGPU-PCIe devices Use the PCI power management functions for iGPU-PCIe devices. JIRA NVGPU-9896 Change-Id: I1ac4ae67fa727e0a8e37ed2037d1417c5c19bb17 Signed-off-by: Ramalingam C <ramalingamc@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2886799 Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Seema Khowala <seemaj@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-04-14 08:20:47 -07:00
Austin Tajiri	24bebfabaf	gpu: nvgpu: add engine base vector HALs Add HALs for getting the base vectors for stall and nonstall engine interrupts. The engine interrupt IDs are added to these base vectors to determine the engine stall and nonstall interrupt vectors. Jira NVGPU-9217 Change-Id: Ieaf0e75caac0f7e23684b80466fbf1dc3a57f68d Signed-off-by: Austin Tajiri <atajiri@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2880426 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-04-13 22:16:13 -07:00
Shashank Singh	21cb70f58d	gpu: nvgpu: remove kind control capability Kind is controlled by nvgpu userspace library so related capability flags can be removed from kernel and uapi interface. Jira NVGPU-9832 Bug 4034184 Change-Id: Id2b0a4e1cd784638362116b8d99177467fba998b Signed-off-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2880391 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-04-13 12:21:46 -07:00
Austin Tajiri	56a4680a3b	gpu: nvgpu: refactor gr.intr.handle_sw_method - Add defintions of the gfx/compute classes and methods that are generated from the hw/sw class header files. Use these definitions instead of the hard-coded ones so that mismatches may be caught by the HAL checker. - Abstract out the sw method handling functionality of gr.intr.handle_sw_method into gr.intr.handle_gfx_sw_method and gr.intr.handle_compute_sw_method and have gr.intr.handle_sw_method call these two new HALs. Jira NVGPU-9217 Change-Id: Ia30fcba6174878d9b5b7b5910c564c879a702ddc Signed-off-by: Austin Tajiri <atajiri@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2885547 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-04-13 12:20:33 -07:00
Divya	7a4fff4b17	gpu: nvgpu: add hal for pmu sequence cleanup - On older chips, PMU uses CMD-MSG queue method to communicate with NvGPU. - From Turing onwards, PMU uses RPC method for this. - During poweroff, we release pmu_sequence and reset the members of the structure. - For chips that use RPC, we need to free the payload as well and then reset the members. - Add pmu_seq_cleanup hal for this. Bug 4019694 Bug 4059157 Change-Id: Ieb474fe4ed81f54d78480214cde53b51d45652c6 Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2882267 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-04-12 16:28:52 -07:00
Divya	db9a411a06	gpu: nvgpu: sync free of rpc_payload - During driver unload, shutdown or RG path as part of pmu destroy, pmu sequences have to be cleaned up to free payload memory and allocation info which is stored as part of pmu_sequence. - While doing so there can be race condition with pmu_isr or nvgpu_pmu_rpc_execute path where it waits for fw ack. - This race condition can lead to freeing of payload memory before nvgpu_pmu_sequences_cleanup() does. - This can lead to memory corruption or double free issue when the cleanup code again tries to free the payload mem. - To resolve this add a new function nvgpu_pmu_seq_free_release() which will check for seq->id in pmu seq tbl before freeing the memory and other info from pmu_sequence. - Use this nvgpu_pmu_seq_free_release() in non-blocking RPC calls and also when fw ack fails or driver is dying scenario. - For blocking call, synchronise freeing of rpc payload memory by using a new boolean seq_free_status. Bug 4019694 Bug 4059157 Change-Id: Id45a6914a2d383a654539a87861c471a77fb6850 Signed-off-by: Divya <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2882210 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2023-04-12 16:28:41 -07:00

1 2 3 4 5 ...

9891 Commits