linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-24 10:34:43 +03:00

Author	SHA1	Message	Date
Richard Zhao	671dbbb145	gpu: nvgpu: remove vm->guest_managed gpu server now moved to use kernel vma range too, so guest_managed is not used anymore. Jira GVSCI-10900 Change-Id: I838cad24194faf72fe5ef53053e5dacc9f6588c1 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2546189 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-28 18:09:44 -07:00
Richard Zhao	2845f2b66e	gpu: nvgpu: unify nvgpu_has_syncpoints - move nvgpu_has_syncpoints to common code and only checks flag NVGPU_HAS_SYNCPOINTS - the debugfs node disable_syncpoints also enable/disable the flag NVGPU_HAS_SYNCPOINTS Jira GVSCI-10881 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I8dc5dd17ad404238203a048abf49ff2b434fce11 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2542738 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Sachin Nikam <snikam@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-28 18:09:14 -07:00
Antony Clince Alex	68e11c8bd3	gpu: nvgpu: remove nvgpu_next_gpuid.h Replace all usages of NVGPU_NEXT_GPUID and NVGPU_NEXT_DGPU_GPUID with NVGPU_GPUID_GA10B and NVGPU_GPUID_GA100. Remove nvgpu_next_gpuid.h and update yaml. Jira NVGPU-4771 Change-Id: I3baf0de4eb5266b79aabd5c6ddf8442bf8f73419 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2547735 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-27 05:03:09 -07:00
Antony Clince Alex	d2919409e9	gpu: nvgpu: rename/collpase nvgpu_next functions and structs Replace all nvgpu_next functions/structs either by 1) collapsing them into nvgpu legacy functions/structs 2) renaming them as follows: - nvgpu_next_() => nvgpu_(ga10b/ga100)_() - nvgpu_next_() => (ga10b/ga100)_() - nvgpu_next_() => nvgpu_() [only if this doesn't cause collision] - nvgpu_next_() = > nvgpu__extra() Create hal.sim unit and move Ampere+ SIM code into it. Jira NVGPU-4771 Change-Id: I215594a0d0df4bd663bd875a0d0db47bcb9ff6a2 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2548056 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-27 05:02:58 -07:00
Antony Clince Alex	f9cac0c64d	gpu: nvgpu: remove nvgpu_next files Remove all nvgpu_next files and move the code into corresponding nvgpu files. Merge nvgpu-next-*.yaml into nvgpu-.yaml files. Jira NVGPU-4771 Change-Id: I595311be3c7bbb4f6314811e68712ff01763801e Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2547557 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Mahantesh Kumbar <mkumbar@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-27 05:02:53 -07:00
Antony Clince Alex	c7d43f5292	gpu: nvgpu: remove usage of CONFIG_NVGPU_NEXT The CONFIG_NVGPU_NEXT config is no longer required now that ga10b and ga100 sources have been collapsed. However, the ga100, ga10b sources are not safety certified, so mark them as NON_FUSA by replacing CONFIG_NVGPU_NEXT with CONFIG_NVGPU_NON_FUSA. Move CONFIG_NVGPU_MIG to Makefile.linux.config and enable MIG support by default on standard build. Jira NVGPU-4771 Change-Id: Idc5861fe71d9d510766cf242c6858e2faf97d7d0 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2547092 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-27 05:02:47 -07:00
Richard Zhao	ff75647d59	gpu: nvgpu: unify power state management code The management code of g->power_on_state on different OS are almost same, so moved the code to the common place. Jira GVSCI-10882 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I890015867b7bbdf3f749ab275ffd085ef76dfec2 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2542846 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-23 09:26:49 -07:00
Konsta Hölttä	e44ece25ba	gpu: nvgpu: keep usermode region flags on railgate When the gpu is railgated, the usermode region mappings must be cleared. This is already done with zap_vma_ptes() but as an extra measure the vm flags are also zeroed. That is an oversight, so delete that code; in particular the VM_DONTCOPY flag is important so that the mapping does not follow fork, as the design does not allow that. Bug 200726443 Change-Id: I84ed4e38b7de1f0c8cbf4cca6276abfa2409ac3b Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2538481 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-22 19:30:00 -07:00
Antony Clince Alex	524418980d	gpu: nvgpu: ga10b: fix typo in header guard Header guard macro was incorrectly defined as 'USERD_GK10B_H' instead of 'USERD_GA10B_H'. Change-Id: I5e22fe55eba068b73614989da3b7e524da5a95f5 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2547000 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-22 14:14:10 -07:00
Sagar Kamble	e099f8b1fd	gpu: nvgpu: specify DMA_TO_DEVICE direction to map_attachment for RO maps nvgpu does map attachment with DMA_BIDIRECTIONAL direction for buffers irrespective of the GPU mapping type. nvmap will allow map attachment with only DMA_TO_DEVICE direction for RO buffers for secure buffer access. nvgpu does RO GPU mapping if the buffer is RO for CPU or user requests to map as RO. In both cases the dma_buf map attachment should be done with DMA_TO_DEVICE direction as the intent for accessing the SGT is reading from GPU. Also, map the gpfifo buffer as read_only as it is intended to be read only. The userd buffer is accessed by the GPU through iova and is not GMMU mapped. Bug 200731819 Change-Id: Ifc60973f298f7cacab16c5dedecbb40c5f33ed1d Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2539312 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-22 14:10:22 -07:00
Sagar Kamble	7e1f9b8b19	gpu: nvgpu: fix the circular header inclusion To use enum gk20a_mem_rw_flag declaration, gmmu.h was included in qnx nvgpu_nvmap_user.h. However with that recursive header inclusion happens as below: nvgpu_nvmap_user.h <- nvgpu/gmmu.h <- nvgpu/nvgpu-mem.h <- nvgpu_rmos/include/nvgpu_mem.h <- nvgpu_nvmap_user.h Remove this recursion by moving gk20a_mem_rw_flag to gmmu.h. Also move nvgpu_aperture to gmmu.h. With this approach gmmu.h can be included in nvgpu-mem.h as it is independent of OS header. Bug 200717195 Bug 3250920 Change-Id: I6d3011d830e3778d8d4224ddfcc2eb85a49e444b Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2531788 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-21 14:49:24 -07:00
Lakshmanan M	19186c8a02	gpu: nvgpu: select map access type from dmabuf permission and user request Add api to translate dmabuf's fmode_t to gk20a_mem_rw_flag for read only/read write mapping selection. By default dmabuf fd mapping permission should be a maximum access permission associated to a particual dmabuf fd. Remove bit flag MAP_ACCESS_NO_WRITE and add 2 bit values for user access requests NVGPU_VM_MAP_ACCESS_DEFAULT\|READ_ONLY\| READ_WRITE. To unify map access type handling in Linux and QNX move the parameter NVGPU_VM_MAP_ACCESS_* check to common function nvgpu_vm_map. Set MAP_ACCESS_TYPE enabled flag in common characteristics init function as it is supported for Linux and QNX. Bug 200717195 Bug 3250920 Change-Id: I1a249f7c52bda099390dd4f371b005e1a7cef62f Signed-off-by: Lakshmanan M <lm@nvidia.com> Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2507150 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-21 14:48:32 -07:00
Vedashree Vidwans	261bdb9cc2	gpu: nvgpu: fix MISRA violations in common.fifo - Rule 8.6 requires each identifier with external linkage to have exactly one external definitions. - Define macro instead of constant value of maximum gpfifo entries. Jira NVGPU-6262 Change-Id: If8f4b9fa06db62a03d487b9e3dd8157a40cffe16 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2540462 (cherry picked from commit 03a42b55b29dc3503999f4728589f0bd17b75c76) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2544846 Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-18 03:01:34 -07:00
Sagar Kadamati	3e43f92f21	gpu: nvgpu: add ga10b & ga100 sources Mass copy ga10b & ga100 sources from nvgpu-next repo. TOP COMMIT-ID: 98f530e6924c844a1bf46816933a7fe015f3cce1 Jira NVGPU-4771 Change-Id: Ibf7102e9208133f8ef3bd3a98381138d5396d831 Signed-off-by: Sagar Kadamati <skadamati@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2524817 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-17 12:56:16 -07:00
Shashank Singh	82734765d8	gpu: nvgpu: corrections to dma alloc API documentation - Mention about NVGPU_DMA_PHYSICALLY_ADDRESSED flag and other not being used on safety. - Correct the range for input size. Jira NVGPU-6415 Change-Id: I6dc8b087edcb640a96e07c90016099cdcf456e60 Signed-off-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2538377 (cherry picked from commit 5a492b0aec1a7bfe0ebf904c48814faecabef99c) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2543401 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-17 07:48:19 -07:00
Lakshmanan M	ee2aaef308	gpu: nvgpu: Report non zero num_sub_partition_per_fbpa value only for dGPU All Tegra iGPUs don't have real FBPA/FBSP units at all. So num_sub_partition_per_fbpa should be 0 for iGPUs. JIRA NVGPU-5656 Change-Id: I30050caf8f9f6b5185404a64dbbbe02f67046093 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2545978 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-16 15:06:30 -07:00
Shashank Singh	9bd91499e3	gpu: nvgpu: fix findings in common.nvgpu from DVR Fix nvgpu_get_litter_value() doxygen output. Jira NVGPU-6597 Change-Id: I67ad29d9b9e880695a450fd030ba110bd739cd9b Signed-off-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2544113 (cherry picked from commit c6b2826700b7435671e31b96d998921680cc9d9c) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2545314 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-15 13:27:07 -07:00
dt	12a0e3fe61	gpu: nvgpu: Add support to print mig config lists This is adding support to show available mig configs when MIG is disabled for nvgpu-next. JIRA NVGPU-6721 Change-Id: I8ba742b7850902c1eea4728655c75d795e0bb3a2 Signed-off-by: dt <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2543472 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-15 13:25:46 -07:00
Divya Singhatwaria	4874bdfbac	gpu: nvgpu: Address DVR issues for common.power_features Fix the common.power_features DVR issues found as part of 5.2 SWUD Lite units design verification. 1.Add note about various *CG features. 2. nvgpu_cg_init_gr_load_gating_prod description fixed. JIRA NVGPU-6610 Change-Id: Id28eaa9d15a5481d28a5fd2cc407c82734a6c165 Signed-off-by: Divya Singhatwaria <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2541739 (cherry picked from commit d19e95407748689a26ae5b5920e6fb50f4399d1f) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2542078 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-15 09:08:21 -07:00
Sagar Kamble	e0e337fb83	gpu: nvgpu: set nvgpu power state to POWERED_OFF on poweron fail When force closing the app, poweron needed in channel close path will fail as pg_task kthread creation fails with -EINTR (process is SIGKILL'd so threads don't get created). Upon poweron failure, device nodes are removed and the nvgpu power state is not reset to NVGPU_STATE_POWERED_OFF. Hence on further gk20a_busy attempts, poweron is not attempted and gpu remains unusable from thereon. Change the state to POWERED_OFF from POWERING_ON on poweron fail. Bug 3308828 Change-Id: I2360f11a4937dfe93eb7933b30c13748fb570898 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2543797 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-15 04:58:28 -07:00
Debarshi Dutta	8f9ac1dea9	gpu: nvgpu: split away power node removal Presently, gk20a_user_deinit is used to remove all device nodes including "power" node as well. Split removal of power node into a separate function gk20a_power_node_deinit to enable other device removal during the normal runtime_suspend path to facilitate the fast path for MIG reconfiguration. Powernode can be removed only during a call to Rmmod. This also enables separately powering off the device nodes in the unlikely case of a poweron failure. Bug 3308828 Jira NVGPU-6920 Change-Id: Ib045a09a992a63c468492a837b273cca41e20f15 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2543014 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-15 04:57:57 -07:00
Divya Singhatwaria	a1d0957a9b	gpu: nvgpu: Update GP10B FW version Updated PMU ucode taken from P4 CL#30066529 for t18x igpu. The ucode resolves the ELPG_DISALLOW_ACK timeout failure P4 CL link for this PMU ucode changes: https://p4sw-swarm.nvidia.com/changes/30066529 Bug 200588696 Change-Id: Ic45c37c75924c581d6ef91ffd754da287d63f4c6 Signed-off-by: Divya Singhatwaria <dsinghatwari@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2544140 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-14 12:58:06 -07:00
Debarshi Dutta	45a1489409	gpu: nvgpu: enable compiling out DGPU specific flag in Hal.Bus unit read_sw_scratch, and write_sw_scratch belonging to gops_bus struct is moved under CONFIG_NVGPU_DGPU compiler flag as these are currently called by DGPU bios specific routines. Jira NVGPU-6402 Change-Id: I5ff22e6d9ad323b0c209f2b4458b8ee3a4a62226 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2542959 (cherry picked from commit 71da44a5dbe3d969d6551dc366813208faf4ed05 in rel-33) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2544003 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-14 05:34:24 -07:00
Seshendra Gadagottu	ae243fa1eb	gpu: nvgpu: set l3_alloc hint based on L3 errata If errata for L3 SCF cache not supported is set, then force l3_alloc hint to false, so that L3 memory traffic will not be generated from nvgpu driver. Bug 3186312 Bug 3288192 Change-Id: Icf776673c2975fdc04cc02bfae28ef26c8deba4d Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2539599 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-12 07:24:09 -07:00
Lakshmanan M	4a3a9d46e3	gpu: nvgpu: Use gr_instance specific api to query the num of sm Replaced get_no_of_sm() with gr_instance specific api nvgpu_gr_config_get_no_of_sm() JIRA NVGPU-5656 Change-Id: I01b786402dde857e7cc30d5370429d02ebe3f428 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2543245 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-11 18:05:07 -07:00
Deepak Nibade	c4dee40c49	gpu: nvgpu: fix MISRA violations in common.gr 1. misra_c_2012_rule_8_6_violation: "gm20b_gr_init_fe_go_idle_timeout" is declared but never defined. Fix by adding config CONFIG_NVGPU_HAL_NON_FUSA for header declaration of "gm20b_gr_init_fe_go_idle_timeout" 2. misra_c_2012_rule_5_7_violation: Identifier "ops" is already used to represent a type. Fix by renaming local variable ops to nonstall_ops in gm20b_gr_intr_nonstall_isr() 3. missing_default: No default case found for the switch statement "switch (offset << 2)" Fix by adding break and default statements to switch case in gv11b_gr_intr_handle_sw_method() Jira NVGPU-6779 Change-Id: I8df097ec66479edcd2e81bf46bab5b5db52ac8c8 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2541246 (cherry picked from commit c4d9fe0449f8c6ee209051abfe58c6f3a745808d) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2543012 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-11 18:04:33 -07:00
Lakshmanan M	5394175d5b	gpu: nvgpu: Move get_num_hwpm_perfmon() after golden context creation Query the num_perfmon requires golden context to be ready. Accessing golden context might require gr_instance_id, specific to a GR engine. On TOT, get_num_hwpm_perfmon() called from perfmon HAL which might require to call nvgpu_gr_exec_with_err_for_instance(). It internally calls nvgpu_grmgr_config_gr_remap_window() to change gr_window_remap register points to a current gr_instance_id for MIG. This approach indirectly mandates to call nvgpu_gr_exec_with_err_for_instance() which can be completely avoided. get_num_hwpm_perfmon() is just a query call which can be moved after the golden context creation. Using this logic, we can avoid unnecessary invocation of nvgpu_gr_exec_with_err_for_instance() during perform specific HAL accesses. 1) Moved get_num_hwpm_perfmon() after golden context creation. 2) Added nvgpu_assert() if (g->num_sys_perfmon == 0U). JIRA NVGPU-5656 Change-Id: I59a6ab4df93763adbc0765fa5e4d1712b2477521 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2542438 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-10 19:53:17 -07:00
Konsta Hölttä	4b3591aafb	gpu: nvgpu: avoid faulty elpg protection Don't store the return value of elpg re-enable if disable fails; this could make the local status value zero again, causing the elpg-protected call to be executed with elpg still enabled and elpg re-enabled twice. Commit `c905858565` ("gpu: nvgpu: add cg and pg function") introduced this bug; failure of re-enabling after a failed disable might be another problem (and it's not clear why this is done in the first place) which isn't propagated to the caller, but that would belong to another patch. Bug 200565050 Change-Id: I7cf7a0887ae59e85bf0c56c38aaaadfefd16cc1c Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2541859 Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-10 19:51:13 -07:00
Lakshmanan M	7d473f4dcc	gpu: nvgpu: Expose logical mask for MIG 1) Expose logical mask instead of physical mask when MIG is enabled. For legacy, NvGpu expose physical mask. 2) Added fb related info in struct nvgpu_gpu_instance(). 4) Added utility api to get the logical id for a given local id nvgpu_grmgr_get_gr_gpc_logical_id() 5) Added grmgr api to get max_gpc_count nvgpu_grmgr_get_max_gpc_count(). 5) Added grmgr's fbp api to get num_fbps and its enable masks. nvgpu_grmgr_get_num_fbps() nvgpu_grmgr_get_fbp_en_mask() nvgpu_grmgr_get_fbp_rop_l2_en_mask() 6) Used grmgr's fbp apis in ioctl_ctrl.c 7) Moved fbp_init_support() in nvgpu_early_init() 8) Added nvgpu_assert handling in grmgr.c 9) Added vgpu hal for get_max_gpc_count(). JIRA NVGPU-5656 Change-Id: I90ac2ad99be608001e7d5d754f6242ad26c70cdb Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2538508 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-10 03:05:21 -07:00
Richard Zhao	e2d8bdc38d	gpu: nvgpu: unify nvgpu_get_gpfifo_entry_size moved nvgpu_get_gpfifo_entry_size implementation to common code. Jira GVSCI-10880 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: Ia6ccee5e26836662f7c2196ff41658ff41e3a570 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2541575 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-09 19:27:25 -07:00
Deepak Nibade	67399a1892	gpu: nvgpu: unit: BVEC test for common.class unit class_validate_setup is already testing for valid/invalid boundary values for common.class APIs. Append the valid/invalid list with BVEC test values. Fix obsolete gops_class doxygen documentation. Jira NVGPU-6403 Change-Id: Id713db614919842324f6d655b36dd57043958919 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2539797 (cherry picked from commit 6aed159f9f3eeea553a442af37e3bcc840152154) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2539795 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-09 14:06:07 -07:00
Richard Zhao	9ac7550f35	gpu: nvgpu: unify NV_READ_ONCE and NV_WRITE_ONCE Implemented NV_READ_ONCE and NV_WRITE_ONCE in common code. Jira GVSCI-10879 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I5465b4bd1cd44fc7bc1592da01d6be455b1fcdcc Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2541559 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Aparna Das <aparnad@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-09 03:15:39 -07:00
Seshendra Gadagottu	5ec1e0cc21	gpu: nvgpu: make gp10b_tegra_acquire_platform_clocks public Made gp10b_tegra_acquire_platform_clocks as public function so that each gpu architecture can supply different number of clock list. Jira NVGPU-6707 Change-Id: Iad2156a63e00913374ce5fa4274c95e7488fdb31 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2511795 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Sivaram Nair <sivaramn@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-08 21:54:23 -07:00
Deepak Nibade	6fb2b892ce	gpu: nvgpu: set/check mmu nack flags only for GPC exceptions gv11b_mm_mmu_fault_handle_mmu_fault_refch() right now checks/sets mmu_nack_handled flag for MMU faults from all clients (i.e. GPC/HUB). Handling of MMU nack in MMU fault handling path is only needed if MMU nack exception is triggered by SM in GPC. Hence set and check this flag only if source client is GPC. In certain cases it is possible that CE engine triggers back to back MMU faults on same channel. When this happens, and because of above incorrect mmu_nack_handled flag handling, mmu_nack_handled flag is set while handling second MMU fault from CE. And because of this gv11b_mm_mmu_fault_handle_mmu_fault_refch() function could end up dropping extra channel refcounts and trigger access after free scenarios on that channel. Bug 3315942 Change-Id: I28d8311edf34a041364dddedb5fc3a5b83132f85 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2540497 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Sagar Kamble <skamble@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-08 06:48:56 -07:00
Richard Zhao	1685a2404f	gpu: nvgpu: vgpu: add b0cc profiler support - added new commands to bind/unbind hwpm/hwpm_streamout/smpc - added new command to updat get/put for PMA buffer - tune function nvgpu_perfbuf_update_get_put so it could be reused on server side. - enable profiler v2 device for gv11b Jira GVSCI-10351 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I4226c89ec3040e53dee5381ac8a30c9fd598e5ef Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2537683 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-08 01:30:03 -07:00
Richard Zhao	a3c4236574	gpu: nvgpu: profiler: create bind/unbind hals - created gops_profiler - added HALs for bind/unbind hwpm/hwpm_streamout/smpc - it helps enable b0cc on vgpu Jira GVSCI-10351 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I9fd30b134d54a92d1ce8108172aa77237c702bc0 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2537682 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-08 01:29:57 -07:00
Richard Zhao	4ea92a530b	gpu: nvgpu: profiler: remove profiler obj from hwpm bind/unbind It helps the hwpm bind/unbind functions to be reused on server side. Server side does not track profiler object. Jira GVSCI-10351 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: Ib692c686e940b8123c934b5bb6ba843e09a27246 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2537681 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-08 01:29:52 -07:00
Richard Zhao	7664bee12f	gpu: nvgpu: profiler: remove profiler obj from smpc bind/unbind It helps the smpc bind/unbind functions to be reused on server side. Server side does not track profiler object. Jira GVSCI-10351 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I5e62901cabb56cb2f2d40d51a249b1404b292f5a Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2537680 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-08 01:29:46 -07:00
Richard Zhao	9b66fca165	gpu: nvgpu: move .exec_regops to only execute regops HAL .exec_regops used to first validate regops then execute it, now moving it to only execute the regops. - It helps B0CC on HV. On server side it does not track profiler object, but regops validation uses the profiler, so moving validation to client side. - The change also remove ctx_buffer_offset checking in validate_reg_op_offset. The offset already checked again whitelists which have be verified when update whitelist. Also vgpu does not have information of ctx and golden image. - Added function nvgpu_regops_exec to cover both regops validation and execution. Jira GVSCI-10351 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I434e027290e263a8a64a25a55500f7294038c9c4 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2534252 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-08 01:29:40 -07:00
Lakshmanan M	08cd42093d	gpu: nvgpu: Add multi gr l2_evict support 1) Added l2_evict support for multi gr 2) Added multi gr handling for the following apis, nvgpu_gr_get_cilp_preempt_pending_chid nvgpu_gr_clear_cilp_preempt_pending_chid JIRA NVGPU-5656 Change-Id: Iee6142a49b9a569f2b440077762164af8aee9fb3 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2539734 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-07 13:46:40 -07:00
Lakshmanan M	df87591b7d	gpu: nvgpu: Add multi gr handling for debugger and profiler 1) Added multi gr handling for dbg_ioctl apis. 2) Added nvgpu_assert() in gr_instances.h (for legacy mode). 3) Added multi gr handling for prof_ioctl apis. 4) Added multi gr handling for profiler. 5) Added multi gr handling for ctxsw enable/disable apis. 6) Updated update_hwpm_ctxsw_mode() HAL for multi gr handling. JIRA NVGPU-5656 Change-Id: I3024d5e6d39bba7a1ae54c5e88c061ce9133e710 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2538761 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-04 18:07:47 -07:00
Sagar Kamble	1dd3e0761c	gpu: nvgpu: fix the usermode mappings deadlock during railgate and munmap Following locking sequence leads to deadlock: 1. gk20a_pm_prepare_poweroff (alter_usermode_mappings): ctrl_privs_lock -> mmap_lock 2. __do_munmap (usermode_vma_close): mmap_lock -> ctrl_privs_lock This lock contention can be resolved by retrying the usermode mapping alteration after a while releasing the ctrl_priv_lock for munmap to proceed. Below is the kernel panic log with deadlock. [] INFO: task kworker/1:1:116 blocked for more than 120 seconds. [] Tainted: G W 5.10.17-tegra #1 [] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [] task:kworker/1:1 state:D stack: 0 pid: 116 ppid: 2 flags:0x00000028 [] Workqueue: pm pm_runtime_work [] Call trace: [] __switch_to+0x104/0x160 [] __schedule+0x3d4/0x900 [] schedule+0x74/0x100 [] rwsem_down_write_slowpath+0x250/0x4b0 [] down_write+0x6c/0x80 [] alter_usermode_mappings+0xb4/0x160 [nvgpu] [] nvgpu_hide_usermode_for_poweroff+0x24/0x30 [nvgpu] [] gk20a_pm_prepare_poweroff+0xe8/0x140 [nvgpu] [] gk20a_pm_runtime_suspend+0x78/0xf0 [nvgpu] [] pm_generic_runtime_suspend+0x3c/0x60 [] genpd_runtime_suspend+0xb0/0x2c0 [] __rpm_callback+0x90/0x150 [] rpm_callback+0x34/0xa0 [] rpm_suspend+0xe0/0x5e0 [] pm_runtime_work+0xbc/0xc0 [] process_one_work+0x1c0/0x4a0 [] worker_thread+0x11c/0x430 [] kthread+0x148/0x170 [] ret_from_fork+0x10/0x18 [] INFO: task nvrm_gpu_tests:1273 blocked for more than 121 seconds. [] Tainted: G W 5.10.17-tegra #1 [] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [] task:nvrm_gpu_tests state:D stack: 0 pid: 1273 ppid: 1245 flags:0x00000000 [] Call trace: [] __switch_to+0x104/0x160 [] __schedule+0x3d4/0x900 [] schedule+0x74/0x100 [] schedule_preempt_disabled+0x28/0x40 [] __mutex_lock.isra.0+0x184/0x5c0 [] __mutex_lock_slowpath+0x24/0x30 [] mutex_lock+0x5c/0x70 [] usermode_vma_close+0x30/0x50 [nvgpu] [] remove_vma+0x34/0x60 [] __do_munmap+0x1f4/0x4a0 [] __vm_munmap+0x74/0xd0 [] __arm64_sys_munmap+0x3c/0x50 [] el0_svc_common.constprop.0+0x7c/0x1a0 [] do_el0_svc+0x34/0xa0 [] el0_svc+0x1c/0x30 [] el0_sync_handler+0xa8/0xb0 [] el0_sync+0x160/0x180 [] ---[ end Kernel panic - not syncing: hung_task: blocked tasks ]--- Bug 200703921 Change-Id: Ie7f017c92f20061d3bf891079f7fc7fe390f7cf7 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2533853 Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-04 18:06:11 -07:00
Deepak Nibade	419a65965b	gpu: nvgpu: add mutex for gr_ctx initialization If user calls IOCTL to allocate object context for two channels in same TSG in parallel, nvgpu_gr_setup_alloc_obj_ctx() could end up racing and trying to allocate object context for both channels at the same time. This could result in corrupting object context. Fix this by introducing per-TSG mutex ctx_init_lock to serialize context initialization for all channels within TSG. In ideal scenario nvrm_gpu is the only caller of all the IOCTLs, and nvrm_gpu makes sure to initialize object context for each channel in serial order. Because of this new lock does not cause any contention. Jira NVGPU-6431 Change-Id: Ibb1cbb4878748929bb7f23e8666c283c39ecbf5a Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2538333 (cherry picked from commit 8be447838dc1ecbd5637eb6bd13b8f338eaf33cd) Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2538773 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-03 15:59:43 -07:00
dt	5e82717c96	gpu: nvgpu: Add powernode support to vgpu As the normal gpu is powered on by writing one to power-node, the patch is adding power node for vgpu. Change-Id: I08fbbe8694e02c826a0d5692f5a4c0f4efd396ff Signed-off-by: dt <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2537053 Reviewed-by: Richard Zhao <rizhao@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-02 19:40:39 -07:00
Sagar Kamble	ac30c4cb65	gpu: nvgpu: change acr bootstrap completion info message Following information message was printed unconditionally. Often, it is not useful. nvgpu_acr_wait_for_completion:100 [INFO] flcn-0: sctl reg 7021 cpuctl reg 50 It is okay to move this to nvgpu_acr_dbg. bug 200734207 Change-Id: Ie66caf20d0e2eb692532e26bf89417342a054cf8 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2536471 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-02 19:40:26 -07:00
Lakshmanan M	edbcf5cfc6	gpu: nvgpu: add multi gr handling for debugger Added multi gr handling for debugger apis. Replaced g->gr with nvgpu_gr_get_cur_instance_ptr(g). JIRA NVGPU-5656 Change-Id: I010eff39b1ebec231b4dbdd53caffc25e1cd54c4 Signed-off-by: Lakshmanan M <lm@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2537784 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2021-06-01 12:50:48 -07:00
Debarshi Dutta	11d27743f8	gpu: nvgpu: add NULL checks before freeing ZBC and ZCULL Disabling NVGPU_SUPPORT_MIG in suspend path leads to inconsistencies. During driver removal without the flag set, the driver still tries to free structures that might not have been allocated in the first place. e.g. nvgpu_gr_zbc_deinit, nvgpu_gr_zcull_deinit. Added NULL checks for ZBC and ZCULL structures before freeing them as a solution. Jira NVGPU-6832 Change-Id: I8a0c64ca982d11fee55542abd3c5bce5a51b4007 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2535101 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-01 07:58:57 -07:00
Tejal Kudav	9f43914933	gpu: nvgpu: Move Intr handling common code to CIC CIC (Central Interrupt controller) will be responsible for the interrupt handling. common.cic unit is the placeholder for all interrupt related code. Move interrupt related defines and Public APIs present in common.mc to common.cic. Note: The common.mc interrupts related struct definitions are not moved as part of this patch. Adapt the code to use interrupt handling related defines and public APIs migrated from common.mc to common.cic JIRA NVGPU-6899 Change-Id: I747e2b556c0dd66d58d74ee5bb36768b9370d276 Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2535618 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-31 19:37:31 -07:00
Deepak Nibade	9034b1676e	gpu: nvgpu: compile out GFxP support in safety GFxP preemption for graphics contexts is not supported in safety. But the support was enabled along with CONFIG_NVGPU_GRAPHICS since GFxP preemption was protected under same config. Add a separate config CONFIG_NVGPU_GFXP to protect all GFxP specific code, enum values, and HALs. Disable the config in safety profile. Jira NVGPU-6893 Change-Id: Iebb5f754a1025dfa6e05a94704bdb8a7123b599a Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2534986 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-28 15:17:36 -07:00
dt	c1b302652e	gpu: nvgpu: Add fix for dev_node leak This is adding fix for dev_node leak when user_deinit called. The dev_nodes in linux are created in two phases. In first phase the power dev_nodes(one for legacy and other for v2) are created. The second phase other dev_nodes are created. While creating the dev_nodes the power cdev_region overwritten by cdev_region. This is fixed by introducing new cdev_region and updating respective nodes. JIRA NVGPU-6721 Change-Id: Iec78db8e5fe40cc0b14fb3fecc35b8881dff716f Signed-off-by: dt <dt@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2535265 Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Lakshmanan M <lm@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-05-28 11:39:58 -07:00

1 2 3 4 5 ...

8464 Commits