linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-23 18:16:01 +03:00

Author	SHA1	Message	Date
Nitin Kumbhar	7cec4ba326	gpu: nvgpu: add platform control for gc off The GC-OFF feature shall be available only for selective dGPUs like Volta, etc. To enable this, add a platform flag to control GC-OFF feature for a given dGPU. If GC-OFF is not enabled for a dGPU, EPERM error will be returned by kernel interfaces. JIRA NVGPU-1100 Change-Id: Ic9e4492b2bb8916d520e78ecb6a500ccd349b70c Signed-off-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1923249 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-27 15:24:10 -08:00
Preetham Chandru R	9ad31113e8	gpu: nvgpu: RDMA implementation This change adds RDMA supports for tegra iGPU. 1. Cuda Process allocates the memory and passes the VA and size to the custom kernel driver. 2. The custom kernel driver maps the user allocated buf and does the DMA to/from it. 3. Only supports iGPU + cudaHostAlloc sysmem 4. Works only for a given process. 5. Address should be sysmem page aligned and size should be multiple of sysmem page size. 6. The custom kernel driver must register a free_callback when get_page() function is called. Bug 200438879 Signed-off-by: Preetham Chandru R <pchandru@nvidia.com> Change-Id: I43ec45734eb46d30341d0701550206c16e051106 Reviewed-on: https://git-master.nvidia.com/r/1953780 (cherry picked from commit `d6278955f6`) Reviewed-on: https://git-master.nvidia.com/r/1821407 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-21 05:35:45 -08:00
Thomas Fleury	4b1cfa5636	gpu: nvgpu: uppercase for VBIOS version Use uppercase to display VBIOS version to match nvflash_eng, spreadsheets, and Docker's manifest. Bug 200473234 Change-Id: Idb3f802c41da8ebd0268386687be6a99c38dd9c3 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1975518 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Eric Yuen <eyuen@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-19 14:55:15 -08:00
Antony Clince Alex	7fb33cf87b	gpu: nvgpu: Defer pstate deinit to driver remove The PMU pstate deinit was invoked part of gpu power off. This frees and clears the pmgr_pmu struct which causes the pmu remove support to crash when it tries to access the pmgr_pmu object for freeing up the pmu board objects. Deferred pstate deinit to nvgpu driver removal as there is no reason for it be invoked part of prepare poweroff sequence. JIRA NVGPU-1618 Change-Id: I2eb52000f0732d0abed54946e0843367b119d443 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1971225 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-18 12:13:42 -08:00
Kary Jin	5b1b9eeab1	gpu: nvgpu: Add reboot handler Add a reboot handler to make sure that nvgpu does not try to busy the GPU if the system is going down. If the system is going down then any number of subsystems nvgpu depends on may already have been deinitialized. Bug 200333709 Bug 200454316 Change-Id: I2ceaf7ca4fb88643310874b5b26937ef44c6e3dd Signed-off-by: Kary Jin <karyj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1927018 (cherry picked from commit `9d2e50de42`) Reviewed-on: https://git-master.nvidia.com/r/1927030 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-17 11:23:56 -08:00
Thomas Fleury	89200e3c75	gpu: nvgpu: use GPL license for linux code Linux specific code should have GPL license Bug 2463898 Change-Id: I38de0a6e57a2154f3d736cd0373015a8fa146987 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1973408 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 18:23:52 -08:00
Deepak Nibade	924502875c	gpu: nvgpu: dGpu VDK support Modified the pci dev_id from tu102 to tu104. JIRA NVGPU-1564 Change-Id: Ib057d11ccd5d69d00b9c569ba947f4328b49885a Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1774971 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 10:55:55 -08:00
Konsta Holtta	7aac00ee58	gpu: nvgpu: verify usermode mapping is at most 64K Commit `ca611e4d0e` (gpu: nvgpu: verify usermode mapping is at least PAGE_SIZE) was not quite the right thing to do; do_mmap() rounds the length up to a page boundary anyway, but the length must not be longer than the size of the usermode region which is 64 KB to avoid leaking access to other registers. Bug 2441531 Change-Id: Ib1c88a6725db62c8276b6e8b880631227a4fc8cd Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1971339 Reviewed-by: Alex Waterman <alexw@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: Allen Martin <amartin@nvidia.com> Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 00:35:22 -08:00
Alex Waterman	0645492bae	gpu: ngpu: Add PHYSICALLY_ADDRESSED flag to Linux DMA debug string Add this flag name to the DMA debug string that is used for sizing the buf used to print DMA debugging info. This was missed when adding this new DMA flag. Change-Id: I2d97f8532f512811f7804e03fff2dbaabe8479a7 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1971677 Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-13 16:36:13 -08:00
Abdul Salam	8d2c1141d3	gpu: nvgpu: Remove support for GP106 Delete gp106 HALs and GPUIDs As first part, below are removed 1. HAL files 2. GPUIDs and its check in hal init 3. Unused _gp106 files Bug 200457373 Change-Id: Ic713e3ef728c006d5935ab638d6ff0e1583486d3 Signed-off-by: Abdul Salam <absalam@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1949495 GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-13 04:56:14 -08:00
Peng Liu	34df003519	gpu: nvgpu: using pmu counters for load estimate PMU counters #0 and #4 are used to count total cycles and busy cycles. These counts are used by podgov to estimate GPU load. PMU idle intr status register is used to monitor overflow. Overflow rarely occurs because frequency governor reads and resets the counters at a high cadence. When overflow occurs, 100% work load is reported to frequency governor. Bug 1963732 Change-Id: I046480ebde162e6eda24577932b96cfd91b77c69 Signed-off-by: Peng Liu <pengliu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1939547 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-11 18:22:54 -08:00
Anup Mahindre	75ff0feeff	gpu: nvgpu: Add characterstics field to expose max ctxsw ring buffer size NVGPU_CTXSW_IOCTL_RING_SETUP can be used to setup a custom ring buffer and it accepts size via arguments. nvgpu driver will return an error if size requested is greater than 128 * 4096 but this value is hardcoded and not exposed anywhere. Add characteristics field in nvgpu.h to expose this size so that corresponding nvrm_gpu API can use it. Bug 2169674 Change-Id: Icf9465d4eec6ba3a307ea9490bd5da563944e4f6 Signed-off-by: Anup Mahindre <amahindre@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1967596 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-11 16:24:27 -08:00
Thomas Fleury	7e68e5c83d	gpu: nvgpu: userd slab allocator We had to force allocation of physically contiguous memory for USERD in nvlink case, as a channel's USERD address is computed as an offset from fifo->userd address, and nvlink bypasses SMMU. With 4096 channels, it can become difficult to allocate 2MB of physically contiguous sysmem for USERD on a busy system. PBDMA does not require any sort of packing or contiguous USERD allocation, as each channel has a direct pointer to that channel's 512B USERD region. When BAR1 is supported we only need the GPU VAs to be contiguous, to setup the BAR1 inst block. - Add slab allocator for USERD. - Slabs are allocated in SYSMEM, using PAGE_SIZE for slab size. - Contiguous channels share the same page (16 channels per slab). - ch->userd_mem points to related nvgpu_mem descriptor - ch->userd_offset is the offset from the beginning of the slab - Pre-allocate GPU VAs for the whole BAR1 - Add g->ops.mm.bar1_map() method - gk20a_mm_bar1_map() uses fixed mapping in BAR1 region - vgpu_mm_bar1_map() passes the offset in TEGRA_VGPU_CMD_MAP_BAR1 - TEGRA_VGPU_CMD_MAP_BAR1 is called for each slab. Bug 2422486 Bug 200474793 Change-Id: I202699fe55a454c1fc6d969e7b6196a46256d704 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1959032 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-11 16:24:10 -08:00
Deepak Nibade	6777bd5ed2	gpu: nvgpu: add separate unit for gr/ctxsw_prog Add separate new unit gr/ctxsw_prog that provides interface to access h/w header files hw_ctxsw_prog_.h Add below chip specific files that access above h/w unit and provide interface through g->ops.gr.ctxsw_prog.() HAL for rest of the units common/gr/ctxsw_prog/ctxsw_prog_gm20b.c common/gr/ctxsw_prog/ctxsw_prog_gp10b.c common/gr/ctxsw_prog/ctxsw_prog_gv11b.c Remove all the h/w header includes from rest of the units and code. Remove direct calls to h/w headers ctxsw_prog_() and use HALs g->ops.gr.ctxsw_prog.() instead In gr_gk20a_find_priv_offset_in_ext_buffer(), h/w header ctxsw_prog_extended_num_smpc_quadrants_v() is only defined on gk20a And since we don't support gk20a remove corresponding code Add missing h/w header ctxsw_prog_main_image_pm_mode_ctxsw_f() for some chips Add new h/w header ctxsw_prog_gpccs_header_stride_v() Jira NVGPU-1526 Change-Id: I170f5c0da26ada833f94f5479ff299c0db56a732 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1966111 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-11 14:41:04 -08:00
Vaibhav Kachore	4c362aefc0	gpu: nvgpu: fix debugfs register access "gk20a_busy" and "gk20a_idle" should be called before and after accessing mailbox0 and mailbox1 registers respectively. Bug 200472922 Change-Id: I6da07f84f1b4e9dc3b2034cd6aefe41f2a507348 Signed-off-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1967358 Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-10 23:43:46 -08:00
Allen Martin	ca611e4d0e	gpu: nvgpu: verify usermode mapping is at least PAGE_SIZE This is part of a move to 64KiB for usermode mapping to fix failures when the system page size is 64KiB. When remapping or zapping the vma, use the existing size, not hardcoded size. Also change the verification of the size when creating the mapping to verify it is at least as big as PAGE_SIZE. This allows 4KiB mappings to continue to work until nvrm_gpu is changed to use 64KiB mappings. Bug 2441531 Change-Id: I447ef8e9f84e6d70bbe96b527e267ec41c5630b8 Signed-off-by: Allen Martin <amartin@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1964687 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-10 15:24:49 -08:00
Thomas Fleury	0a90a5c5a9	WAR: gpu: nvgpu: disable MSI for kernel 4.14 With MSI enabled and kernel 4.14, there are some occurrences of interrupts not being launched (no issue on kernel 4.9). If an interrupt is not served for a software method, we end up most of the time with PBDMA busy, while engine is idle. Disable MSI for dGPU on kernel 4.14 Bug 200460636 Change-Id: I3c5657d63195de5c714f44d331ac992199811d9f Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1968073 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-10 14:15:22 -08:00
Peter Daifuku	ebf874c351	nvgpu: pmu: cleanup init thread on destroy In nvgpu_kill_task_pg_init(), call nvgpu_thread_join() if the init thread is no longer running in order to reclaim thread resources. Bug 2452799 JIRA ESRM-437 Change-Id: Id9c67f689027f00039ac2df226ee9c28ad89dd1d Signed-off-by: Peter Daifuku <pdaifuku@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1967983 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-10 14:15:14 -08:00
Alex Waterman	fc939e5fb6	gpu: nvgpu: Add IOCTL flag + plumbing for unified VAs Add a flag that let's userspace enable the unified VM functionality on a selective bassis. This feature is working for all cases except a single MODS trace. This will allow test coverage to be selectively added in certain userspace tests as well to help prevent this feature from bit rotting (as it has historically done). Also update the unit test for the page table management in the GMMU to reflect this new flag. It's been set to false since the target platform for safety is currently not using unified address spaces. Bug 200438879 Change-Id: Ibe005472910d1668e8372754be8dd792773f9d8c Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1951864 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-07 12:15:11 -08:00
Alex Waterman	7d9d835631	gpu: nvgpu: Unified VA space for vGPUs Enable unified address spaces for all vGPU configurations. Bug 200105199 Change-Id: Ic175214dafccaba5850c1e1995ff0b5280a4ad09 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1955625 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-30 15:24:20 -08:00
Debarshi Dutta	c965ef8dc2	gpu: nvgpu: error handling for invalid ioctl call NVGPU_GPU_IOCTL_GET_EVENT_FD should return -EINVAL when invoked in any chips which donot have NVGPU_SUPPORT_DEVICE_EVENTS enabled. This is resulting in an use-after-free error in UBSAN from syzkaller fuzzing in the nvgpu driver. Also, as an addon remove the flag clk_arb_events_supported as the device events check can be made using the flag NVGPU_SUPPORT_DEVICE_EVENTS. Bug 200463292 Change-Id: I0ed0217704daa9e401b57a268a30b9f798928e4a Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1956070 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: Konsta Holtta <kholtta@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-30 11:54:17 -08:00
Debarshi Dutta	1e78d47f15	gpu: nvgpu: replace input parameter tsgid with pointer to struct tsg_gk20a gv11b_fifo_preempt_tsg needs to access the runlist_id of the tsg as well as pass the tsg pointer to other public functions such as gk20a_fifo_disable_tsg_sched. This qualifies the preempt_tsg to use a pointer to a struct tsg_gk20a instead of just using the tsgid. Jira NVGPU-1461 Change-Id: I01fbd2370b5746c2a597a0351e0301b0f7d25175 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1959068 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-30 08:15:06 -08:00
Anup Mahindre	ae57a78c73	gpu: nvgpu: Return size of ring buffer from NVGPU_CTXSW_IOCTL_RING_SETUP NVGPU_CTXSW_IOCTL_RING_SETUP is used to setup a ring buffer of custom size for FECS tracing. It uses size field from its arguments to setup a user-mapped ring buffer for holding FECS Trace entries. The value from this field is rounded up to nearest page-size boundary. This rounded up value is supposed to be returned by the IOCTL (as per description of the field in nvgpu.h). That is currently not the case and the IOCTL just returns the same value as that was passed. This change fixes this issue by returning updated value. Bug 200469520 Change-Id: I477aefaede9a4cdba921026466db3fb8fbfd0712 Signed-off-by: Anup Mahindre <amahindre@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1955337 Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-29 10:15:05 -08:00
Alex Waterman	c49e9e4bcd	gpu: nvgpu: split the nvgpu_sgt unit from nvgpu_mem Split the nvgpu_sgt code out from the nvgpu_mem code. Although the two chunks of code are related the SGT code is distinct and as such should be its own unit. To do this a new source file has been added - nvgpu_sgt.c - which contains all the nvgpu_sgt common APIs. These are the facade APIs to abstract the actual details of how any given nvgpu_sgt is actually implemented. An abstract unit - nvgpu_sgt_os - was also defined. This unit exists solely for the nvgpu_sgt unit to call so that the OS specific nvgpu_sgt_os_create_from_mem() API can be moved from the common nvgpu_sgt unit. Note this also updates the name of what the OS specific units are expected to call. Common code may still use the generic nvgpu_sgt_create_from_mem() API. JIRA NVGPU-1391 Change-Id: I37f5b2bbf9f84c0fb6bc296c3e04ea13518bd4d0 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1946012 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-29 03:15:17 -08:00
Sharif Inamdar	98dca979d6	Revert "nvgpu: Change the path in the dependent files" This breaks the Android builds This reverts commit `4e7333967d`. Change-Id: I537c3a86d0bdce52ad8e3f42a1e8a7535199ea0a Signed-off-by: Sharif Inamdar <isharif@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1959910	2018-11-28 02:39:51 -08:00
Thomas Fleury	2b762363ac	gpu: nvgpu: flag for physically addressed buffers Some buffers like userd are physically addressed. If nvlink is enabled, or device is not iommuable, this requires buffer to be physically contiguous. Add NVGPU_DMA_PHYSICALLY_ADDRESSED to identify such buffers, in order to force physically contiguous allocation, only in above cases. Bug 2422486 Change-Id: I6426e23b064904e812e6b33e6d706391648a51ae Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1959034 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-27 21:42:57 -08:00
Anuj Gangwar	4e7333967d	nvgpu: Change the path in the dependent files changes in path because we move the nvhost linux user-interface from include/linux/ to include/uapi/linux depends on I2e116dc8f6c33f53c03fb56b923931b6e600b534 Bug 2062672 Change-Id: If2e165852432d5795cf6680cfeb5d4b661fdee74 Signed-off-by: Anuj Gangwar <anujg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1953731 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Bibek Basu <bbasu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-27 03:15:36 -08:00
Terje Bergstrom	1cf6e4fc5e	gpu: nvgpu: Remove pmgr.h dependency from gk20a.h gk20a.h depends on definition of struct pmgr_pmupstate. Change that to a pointer and use forward declaration, and allocation and free functions. Also set pointer to NULL when freed. Fix a few build breaks by adding explicit includes where previously a header file had gotten included implicitly. JIRA NVGPU-596 Change-Id: I21ff1ae93ac7b92a71502f97785252c04964e72f Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1954003 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-26 21:22:57 -08:00
Rajesh Devaraj	bc1ee5a281	gpu: nvgpu: gk20a.c unification Renamed gk20a.c to nvgpu_init.c and moved it to be part of common code. JIRA NVGPU-1397 JIRA VQRM-2094 JIRA VQRM-4169 Change-Id: I716542a55f1f7acd82da5bd5e7b22d59e0f5cf23 Signed-off-by: Rajesh Devaraj <rdevaraj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1956049 GVS: Gerrit_Virtual_Submit Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-25 23:54:10 -08:00
Petlozu Pravareshwar	1652646d44	nvlink: Update nvlink core header file path As part of unifying tegra nvlink SW(linux and qnx), the nvlink core header file path is changed. This change updates the path on nvgpu files accordingly. Bug 200406382 Change-Id: I4c330fe6706134b11749f5c7a9ba7d64e3de95f1 Signed-off-by: Petlozu Pravareshwar <petlozup@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1941092 Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: Abhishek Sahu <absahu@nvidia.com> Reviewed-by: Rakesh Babu Bodla <rbodla@nvidia.com> Reviewed-by: Tejal Kudav <tkudav@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-24 00:33:47 -08:00
Konsta Holtta	d49d64e720	gpu: nvgpu: store usermode regs bus addr directly Instead of just the base address of the main register range, store (also) the base address of usermode area. All regs may not be always available; on vgpu guests we have only the usermode regs. Store the usermode addr we get from a platform resource directly in gv11b_vgpu_probe() for vgpu. In that case the main reg addr is unset. The base address is computed in gk20a_pm_finalize_poweron() for native environments; when the reg addr is read from a resource, the chip is still unknown and as such the HAL op for reading the usermode base offset is unavailable. Bug 200145225 Bug 200467197 Change-Id: I8855bb54a6456eb63b69559c84398f7eeaec3513 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1951524 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-22 20:14:04 -08:00
Sagar Kamble	fd332ca6b4	gpu: nvgpu: s/_flcn_/_falcon_ There is mixed usage of falcon & flcn in function and data types. Lets update all with "falcon" for consistency with file names. JIRA NVGPU-1459 Change-Id: I02dbc866ce2cca009f2e8b87cfe11a919ec10749 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1953793 Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-21 23:04:36 -08:00
Alex Waterman	998f13dc8a	gpu: nvgpu: Unified VA space for dGPUs Enable the unified address space flag for all dGPUs. Bug 200105199 Change-Id: I082742344f100bf7d27abf0580ddd6134aae8f90 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1955624 Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-21 18:44:26 -08:00
Alex Waterman	2c5f4a54d5	gpu: nvgpu: Unified VA space for gp10b and gv11b Enable the unified address space config for o gp10b o gv11b gm20b is suffering from a problem in a T214 MODS test. This should work for the time being in more recent chips. Also this will increase the soak time these changes get before being released. Other chips (vGPUs, dGPUs) will (possibly) be enabled at a later date. Bug 200105199 Change-Id: I03a6803c6369d89e8a318886fc642b55c5538dd9 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1951858 Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-16 13:13:44 -08:00
Konsta Holtta	0567904ac0	Revert "gpu: nvgpu: Remove pmgr.h dependency from gk20a.h" This reverts commit `2dc48ceba1`. Bug 2443630 JIRA NVGPU-596 Change-Id: Id728c908cd89142245f1708fb423c0fff38ba96d Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1952266 Reviewed-by: Bo Yan <byan@nvidia.com> Tested-by: Bo Yan <byan@nvidia.com>	2018-11-16 11:26:03 -08:00
Seema Khowala	1f54ea09e3	gpu: nvgpu: rename has_timedout and make it thread safe Currently has_timedout variable is protected by wmb at places where it is being set and there is no correspoding rmb whenever has_timedout variable is read. This is prone to errors for concurrent execution. This change is supposed to fix this issue. Rename has_timedout variable of channel struct to ch_timedout. Also to avoid rmb every time ch_timedout is read, ch_timedout_spinlock is added to protect ch_timedout variable for taking care of concurrent execution. Bug 2404865 Bug 2092051 Change-Id: I0bee9f50af0a48720aa8b54cbc3af97ef9f6df00 Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1930935 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-15 15:35:57 -08:00
smadhavan	503b897b45	gpu: nvgpu: Fix MISRA rule 8.3 violations MISRA rule 8.3 requires that all declarations of a function shall use the same parameter names and type qualifiers. There are cases where the parameter names do not match between function prototype and declaration. This patch will fix some of these violations by renaming the prototype parameter. JIRA NVGPU-847 Change-Id: I980ca7ba8adc853de9c1b6f6c7e7b3e4ac12f88e Signed-off-by: smadhavan <smadhavan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1926980 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-15 15:35:47 -08:00
Anup Mahindre	a6138b7810	gpu: nvgpu: Add a characteristics flag to denote FECS tracing support Add a flag to nvgpu_gpu_characteristics to expose FECS tracing capability to userspace. This is required for adding nvrm_gpu APIs for CTXSW set of IOCTLs which were requested in several bugs. nvrm_gpu APIs would query this flag to check the availability of IOCTLs. Bug 2169678 Bug 2169677 Bug 2169675 Bug 2169674 Bug 2169673 Bug 2168342 Change-Id: Ie6ba80a4144637546b97fa93baae67b8d0c4d425 Signed-off-by: Anup Mahindre <amahindre@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1950559 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-15 02:53:39 -08:00
Scott Long	e24df49765	gpu: nvgpu: nvgpu_memcpy changes to linux os code MISRA Rule 21.15 prohibits use of memcpy() with incompatible ptrs to qualified/unqualified types. To circumvent this issue we've introduced a new MISRA-compliant nvgpu_memcpy() function. While linux os code does not need to be MISRA-compliant this change switches over all memcpy() uses to nvgpu_memcpy() with appropriate casts applied to maintain consistency within the nvgpu source base. JIRA NVGPU-849 Change-Id: I2c21a7845df5709dafa19508c121f8afa27cc4fc Signed-off-by: Scott Long <scottl@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1950995 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Nitin Kumbhar <nkumbhar@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-14 21:44:35 -08:00
Terje Bergstrom	2dc48ceba1	gpu: nvgpu: Remove pmgr.h dependency from gk20a.h gk20a.h depends on definition of struct pmgr_pmupstate. Change that to a pointer and use forward declaration, and allocation and free functions. Fix a few build breaks by adding explicit includes where previously a header file had gotten included implicitly. JIRA NVGPU-596 Change-Id: I7ced14d6629e033b0ccef3a93a3dbf099e43ba4c Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1946662 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-14 13:34:06 -08:00
Terje Bergstrom	07760eb9a1	gpu: nvgpu: Remove clk.h dependency from gk20a.h gk20a.h depends on definition of struct clk_pmupstate. Change that to a pointer and use forward declaration, and allocation and free functions. Fix a few build breaks by adding explicit includes where previously a header file had gotten included implicitly. JIRA NVGPU-596 Change-Id: Iafe7d72a6fd31543653e0e10e2d2e552b6c3514b Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1945286 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-14 13:33:38 -08:00
Terje Bergstrom	bca27e31e3	gpu: nvgpu: Fix clk_gp106.h and clk_gv100.h headers clk_gp106.h and clk_gv100.h define conflicting symbols, which prevent including them both at the same time. One of the conflicting structs is namemap_cfg, which has different definitions in clk_gp106.h and include/nvgpu/clk.h. Move all constants used only by clk_*.c to be defined there, delete the extra namemap_cfg structure definition, and modify code to cope with the unified namemap_cfg. JIRa NVGPU-596 Change-Id: Id68919da4567ec1507eda0cfaa19bf047a7bfc59 Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1945285 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-14 13:33:29 -08:00
Richard Zhao	f2cb8c5d2e	gpu: nvgpu: vgpu: unify fecs trace move fecs_trace_vgpu.c to be common, leaving only few functions os specific. struct gk20a_fecs_trace_header was moved to header, to share with os specific code. Jira EVLR-3275 Change-Id: I372aeb539cbca3abb87e997c9e35e6d682f9cb96 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1831991 GVS: Gerrit_Virtual_Submit Reviewed-by: Aparna Das <aparnad@nvidia.com> Reviewed-by: Nirav Patel <nipatel@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-13 19:13:37 -08:00
Sai Nikhil	94e00ab6ad	gpu: nvgpu: gk20a: fix MISRA 10.4 Violations [1/2] MISRA Rule 10.4 only allows the usage of arithmetic operations on operands of the same essential type category. Adding "U" at the end of the integer literals to have same type of operands when an arithmetic operation is performed. This fixes violation where an arithmetic operation is performed on signed and unsigned int types. JIRA NVGPU-992 Change-Id: Ifb8cb992a5cb9b04440f162918a8ed2ae17ec928 Signed-off-by: Sai Nikhil <snikhil@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1822587 GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-09 13:27:08 -08:00
Terje Bergstrom	88e374d5eb	gpu: nvgpu: Move gk20a.c to os/linux gk20a.c is used only in Linux build. It's in theory common code, but in practice implements OS specific policies. Also implement os/posix/gk20a.c to implement gk20a_init_gpu_characteristics(), gk20a_get() and gk20a_put() which are called from common code. JIRA NVGPU-596 Change-Id: I6a6079ca6d4c6a225f0dd0e1cd7c439333a704bf Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1944884 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-08 21:44:18 -08:00
Alex Waterman	032b37bee5	gpu: nvgpu: Update debug crash dump Update the debug crash dump to be clearer, more concise and avoid many of the misformatting issues that have crept in over the last couple years. This also changes the debug prints to move from pr_err() in the Linux kernel to nvgpu_err(). This makes it easier to filter all nvgpu messages in a log file with a single grep command. Change-Id: I00ca9e6c32da7a79c8f6903a139bf6b43e89618a Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1940515 GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-08 21:42:38 -08:00
Alex Waterman	ac5763eb0c	gpu: nvgpu: Re-order the debug output Originally the order for output was: 1. Dump platform deps (sync-points/host1x stuff) 2. Dump PBDMA status 3. Dump engine status 4. Dump channel status The updated ordering is: 1. Dump channel status 2. Dump PBDMA status 3. Dump engine status 4. Dump platform deps (sync-points/host1x stuff) The purpose of this is to put the useful information first and relegate the less useful info to later in the dump. We naturally scan downwards and treat stuff at the top as most important. The end goal is to make the debug dump as useful in as little time as possible. So instead of making an engineer dig through a complex jumble of information to find the useful stuff the hope is that the useful stuff is immediately available. Change-Id: I9d2b755676b7e5dc2f8949f14dc36f3d337e2a3f Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1940514 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-08 21:42:34 -08:00
Alex Waterman	7222826680	gpu: nvgpu: Return bool from nvgpu_log_mask_enabled This function returns a boolean describing if a given log mask is enabled for a given GPU. Previously this returned and int but the bool type is far better suited for this. Also implement this function in posix, as it may be useful to have implemented there if any common code chooses to use this function. Change-Id: I7382e73df83282763df1bdbccbbb219c9f3e6f1b Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1938341 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-08 21:42:14 -08:00
Terje Bergstrom	7525c1337b	gpu: nvgpu: Remove the GPU-NEXT conditional Remove build conditional for GPU-NEXT. It was used for including code for tu104, but now it's part of main nvgpu. Leave a TURING conditional to not need Turing code in other builds. JIRA NVGPU-961 Change-Id: I74177863c451d78b6db6165249561f15eadc3cc3 Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1936803 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-08 19:35:09 -08:00
Amurthyreddy	710aab6ba4	gpu: nvgpu: MISRA 14.4 boolean fixes MISRA rule 14.4 doesn't allow the usage of non-boolean variable as boolean in the controlling expression of an if statement or an iteration statement. Fix violations where a non-boolean variable is used as a boolean in the controlling expression of if and loop statements. JIRA NVGPU-1022 Change-Id: I957f8ca1fa0eb00928c476960da1e6e420781c09 Signed-off-by: Amurthyreddy <amurthyreddy@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1941002 GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-07 10:35:13 -08:00

1 2 3 4 5

213 Commits