linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-22 17:36:20 +03:00

Author	SHA1	Message	Date
Shashank Singh	9512b9f1de	gpu: nvgpu: remove user managed addr space capability flag Remove NVGPU_GPU_IOCTL_ALLOC_AS_FLAGS_USERSPACE_MANAGED and NVGPU_AS_ALLOC_USERSPACE_MANAGED flags which are used for supporting userspace managed address-space. This functionality is not implemented fully in kernel neither going to be implemented in near future. Jira NVGPU-9832 Bug 4034184 Change-Id: I3787d92c44682b02d440e52c7a0c8c0553742dcc Signed-off-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2882168 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2023-05-04 11:39:30 -07:00
Martin Radev	6249220e09	gpu: nvgpu: fix nvgpu_css_allocate_perfmon_ids This patch fixes nvgpu_css_allocate_perfmon_ids which leads to a buffer overflow if the allocation of perfmon ids does not succeed. If the allocation of perfmon ids cannot be satisfied, bitmap_find... would return CSS_MAX_PERFMON_IDS and nvgpu_bitmap_set would still be called with start after the bitmap array. This results into a buffer overflow. Bug 3814963 Change-Id: I4caff36cf0c920b4445e1841d16ba2b4c3d19aaa Signed-off-by: Martin Radev <mradev@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2786747 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Prateek Sethi <prsethi@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-10-05 20:13:27 -07:00
Sagar Kamble	693305c0fd	gpu: nvgpu: subcontext add/remove support Subcontext PDBs and valid mask in the instance blocks of the channels in various subcontexts has to be updated when new subcontext is created or a subcontext is removed. Replayable fault state is cached in the channel structure. Replayable fault state for subcontext is set based on first channel's bind parameter. It was earlier programmed in function channel_setup_ramfc. init_inst_block_core is updated to setup TSG level pdb map and mask. Added new hal gv11b_channel_bind to enable the subcontext on channel bind. Bug 3677982 Change-Id: I58156c5b3ab6309b6a4b8e72b0e798d6a39c1bee Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2719994 Reviewed-by: Ankur Kishore <ankkishore@nvidia.com> GVS: Gerrit_Virtual_Submit <buildbot_gerritrpt@nvidia.com>	2022-09-08 21:00:20 -07:00
Sagar Kamble	5b55088970	gpu: nvgpu: skip subctx pdb init during as-channel bind While creating a new channel, ioctls are called in the below sequence: 1. GPU_IOCTL_OPEN_CHANNEL 2. AS_IOCTL_BIND_CHANNEL 3. TSG_IOCTL_BIND_CHANNEL_EX 4. CHANNEL_ALLOC_GPFIFO_EX 5. CHANNEL_ALLOC_OBJ_CTX. subctx pdbs and valid mask are programmed in the channel instance block in the channel ioctls AS_IOCTL_BIND_CHANNEL & CHANNEL_ALLOC_GPFIFO_EX. Programming them in the ioctl AS_IOCTL_BIND_CHANNEL is redundant. Remove related hal g->ops.mm.init_inst_block_for_subctxs. The hal init_inst_block will program context pdb and big page size. The hal init_inst_block_core will program context pdb, big page size and subctx 0 pdb. This is used by h/w units (fecs, pmu, hwpm, bar1, bar2, sec2, gsp, perfbuf etc.). For user channels, subctx pdbs are programmed as part of ramfc setup. Bug 3677982 Change-Id: I6656b002d513404c1fd7c3d349933e80cca7e604 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2680907 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-06-28 23:33:31 -07:00
Antony Clince Alex	c0f4723339	gpu: nvgpu: perbuf: update PMA buffer mapping The PMA unit can only access GPU VAs within a 4GB window, hence both the user allocated PMA buffer and the kernel allocated bytes available buffer should lie in the same 4GB window. This is accomplished by carving out and reserving a 4GB VA space in perbuf.vm and using fixed GPU VAs to ensure that both buffers are bound within the same 4GB window. In addition, update ALLOC_PMA_STREAM to use pma_buffer_offset, pma_buffer_map_size fields correctly. Bug 3503708 Change-Id: Ic5297a22c2db42b18ff5e676d565d3be3c1cd780 Signed-off-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2671637 Reviewed-by: svcacv <svcacv@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Vaibhav Kachore <vkachore@nvidia.com> GVS: Gerrit_Virtual_Submit	2022-03-07 15:17:35 -08:00
Richard Zhao	9ab1271269	gpu: nvgpu: common: fix compile error of new compile flags It's preparing to add bellow CFLAGS: -Werror -Wall -Wextra \ -Wmissing-braces -Wpointer-arith -Wundef \ -Wconversion -Wsign-conversion \ -Wformat-security \ -Wmissing-declarations -Wredundant-decls -Wimplicit-fallthrough Jira GVSCI-11640 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: Ia8f508c65071aa4775d71b8ee5dbf88a33b5cbd5 Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2555056 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2022-01-13 12:36:14 -08:00
Konsta Hölttä	f4ec400d5f	gpu: nvgpu: simplify nvgpu_timeout_init nvgpu_timeout_init() returns an error code only when the flags parameter is invalid. There are very few possible values for flags, so extract the two most common cases - cpu clock based and a retry based timeout - to functions that cannot fail and thus return nothing. Adjust all callers to use those, simplfying error handling quite a bit. Change-Id: I985fe7fa988ebbae25601d15cf57fd48eda0c677 Signed-off-by: Konsta Hölttä <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2613833 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-10-26 13:47:32 -07:00
Richard Zhao	1685a2404f	gpu: nvgpu: vgpu: add b0cc profiler support - added new commands to bind/unbind hwpm/hwpm_streamout/smpc - added new command to updat get/put for PMA buffer - tune function nvgpu_perfbuf_update_get_put so it could be reused on server side. - enable profiler v2 device for gv11b Jira GVSCI-10351 Signed-off-by: Richard Zhao <rizhao@nvidia.com> Change-Id: I4226c89ec3040e53dee5381ac8a30c9fd598e5ef Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2537683 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc_kernel_abi <svc_kernel_abi@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2021-06-08 01:30:03 -07:00
Vedashree Vidwans	00d1e10ff2	gpu: nvgpu: accept small_big_split in vm_init Currently, when unified address space is not requested, nvgpu_vm_init splits user vm at a fixed address of 56G. Modify nvgpu_vm_init to allow user to specify small big page vm split. JIRA NVGPU-5302 Change-Id: I6ed33a4dc080f10a723cb9bd486f0d36c0cee0e9 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2428326 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Sami Kiminki <skiminki@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: Sami Kiminki <skiminki@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	c6aae8c049	gpu: nvgpu: use fixed address mapping for pma byte buffer Use fixed address mapping for pma byte buffer so that the address of this buffer always fits in 32 bits. This also requires to move unmap sequence to OS specific function since different unmap API is now needed for linux and QNX. Also call nvgpu_prof_free_pma_stream_priv_data() before nvgpu_profiler_free_pma_stream() since former uses mm->perfbuf which is released in later. Bug 2510974 Jira NVGPU-5360 Change-Id: I398b0ca4f96527d6e09c9aacacb4b43c90f5bfc9 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2424691 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	673cd507a8	gpu: nvgpu: add mm gops to get default va size Currently, default va aperture size, user size and kernel size are defined as fixed macros. However, max va bits can be chip specific. Add below mm gops API to obtain default aperture, user and/or kernel virtual memory size. void (get_default_va_sizes)(u64 aperture_size, u64 user_size, u64 kernel_size); JIRA NVGPU-5302 Change-Id: Ie0c60ca08ecff6613ce44184153bda066803d7d9 Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2414840 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	9e94e118fe	gpu: nvgpu: ensure pma byte buffer address fits in 32 bits Right now PMA byte buffer address is allocated in the range of 0x1ffc010000. The register that stores this address is only 32-bit and there is no corresponding _hi() register, so the address must fit in 32 bits. Update nvgpu_vm_init() parameters in nvgpu_perfbuf_init_vm() so that a low_hole of only 4K is used. This allows the address to be allocated in the range of 0x4000000. Also map byte buffer before PMA stream buffer so that byte buffer always gets lower address. There is only one PMA stream buffer allowed to be mapped right now so this works for now. But in future multiple buffers can be mapped and this solution needs to be reworked. Bug 2510974 Jira NVGPU-5360 Change-Id: Ief1a9ee54d554e3bc13c7a9567934dcbeaefbcc6 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2418520 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	221475f753	gpu: nvgpu: add profiler apis to manage PMA stream Support new IOCTL to manage PMA stream meta data by adding below API nvgpu_prof_ioctl_pma_stream_update_get_put() Add nvgpu_perfbuf_update_get_put() to handle all the updates coming from userspace and to pass all required information. Add gops.perf.update_get_put() to handle all HW accesses required in perf HW unit. Add gops.perf.bind_mem_bytes_buffer_addr() to bind the available bytes buffer while binding HWPM streamout. Bug 2510974 Jira NVGPU-5360 Change-Id: Ibacc2299b845e47776babc081759dfc4afde34fe Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2406484 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-mobile-cert <svc-mobile-cert@nvidia.com> Reviewed-by: Antony Clince Alex <aalex@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Vedashree Vidwans	49c9f0c137	gpu: nvgpu: accept user vma size in vm init Modify nvgpu_vm_init to accept low_hole, user_reserved and kernel_reserved. This will simplify argument limit checks and make code more legible. JIRA NVGPU-5302 Change-Id: I62773dd7b06264a3b6cb8896239b24c49fa69f9b Signed-off-by: Vedashree Vidwans <vvidwans@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2394901 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Deepak Nibade	f34711d3de	gpu: nvgpu: split perfbuf initialization gk20a_perfbuf_map() allocates perfbuf VM, maps the user buffer into new VM, and then triggers gops.perfbuf.perfbuf_enable(). This HAL then does following : - Allocate perfbuf instance block - Initialize perfbuf instance block - Reset stream buffer - Program instance block address in PMA registers - Program user buffer address into PMA registers New profiler interface will have it's own API to setup PMA strem, and it requires above setup to be done in two phases of perfbuf initialization and then user buffer setup. Split above functionalities into below functions - nvgpu_perfbuf_init_vm() - Allocate perfbuf VM - Call gops.perfbuf.init_inst_block() to initialize perfbuf instance block - gops.perfbuf.init_inst_block() - Allocate perfbuf instance block - Initialize perfbuf instance block - Program instance block address in PMA registers using gops.perf.init_inst_block() - In case of vGPU, trigger TEGRA_VGPU_CMD_PERFBUF_INST_BLOCK_MGT command to gpu server - gops.perf.init_inst_block() - Reset stream buffer - Program user buffer address into PMA registers Also add corresponding cleanup functions as below : gops.perf.deinit_inst_block() gops.perfbuf.deinit_inst_block() nvgpu_perfbuf_deinit_vm() Bug 2510974 Jira NVGPU-5360 Change-Id: I486370f21012cbb7fea84fe46fb16db95bc16790 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2372984 Tested-by: mobile promotions <svcmobile_promotions@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:13:28 -06:00
Philip Elcan	06fd513e1e	gpu: nvgpu: move common.unit into common.mc nvgpu.common.unit was just an enum used for passing to nvgpu.common.mc APIs. So, move the enum into mc.h, and replace the include of unit.h with mc.h where appropriate. And update the yaml arch. JIRA NVGPU-4144 Change-Id: I210ea4d3b49cd494e43add1b52f3fbcdb020a1e3 Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2216106 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:10:29 -06:00
Shashank Singh	6fd0d972ae	nvgpu: gpu: include qnx_init unit in doxygen documentation -Include qnx_init unit in doxygen documentation. -Add documentation for gk20a_busy/idle and similar functions. -Remove must_check return value as misra already reports violation for that. Jira NVGPU-2571 Change-Id: I9573cb61865677944809dcc494d92f63cc6e0f58 Signed-off-by: Shashank Singh <shashsingh@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2176755 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2020-12-15 14:05:52 -06:00
Debarshi Dutta	17486ec1f6	gpu: nvgpu: rename tsg_gk20a and channel_gk20a structs rename struct tsg_gk20a to struct nvgpu_tsg and rename struct channel_gk20a to struct nvgpu_channel Jira NVGPU-3248 Change-Id: I2a227347d249f9eea59223d82f09eae23dfc1306 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2112424 GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-06 02:56:53 -07:00
Alex Waterman	c71e764348	gpu: nvgpu: Remove alloc_inst_block from mm HAL The alloc_insty_block() function in the MM HAL is not a HAL. It does not abstract any HW accesses; instead it just wraps a dma allocation. As such remove it from the HAL and move the single gk20a implementation to common/mm/mm.c as nvgpu_alloc_inst_block(). JIRA NVGPU-2042 Change-Id: I0a586800a11cd230ca43b85f94a35de107f5d1e1 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2109049 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-05-01 15:06:42 -07:00
Deepak Nibade	c474f7c288	gpu: nvgpu: add CSS hal to get max buffer size Currently max_css_buffer_size is incorrectly stored in struct nvgpu_gr Add a new hal g->ops.css.get_max_buffer_size() to get the size and remove the variable from struct nvgpu_gr Jira NVGPU-3125 Change-Id: If78fd86559526b84031051e281a98327a46fc11d Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2105652 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-26 09:37:12 -07:00
Deepak Nibade	1533951567	gpu: nvgpu: move cyclestats_snapshot data to struct gk20a cyclestats_snapshot data and lock is right now stored in struct nvgpu_gr Use case itself is not specific to GR engine but in general it applies to other units outside of GR too. Hence it makes sense to move both data and lock to struct gk20a instead of keeping them in struct nvgpu_gr Update all cyclestats_snapshot code to refer data/lock from struct gk20a Remove gr_priv.h header include from cyclestats_snapshot.c Some of the functions were mistakenly declared in gr_gk20a.h. Move them to cyclestats_snapshot.h and rename them to form nvgpu_css_*() Jira NVGPU-1103 Change-Id: I3fb32fe96f0ca6613f4640c8bd227b9e0e02dca3 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2104848 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-25 07:57:34 -07:00
Vinod G	dc82262b99	gpu: nvgpu: Add gr_priv header file Move nvgpu_gr structure to private file gr_priv.h Include the private file where gr variables are used. JIRA NVGPU-3132 JIRA NVGPU-3079 Change-Id: Ib26ca5c5cb25fd8dd013a7c643278efc34aa55d4 Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2098021 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-22 03:15:09 -07:00
Vinod G	556e139077	gpu: nvgpu: Cleanup for gr_gk20a header Removed unused struct from gr_gk20a.h Change static allocation for struct gr_gk20a to dynamic type. Change all the files that being affected by that change. Call gr allocation from corresponding init_support functions, which are part of the probe functions. nvgpu_pci_init_support in pci.c vgpu_init_support in vgpu_linux.c gk20a_init_support in module.c Call gr free before the gk20a free call in nvgpu_free_gk20a. Rename struct gr_gk20a to struct nvgpu_gr JIRA NVGPU-3132 Change-Id: Ief5e664521f141c7378c4044ed0df5f03ba06fca Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2095798 Reviewed-by: Seshendra Gadagottu <sgadagottu@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-19 00:04:00 -07:00
Seshendra Gadagottu	6fb37f04ca	gpu: nvgpu: move chip specific perf files to hal Moved chip specific perf related files to hal from common. JIRA NVGPU-3131 Change-Id: I4fb1b6a619712ee82e20b2b08a76ad5361e690c1 Signed-off-by: Seshendra Gadagottu <sgadagottu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2095563 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-11 18:53:55 -07:00
Thomas Fleury	4ef4939797	gpu: nvgpu: add base_shift and alloc_size ramin HALs Added the following HALs - ramin.base_shift - ramin.alloc_base Use above HALs in mm, instead of using hw definitions. Defined nvgpu_inst_block_ptr to - get inst_block address, - shift if by base_shift - assert upper 32 bits are 0 - return lower 32 bits Added missing #include for <nvgpu/mm.h> Jira NVGPU-3015 Change-Id: I558a6f4c9fbc6873a5b71f1557ea9ad8eae2778f Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2077840 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-07 15:54:10 -07:00
Philip Elcan	8efcf68017	gpu: nvgpu: perf: fix MISRA 10.3 violations MISRA Rule 10.3 prohibits assignment of objects of different essential or narrower type. This fixes a number of MISRA 10.3 violations in the common/perf unit. JIRA: NVGPU-3023 Change-Id: I7edc51c62649b8e642c22ee911bc57d67b388000 Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2084044 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-04-01 15:54:17 -07:00
Philip Elcan	257115e06c	gpu: nvgpu: add nvgpu_bitmap_set and nvgpu_bitmap_clear Introduce nvgpu_bitmap_set() and nvgpu_bitmap_clear() APIs to wrap the bitmap_set() and bitmap_clear() APIs, respectively. The new nvgpu_* versions accept unsigned length parameters since length is logically an unsigned value where bitmap_set and bitmap_clear accept signed values. We inherit bitmap_set and bitmap_clear from the OS, so we can't directly change those. Also, change uses of the old APIs to the new ones. These changes resolve MISRA Rule 10.3 violations for implicit assignment of objects of different essential or narrower type. JIRA NVGPU-2953 Change-Id: I2c8f790049232a791f248b350c485bb07452315b Signed-off-by: Philip Elcan <pelcan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2077624 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-25 13:35:24 -07:00
Deepak Nibade	278ec56a6f	gpu: nvgpu: move ops.gr.get_pmm_per_chiplet_offset() HAL to PERF unit g->ops.gr.get_pmm_per_chiplet_offset() HAL operation accesses registers owned by PERF unit, hence move this HAL to PERF unit as g->ops.perf.get_pmm_per_chiplet_offset() Jira NVGPU-2894 Change-Id: I8fb1160329829dc50326aad4b7d1a59e088bba9f Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2030693 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-05 02:50:21 -08:00
Adeel Raza	d828e013db	gpu: nvgpu: common: MISRA rule 15.6 fixes MISRA rule 15.6 requires that all if/else/loop blocks should be enclosed by brackets. This patch adds brackets to single line if/else/loop blocks in the common directory. JIRA NVGPU-775 Change-Id: I0dfb38dbf256d49bc0391d889d9fbe5e21da5641 Signed-off-by: Adeel Raza <araza@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2011655 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: Scott Long <scottl@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-05 19:23:47 -08:00
Nicolas Benech	cb2a05dd92	gpu: nvgpu: Fix LibC MISRA 17.7 in common MISRA Rule-17.7 requires the return value of all functions to be used. Fix is either to use the return value or change the function to return void. This patch contains fix for all 17.7 violations instandard C functions in common code. JIRA NVGPU-1036 Change-Id: Id6dea92df371e71b22b54cd7a521fc22812f9b69 Signed-off-by: Nicolas Benech <nbenech@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1929899 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-01 17:15:37 -07:00
Debarshi Dutta	6c8be7cfe2	gpu: nvgpu: move header location of gk20a.h Change path corresponding to gk20a.h to <nvgpu/gk20a.h> corresponding to files in the following directories. gk20a/ vgpu/ gv100/ tu104/ common/bus/ common/fb/ common/ltc/ common/mc/ common/perf/ Jira NVGPU-597 Change-Id: I7b4f5e5ea3d13a4d1810c5db35fbc26fe5da443e Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1846826 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-24 23:16:10 -07:00
Amurthyreddy	88d21daedb	gpu: nvgpu: MISRA 14.4 err/ret/status as boolean MISRA rule 14.4 doesn't allow the usage of integer types as booleans in the controlling expression of an if statement or an iteration statement. Fix violations where the integer variables err, ret, status are used as booleans in the controlling expression of if and loop statements. JIRA NVGPU-1019 Change-Id: Ia2ec5f1db3c7a1884efe5ba7b8b4d9ebbd021734 Signed-off-by: Amurthyreddy <amurthyreddy@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1921373 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-24 17:00:11 -07:00
Amurthyreddy	c114b9e77e	gpu: nvgpu: MISRA 14.4 err/ret/status as boolean MISRA rule 14.4 doesn't allow the usage of integer types as booleans in the controlling expression of an if statement or an iteration statement Fix violations where the integer variables err, ret, status are used as booleans in the controlling expression of if and loop statements. JIRA NVGPU-1019 Change-Id: Ia950828797b8eff4bc754269ea2d9fa272f59436 Signed-off-by: Amurthyreddy <amurthyreddy@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1919111 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Scott Long <scottl@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:11 +05:30
Deepak Nibade	92c1949392	gpu: nvgpu: add separate unit for cyclestats_snapshot Add new separate unit common/perf/cyclestats_snapshot.c and add corresponding header file include/nvgpu/cyclestats_snapshot.h This unit is h/w independent and simply calls gops.perf.* HALs exposed by perf unit to do the h/w configurations Also remove gv11b/css_gr_gv11b.* files as h/w specific sequence implemented in them is already moved to perf unit Rename all cyclestats_snapshot HALs in the form nvgpu_css_*() Jira NVGPU-1103 Change-Id: I303f6becb313ac918e06c495a5fe299947a1f0b1 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1916652 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:11 +05:30
Deepak Nibade	362f9a7371	gpu: nvgpu: use MC hals to get MC reset mask in perf unit In common/perf/perf_*.c we right now include MC h/w headers to get MC reset mask for perfmon Use MC HAL gops.mc.reset_mask() to get reset mask instead of using direct MC register headers Jira NVGPU-1102 Change-Id: I06f0b3638775c07c9e4e7ac80efd479871322d50 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1852620 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:08 +05:30
Deepak Nibade	412c9fa30c	gpu: nvgpu: add separate unit for perfbuf Add separate unit for perfbuf in common/perf/perfbuf.c which does not need to include any h/w file. This unit will utilize HALs exported by perf_*.c units for h/w accesses. Add corresponding header file at include/nvgpu/perfbuf.h Add new HAL gops.perfbuf with below operations : gops.perfbuf.perfbuf_enable() gops.perfbuf.perfbuf_disable() Remove below debug session specific HALs gops.dbg_session_ops.perfbuffer_enable() gops.dbg_session_ops.perfbuffer_disable() Delete file gv11b/dbg_gpu_gv11b.c since it is no longer needed now as it was only including perfbuf sequence Also remove perfbuf sequences from gk20a/dbg_gpu_gk20a.c Jira NVGPU-1102 Change-Id: I57b87c9f0dcd85784f8002bc92728b6d78a68d98 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1819303 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:08 +05:30
Deepak Nibade	71a4ca9935	gpu: nvgpu: add separate unit for perf Add separate unit for perf under common/perf/ to provide accesses to h/w unit hw_perf_*_.c Implement below HALs in gm20b and gv11b specific h/w files and set them to appropriate chips gops.perf.enable_membuf() gops.perf.disable_membuf() gops.perf.membuf_reset_streaming() gops.perf.get_membuf_pending_bytes() gops.perf.set_membuf_handled_bytes() gops.perf.get_membuf_overflow_status() Jira NVGPU-1102 Change-Id: I161990fdb7283f33c0fb2ab6a8051f4bfc3bb181 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1819302 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:08 +05:30

37 Commits