linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-23 09:57:08 +03:00

Author	SHA1	Message	Date
Deepak Nibade	0aa1ef7c0c	gpu: nvgpu: move gv11b fecs_trace HAL to gr/fecs trace unit Rename gv11b/fecs_trace_gv11b.* files to common/gr/fecs_trace/fecs_trace_gv11b.* Also move HAL API gk20a_fecs_trace_get_buffer_full_mailbox_val() to gr/fecs_trace unit and rename it as gm20b_fecs_trace_get_buffer_full_mailbox_val() Protect gm20b/gv11b HAL code under CONFIG_GK20A_CTXSW_TRACE Remove tu104/fecs_trace_tu104.* since tu104 will re-use gv11b HAL Fix g->ops.fecs_trace.get_buffer_full_mailbox_val() for vgpu/gv11b and use gv11b HAL Jira NVGPU-1880 Change-Id: If78480e36be4e5f0fd659019518f233d8805486d Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2029259 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-05 02:49:16 -08:00
Deepak Nibade	11757aabbd	gpu: nvgpu: move poll and trace read APIs to gr/fecs_trace Remove below calls from fecs_trace_gk20a.c gk20a_fecs_trace_ring_read() gk20a_fecs_trace_poll() gk20a_fecs_trace_periodic_polling() gk20a_fecs_trace_reset() And move them to common gr/fecs_trace unit with below renames nvgpu_gr_fecs_trace_ring_read() nvgpu_gr_fecs_trace_poll() nvgpu_gr_fecs_trace_periodic_polling() nvgpu_gr_fecs_trace_reset() Also update above calls to support QNX use cases by adding vm_update_mask as a parameter Add below HALs for QNX support. These HALs will not be set for linux g->ops.fecs_trace.vm_dev_write() g->ops.fecs_trace.vm_dev_update() Jira NVGPU-1880 Change-Id: Idc305b9288a1df5ca86622b95d6e62a23fdfde7e Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2029258 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-03-05 02:49:01 -08:00
Deepak Nibade	73d62c0c52	gpu: nvgpu: move fecs_trace_enable/disable APIs to gr/fecs_trace Move below APIs from gk20a/fecs_trace_gk20a.c gk20a_fecs_trace_enable() gk20a_fecs_trace_disable() gk20a_fecs_trace_is_enabled() gk20a_fecs_trace_reset_buffer() gk20a_fecs_trace_buffer_size() gk20a_gr_max_entries() and move them to new gr/fecs_trace unit with below renames nvgpu_gr_fecs_trace_enable() nvgpu_gr_fecs_trace_disable() nvgpu_gr_fecs_trace_is_enabled() nvgpu_gr_fecs_trace_reset_buffer() nvgpu_gr_fecs_trace_buffer_size() nvgpu_gr_fecs_trace_max_entries() Use new functions in the driver instead of old ones Export gk20a_fecs_trace_periodic_polling() in fecs_trace_gk20a.h header since it is needed in gr/fecs_trace for transition This include and the function itself will be later moved to gr/fecs_trace unit Move struct nvgpu_gpu_ctxsw_trace_filter and all filter TSG macros in the form NVGPU_GPU_CTXSW_TAG_* to gr/fecs_trace.h Jira NVGPU-1880 Change-Id: Ic95b99554e626033a111452f311bbc026ec604e2 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2027530 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-27 02:47:11 -08:00
Deepak Nibade	2104ded777	gpu: nvgpu: move struct gk20a_fecs_trace to gr/fecs_trace unit Move struct gk20a_fecs_trace to new gr/fecs_trace unit and rename it as struct nvgpu_gr_fecs_trace Add enable_lock mutex and enable_count to this structure to support QNX use cases Remove init field from struct gk20a_fecs_trace Rename gk20a_fecs_trace_init() to nvgpu_gr_fecs_trace_init() and move it to new unit Rename gk20a_fecs_trace_deinit() to nvgpu_gr_fecs_trace_deinit() and move it to new unit Update gk20a_fecs_trace_enable() to start thread only when enable_count == 1, otherwise we just increment enable_count Update gk20a_fecs_trace_disable() to stop thread when enable_count == 0, otherwise we just decrement enable_count Before this patch struct gk20a_fecs_trace was not visible in new unit, and hence all mutex_acquire for list_lock were done in fecs_trace_gk20a.c file Since new struct is now available in new unit, move mutex_lock/release calls to gr/fecs_trace unit now Jira NVGPU-1880 Change-Id: I5abfa0165fa1c31716f3d6f2f669284f8959d7cf Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/2024562 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-by: Bharat Nihalani <bnihalani@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-02-25 03:45:59 -08:00
Aparna Das	d4f1a138dc	gpu: nvgpu: add vmid param to fecs trace bind_channel OS specific implementation of fecs trace bind_channel function needs to handle special case for vserver to retrieve vmid from channel id. Native code should be independent of server code. Modify struct fecs_trace member function bind_channel to pass vmid parameter enabling retrieving and passing vmid from server code. Jira GVSCI-44 Change-Id: I96223376f2068e2cbf60a9c9b35ff564a65e5dc3 Signed-off-by: Aparna Das <aparnad@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1970693 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2019-01-04 11:15:06 -08:00
Deepak Nibade	bb677160e5	gpu: nvgpu: check tu104 specific timestamp buffer full error code In gk20a_gr_handle_fecs_error(), we right now check the error code in mailbox to identify if we hit timestamp buffer full error interrupt This error code right now is hard coded to 0x26 But on Turing ucode this error code is set to 0x32 Add new HAL g->ops.fecs_trace.get_buffer_full_mailbox_val() to get correct error code per platform and use this in gk20a_gr_handle_fecs_error() Bug 200471541 Bug 2469604 Change-Id: I7325354b39d35b1c8b218e554814316d22950469 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1978144 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-31 09:43:39 -08:00
Seema Khowala	790ba09554	gpu: nvgpu: handle timestamp buffer full ctxsw_intr0 If enabled, fecs trace updating happens from ucode side even when there is no fecs trace dumper application to consume it. Due to this, trace buffer will get eventually full and ucode will trigger ctxsw_intr0. Reset fecs_trace buffer to handle timestamp buffer full ctxsw_intr0. Bug 2361571 Bug 200472922 Change-Id: Ia26a17635fc6bd6e8663b8af983acc91839ecfcd Signed-off-by: Seema Khowala <seemaj@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1965370 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-07 14:54:02 -08:00
Konsta Holtta	ec87761b7d	gpu: nvgpu: pass gr ctx to fecs_trace_bind_channel Simplify object ownership by passing the gr_ctx around directly instead of reading from tsg via a channel; the caller holds the gr_ctx already. Jira NVGPU-1149 Change-Id: I2a1c96f88c4eac6493c83ac17b51af1c680e5418 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1822026 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-23 04:30:28 -08:00
smadhavan	c4ac750e98	nvgpu: gk20a: MISRA Rule 21.2 header guard fixes MISRA rule 21.2 doesn't allow the use of macro names which start with an underscore. These leading underscores are to be removed from the macro names. This patch will fix such violations caused by include guards by renaming them to follow the convention, 'NVGPU_PARENT-DIR_HEADER_H' JIRA NVGPU-1028 Change-Id: I478be317d067a75cdc8cb7fe9577a66d06318a11 Signed-off-by: smadhavan <smadhavan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1813068 GVS: Gerrit_Virtual_Submit Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-09-06 16:15:30 -07:00
Vaibhav Kachore	7bf80a1c69	gpu: nvgpu: remove use of NVGPU_CTXSW_FILTER_ISSET - Remove the usage of NVGPU_CTXSW_FILTER_ISSET splattered across nvgpu, and replace with a MACRO defined in common code. The usage is still inside Linux, but this helps the subsequent unification efforts, e.g. to unify the fecs trace path. - Remove "uapi/linux/nvgpu.h" from common code. EVLR-3078 Change-Id: I60b0e1627576a4b255671d58530d8c773ea6154c Signed-off-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1803210 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-08-31 12:16:19 -07:00
Vaibhav Kachore	ca3215c6b2	gpu: nvgpu: add support for FECS VA - On t186, ucode expects physical address to be programmed for FECS trace buffer. - On t194, ucode expects GPU VA to be programmed for FECS trace buffer. This patch adds extra support to handle this change for linux native. - Increase the size of FECS trace buffer (as few entries were getting dropped due to overflow of FECS trace buffer.) - This moves FECS trace buffer handling in global context buffer. - This adds extra check for updation of mailbox1 register. (Bug 200417403) EVLR-2077 Change-Id: I7c3324ce9341976a1375e0afe6c53c424a053723 Signed-off-by: Vaibhav Kachore <vkachore@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1536028 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Nirav Patel <nipatel@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-06-14 06:44:08 -07:00
Terje Bergstrom	7885500a42	gpu: nvgpu: Change license for common files to MIT Change license of OS independent source code files to MIT. JIRA NVGPU-218 Change-Id: I1474065f4b552112786974a16cdf076c5179540e Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1565880 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2017-09-26 11:37:32 -07:00
Sunny He	9d37d8b78c	gpu: nvgpu: Reorg fecs_trace HAL initialization Reorganize HAL initialization to remove inheritance and construct the gpu_ops struct at compile time. This patch only covers the fecs_trace sub-module of the gpu_ops struct. Perform HAL function assignments in hal_gxxxx.c through the population of a chip-specific copy of gpu_ops. Jira NVGPU-74 Change-Id: I84485ad64997270c6a0fce3c95dc9ceb0094cf6c Signed-off-by: Sunny He <suhe@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1527419 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com>	2017-07-27 16:34:44 -07:00
Anton Vorontsov	1c40d09c4c	gpu: nvgpu: Add support for FECS ctxsw tracing bug 1648908 This commit adds support for FECS ctxsw tracing. Code is compiled conditionnaly under CONFIG_GK20_CTXSW_TRACE. This feature requires an updated FECS ucode that writes one record to a ring buffer on each context switch. On RM/Kernel side, the GPU driver reads records from the master ring buffer and generates trace entries into a user-facing VM ring buffer. For each record in the master ring buffer, RM/Kernel has to retrieve the vmid+pid of the user process that submitted related work. Features currently implemented: - master ring buffer allocation - debugfs to dump master ring buffer - FECS record per context switch (with both current and new contexts) - dedicated device for ctxsw tracing (access to VM ring buffer) - SOF generation (and access to PTIMER) - VM ring buffer allocation, and reconfiguration - enable/disable tracing at user level - event-based trace filtering - context_ptr to vmid+pid mapping - read system call for ctxsw dev - mmap system call for ctxsw dev (direct access to VM ring buffer) - poll system call for ctxsw dev - save/restore register on ELPG/CG6 - separate user ring from FECS ring handling Features requiring ucode changes: - enable/disable tracing at FECS level - actual busy time on engine (bug 1642354) - master ring buffer threshold interrupt (P1) - API for GPU to CPU timestamp conversion (P1) - vmid/pid/uid based filtering (P1) Change-Id: I8e39c648221ee0fa09d5df8524b03dca83fe24f3 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: http://git-master/r/1022737 GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com>	2016-03-23 07:48:47 -07:00

14 Commits