linux-nvgpu

mirror of git://nv-tegra.nvidia.com/linux-nvgpu.git synced 2025-12-23 18:16:01 +03:00

Author	SHA1	Message	Date
Tejal Kudav	a307b6eb77	gpu: nvgpu: Move nvlink HAL files to common/nvlink Move the nvlink HAL code to unit specific directory as part of nvgpu restructing. This move is done after removing usage of other unit's hardware headers from nvlink. Also confirmed that no other unit files are including nvlink hardware headers. JIRA NVGPU-966 Change-Id: I301e3f8de37c5792a3e1e799b97e5fdfc131f058 Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1975259 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-21 13:24:19 -08:00
tkudav	3267530f22	gpu: nvgpu: Use device_info parsing HAL for Fifo Update the fifo code to use the HALs exposed by "Top" unit to read data from device_info table. The information for GRAPHICS engine in device_info table is now parsed using the get_device_info HAL from "Top" unit. Copy engine(CE) has multiple entries in the device_info table corresponding to each instance of the engine. Prior to Pascal, each instance of an engine was denoted by different engine type. For example in GM20B, there are engine types like COPY_ENGINE0, COPY_ENGINE1 and so on. In Pascal and chips beyond, a new field called "inst_id" is added and the engine_type is kept the same for different instances of an engine. For example in GP10B, all copy engine entries have same engine type i.e ENGINE_LCE, but different inst_ids. So for Pascal and chips beyond, we use a different HAL to get CE information from device_info table. JIRA NVGPU-1053 Change-Id: Ib40a616d903a5dbef5730678c2ebc3454b8e900d Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1969400 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-20 09:26:01 -08:00
tkudav	38f8b3fb00	gpu: nvgpu: Add HALs for device_info table parsing The device_info table is an array of registers which contain engine specific data for engines like CE, graphics, nvdec, ioctrl etc. These registers contain data like intr_enum, reset_enum, pri_base and so on. The Top unit would include HAL to parse this table and get data for a particular engine. Some engines like CE have multiple entries in the device_info table corresponding to each instance of the engine. Prior to Pascal, each instance of an engine was denoted by different engine type. For example in GM20B, there are engine types like COPY_ENGINE0, COPY_ENGINE1 and so on. In Pascal and chips beyond, a new field called "inst_id" is added and the engine_type is kept the same. For example in GP10B, all copy engine entries have same engine type i.e ENGINE_LCE, but different inst_ids. So for Pascal and chips beyond, add HAL to get number of entries corresponding to an engine type.The "get_device_info" HAL will parse a specific instance of the engine using inst_id argument JIRA NVGPU-1053 Change-Id: Ie3058b1c1bfdd87bfa47e5f037d049d9d50cfc0b Signed-off-by: Tejal Kudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1969399 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: svc-mobile-misra <svc-mobile-misra@nvidia.com> Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-20 09:25:57 -08:00
Thomas Fleury	3943f87d69	gpu: nvgpu: userd slab cleanup Follow-up change to rename g->ops.mm.bar1_map (and implementations) to more specific g->ops.mm.bar1_map_userd. Also use nvgpu_big_zalloc() to allocate userd slabs memory descriptors. Bug 2422486 Bug 200474793 Change-Id: Iceff3bd1d34d56d3bb9496c179fff1b876b224ce Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1970891 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-17 12:33:43 -08:00
Debarshi Dutta	fcd216e170	gpu: nvgpu: move gk20a_fifo_engines_on_id to ops struct gk20a_fifo_engines_on_id uses H/W headers to return a valid active engine mask. This qualifies the function to be invoked via a struct gpu_ops function pointer instead. Jira NVGPU-1237 Change-Id: Ice30610ef51cf4471b3750f21d38e6648953e9e2 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1970032 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 21:54:48 -08:00
Debarshi Dutta	7f58347ed9	gpu: nvgpu: move tsg functions to common Any tsg specific functions that does high-level software-centric operations below to the TSG unit and not the FIFO unit. Move the below public functions as well as their dependent static functions to common/fifo/tsg.c and also rename them to use the prefix nvgpu_tsg_* gk20a_fifo_set_ctx_mmu_error_tsg gk20a_fifo_abort_tsg gk20a_fifo_error_tsg gk20a_fifo_check_tsg_ctxsw_timeout Jira NVGPU-1237 Change-Id: I4e3da821a878d4b4a0a0b53fbb7f4c10f135f58d Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1934299 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 21:54:26 -08:00
Debarshi Dutta	57f03e3a20	gpu: nvgpu: move channel functions to common Any channel specific functions having high-level software-centric operations belong to the channel unit and not the FIFO unit. Move the below public functions as well as their dependent static functions to common/fifo/channel.c. Also, rename the functions to use the prefix nvgpu_channel_*. gk20a_fifo_set_ctx_mmu_error_ch gk20a_fifo_error_ch gk20a_fifo_check_ch_ctxsw_timeout Jira NVGPU-1237 Change-Id: Id6b6d69bbed193befbfc4c30ecda1b600d846199 Signed-off-by: Debarshi Dutta <ddutta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1932358 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 21:54:17 -08:00
Konsta Holtta	07993bbbd8	gpu: nvgpu: add runlist_write_state HAL The function gk20a_fifo_sched_disable_rw accesses HW directly. Rename it and add a HAL indirection so that it can be called from chip-independent code. Also fix some trivial MISRA violations in the function. Jira NVGPU-1309 Change-Id: Icf320738d3d1d4baa40257a9da3ca2c6b7fefc0b Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1971274 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 12:06:08 -08:00
Deepak Nibade	fdc15553bc	gpu: nvgpu: add new HAL to initialize preemption mode g->ops.gr.alloc_gr_ctx HAL right now allocates graphics context and also initializes preemption mode for various platforms Separate out a new HAL g->ops.gr.init_ctxsw_preemption_mode that initializes preemption mode and call it from gk20a_alloc_obj_ctx() after context is created g->ops.gr.alloc_gr_ctx now only allocates the context as the name suggests Jira NVGPU-1527 Change-Id: I8a44672d5ab2ebfe315e6334115265e4ee4f24f0 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1972254 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 00:35:39 -08:00
Deepak Nibade	6bbcdb51c6	gpu: nvgpu: remove redundant GR ops g->ops.gr.enable_cde_in_fecs and g->ops.gr.update_boosted_ctx are no longer required since we can directly call g->ops.gr.ctxsw_prog.set_cde_enabled and g->ops.gr.ctxsw_prog.set_pmu_options_boost_clock_frequencies respectively remove those functions and the ops Jira NVGPU-1526 Change-Id: Idb0ad5f634e78aac44ec325ba2b7f59c612b29e8 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1972184 GVS: Gerrit_Virtual_Submit Reviewed-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-14 00:35:29 -08:00
Sagar Kamble	cb1c2b7845	gpu: nvgpu: update MINION falcon base addr init Prepare new hal api g->ops.nvlink.falcon_base_addr to get the MINION falcon base address. JIRA NVGPU-1587 Change-Id: I83a38bf78fd582ea715248900587c1e8e209da3c Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1969433 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-12 15:14:27 -08:00
Sagar Kamble	ccb035c587	gpu: nvgpu: update GSP falcon base addr init GSPLITE falcon base address was being set without invoking hal api. This patch defines gpu_ops.gsp.falcon_base_addr hal api to get this base address. JIRA NVGPU-1587 Change-Id: Id187b34d022f90c09b8762cdab7769323b607cc0 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1969432 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-12 15:14:24 -08:00
Sagar Kamble	147d5d9402	gpu: nvgpu: update GPCCS falcon base addr init GPCCS falcon base address was being set without invoking hal api. Remove FALCON_GPCCS_BASE. This patch defines gpu_ops.gr.gpccs_falcon_base_addr hal api to get this base address. JIRA NVGPU-1587 Change-Id: Icfa7a26d1bb2d67c81f05a43f6ce906f59706b3d Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1969431 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-12 15:14:20 -08:00
Sagar Kamble	c6fc301a9b	gpu: nvgpu: update FECS falcon base addr init FECS falcon base address was being set without invoking hal api. Remove FALCON_FECS_BASE. This patch defines gpu_ops.gr.fecs_falcon_base_addr hal api to get this base address. JIRA NVGPU-1587 Change-Id: I9c8e60be4ee81a154020c982893725a12ebb72ef Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1969430 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-12 15:14:16 -08:00
Sagar Kamble	84b493e644	gpu: nvgpu: update SEC2 falcon base addr init SEC2 falcon base address was being set without invoking hal api. Remove FALCON_SEC_BASE. This patch defines gpu_ops.sec2.falcon_base_addr hal api to get this base address. Also, don't initialize the base for non-supported falcons. JIRA NVGPU-1587 Change-Id: Iad19a9987416076cf9090d30a48ff83369cf73c2 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1969429 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-12 15:14:13 -08:00
Sagar Kamble	e6668a163f	gpu: nvgpu: update PMU falcon base addr init PMU falcon base address was being set without invoking hal api. Remove FALCON_PWR_BASE. This patch defines gpu_ops.pmu.falcon_base_addr hal api to get this base address. JIRA NVGPU-1587 Change-Id: I5c3f27e89bdcc775025bc8d4fa9cf0af11ceb002 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1969428 Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-12 15:14:09 -08:00
Sagar Kamble	e86949f5a2	gpu: nvgpu: update NVDEC falcon base addr init NVDEC falcon base address was being set without invoking hal api. Remove FALCON_NVDEC_BASE. This patch defines gpu_ops.fb.falcon_base_addr hal api to get this base address. Currently gp106 and tu104 have these implemented. gv100 uses the gp106 hal interface. Also, don't initialize the base for non-supported falcons. JIRA NVGPU-1587 Change-Id: I0be759b8462ede9b85690a70431480afdee9602c Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1969427 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-12 15:14:05 -08:00
Peng Liu	34df003519	gpu: nvgpu: using pmu counters for load estimate PMU counters #0 and #4 are used to count total cycles and busy cycles. These counts are used by podgov to estimate GPU load. PMU idle intr status register is used to monitor overflow. Overflow rarely occurs because frequency governor reads and resets the counters at a high cadence. When overflow occurs, 100% work load is reported to frequency governor. Bug 1963732 Change-Id: I046480ebde162e6eda24577932b96cfd91b77c69 Signed-off-by: Peng Liu <pengliu@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1939547 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-11 18:22:54 -08:00
Thomas Fleury	7e68e5c83d	gpu: nvgpu: userd slab allocator We had to force allocation of physically contiguous memory for USERD in nvlink case, as a channel's USERD address is computed as an offset from fifo->userd address, and nvlink bypasses SMMU. With 4096 channels, it can become difficult to allocate 2MB of physically contiguous sysmem for USERD on a busy system. PBDMA does not require any sort of packing or contiguous USERD allocation, as each channel has a direct pointer to that channel's 512B USERD region. When BAR1 is supported we only need the GPU VAs to be contiguous, to setup the BAR1 inst block. - Add slab allocator for USERD. - Slabs are allocated in SYSMEM, using PAGE_SIZE for slab size. - Contiguous channels share the same page (16 channels per slab). - ch->userd_mem points to related nvgpu_mem descriptor - ch->userd_offset is the offset from the beginning of the slab - Pre-allocate GPU VAs for the whole BAR1 - Add g->ops.mm.bar1_map() method - gk20a_mm_bar1_map() uses fixed mapping in BAR1 region - vgpu_mm_bar1_map() passes the offset in TEGRA_VGPU_CMD_MAP_BAR1 - TEGRA_VGPU_CMD_MAP_BAR1 is called for each slab. Bug 2422486 Bug 200474793 Change-Id: I202699fe55a454c1fc6d969e7b6196a46256d704 Signed-off-by: Thomas Fleury <tfleury@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1959032 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-11 16:24:10 -08:00
Deepak Nibade	6777bd5ed2	gpu: nvgpu: add separate unit for gr/ctxsw_prog Add separate new unit gr/ctxsw_prog that provides interface to access h/w header files hw_ctxsw_prog_.h Add below chip specific files that access above h/w unit and provide interface through g->ops.gr.ctxsw_prog.() HAL for rest of the units common/gr/ctxsw_prog/ctxsw_prog_gm20b.c common/gr/ctxsw_prog/ctxsw_prog_gp10b.c common/gr/ctxsw_prog/ctxsw_prog_gv11b.c Remove all the h/w header includes from rest of the units and code. Remove direct calls to h/w headers ctxsw_prog_() and use HALs g->ops.gr.ctxsw_prog.() instead In gr_gk20a_find_priv_offset_in_ext_buffer(), h/w header ctxsw_prog_extended_num_smpc_quadrants_v() is only defined on gk20a And since we don't support gk20a remove corresponding code Add missing h/w header ctxsw_prog_main_image_pm_mode_ctxsw_f() for some chips Add new h/w header ctxsw_prog_gpccs_header_stride_v() Jira NVGPU-1526 Change-Id: I170f5c0da26ada833f94f5479ff299c0db56a732 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1966111 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-11 14:41:04 -08:00
Vinod G	a747e3a3ba	gpu: nvgpu: RTV cb support for gfxp Add new buffer support for graphics preemption in Turing. Add new hal for allocate and commit rtv circular buffer for gfxp. Add new hal for free gr_ctx for TU104. JIRA NVGPUT-98 Change-Id: I4396fd50288db55da5f924fefa96a2e3d170094b Signed-off-by: Vinod G <vinodg@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1944975 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-12-05 17:03:53 -08:00
Konsta Holtta	94d4a42d10	gpu: nvgpu: add runlist_busy_engines HAL Split out the code to check which engines on a particular runlist are busy from gk20a_fifo_runlist_reset_engines() and make it a HAL op. Resetting engines is common across chips but status is read from registers. Jira NVGPU-1309 Change-Id: I7a63a2942a9e210481822eaf85795fc17dad0dc5 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1961822 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-30 11:54:27 -08:00
Mahantesh Kumbar	281c150080	gpu: nvgpu: enable pstate support for tu10x -Enable supported on tu10x JIRA NVGPU-1150 Change-Id: Id32d5a966de3fbbfff5271bf2d5a127f0aa87b5f Reviewed-on: https://git-master.nvidia.com/r/1929896 Signed-off-by: Vaikundanathan S <vaikuns@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1957829 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-29 05:35:10 -08:00
Abdul Salam	45b0cf9a61	gpu: nvgpu: Remove clk arb HAL's from GV100 GV100 doesnt support clk arbitration Setting this to NULL will help in removing GP106 clk arb Bug 200457373 Change-Id: Ibcf823a6269e66ff90e67c4158a2ec86441066d5 Signed-off-by: Abdul Salam <absalam@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1959171 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-27 16:43:59 -08:00
Mahantesh Kumbar	7672890f48	gpu:nvgpu: Add Change Sequencer Add change sequencer for PS3.5 Add HAL to select if change sequencer is neeeded. Add calls from pstate.c to change sequence sw and pmu setup. JIRA NVGPU-1157 Change-Id: I0722c4bf875577ba04f56f49f21cb1a149b1d37b Reviewed-on: https://git-master.nvidia.com/r/1929788 Signed-off-by: Vaikundanathan S <vaikuns@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1950409 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-24 00:34:04 -08:00
Sagar Kamble	1da7c720c0	gpu: nvgpu: reorganize falcon HAL code Move falcon HAL files under common/falcon unit and rename the files to falcon_*.c\|h for consistency. JIRA NVGPU-1459 Change-Id: I9f39097f35fd6228e80945251c7b7ef9cc901398 Signed-off-by: Sagar Kamble <skamble@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1953757 Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-21 23:04:33 -08:00
Terje Bergstrom	d6a9b1dae1	gpu: nvgpu: Move gv100 perf policy to pmu_perf While code communicating with PMU perf got moved to pmu_perf, the file implementing gv100 specifics got left behind. Move that, to pmu_perf, too. JIRA NVGPU-596 Change-Id: I2b59970ca60fee8c6c1f19b54dcebfb65c1fde80 Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1944887 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-09 13:28:45 -08:00
Terje Bergstrom	f00d9ca1aa	gpu: nvgpu: Move pmu HAL files to common/pmu Move PMU and ACR HAL source code files to live under common/pmu. Also update the #include paths and delete unnecessary #include dependencies. JIRA NVGPU-961 Change-Id: I29a220bce6de0a46b6a5fe8ff7f9dc4d67395348 Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1935626 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-08 20:04:06 -08:00
Konsta Holtta	513cb21f26	gpu: nvgpu: move doorbell token number to HAL Add a fifo HAL for querying the doorbell token of a specific channel and call it instead of doing the calculation directly. For Volta the token is just the channel id plus the possible base number. Bug 200145225 Change-Id: Ifbb150191575fdc72e413a14c799cab7e52d8c14 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1849639 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-06 21:56:26 -08:00
tkudav	1cdcc54a53	gpu: nvgpu: Use nvlink speed from VBIOS Different SKUs may require different nvlink speed and hence the nvlink speed value should come from VBIOS. The initpll number corresponding to speed is present in VBIOS Low Power Nvlink table header. Parse this data from VBIOS and set corresponding nvlink speed and minion initpll DLCMD as default. We can no longer update the GV100 VBIOS with necessary nvlink speed value. Hence the hardcoding stays for GV100. The nvlink speed should match across the endpoints. So in speed_config fops, communicate the speed to nvlink core-driver for co-ordination with Tegra endpoint. Bug 2418403 Change-Id: Ib6f60951d4ca1c275968707d4cc6d738ba3a3f08 Signed-off-by: tkudav <tkudav@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1938046 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-06 02:14:32 -08:00
Srirangan Madhavan	ef5fdac7a6	gpu: nvgpu: Fix MISRA rule 15.6 violations MISRA Rule-15.6 requires that all if-else blocks and loop blocks be enclosed in braces, including single statement blocks. Fix errors due to single statement if-else and loop blocks without braces by introducing the braces. JIRA NVGPU-775 Change-Id: Ib70621d39735abae3fd2eb7ccf77f36125e2d7b7 Signed-off-by: Srirangan Madhavan <smadhavan@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1928745 GVS: Gerrit_Virtual_Submit Reviewed-by: Adeel Raza <araza@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-11-05 22:13:16 -08:00
Deepak Nibade	e059f3cb12	gpu: nvgpu: add separate unit for netlist All the netlist parsing code is currently under GR unit, but netlist ucode parsing does not really have any logical dependency to GR Hence separate out a new unit common/netlist/ that parses the netlist image and stores/exposes its content through netlist_vars structure Structure nvgpu_netlist_vars is added to structure gk20a Move netlist parsing code to common/netlist/netlist.c and chip specific files to common/netlist/netlist_<chip>.c Move simulation netlist parsing to common/netlist/netlist_sim.c Rename g.ops.gr_ctx HAL to g.ops.netlist Rename all the exported structures to be in the form of nvgpu_* Rename all exported functions to be in the form of nvgpu_netlist_*() Add netlist initialization to GPU boot path, and add deinitialization to GPU remove path Jira NVGPU-1317 Change-Id: I9af86e3b3230a89db5260cc8ed96ff5f72938c9a Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1936454 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-31 09:00:49 -07:00
Deepak Nibade	ac1a2f0897	gpu: nvgpu: use HAL to read fecs_ctx_state_store_major_rev_id() In gk20a/gr_ctx_gk20a.c we right now directly read the GR register gr_fecs_ctx_state_store_major_rev_id_r() which adds the dependency to GR h/w header Add a new HAL g.ops.gr.get_fecs_ctx_state_store_major_rev_id() to read this register and use this instead Also remove h/w header from gr_ctx_gk20a.c Jira NVGPU-1317 Change-Id: Iab64fbfacff4d7ce4f3b61ca90b00ddc77e29551 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1936453 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-31 09:00:40 -07:00
Konsta Holtta	f8188089df	gpu: nvgpu: save only used part of channel ram for dump Reduce the size of memory allocations in the channel debug dump by capturing only the necessary values from the instance block. This also simplifies the allocation path slightly with the downside of having to add a capture_channel_ram_dump HAL for reading the interesting parts explicitly beforehand to the now smaller staging buffer. Also rename struct ch_state to struct nvgpu_channel_dump_info. Jira NVGPU-886 Change-Id: I5d7518d9d474b0b728b183383bc83d89ecf91b98 Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1928207 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-30 15:35:26 -07:00
Konsta Holtta	a39d91b591	gpu: nvgpu: gv100: support usermode submit Use usermode_base HAL from gv11b and turn on NVGPU_SUPPORT_USERMODE_SUBMIT for gv100. Bug 200145225 Change-Id: I9f60a1fb07ae19ee9e0de9e28d56789fe282907f Signed-off-by: Konsta Holtta <kholtta@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1924509 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-29 08:04:56 -07:00
Terje Bergstrom	bc379d5eed	gpu: nvgpu: Split L2 interrupt handling to MC and L2 L2 interrupt is processed by first reading from MC which L2 triggered the interrupt and then calling a function per L2 slice to get the details. Move the outer loop to MC unit, and the inner loop and L2 accesses to LTC unit. JIRA NVGPU-954 Change-Id: I69b7bb82e4574b0519cdcd73b94d7d3e3fa6ef9e Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1851328 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-24 17:00:01 -07:00
Deepak Nibade	1b2a0833e0	gpu: nvgpu: add separate unit for debugger Rename gk20a/dbg_gpu_gk20a.c to common/debugger.c and make it a separate common unit Also rename gk20a/dbg_gpu_gk20a.h to include/nvgpu/debugger.h We had two different HALs for debugger - gops.debugger and gops.dbg_session_ops Combine them into one single HAL gops.debugger and remove gops.dbg_session_ops Rename all exported APIs from debugger.h to be in the form of nvgpu_*() Jira NVGPU-1013 Change-Id: I136dc7786e3b2065921eb03b99f16049212f3cd2 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1920075 Reviewed-by: Sachin Jadhav <sachinj@nvidia.com> Tested-by: Sachin Jadhav <sachinj@nvidia.com>	2018-10-24 00:30:19 -07:00
Amurthyreddy	c94643155e	gpu: nvgpu: MISRA 14.4 err/ret/status as boolean MISRA rule 14.4 doesn't allow the usage of integer types as booleans in the controlling expression of an if statement or an iteration statement. Fix violations where the integer variables err, ret, status are used as booleans in the controlling expression of if and loop statements. JIRA NVGPU-1019 Change-Id: I8c9ad786a741b78293d0ebc4e1c33d4d0fc8f9b4 Signed-off-by: Amurthyreddy <amurthyreddy@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1921260 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-22 08:53:34 -07:00
Deepak	7e8ca5f5e7	gpu: nvgpu: Remove cyclic dependency PMU<->GR. -Created & used HAL for dumping gr falcon stats. -Trimmed the fecs_dump_falcon_stats to re-use code from generic falcon debug dump. JIRA NVGPU-621 Change-Id: Ia008726915112b33f0aca68a48cb98b8ed2c3475 Signed-off-by: Deepak <dgoyal@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1923353 Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User GVS: Gerrit_Virtual_Submit Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-16 05:54:55 -07:00
matthewb	4b10960329	gpu: nvgpu: HAL-ify pmm type broadcast values The PMM type-specific broadcast->unicast expansion calculation was using incorrect values. This caused the invalid register accesses to be generated. This change HAL-ifies the values, so that the expansion will be performed correctly. Bug 200454109 Change-Id: I96c15de27b5e16e4db2e788fd98e6bf7d6e7d564 Signed-off-by: Matthew Braun <matthewb@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1919476 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:11 +05:30
Deepak Nibade	92c1949392	gpu: nvgpu: add separate unit for cyclestats_snapshot Add new separate unit common/perf/cyclestats_snapshot.c and add corresponding header file include/nvgpu/cyclestats_snapshot.h This unit is h/w independent and simply calls gops.perf.* HALs exposed by perf unit to do the h/w configurations Also remove gv11b/css_gr_gv11b.* files as h/w specific sequence implemented in them is already moved to perf unit Rename all cyclestats_snapshot HALs in the form nvgpu_css_*() Jira NVGPU-1103 Change-Id: I303f6becb313ac918e06c495a5fe299947a1f0b1 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1916652 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:11 +05:30
aalex	e1a4bc8401	Revert "Revert "gpu: nvgpu: refactor SET_SM_EXCEPTION_MASK ioctl"" This patch was reverted as the "set_sm_exception_type_mask" HAL assignment for gp10b was missing causing regression on Pascal platform. Added missing gp10b HAL assignment for setting SM exception mask. Bug 200447406 This reverts commit `ce5228e094`. Change-Id: Ic48f4661fd4b6100310f8b4d23d902847e31f5df Signed-off-by: aalex <aalex@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1837653 Reviewed-by: svc-misra-checker <svc-misra-checker@nvidia.com> Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com> GVS: Gerrit_Virtual_Submit Reviewed-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-by: Aparna Das <aparnad@nvidia.com> Tested-by: Sandarbh Jain <sanjain@nvidia.com> Reviewed-by: Nirav Patel <nipatel@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:10 +05:30
Deepak Nibade	412c9fa30c	gpu: nvgpu: add separate unit for perfbuf Add separate unit for perfbuf in common/perf/perfbuf.c which does not need to include any h/w file. This unit will utilize HALs exported by perf_*.c units for h/w accesses. Add corresponding header file at include/nvgpu/perfbuf.h Add new HAL gops.perfbuf with below operations : gops.perfbuf.perfbuf_enable() gops.perfbuf.perfbuf_disable() Remove below debug session specific HALs gops.dbg_session_ops.perfbuffer_enable() gops.dbg_session_ops.perfbuffer_disable() Delete file gv11b/dbg_gpu_gv11b.c since it is no longer needed now as it was only including perfbuf sequence Also remove perfbuf sequences from gk20a/dbg_gpu_gk20a.c Jira NVGPU-1102 Change-Id: I57b87c9f0dcd85784f8002bc92728b6d78a68d98 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1819303 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:08 +05:30
Deepak Nibade	71a4ca9935	gpu: nvgpu: add separate unit for perf Add separate unit for perf under common/perf/ to provide accesses to h/w unit hw_perf_*_.c Implement below HALs in gm20b and gv11b specific h/w files and set them to appropriate chips gops.perf.enable_membuf() gops.perf.disable_membuf() gops.perf.membuf_reset_streaming() gops.perf.get_membuf_pending_bytes() gops.perf.set_membuf_handled_bytes() gops.perf.get_membuf_overflow_status() Jira NVGPU-1102 Change-Id: I161990fdb7283f33c0fb2ab6a8051f4bfc3bb181 Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1819302 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:08 +05:30
Deepak Nibade	83ad80de50	gpu: nvgpu: remove VPR HALs from dGPUs gops.fb.dump_vpr_wpr_info() accesses both VPR and WPR registers. Split this into two different HALs gops.fb.dump_vpr_info() and gops.fb.dump_wpr_info() Also unset HALs accessing VPR registers on dGPUs We don't support VPR on dGPUs Remove fb_mmu_vpr_info_r() register and all its accessors from dGPU headers Bug 2173122 Change-Id: I5b2712f8c5389e422a84c375a7e836add48bfd1c Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1850947 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:08 +05:30
Deepak Nibade	7ae214a5d1	gpu: nvgpu: remove big page setting on pascal+ We don't support big page size beginning Pascal, so set HAL gops.fb.set_mmu_page_size() to NULL on all those platforms Also remove these accessors from corresponding platforms fb_mmu_ctrl_use_pdb_big_page_size_v() fb_mmu_ctrl_use_pdb_big_page_size_true_f() fb_mmu_ctrl_use_pdb_big_page_size_false_f() Bug 2173122 Change-Id: I7353412860a7a6f8a993ca9184a0dc3ca9d749af Signed-off-by: Deepak Nibade <dnibade@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1850946 GVS: Gerrit_Virtual_Submit Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-by: Automatic_Commit_Validation_User Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:08 +05:30
Anup Mahindre	96768f617f	gpu: nvgpu: Add gv11b_gr_clear_sm_error_state All chips were currently using gm20b_gr_clear_sm_error_state It was wrong for chips based on volta and later as the implementation didn't consider non pes-aware vsms mapping Add new HAL implementation for clear_sm_error_state for volta based and later chips to fix this. Bug 200448172 Change-Id: I65988c8cbb35d13089ac628e8333d9a3b58e0eb1 Signed-off-by: Anup Mahindre <amahindre@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1837188 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:07 +05:30
Terje Bergstrom	2c298b8c21	gpu: nvgpu: Move FB reset to MC unit FB reset is done by accessing MC register. Move the code to MC unit. JIRA NVGPU-954 Change-Id: I1636887af805f016da5490af65e808f9ac015cde Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1823385 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:07 +05:30
Terje Bergstrom	2c17e71aa1	gpu: nvgpu: Add MC APIs for reset masks Add API for querying reset mask corresponding to a unit. The reset masks need to be read from MC HW header, and we do not want all units to access Mc HW headers themselves. JIRA NVGPU-954 Change-Id: I49ebbd891569de634bfc71afcecc8cd2358805c0 Signed-off-by: Terje Bergstrom <tbergstrom@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1823384 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:07 +05:30
Alex Waterman	966d1a08be	gpu: nvgpu: Standardize HAS_SYNCPOINTs check Nvgpu uses many ways to check if sync points are enabled. The four ways used to be: platform->has_syncpoints g->has_syncpoints nvgpu_is_enabled(g, NVPGU_HAS_SYNCPOINTS) gk20a_platform_has_syncpoints() This patch standardizes all usage to now be nvgpu_has_syncpoints() which is based on gk20a_platform_has_syncpoints() - just renamed to be general to nvgpu. All usage of the other forms have now been consolidated. However, under the hood nvgpu_has_syncpoints() does check the is_enabled flag. This flag is now set where g->has_syncpoints used to be set based on the platform data. The basic dependency chain is this: nvgpu_has_syncpoints -> NVGPU_HAS_SYNCPOINTS -> platform->has_syncpoints However, note: there are several places where syncpoints can be disabled if some other driver initialization fails (for ex. host1x). Also note that nvgpu_has_syncpoints() also considers a disable variable set by debugfs. Bug 2327574 Change-Id: Ia2375a80f5f2e27285e6175568dd13e6bb25fd33 Signed-off-by: Alex Waterman <alexw@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/1803975 Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>	2018-10-12 17:35:07 +05:30

1 2 3 4 5

234 Commits