delta/nouveau.git - github.com: Gnurou/nouveau.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	gm20b: secure-boot FECS falconsecure_boot/base	Alexandre Courbot	2015-10-26	2	-4/+6
\| \| \| \| \| \|	Enable secure boot of FECS for GM20B. Signed-off-by: Alexandre Courbot <acourbot@nvidia.com>
*	gr: support for securely-booted FECS firmware	Alexandre Courbot	2015-10-26	1	-10/+46
\| \| \| \| \| \| \| \|	Trigger the loading of FECS/GPCCS using secure boot if required, and start managed falcons using the CPUCTL_ALIAS register since CPUCTL is protected in that case. Signed-off-by: Alexandre Courbot <acourbot@nvidia.com>
*	core: add support for secure boot	Alexandre Courbot	2015-10-26	5	-0/+1809
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	On GM20x and later GPUs, firmware for some essential falcons (notably FECS) must be authenticated by a NVIDIA-produced signature and loaded by a high-secure falcon in order to access certain registers, in a process known as Secure Boot. Secure Boot requires the building of a binary blob containing the firmwares and signatures of the falcons to be loaded. This blob is then given to a high-secure falcon running a signed loader firmware that copies the blob into a write-protected region, checks that the signatures are valid, and finally loads the verified firmware into the managed falcons and switches them to a priviledged mode. This patch adds code that performs this process using PMU as the high-secure falcon, and wires it into the device core. Currently, only the secure loading of the FECS firmware is handled, but support for other falcons (notably GPCCS and PMU) is upcoming. The reason for limiting to FECS is that GR must initiate the loading of FECS at init time, which, being managed by secure boot, will trigger the loading of all other managed firmwares. A solution to this needs to be discussed. This code is tested on Tegra/GM20B and some minor work is required for dGPU support, but the fundations are here for general support of Secure Boot. This work is based on Deepak Goyal's initial port of Secure Boot to Nouveau. Signed-off-by: Alexandre Courbot <acourbot@nvidia.com>
*	core: fix return in error path of device probe	Alexandre Courbot	2015-10-26	1	-1/+2
\| \| \| \| \| \|	We want to unlock nv_devices_mutex in this error path as well. Signed-off-by: Alexandre Courbot <acourbot@nvidia.com>
*	fifo/gm20b: kick channel during cleanup	Alexandre Courbot	2015-10-26	5	-24/+121
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	GM20B requires a channel kick to be performed during gpfifo cleanup, or the FIFO will attempt to fetch memory from the previous context as a channel is recycled. A previous commit attempted to do this for all Kepler GPUs, but due to bug reports that pinned it down it has been reverted. The present commit limits its scope to GM20B only. The only effective change of this patch is to add a call to gk104_fifo_gpfifo_kick() in gpfifo_fini for GM20B, but doing so requires to export quite a few extra functions, hence its non-trivial length. Signed-off-by: Alexandre Courbot <acourbot@nvidia.com>
*	instmem/gk20a: exclusively acquire instobjs	Alexandre Courbot	2015-10-26	1	-9/+6
\| \| \| \| \| \| \| \|	Although I would not have expected this to happen, we seem to run into race conditions if instobjs are accessed concurrently. Use a global lock for safety. Signed-off-by: Alexandre Courbot <acourbot@nvidia.com>
*	Compile fixes for GM20B	Alexandre Courbot	2015-10-26	2	-1/+2
\|
*	Compile fixes	Alexandre Courbot	2015-10-26	3	-5/+7
\|
*	clk/g84: Enable reclocking for GDDR3 G94-G200	Roy Spliet	2015-10-22	1	-1/+1
\| \| \| \| \| \| \| \|	Your milage may vary, as it's only been tested on a single G94 and one G96. Signed-off-by: Roy Spliet <rspliet@eclipso.eu> Tested-by: Pierre Moreau <pierre.morrow@free.fr> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	bus/hwsq: Implement VBLANK waiting heuristic	Roy Spliet	2015-10-22	5	-2/+41
\| \| \| \| \| \| \| \| \|	Avoids waiting for VBLANKS that never arrive on headless or otherwise unconventional set-ups. Strategy taken from MEMX. Signed-off-by: Roy Spliet <rspliet@eclipso.eu> Tested-by: Pierre Moreau <pierre.morrow@free.fr> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	fb/ramnv50: Script changes for G94 and up	Roy Spliet	2015-10-22	1	-6/+30
\| \| \| \| \| \| \| \| \| \| \|	10053c is not even read on some cards, and I have no idea exactly what the criteria are. Likely NVIDIA pre-scans the VBIOS and in their driver disables all features that are never used. The practical effect should be the same as this implementation though. Signed-off-by: Roy Spliet <rspliet@eclipso.eu> Tested-by: Pierre Moreau <pierre.morrow@free.fr> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	fb/ramnv50: Deal with cards without timing entries	Roy Spliet	2015-10-22	3	-7/+50
\| \| \| \| \| \| \| \|	Like Pierre's G94. We might want to structure Kepler similarly in a follow-up. Signed-off-by: Roy Spliet <rspliet@eclipso.eu> Tested-by: Pierre Moreau <pierre.morrow@free.fr> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	fb/ramnv50: Voltage GPIOs	Roy Spliet	2015-10-22	2	-0/+42
\| \| \| \| \| \| \| \|	Does not seem to be necessary for NVA0, hence untested by me. Signed-off-by: Roy Spliet <rspliet@eclipso.eu> Tested-by: Pierre Moreau <pierre.morrow@free.fr> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	fb/ramgt215: Restructure r111100 calculation for DDR2	Roy Spliet	2015-10-22	1	-30/+34
\| \| \| \| \| \| \| \|	Seems to be mostly equal to DDR3 on < GT218, should improve stability for DDR2 reclocks. Signed-off-by: Roy Spliet <rspliet@eclipso.eu> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	fb/ramgt215: Change FBVDD/Q when BIOS asks for it	Roy Spliet	2015-10-22	3	-0/+20
\| \| \| \| \|	Signed-off-by: Roy Spliet <rspliet@eclipso.eu> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	fb/ramgt215: Transform GPIO ramfuc method from FBVREF-specific to generic	Roy Spliet	2015-10-22	2	-24/+19
\| \| \| \| \| \| \| \|	In preparation of changing FBVDDQ, as observed on at least one GDDR3 card. While at it, adhere to func.log[1] properly for consistency. Signed-off-by: Roy Spliet <rspliet@eclipso.eu> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	bios/rammap: Identify DLLoff for >= GF100	Roy Spliet	2015-10-22	5	-12/+39
\| \| \| \| \|	Signed-off-by: Roy Spliet <rspliet@eclipso.eu> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	pci: Handle 5-bit and 8-bit tag field	Pierre Moreau	2015-10-22	6	-0/+37
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If the hardware supports extended tag field (8-bit ones), then enable it. This is usually done by the VBIOS, but not on some MBPs (see fdo#86537). In case extended tag field is not supported, 5-bit tag field is used which limits the possible number of requests to 32. Apparently bits 7:0 of 0x08841c stores some number of outstanding requests, so cap it to 32 if extended tag is unsupported. Fixes: fdo#86537 v2: Restrict changes to chipsets >= 0x84 v3: * Add nvkm_pci_mask to pci.h * Mask bit 8 before setting it v4: * Rename `add` argument of nvkm_pci_mask to `value` * Move code from nvkm_pci_init to g84_pci_init and remove PCIe and chipset checks v5: * Rebase code on latest PCI structure * Restore PCIe check * Fix namings in nvkm_pci_mask * Rephrase part of the commit message Signed-off-by: Pierre Moreau <pierre.morrow@free.fr> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	disp,pm: constify nvkm_object_func structures	Julia Lawall	2015-10-22	2	-2/+2
\| \| \| \| \| \| \| \| \| \|	These nvkm_object_func structures are never modified. All other nvkm_object_func structures are declared as const. Done with the help of Coccinelle. Signed-off-by: Julia Lawall <Julia.Lawall@lip6.fr> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	gr: add FERMI_COMPUTE_B class to GF110+	Ilia Mirkin	2015-10-22	3	-0/+3
\| \| \| \| \| \| \| \|	GF110+ supports both the A and B compute classes, make sure to accept both. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	gr: document mp error 0x10	Ilia Mirkin	2015-10-22	1	-0/+1
\| \| \| \| \| \| \| \| \|	NVIDIA provided the documentation for mp error 0x10, INVALID_ADDR_SPACE, which apparently happens when trying to use an atomic operation on local or shared memory (instead of global memory). Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	drm: fix memory leak	Sudip Mukherjee	2015-10-22	1	-1/+3
\| \| \| \| \| \| \| \|	If pm_runtime_get_sync() we were going to "out" but we missed freeing vma. Signed-off-by: Sudip Mukherjee <sudip@vectorindia.org> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	drm: remove unused function	Sudip Mukherjee	2015-10-22	2	-15/+0
\| \| \| \| \| \| \| \| \| \|	coverity.com reported that memset was using a buffer of size 0, on checking the code it turned out that the function was not being used. So remove it. Signed-off-by: Sudip Mukherjee <sudip@vectorindia.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	WIPpmu/gk107: enable PGOB codepaths	Ben Skeggs	2015-10-22	1	-1/+1
\| \| \| \| \| \|	Reported to be needed as per fdo#70354 comment #61. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	WIPpmu/gk104: check fuse to determine presence of PGOB	Ben Skeggs	2015-10-22	1	-0/+4
\| \| \| \| \| \| \|	Not 100% confirmed, but seems to match from the few boards I've looked at so far. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	pci: prepare for chipset-specific initialisation tasks	Ben Skeggs	2015-10-22	2	-0/+4
\| \| \| \|	Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	pci/nv46: attempt to fix msi, and re-enable by default	Ben Skeggs	2015-10-22	6	-12/+12
\| \| \| \| \| \| \| \|	Was not able to obtain a trace of NVRM due to kernel version annoyances, however, experimentally confirmed that the WAR we use on NV50/G8x boards works here too. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	pci/g94: split implementation from nv40	Ben Skeggs	2015-10-22	6	-26/+67
\| \| \| \| \| \| \|	An upcoming patch will implement functionality that we don't use on any NV40 chipset. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	pci/g84: split implementation from nv50	Ben Skeggs	2015-10-22	6	-5/+49
\| \| \| \| \| \| \|	An upcoming patch will implement functionality that we don't use on the original NV50. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	ibus/gf100: increase wait timeout to avoid read faults	Samuel Pitoiset	2015-10-22	6	-4/+77
\| \| \| \| \| \| \| \| \| \| \| \|	Increase clock timeout of some unknown engines in order to avoid failure at high gpcclk rate. This fixes IBUS read faults on my GF119 when reclocking is manually enabled. Note that memory reclocking is completely broken and NvMemExec has to be disabled to allow core clock reclocking only. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	gm204/6: add voltage control using the new gk104 volt class	Martin Peres	2015-10-22	1	-0/+2
\| \| \| \| \| \| \| \| \| \|	I got confirmation that we can read and change the voltage with the same code. The divider is also computed correctly on the gm204 we got our hands on. Thanks to Yoshimo on IRC for executing the tests on his gm204! Signed-off-by: Martin Peres <martin.peres@free.fr> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	gm107: add voltage control using the new gk104 volt class	Martin Peres	2015-10-22	1	-0/+1
\| \| \| \| \| \| \|	Let's ignore the other desktop Maxwells until I get my hands on one and confirm that we still can change the voltage. Signed-off-by: Martin Peres <martin.peres@free.fr>
*	volt/gk104: add support for pwm and gpio modes	Martin Peres	2015-10-22	6	-7/+133
\| \| \| \| \| \| \| \| \| \| \| \| \|	Most Keplers actually use the GPIO-based voltage management instead of the new PWM-based one. Use the GPIO mode as a fallback as it already gracefully handles the case where no GPIOs exist. All the Maxwells seem to use the PWM method though. v2: - Do not forget to commit the PWM configuration change! Signed-off-by: Martin Peres <martin.peres@free.fr>
*	volt: add support for non-vid-based voltage controllers	Martin Peres	2015-10-22	2	-1/+12
\| \| \| \| \| \| \| \|	This patch is not ideal but it definitely beats a rewrite of the current interface and is very self-contained. Signed-off-by: Martin Peres <martin.peres@free.fr> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	bios/volt: add support for pwm-based volt management	Martin Peres	2015-10-22	2	-3/+29
\| \| \| \| \|	Signed-off-by: Martin Peres <martin.peres@free.fr> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	ttm: set the DMA mask for platform devices	Alexandre Courbot	2015-10-22	1	-6/+19
\| \| \| \| \| \| \| \| \| \|	So far the DMA mask was not set for platform devices, which limited them to a 32-bit physical space. Allow dma_set_mask() to be called for non-PCI devices, and also take the IOMMU bit into account since it could restrict the physically addressable space. Signed-off-by: Alexandre Courbot <acourbot@nvidia.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	ttm: convert to DMA API	Alexandre Courbot	2015-10-22	1	-7/+5
\| \| \| \| \| \| \| \|	The pci_dma_* functions are now superseeded in the kernel by the DMA API. Make the conversion to this more generic API. Signed-off-by: Alexandre Courbot <acourbot@nvidia.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	instmem/gk20a: make use of the IOMMU bit	Alexandre Courbot	2015-10-22	1	-4/+6
\| \| \| \| \| \| \| \|	Use the IOMMU bit specified in platform data instead of hardcoding it to the bit used by current Tegra GPUs. Signed-off-by: Alexandre Courbot <acourbot@nvidia.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	platform: allow to specify the IOMMU bit	Alexandre Courbot	2015-10-22	6	-10/+46
\| \| \| \| \| \| \| \| \| \| \|	Current Tegra code taking advantage of the IOMMU assumes a hardcoded value for the IOMMU bit. Make it a platform property instead for flexibility. v2 (Ben Skeggs): remove nvkm dependence on drm structures Signed-off-by: Alexandre Courbot <acourbot@nvidia.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	instmem/gk20a: use direct CPU access	Alexandre Courbot	2015-10-22	2	-116/+317
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The Great Nouveau Refactoring Take II brought us a lot of goodness, including acquire/release methods that are called before and after an instobj is modified. These functions can be used as synchronization points to manage CPU/GPU coherency if we modify an instobj using the CPU. This patch replaces the legacy and slow PRAMIN access for gk20a instmem with CPU mappings and writes. A LRU list is used to unmap unused mappings after a certain threshold (currently 1MB) of mapped instobjs is reached. This allows mappings to be reused most of the time. Accessing instobjs using the CPU requires to maintain the GPU L2 cache, which we do in the acquire/release functions. This triggers a lot of L2 flushes/invalidates, but most of them are performed on an empty cache (and thus return immediately), and overall context setup performance greatly benefits from this (from 250ms to 160ms on Jetson TK1 for a simple libdrm program). Making L2 management more explicit should allow us to grab some more performance in the future. Signed-off-by: Alexandre Courbot <acourbot@nvidia.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	drm: remove unnecessary usage of object handles	Ben Skeggs	2015-10-22	10	-61/+31
\| \| \| \| \| \| \|	No longer required in a lot of cases, as objects are identified over NVIF via an alternate mechanism since the rework. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	ltc/gf100: add flush/invalidate functions	Alexandre Courbot	2015-10-22	5	-0/+39
\| \| \| \| \| \| \| \|	Allow clients to manually flush and invalidate L2. This will be useful for Tegra systems for which we want to write instmem using the CPU. Signed-off-by: Alexandre Courbot <acourbot@nvidia.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	ltc: add hooks for invalidate and flush	Alexandre Courbot	2015-10-22	3	-0/+20
\| \| \| \| \| \| \| \|	These are useful for systems without a coherent CPU/GPU bus. For such systems we may need to maintain the L2 ourselves. Signed-off-by: Alexandre Courbot <acourbot@nvidia.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	timer: re-introduce nvkm_wait_xsec macros	Alexandre Courbot	2015-10-22	1	-0/+10
\| \| \| \| \| \| \| \| \|	Reintroduce macros allowing us to test a register against a certain mask, since this is the most common usage pattern for the more generic nvkm_xsec macros and makes the code more concise and readable. Signed-off-by: Alexandre Courbot <acourbot@nvidia.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	pmu: do not assume a PMU is present	Alexandre Courbot	2015-10-22	1	-1/+1
\| \| \| \| \| \| \| \| \|	Some devices may not have a PMU. Avoid a NULL pointer dereference in such cases by checking whether the pointer given to nvkm_pmu_pgob() is valid. Signed-off-by: Alexandre Courbot <acourbot@nvidia.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	gem: return only valid domain when there's only one	Ilia Mirkin	2015-10-22	2	-2/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	On nv50+, we restrict the valid domains to just the one where the buffer was originally created. However after the buffer is evicted to system memory, we might move it back to a different domain that was not originally valid. When sharing the buffer and retrieving its GEM_INFO data, we still want the domain that will be valid for this buffer in a pushbuf, not the one where it currently happens to be. This resolves fdo#92504 and several others. These are due to suspend evicting all buffers, making it more likely that they temporarily end up in the wrong place. Cc: stable@vger.kernel.org Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=92504 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	bios: fix OF loading	Ilia Mirkin	2015-10-12	3	-11/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently OF bios load fails for a few reasons: - checksum failure - bios size too small - no PCIR header - bios length not a multiple of 4 In this change, we resolve all of the above by ignoring any checksum failures (since OF VBIOS tends not to have a checksum), and faking the PCIR data when loading from OF. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	fbcon: take runpm reference when userspace has an open fd	Ben Skeggs	2015-10-12	1	-0/+24
\| \| \| \| \| \| \| \| \| \| \|	We need to do this in order to prevent accesses to the device while it's powered down. Userspace may have an mmap of the fb, and there's no good way (that I know of) to prevent it from touching the device otherwise. This fixes some nasty races between runpm and plymouth on some systems, which result in the GPU getting very upset and hanging the boot. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	nouveau: Disable AGP for SiS 761	Ondrej Zary	2015-10-12	2	-2/+9
\| \| \| \| \| \| \| \| \| \| \| \| \|	SiS 761 chipset does not support AGP cards but has AGP capability (for the onboard video). At least PC Chips A31G board using this chipset has an AGP-like AGPro slot that's wired to the PCI bus. Enabling AGP will fail (GPU lockup and software fbcon, X11 hangs). Add support for matching just the host bridge in nvkm_device_agp_quirks and add entry for SiS 761 with mode 0 (AGP disabled). Signed-off-by: Ondrej Zary <linux@rainbow-software.org> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
*	display: allow up to 16k width/height for fermi+	Ilia Mirkin	2015-10-12	1	-1/+5
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>