summaryrefslogtreecommitdiff
AgeCommit message (Collapse)AuthorFilesLines
2014-10-16radeonsi/compute: Enable PIPE_SHADER_IR_NATIVE for compute shaders v2clover-elf-finalTom Stellard4-71/+123
v2: - Drop dependency on LLVM >= 3.5.1 - Rename si_create_shader() to si_shader_binary_read()
2014-10-16r600g/compute: Enable PIPE_SHADER_IR_NATIVE for compute shaders v2Tom Stellard8-97/+180
v2: - Drop dependency on LLVM >= 3.5.1
2014-10-16gallium/radeon: Add query for symbol specific config informationTom Stellard3-0/+86
This adds a query which allows drivers to access the config information of a specific function within the LLVM generated ELF binary. This makes it possible for the driver to handle ELF binaries with multiple kernels / global functions.
2014-10-16clover: Add environment variables for dumping kernel code v2Tom Stellard1-11/+75
There are two debug variables: CLOVER_DEBUG which you can set to any combination of llvm,clc,asm (separated by commas) to dump llvm IR, OpenCL C, and native assembly. CLOVER_DEBUG_FILE which you can set to a file name for dumping output instead of stderr. If you set this variable, the output will be split into three separate files with different suffixes: .cl for OpenCL C, .ll for LLVM IR, and .asm for native assembly. Note that when data is written, it is always appended to the files. v2: - Code cleanups - Add CLOVER_DEBUG_FILE environment variable for dumping to a file. Reviewed-by: Francisco Jerez <currojerez@riseup.net>
2014-10-16clover: Register an llvm diagnostic handler v3Tom Stellard1-0/+25
This will allow us to handle internal compiler errors. v2: - Code cleanups. v3: - More cleanups. Reviewed-by: Francisco Jerez <currojerez@riseup.net>
2014-10-16clover: Add support for compiling to native object code v3Tom Stellard2-9/+204
v2: - Split build_module_native() into three separate functions. - Code cleanups. v3: - More cleanups. Reviewed-by: Francisco Jerez <currojerez@riseup.net>
2014-10-16gallium: Add PIPE_SHADER_IR_NATIVE to enum pipe_shader_irTom Stellard2-3/+4
Drivers can return this value for PIPE_COMPUTE_CAP_IR_TARGET if they want clover to give them native object code. Reviewed-by: Francisco Jerez <currojerez@riseup.net>
2014-10-16clover: Factor kernel argument parsing into its own function v2Tom Stellard1-81/+92
v2: - Code cleanups. Reviewed-by: Francisco Jerez <currojerez@riseup.net>
2014-10-15r600g,radeonsi: Only set use_staging_texture = TRUE onceMichel Dänzer1-8/+5
No need to check for setting the flag after we set it already. Reviewed-by: Marek Olšák <marek.olsak@amd.com>
2014-10-15r600g,radeonsi: Use staging texture for transfers if any miplevel is tiledMichel Dänzer1-1/+1
We set the NO_CPU_ACCESS flag for BO allocation in that case, so direct CPU access may not work. Reviewed-by: Marek Olšák <marek.olsak@amd.com>
2014-10-15winsys/radeon: Use separate caching buffer manager for each set of flagsMichel Dänzer3-41/+32
Otherwise the caching buffer manager may return a buffer which was created with a different set of flags, which can cause trouble. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com>
2014-10-15configure.ac: check for libexpat when no pkg-config is availableAndres Gomez1-8/+7
Previously, when no pkg-config was available for libexpat we would just add the needed linking flags without any extra check. Now, we check that the library and the headers are also installed in the building environment. Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>
2014-10-14clover: Fix regression in module serializationTom Stellard1-0/+1
We need to serialize semantic information for arguments, which was added in 06139c56fa070f84a931a4ddbdb894c9e8d24f55. Reviewed-by: Francisco Jerez <currojerez@riseup.net>
2014-10-14i965/fs: Use the correct regs_written on unspill instructionsJason Ekstrand1-0/+1
Signed-off-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
2014-10-14st/gbm: fix order of arguments passed to is_format_supportedIlia Mirkin1-1/+1
Reported by Coverity Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Cc: mesa-stable@lists.freedesktop.org
2014-10-14nouveau: 3d textures are unsupported, limit 3d levels to 1Ilia Mirkin1-0/+3
Ideally there would be a swrast fallback, but the driver isn't ready for that. This should avoid crashes if someone tries to use 3d textures though. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Cc: mesa-stable@lists.freedesktop.org
2014-10-14freedreno: use tgsi_loweringRob Clark8-1673/+6
Now that the freedreno_lowering code is moved to tgsi_lowering, remove our private copy and switch over to using the common version. Signed-off-by: Rob Clark <robclark@freedesktop.org>
2014-10-14r300/compiler: remove useless checkDavid Heidelberger1-5/+2
This code is already in if (!variable->C->is_r500) so no need check twice. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: David Heidelberger <david.heidelberger@ixit.cz>
2014-10-14ilo: Build pipe-loader for iloNick Sarnie2-0/+40
Trivial patch to create the pipe loader for ilo. All the code was already there. Signed-off-by: Nick Sarnie <commendsarnex@gmail.com> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>
2014-10-14automake: explicitly set TARGET_RADEON_{WINSYS,COMMON}Emil Velikov3-5/+5
Originally the variables were set only once via the ?= operator but that causes issues when doing incremental builds. They appear to be undefined and missing from the dependency list despite their addition to LIBADD. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=84807 Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>
2014-10-14vc4: Fix render target NPOT alignment at small miplevels.Eric Anholt1-3/+12
The texturing hardware takes the POT level 0 width/height and minifies those. This is different from what we were doing, for example, for 273-wide's level 5: POT(273>>5) == 8, while POT(273)>>5 == 16. Fixes piglit-depthstencil-render-miplevels 273.
2014-10-14vc4: Add support for having 0 vertex elements used.Eric Anholt2-6/+47
You have to load at least 1, according to the simulator. Fixes 4 piglit tests and even more ES2 conformance tests.
2014-10-13auxilary/os: Add DragonFly BSD support in os_get_total_physical_memory.Vinson Lee1-0/+2
This patch fixes this build error on DragonFly BSD. CC os/os_misc.lo os/os_misc.c: In function 'os_get_total_physical_memory': os/os_misc.c:132:2: error: #error Unsupported *BSD Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>
2014-10-13glx: Fix glxUseXFont for glxWindow and glxPixmapsDaniel Manjarres1-0/+5
The current implementation of glxUseXFont requires creating a temporary pixmap and graphics context, which requires a real old-school X11 Window, not a glxDrawable. This patch changes things so that glxUseXFont will also accept a glxWindow or glxPixmap, and lookup the underlying X11 Drawable. Without this patch glxUseXFont generates a giant stream of Xerrors about bad drawables and bad graphics contexts. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=54372 Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Brian Paul <brianp@vmware.com>
2014-10-14ilo: clear writer pointer after unmappingChia-I Wu1-0/+1
It does not look like an issue now but it is good to be future proof. Spotted by Courtney Goeltzenleuchter. Signed-off-by: Chia-I Wu <olvaffe@gmail.com>
2014-10-13vc4: Write the VPM read setup multiple times to queue all the inputs.Eric Anholt1-3/+18
There's a 4-element fifo, and the size (number of dwords per vertex) field is just 4 bits. Fixes glsl-routing on sim.
2014-10-13vc4: Add support for the TXL opcode.Eric Anholt1-5/+15
There's a bit at the bottom of cube map stride (which has some formatting bugs in the docs) which flips the bias coordinate to being an absolute LOD.
2014-10-13vc4: Improve the accuracy of SIN and COS.Eric Anholt1-11/+17
This gets them to pass glsl-sin/cos. There was an obvious problem that I was using the FRC code on the scaled input value, which means that we had a range in [0, 1], while our taylor is most accurate across [-0.5, 0.5]. We can just slide things over, but that means flipping the sign of the coefficients. After that, it was just a matter of stuffing more coefficients in.
2014-10-13i965: Use unsynchronized maps for the program cache on LLC platforms.Kenneth Graunke1-7/+28
There's no reason to stall on pwrite - the CPU always appends to the buffer and never modifies existing contents, and the GPU never writes it. Further, the CPU always appends new data before submitting a batch that requires it. This code predates the unsynchronized mapping feature, so we simply didn't have the option when it was written. Ideally, we would do this for non-LLC platforms too, but unsynchronized mapping support only exists for LLC systems. Saves a bunch of stall avoidance copies when uploading shaders. v2: Rebase on changes to previous patch. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net> [v1]
2014-10-13i965: Issue performance warnings when copying the program cache BO.Kenneth Graunke1-0/+3
We don't really want unnecessary buffer copying, so it'd be nice to know when it's happening. v2: Drop stall warnings when doing a read-only CPU mapping of the cache BO. The GPU also uses it in a read-only fashion, so there won't be any stalls, even though the buffer is busy. (Thanks to Chris Wilson for catching this mistake.) Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net> [v1]
2014-10-13i965: Issue performance warnings on MapBufferRange stalls.Kenneth Graunke1-3/+4
This is easy: we just need to use brw_map_bo instead of mapping it directly. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>
2014-10-13vc4: Match VS outputs to FS inputs.Eric Anholt3-18/+135
If the VS doesn't output a value that the FS needs, we still need to read the right contents for the remaining FS inputs, by emitting padding. And if the VS outputs something the FS doesn't need, we shouldn't put it in the VPM at all (so the code producing it can get DCEed). Fixes 77 piglit tests.
2014-10-13configure: use $libdir/dri as default for VA-APIChristian König1-2/+2
Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>
2014-10-13configure: remove superflous VA-API line from configure.acChristian König1-1/+0
We don't have GALLIUM_STATE_TRACKERS_DIRS any more. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>
2014-10-13configure: respect $libdir for the OMX installation dirChristian König1-5/+2
Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>
2014-10-13configure: Revert "ask vdpau.pc for the default location of the vdpau drivers"Christian König1-8/+3
This reverts commit bbe6f7f865cd4316b5f885507ee0b128a20686eb. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>
2014-10-13vc4: Add support for the CEIL opcode.Eric Anholt1-0/+22
Not as big of a deal as SSG, but still +9 piglit tests.
2014-10-13vc4: Add support for the SSG opcode.Eric Anholt1-0/+12
2014-10-13docs: add news item and link release notesEmil Velikov2-0/+14
Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>
2014-10-13docs: Add sha256 sums for the 10.3.1 releaseEmil Velikov1-1/+3
Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> (cherry picked from commit fa98c74692634de4f87694a40a299b59c4716ee5)
2014-10-13Add release notes for the 10.3.1 releaseEmil Velikov1-0/+156
Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> (cherry picked from commit 088d3501786a2ff0833de45951b63acbe6560a0f)
2014-10-13docs: Add sha256 sums for the 10.2.9 releaseEmil Velikov1-1/+3
Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> (cherry picked from commit 52bd154980e306b8bc9b9d2edc0e728a9f8f3bf6)
2014-10-13Add release notes for the 10.2.9 releaseEmil Velikov1-0/+99
Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> (cherry picked from commit 9f1149876f2d010c871751a53d02d4d2b6aef1fe)
2014-10-12r600g: Implement GL_ARB_sample_shadingGlenn Kennard12-120/+385
Also fixes two sided lighting which was broken at least on pre-evergreen by commit b1eb00. Signed-off-by: Glenn Kennard <glenn.kennard@gmail.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>
2014-10-12radeonsi: use tgsi_shader_info in si_llvm_emit_fs_epilogueMarek Olšák1-71/+61
This is the last use tgsi_parse_token in radeonsi. It looks ugly because the code was re-indented, but there is really no change in behavior. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
2014-10-12radeonsi: remove si_shader_output_values::indexMarek Olšák1-17/+6
It's redundant now. It led to a simplification in si_llvm_emit_streamout, because outidx == reg. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
2014-10-12radeonsi: use tgsi_shader_info in si_llvm_emit_vs_epilogueMarek Olšák1-26/+13
That code was really ugly. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
2014-10-12radeonsi: remove shader->input[] and output[] arrays and dependenciesMarek Olšák3-89/+2
They were reinventing tgsi_shader_info. They are unused now. radeon_llvm_context::load_input can be NULL if input fetching is implemented in some other way. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
2014-10-12radeonsi: move param_offset out of shader->input[] and output[]Marek Olšák3-7/+10
Those are going away. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
2014-10-12radeonsi: use tgsi_shader_info to get a list of GS outputsMarek Olšák2-14/+12
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>