summaryrefslogtreecommitdiff
AgeCommit message (Collapse)AuthorFilesLines
2016-05-30anv: automake: bring back VULKAN_ENTRYPOINT_CPPFLAGSEmil Velikov1-2/+6
We should not have removed them in the first place. There's a subtle difference between generating the complete sources and using them which was not obvious as we nuked them. Without this, the release tarball ends up without various hunks of the generated sources, thus things fail at a later stage as we attempt to build them. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-30anv: automake: ship the json files in the release tarballEmil Velikov1-1/+3
Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-30softpipe: add sp_buffer.h to the sources list (release tarball)Emil Velikov1-0/+1
Signed-off-by: Emil Velikov <emil.velikov@collabora.com>
2016-05-30freedreno: make sure we pick up ir3_nir_trig.py in the release tarballEmil Velikov1-0/+1
Signed-off-by: Emil Velikov <emil.velikov@collabora.com>
2016-05-30isl: add isl_priv.h to the sources listEmil Velikov1-0/+1
Otherwise it will be missing from the release tarball. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-30isl: move the sources lists to Makefile.sourcesMauro Rossi2-32/+39
[Emil Velikov: use the file in the autoconf build] Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-30isl: automake: list builddir before srcdir in the includes listEmil Velikov1-4/+3
As seen elsewhere - we want to include the freshly built sources as opposed the the (likely) stale ones in the srcdir. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-30isl: automake: flatten the tests rulesEmil Velikov1-9/+4
Fold the unneeded extra variable tests_ldadd, the explicit sources section (single file with the default extension) and flip the check_PROGRAMS <> TESTS order (TESTS includes scripts, while check_PROGRAMS is binaries only). Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-30isl: automake: remove unneeded install-lib-links.mk includeEmil Velikov1-2/+0
One uses the makefile to create compatibility symlinks (to $top_builddir/libs) for shared libraries/modules. As we don't create any here, there's no need to include the file. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-30isl: automake: remove unneeded SUBDIRSEmil Velikov1-3/+0
As we do not include any other subdirs but self, we don't need to set it. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-30genxml: move the sources (headers) list to Makefile.sourcesMauro Rossi2-6/+9
[Emil Velikov: use the file in the autoconf build] Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Matt Turner <mattst88@gmail.com>
2016-05-30anv: bail out if anv_wsi_init() failsEmil Velikov1-1/+3
Otherwise we'll end up setting up a device with no winsys integration. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> --- Hard-coding the rendernode name in anv_physical_device_init() is a bad idea really. We could/should be using drmGetDevices() to get info on all the devices (master/render/etc. node names, pci location etc.) and apply our heuristics on top of that. That can come up as a follow up change.
2016-05-30anv: resolve wayland-only buildEmil Velikov1-0/+2
Ensure that the final X11/XCB hunk is guarded by the correct macro. Otherwise we'll require the symbol even when building without said platform. Cc: Cedric Sodhi <manday@openmail.cc> Reported-by: Cedric Sodhi <manday@openmail.cc> Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-30anv: Fix use of uninitialized variable.Robert Foss1-2/+6
The return variable was not set for failure paths. It has now been changed to VK_ERROR_INITIALIZATION_FAILED for failure paths. Coverity: 1358944 Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com> Signed-off-by: Robert Foss <robert.foss@collabora.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> [Emil Velikov: rebase against master, s/vulkan/anv/] Signed-off-by: Emil Velikov <emil.velikov@collabora.com>
2016-05-30gallium: push offset down to driverStanimir Varbanov9-5/+53
Push offset down to drivers when importing dmabuf. This is needed to more fully support EGL_EXT_image_dma_buf_import when a non-zero offset is specified. Tesing has been done for freedreno, and compile tested following gallium drivers: nouveau,svga,virgl,r600,r300,radeonsi,swrast,i915,ilo Signed-off-by: Stanimir Varbanov <stanimir.varbanov@linaro.org> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>
2016-05-30st/dri: cleanup image_from_fd/dma_buf pathsStanimir Varbanov1-37/+44
Signed-off-by: Stanimir Varbanov <stanimir.varbanov@linaro.org> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>
2016-05-30st/dri: add handling of R8 and GR88 DRI fourcc formatsStanimir Varbanov1-0/+20
This helps to import dmabuf buffers from DRM_FORMAT_R8 and DRM_FORMAT_GR88 used for example by GStreamer for YUV to RGB conversion using shaders. Signed-off-by: Stanimir Varbanov <stanimir.varbanov@linaro.org> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>
2016-05-30radeonsi: Don't offset OFFCHIP_BUFFERING on pre-VI cards.Bas Nieuwenhuizen1-2/+6
Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=96239 Reviewed-by: Marek Olšák <marek.olsak@amd.com>
2016-05-29i965: Expose GL 4.3 on Gen8+.Francisco Jerez3-4/+3
ARB_compute_shader was the last feature missing. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-29i965/fs: Skip gen4 pre/post-send dependency workaronds for the first/last block.Francisco Jerez1-2/+2
We know that there cannot be any destination dependency race if we reach the beginning or end of the program without having found any other instruction the send could possibly race with. This avoids emitting a pile of useless moves at the beginning or end of the program in the most common case in which the program has a single basic block only. On the original i965 I get the following shader-db results: total instructions in shared programs: 3354165 -> 3215637 (-4.13%) instructions in affected programs: 3183065 -> 3044537 (-4.35%) helped: 13498 HURT: 0 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-29i965/fs: Skip SIMD lowering source unzipping for regular scalar regions.Francisco Jerez1-2/+21
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-29i965/fs: Factor out region zipping and unzipping from the SIMD lowering pass.Francisco Jerez1-60/+84
Just to make sure we keep the SIMD lowering pass tidy when we introduce additional logic to try to optimize out the copy instructions used to zip and unzip the destination and source regions into multiple packed regions of the lowered instruction width. Shouldn't cause any functional changes. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-29i965/fs: Generalize regions_overlap() from copy propagation to handle ↵Francisco Jerez2-12/+42
non-VGRF files. This will be useful in several places. The only externally visible difference (other than non-VGRF files being supported now) is that the region sizes are now passed in byte units instead of in GRF units because the loss of precision would have become a problem in the SIMD lowering pass. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-29i965/fs: Refactor offset() into a separate function taking the width as ↵Francisco Jerez2-16/+21
argument. This will be useful in the SIMD lowering pass to avoid having to construct a builder object of the known region width just to pass it as argument to offset(), which doesn't do anything with it other than taking the builder dispatch_width as region width. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-29i965/fs: Implement opt_sampler_eot() in terms of logical sends.Francisco Jerez2-55/+40
This makes the whole LOAD_PAYLOAD munging unnecessary which simplifies the code and will allow the optimization to succeed in more cases independent of whether the LOAD_PAYLOAD instruction can be found or not. The following patch is squashed in: SQUASH: i965/fs: Add basic dataflow check to opt_sampler_eot(). The sampler EOT optimization pass naively assumes that the texturing instruction provides all the data used by the FB write just because they're standing next to each other. The least we should be checking is whether the source and destination regions of the FB write and texturing instructions match. Without this the previous seemingly harmless patch would have caused opt_sampler_eot() to misoptimize a shader from dota-2 causing DCE to eliminate all of its 78 instructions except for the final sampler EOT message (!). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-29i965/fs: Fix UB list sentinel dereference in opt_sampler_eot().Francisco Jerez1-6/+9
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-29i965/fs: Take opt_redundant_discard_jumps out of the optimization loop.Francisco Jerez1-2/+1
No shader-db regressions. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-29i965/fs: Run SIMD and logical send lowering after the optimization loop.Francisco Jerez1-4/+25
There are two reasons why this is useful: - It avoids the introduction of an amount of partial writes emitted by the SIMD lowering pass to zip and unzip register regions early during optimization, which can make subsequent optimization less effective. - It substantially reduces the burden on the compiler when a large fraction of the instructions in the program need to be split (e.g. during SIMD32 builds). Individual halves of split instructions will be optimized identically (if they can still be optimized at all), so doing it up front can duplicate the amount of instructions the optimizer has to deal with which causes the compilation time to explode in some cases due to the worse-than-linear runtime behaviour of the back-end. It seems helpful to re-run a few optimization passes in cases where any of the lowering passes was able to make progress. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-29i965/fs: Add FS_OPCODE_FB_WRITE_LOGICAL to has_side_effects().Francisco Jerez1-0/+1
Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-29i965/fs: Allow constant propagation into logical send sources.Francisco Jerez1-0/+34
Logical sends are eventually lowered into a series of copies so they can take almost anything as source. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-29i965/fs: Let CSE handle logical sampler sends as expressions.Francisco Jerez1-0/+13
This will prevent some shader-db regressions when we start plumbing logical sends through the optimizer. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-29i965/fs: Pass a BAD_FILE register to the logical FB write when oMask is unused.Francisco Jerez2-2/+3
This will let the optimizer know that the sample mask value is unused so its definition can be DCE'ed. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-30glsl: fix xfb_offset unsized array validationTimothy Arceri4-31/+41
This partially fixes CTS test: GL44-CTS.enhanced_layouts.xfb_get_program_resource_api The test now fails at a tes evaluation shader with unsized output arrays. The ARB_enhanced_layouts spec says: "It is a compile-time error to apply xfb_offset to the declaration of an unsized array." So this seems like a bug in the CTS. Reviewed-by: Dave Airlie <airlied@redhat.com>
2016-05-30glsl: dont crash when attempting to assign a value to a builtin defineTimothy Arceri1-1/+1
For example GL_ARB_enhanced_layouts = 3; Fixes: GL44-CTS.enhanced_layouts.glsl_contant_immutablity Reviewed-by: Dave Airlie <airlied@redhat.com>
2016-05-30egl/dri3: don't crash on no context.Dave Airlie1-2/+4
Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=94925 Pointed out by Karol Herbst on irc. Signed-off-by: Dave Airlie <airlied@redhat.com> Cc: "11.1 11.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>
2016-05-30mesa/program_interface_query: fix transform feedback varyings.Dave Airlie2-35/+48
The spec says gl_NextBuffer and gl_SkipComponents need to be returned to userspace in the program interface queries. We currently throw those away, this requires a complete piglit run to make sure no drivers fallover due to the extra varyings. This fixes: GL45-CTS.program_interface_query.transform-feedback-built-in Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> Signed-off-by: Dave Airlie <airlied@redhat.com>
2016-05-30glsl/ast: subroutineTypes can't be returned from functions.Dave Airlie1-0/+9
These types can't be returned. This fixes: GL43-CTS.shader_subroutine.subroutines_not_allowed_as_variables_constructors_and_argument_or_return_types for the return type case. Reviewed-by: Chris Forbes <chrisforbes@google.com> Signed-off-by: Dave Airlie <airlied@redhat.com>
2016-05-30glsl: use has_double() helperTimothy Arceri1-2/+1
Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>
2016-05-30glsl: fix explicit uniform block alignmentTimothy Arceri1-3/+1
This stops the offset being bumped again when and an explicit alignment has already been applied. Fixes alignment issues in: GL44-CTS.enhanced_layouts.uniform_block_alignment Note the test still fails due to unrelated issues with doubles. Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>
2016-05-29i965: Shrink stage_prog_data param array lengthJordan Justen6-14/+6
It appears we were over-allocating these arrays. Previously we would use nir->num_uniforms directly for scalar programs, and multiply it by 4 for vec4 programs. Instead we should have been dividing by 4 in both cases to convert from bytes to a gl_constant_value count. The size of gl_constant_value is 4 bytes. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-29nv50,nvc0: fix the max_vertices=0 caseIlia Mirkin3-2/+4
This is apparently legal. Drop any emit/restarts, and pass a 1 to the hardware. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
2016-05-29st/mesa: fix setting of point_size_per_vertex in ES contextsIlia Mirkin1-2/+18
GL ES 2.0+ does not have a GL_PROGRAM_POINT_SIZE enable, unlike desktop GL. So we have to go and check the last pre-rasterizer stage to see whether it outputs a point size or not. This fixes a number of dEQP tests that use a geometry or tessellation shader to emit points primitives. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Cc: "11.1 11.2" <mesa-stable@lists.freedesktop.org>
2016-05-29mesa: skip level checking for FramebufferTexture*D if texture is zeroMarek Olšák1-3/+3
From the OpenGL 4.5 core spec: "An INVALID_VALUE error is generated if texture is not zero and level is not a supported texture level for textarget, as described above." Other FramebufferTexture functions already do the right thing. This fixes the main menu in F1 2015. Cc: 11.1 11.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Dave Airlie <airlied@redhat.com>
2016-05-28st/mesa: expose OES_shader_io_blocks when we have enough for ES 3.1Ilia Mirkin3-2/+8
Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Matt Turner <mattst88@gmail.com>
2016-05-28swr: [rasterizer] Do not define _mm256_storeu2_m128i with icc.Vinson Lee1-1/+1
Fix build error with icc. CXX libswrAVX_la-swr_clear.lo icpc: command line warning #10006: ignoring unknown option '-Wdelete-non-virtual-dtor' In file included from ./rasterizer/jitter/jit_api.h(31), from swr_context.h(30), from swr_clear.cpp(24): ./rasterizer/common/os.h(135): error: expected an identifier void _mm256_storeu2_m128i(__m128i *hi, __m128i *lo, __m256i a) ^ Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Tim Rowley <timothy.o.rowley@intel.com>
2016-05-28i965: add missing return in if statementThomas Hindoe Paaboel Andersen1-0/+1
Re-add the "return false" that was removed in 0c02d7002d6c005b4c1fe997b5ef5916978dd183 It seems that something went wrong when merging the patch. The patch sent to the mailing list does not directly match what was committed. https://lists.freedesktop.org/archives/mesa-dev/2016-May/118198.html Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
2016-05-28gk110/ir: fix unspilling of predicates from registersIlia Mirkin1-0/+28
Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=96258 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "11.2 11.1" <mesa-stable@lists.freedesktop.org>
2016-05-28nvc0: remove outdated surfaces validation code for GK104Samuel Pitoiset1-70/+0
This code was used for validating surfaces with compute but now we use pipe_image_view instead. Anyway, surfaces support should be re-introduced properly once OpenCL happens. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>
2016-05-28nvc0: do not always invalidate 3D CBs when using computeSamuel Pitoiset1-8/+17
Constant buffers are aliased between 3D and CP on Fermi, but we should only invalidate them when a compute shader actually uses CBs and not all the time after a lauching grid. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>
2016-05-27i965: Update compute workgroup size limit calculation for SIMD32.Francisco Jerez1-11/+3
This should have the side effect of enabling the ARB_compute_shader extension on Gen8+ hardware and all Gen7 platforms that didn't previously expose it (VLV and IVB GT1) due to the number of hardware threads per subslice being insufficient in SIMD16 mode. v2: Bump workgroup size limit for GLES too (Jordan). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>