summaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Collapse)Author
2010-08-18r600g: emit texture level offset in CB/DB setup.Dave Airlie
8 more piglit tests pass, fbo-clearmipmap, fbo-copyteximage, fbo-generatemipmap, fbo-generatemipmap-nonsquare, fbo-generatemipmap-scissor, fbo-generatemipmap-viewport, gen-teximage, gen-texsubimage
2010-08-17ir_to_mesa: Fix implementation of ir_binop_equal, ir_binop_notequal.Eric Anholt
These binops are the vector-to-bool comparisons, not vec-to-bvec. We likely want both operations avilable as expression, since 915 and 965 FS naturally does the vector version, while 965 VS can also naturally do the scalar version. However, we can save that until later. Fixes: glsl-fs-vec4-operator-equal.shader_test glsl-fs-vec4-operator-notequal.shader_test glsl-vs-vec4-operator-equal.shader_test glsl-vs-vec4-operator-notequal.shader_test
2010-08-17i915: Add support for SSG opcode.Eric Anholt
Fixes glsl-fs-sign and many of the tests of trig builtins.
2010-08-17i915: Add support for reading output regs in the FS.Eric Anholt
Fixes glsl-unused-varying and many others, since we produce an output reg read any time gl_FragColor is written inside an if statement.
2010-08-17i915: Add support for OPCODE_DP2.Eric Anholt
Fixes glsl-fs-dot-vec2.
2010-08-17i915: Enable ARB_fragment_shader by default.Eric Anholt
Now that we have glsl2 with if flattening in place, most shaders will just work. Remaining failing shaders will mostly be due to loop unrolling (in progress), some possible if flattening failures in inlining functions (planning on fixing), and the register/instruction count limits. While the GLSL and GLSL-ES specs say that shaders shouldn't fail to compile/link due to register/instruction limits, in practice we're not the first vendor to expose GLSL on hardware with these limitations. The benefit to application developers of providing a better language for GPU programming is greater than the pain of having to handle instruction limits (which they had to for ARB_fp on this hardware anyway)
2010-08-17glsl2: Fix transpose of rows and colsIan Romanick
This error led to an assertion failure for some constructors of non-square matrices. It only occured in matrices where the number of columns was greater than the number of rows. It didn't even always occur on those. Fixes piglit glslparsertest case constructor-16.vert.
2010-08-17r600c: Handle reads from PROGRAM_OUTPUTHenri Verbeet
with glsl2, reads from outputs are legal
2010-08-17linker: Demote user-defined varyings in the VS-only caseIan Romanick
Fixes piglit test case glsl-vs-ff-frag and bugzilla #29623.
2010-08-17prog_optimize: Only merge writes to temporary registersBenjamin Segovia
In one optimization pass, register files may have been messed therefore merging instructions which use the same index in two different register files.
2010-08-17r600g: fix fake pixel outputJerome Glisse
Signed-off-by: Jerome Glisse <jglisse@redhat.com>
2010-08-17i965: Add support for DP2 in the VS.Eric Anholt
Fixes glsl-vs-dot-vec2.
2010-08-17glsl: When doing algebraic simplification, make sure the type still matches.Eric Anholt
When simplifying (vec4(1.0) / (float(x))) to rcp(float(x)), we forgot to produce a vec4, angering ir_validate when starting alien-arena. Fixes: glsl-algebraic-add-zero-2 glsl-algebraic-div-one-2 glsl-algebraic-mul-one-2 glsl-algebraic-sub-zero-3 glsl-algebraic-rcp-sqrt-2
2010-08-17glsl: Make ir_algebraic new expressions allocate out of the parent.Eric Anholt
This could reduce the amount of memory used by a shader tree after optimization, and increases consistency with other passes.
2010-08-17ir_constant: Don't assert on out-of-bounds array accessesIan Romanick
Several optimization paths, including constant folding, can lead to accessing an ir_constant array with an out of bounds index. The GLSL spec lets us produce "undefined" results, but it does not let us crash. Fixes piglit test case glsl-array-bounds-01 and glsl-array-bounds-03.
2010-08-17mesa: Dump shader source before validating the shader.Eric Anholt
This will make extracting source to produce minimal testcases for shader compile issues easier.
2010-08-17r600c: fix dword miscount in blit emit codeAlex Deucher
2010-08-17targets/egl: Link with DRI_LIB_DEPS.Chia-I Wu
Use DRI_LIB_DEPS when linking GL/GLES state trackers. This fixes missing talloc symbol errors, and is hopefully more future proof.
2010-08-17gallivm: Fix and re-enable MMX-disabling codenobled
Signed-off-by: José Fonseca <jfonseca@vmware.com>
2010-08-17mesa: fix es1/2 build hopefullyDave Airlie
needed to add cpp rules and includes properly for es1/es2
2010-08-17r300g: fix context destroy under hyperzDave Airlie
we were destroying the mm before unrefing all the objects, so segfault. Signed-off-by: Dave Airlie <airlied@redhat.com>
2010-08-17r600g: kill event type magic number in winsysDave Airlie
these events have names, use them.
2010-08-17r600g: add user clip plane support.Dave Airlie
Apart from the fact that the radeon.h/r600_states.h editing is a nightmare, this wasn't so bad. passes piglit user-clip test now also trivial tests. Signed-off-by: Dave Airlie <airlied@redhat.com>
2010-08-16i965: Use the implied move available in most brw_wm_emit brw_math() calls.Eric Anholt
This saves an extra message reg move in the program, though I'm not clear on whether it will have any performance impact other than cache footprint. It will also fix those math calls on Sandybridge, where the brw_eu_emit.c brw_math() support relies on the implied move being used.
2010-08-16i965: Add disasm for Compr4 instruction compression.Eric Anholt
2010-08-16Merge branch 'glsl2'Ian Romanick
Conflicts: src/mesa/program/prog_optimize.c
2010-08-16util: Remove check_os_katmai_support.Vinson Lee
check_os_katmai_support checks that the operating system running on a SSE-capable processor supports SSE. This is necessary for unpatched 2.2.x and earlier kernels. 2.4.x and later kernels support SSE. check_os_katmai_support will disable SSE capabilities for 32-bit x86 operating systems for which there is no code path. Currently, this function handles Linux, Windows, and several BSDs. Mac OS, Cygwin, and Solaris are several operating systems with no code paths. Rather than add code for the unhandled operating systems, remove this function altogether. This will fix SSE detection on all recent 32-bit x86 operating systems. This completely breaks functionality on unpatched 2.2.x and earlier kernels, although there are likely no Gallium3D users on such operating systems.
2010-08-17r600g: drop libdrm_radeon linkDave Airlie
2010-08-16glsl2: Refresh autogenerated file builtin_function.cpp.Kenneth Graunke
2010-08-16glsl2: Add builtins profile for GLSL 1.30.Kenneth Graunke
Many functions are currently wrapped with #if 0 since we haven't implemented them yet.
2010-08-16linker: Include compiler.h to avoid spurious warnings about INLINEIan Romanick
2010-08-16glsl2: Silence unused variable warningIan Romanick
2010-08-16translate: Move loop variable declaration outside for loop.Vinson Lee
Fixes MSVC build.
2010-08-16glcpp: Refresh autogenerated lexer and parser.Kenneth Graunke
2010-08-16glcpp: Remove spurious newline generated by #version handling.Kenneth Graunke
This was causing line numbering to be off by one. The newline comes from the NEWLINE token at the end of the line; there's no need to insert one.
2010-08-16r300g: fix assert in the rasterizer block for r3xx-r4xxMarek Olšák
Reported-by: Niels Ole Salscheider <niels_ole@salscheider-online.de>
2010-08-16ir_to_mesa: Silence unused variable warningsIan Romanick
2010-08-16ir_to_mesa: Clean up assertions in ir_to_mesa_visitor::visit(ir_texture *)Ian Romanick
2010-08-16ir_to_mesa: Support texture rectangle targetsIan Romanick
2010-08-16translate: Remove unused temporary register.José Fonseca
Assuming the side-effect of x86_make_reg is also unnecessary.
2010-08-16translate: Eliminate void pointer arithmetic.José Fonseca
Non-portable.
2010-08-17st/egl: Add support for EGL_KHR_fence_sync.Chia-I Wu
The extension is implemented by pipe_fence_handle.
2010-08-17egl: Add support for EGL_KHR_fence_sync.Chia-I Wu
Individual drivers still need to support and enable the extension.
2010-08-17st/egl: Add support for EGL_KHR_reusable_sync.Chia-I Wu
The extension is implemented by pipe_condvar.
2010-08-17egl: Add support for EGL_KHR_reusable_sync.Chia-I Wu
Individual drivers still need to support and enable the extension.
2010-08-16r600c: blit emit updatesAlex Deucher
- set VGT_MAX_VTX_INDX to a larger value - emit PA_SC_AA_CONFIG. The command checker in 2.6.36+ requires this reg.
2010-08-16draw_llvm: fix segfaults on non-SSE2 CPUs where it is disabled (v2)Luca Barbieri
Changes in v2: - Change function name Currently draw_llvm refuses to create itself on non-SSE2 CPUs due to an alleged LLVM bug. However, this is implemented improperly, because other parts of draw still attempt to access draw->llvm, resulting in segfaults. Instead, put the check in debug_get_option_draw_use_llvm, check that before calling draw_llvm_create, and then check whether draw->llvm is non-null everywhere else.
2010-08-16translate_sse: major rewrite (v5)Luca Barbieri
NOTE: Win64 is untested, and is thus currently disabled. If you have such a system, please enable it and report whether it works. To enable it, change src/gallium/auxiliary/translate/translate.c Changes in v5: - On Win64, preserve %xmm6 and %xmm7 as required by the ABI - Use _WIN64 instead of WIN64 Changes in v4: - Use x86_target() and x86_target_caps() - Enable translate_sse in x86-64, but not in Win64 Changes in v3: - Win64 support (untested) - Use u_cpu_detect.h constants instead of #ifs Changes in v2: - Minimize #ifs - Give a name to magic number CHANNELS_0001 - Add support for CPUs without SSE (only memcpy and swizzles, like non SSE2) - Fixed comments translate_sse is currently very limited to the point of being useless in essentially all cases. In particular, it only support some float32 and unorm8 formats and doesn't work on x86-64. This commit rewrites it to support: 1. Dumb memory copy for any pair of identical formats 2. All formats that are swizzles of each other 3. Converting 32/64-bit floats and all 8/16/32-bit integers to 32-bit float 4. Converting unorm8/snorm8 to snorm16 and uscaled8/sscaled8 to sscaled16 5. Support for x86-64 (doesn't take advantage of it in any way though) This new translate can even be useful to translate index buffers for cards that lack 8-bit index support. It passes the testsuite I wrote, but note that this is a major change, and more testing would be great.
2010-08-16rtasm: add minimal x86-64 support and new instructions (v5)Luca Barbieri
Changes in v5: - Add sse2_movdqa Changes in v4: - Use _WIN64 instead of WIN64 Changes in v3: - Add target and target caps functions, so that they could be different in principle from the current CPU and they don't need #ifs to check Changes in v2: - Win64 support (untested) - Use u_cpu_detect.h constants instead of #ifs This commit adds minimal x86-64 support: only movs between registers are supported for r8-r15, and x64_rexw() must be used to ask for 64-bit operations. It also adds several new instructions for the new translate_sse code. movdqa
2010-08-16translate: add support for 8/16-bit indicesLuca Barbieri
Currently, only 32-bit indices are supported, but some use cases translate needs support for all types.