summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2010-10-15llvmpipe: check shader outputs are non-null before usingKeith Whitwell
2010-10-15mesa: Add missing header to shaderobj.h.Vinson Lee
Include compiler.h for ASSERT symbol.
2010-10-15llvmpipe: fix non-sse build after recent changesKeith Whitwell
2010-10-15llvmpipe: use aligned loads/stores for plane valuesKeith Whitwell
2010-10-15llvmpipe: do plane calculations with intrinsicsKeith Whitwell
This is a step towards moving this code into the rasterizer.
2010-10-15gallium: move some intrinsics helpers to u_sse.hKeith Whitwell
2010-10-15llvmpipe: don't store plane.ei value in binned dataKeith Whitwell
Further reduce the size of a binned triangle.
2010-10-15llvmpipe: slightly shrink the size of a binned triangleKeith Whitwell
2010-10-15llvmpipe: don't pass frontfacing as a floatKeith Whitwell
2010-10-15llvmpipe: reintroduce SET_STATE binner commandKeith Whitwell
But bin lazily only into bins which are receiving geometry.
2010-10-15glsl: Fix ir validation for bit logic opsChad Versace
In ir_validate::visit_leave(), the cases for - ir_binop_bit_and - ir_binop_bit_xor - ir_binop_bit_or were incorrect. It was incorrectly asserted that both operands must be the same type, when in fact one may be scalar and the other a vector. It was also incorrectly asserted that the resultant type was the type of the left operand, which in fact does not hold when the left operand is a scalar and the right operand is a vector.
2010-10-15glsl: Implement constant expr evaluation for bitwise logic opsChad Versace
Implement by adding the following cases to ir_exporession::constant_expression_value(): - ir_binop_bit_and - ir_binop_bit_or - ir_binop_bit_xor
2010-10-15glsl: Implement constant expr evaluation for bit-shift opsChad Versace
Implement by adding the following cases to ir_expression::constant_expression_value(): - ir_binop_lshfit - ir_binop_rshfit
2010-10-15glsl: Implement constant expr evaluation for bitwise-notChad Versace
Implement by adding a case to ir_expression::constant_expression_value() for ir_unop_bit_not.
2010-10-15glsl: Implement ast-to-hir for binary shifts in GLSL 1.30Chad Versace
Implement by adding the following cases to ast_expression::hir(): - ast_lshift - ast_rshift Also, implement ir validation for the new operators by adding the following cases to ir_validate::visit_leave(): - ir_binop_lshift - ir_binop_rshift
2010-10-15glsl: Change generated file glsl_lexer.cppChad Versace
2010-10-15glsl: Add lexer rules for << and >> in GLSL 1.30Chad Versace
Commit for generated file glsl_lexer.cpp follows this commit.
2010-10-15r600g: evergreen interpolation support.Dave Airlie
On evergreen, interpolation has moved into the fragment shader, with the interpolation parmaters being passed via GPRs and LDS entries. This works out the number of interps required and reserves GPR/LDS storage for them, it also correctly routes face/position values which aren't interpolated from the vertex shader. Also if we noticed nothing is to be interpolated we always setup perspective interpolation for one value otherwise the GPU appears to lockup. This fixes about 15 piglit tests on evergreen.
2010-10-15tgsi: add scanner support for centroid inputsDave Airlie
2010-10-14glsl: Slightly change the semantic of _LinkedShadersIan Romanick
Previously _LinkedShaders was a compact array of the linked shaders for each shader stage. Now it is arranged such that each slot, indexed by the MESA_SHADER_* defines, refers to a specific shader stage. As a result, some slots will be NULL. This makes things a little more complex in the linker, but it simplifies things in other places. As a side effect _NumLinkedShaders is removed. NOTE: This may be a candidate for the 7.9 branch. If there are other patches that get backported to 7.9 that use _LinkedShader, this patch should be cherry picked also.
2010-10-14i965: Fix texturing on pre-gen5.Eric Anholt
I broke it in 06fd639c519214b6ebcbf29127b6d9ed429f8641 by only testing 2 generations of hardware :(
2010-10-14llvmpipe: code to dump bytecode to file (disabled)Brian Paul
2010-10-14gallivm: add compile-time option to emit inst addrs and/or line numbersBrian Paul
Disabling address printing is helpful for diffing.
2010-10-14mesa: remove post-convolution width/height varsBrian Paul
These were left-over bits from when convolution was removed.
2010-10-14glsl: Refresh autogenerated file builtin_function.cpp.Kenneth Graunke
2010-10-14glsl: Add support for the 1.30 round() built-in.Kenneth Graunke
This implements round() via the ir_unop_round_even opcode, rather than adding a new opcode. We may wish to add one in the future, since it might enable a small performance increase on some hardware, but for now, this should suffice.
2010-10-14i965: Add support for ir_unop_round_even via the RNDE instruction.Kenneth Graunke
2010-10-14glsl: Add front-end support for GLSL 1.30's roundEven built-in.Kenneth Graunke
Implemented using the op-code introduced in the previous commit.
2010-10-14glsl: Add a new ir_unop_round_even opcode for GLSL 1.30's roundEven.Kenneth Graunke
Also, update ir_to_mesa's "1.30 is unsupported" case to "handle" it.
2010-10-15r300g: clean up warning due to unknown cap.Dave Airlie
2010-10-15r600g: handle absolute modifier in shader translatorKeith Whitwell
This was being classed as unsupported in one place but used in others. Enabling it seems to work fine. Signed-off-by: Dave Airlie <airlied@redhat.com>
2010-10-15r600g: emit hardware linewidthKeith Whitwell
Tested with demos/pixeltest - line rasterization doesn't seem to be set up for GL conventions yet, but at least width is respected now. Signed-off-by: Dave Airlie <airlied@redhat.com>
2010-10-15r600/drm: fix segfaults in winsys create failure pathKeith Whitwell
Would try to destroy radeon->cman, radeon->kman both which were still NULL. Signed-off-by: Dave Airlie <airlied@redhat.com>
2010-10-14i965: Clean up a warning in the old fragment backend.Kenneth Graunke
Hopefully this code can just go away soon.
2010-10-14i965: Enable the new FS backend on pre-gen6 as well.Eric Anholt
It is now to the point where we have no regressing piglit tests. It also fixes Yo Frankie! and Humus DynamicBranching, probably due to the piglit bias tests that work that didn't on the Mesa IR backend. As a downside, performance takes about a 5-10% performance hit at the moment (e.g. nexuiz 19.8fps -> 18.8fps), which I plan to resolve by reintroducing 16-wide fragment shaders where possible. It is a win, though, for fragment shaders using flow control.
2010-10-14i965: Correctly emit the RNDZ instruction.Kenneth Graunke
Simply using RNDU, RNDZ, or RNDE does not produce the desired result. Rather, the RND* instructions place a value in the destination register that may be 1 less than the correct answer. They can also set per-channel "increment bits" in a flag register, which, if set, mean dest needs to be incremented by 1. A second instruction - a predicated add - completes the job. Notably, RNDD always produces the correct answer in a single instruction. Fixes piglit test glsl-fs-trunc.
2010-10-14i965: Use RNDZ for ir_unop_trunc in the new FS.Kenneth Graunke
The existing code used RNDD, which rounds down, rather than toward zero.
2010-10-14glsl: Refresh autogenerated file builtin_function.cpp.Kenneth Graunke
2010-10-14glsl: Add front-end support for the "trunc" built-in.Kenneth Graunke
2010-10-14i965: Use logical-not when emitting ir_unop_ceil.Kenneth Graunke
Fixes piglit test glsl-fs-ceil.
2010-10-14i965: Add peepholing of conditional mod generation from expressions.Eric Anholt
This cuts usually 2 out of 3 instructions for flag reg generation (if statements, conditional assignment) by producing the conditional mod in the expression representing the boolean value. Fixes glsl-fs-vec4-indexing-temp-dst-in-nested-loop-combined (register allocation no longer fails for the conditional generation proliferation)
2010-10-14i965: Add a function for handling the move of boolean values to flag regs.Eric Anholt
This will be a place to peephole comparisions directly to the flag regs, and for now avoids using MOV with conditional mod on gen6, which is now illegal.
2010-10-14Only install vtxfmt tables for OpenGLKristian Høgsberg
GLES1 and GLES2 install their own exec pointers and don't need the Save table. Also, the SET_* macros use different indices for the different APIs so the offsets used in vtxfmt.c are actually wrong for the ES APIs.
2010-10-14i965: Add a pass to the FS to split virtual GRFs to float channels.Eric Anholt
Improves nexuiz performance 0.91% (+/- 0.54%, n=8)
2010-10-14i965: Update the live interval when coalescing regs.Eric Anholt
2010-10-14i965: Set class_sizes[] for the aligned reg pair class.Eric Anholt
So far, I've only seen this be a valgrind warning and not a real failure.
2010-10-14llvmpipe: don't try to emit non-existent color outputsKeith Whitwell
2010-10-14Drop the "neutral" tnl moduleKristian Høgsberg
Just always check for FLUSH_UPDATE_CURRENT and call Driver.BeginVertices when necessary. By using the unlikely() macros, this ends up as a 10% performance improvement (for isosurf, anyway) over the old, complicated function pointer swapping.
2010-10-14st/egl: Do not finish a fence that is NULL.Chia-I Wu
i915g would dereference the NULL pointer.
2010-10-14st/egl: Access _EGLConfig directly.Chia-I Wu
Drop the use of SET_CONFIG_ATTRIB. Fix the value of EGL_SAMPLE_BUFFERS along the way.