summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2010-07-28ir_to_mesa: Add remaining state variable (builtin uniforms) support.Eric Anholt
Fixes: glsl1-GL state variable reference (diffuse product) glsl1-GL state variable reference (gl_FrontMaterial.ambient) glsl1-GL state variable reference (gl_LightSource[0].diffuse) glsl1-GL state variable reference (point attenuation) glsl1-GL state variable reference (point size) glsl1-linear fog
2010-07-28glsl2: Add the remaining builtin uniforms.Eric Anholt
2010-07-28glsl2: Size builtin arrays according to the context constants.Eric Anholt
Cleans up some of the FINISHMEs in this file.
2010-07-28glcpp: Explicitly expect 0 shift/reduce conflicts.Carl Worth
The "%expect 0" construct will make bison emit an error if any future changes to the grammar introduce shift/reduce conflicts, (without also increasing the number after "%expect").
2010-07-28glcpp: Remove 2 shift/reduce conflicts from the grammar.Carl Worth
Since we have productions to turn "defined FOO" and "defined ( FOO )" into a conditional_token we don't need to list DEFINED as an operator as well. Doing so just introduces the shift/reduce ambiguity with no benefit.
2010-07-27glsl2: When stealing var->constant_value, steal its children as well.Eric Anholt
Fixes: glsl1-GLSL 1.20 uniform array constructor
2010-07-27ir_to_mesa: Provide a restricted type size to _mesa_add_uniform.Eric Anholt
Fixes: glsl-uniform-out-of-bounds.
2010-07-27glsl2: Actually use the linked dead code eliminator.Eric Anholt
I managed to revert the change from unlinked at some point while cleaning up the changes. glsl-fs-raytrace-bug27060 drops from 389 instructions to 370.
2010-07-27glsl2: Don't dereference a NULL var in CE handling during a compile error.Eric Anholt
If an undeclared variable was dereferenced in an expression that needed constant expression handling, we would walk off a null ir->var pointer. Fixes: glsl1-TIntermediate::addUnaryMath
2010-07-27ir_to_mesa: Add support for array constants.Eric Anholt
Fixes: glsl1-GLSL 1.20 array constructor 1 glsl1-GLSL 1.20 array constructor 2 glsl1-GLSL 1.20 array.length() glsl1-GLSL 1.20 const array constructor 1 glsl1-GLSL 1.20 const array constructor 2
2010-07-27glsl2: Use ir_dead_code's linked version after linking.Eric Anholt
glsl-fs-raytrace-bug27060 goes from 485 Mesa IR instructions to 389 before Mesa IR optimization.
2010-07-27glsl2: Fix the linked version of ir_dead_code.Eric Anholt
If we don't walk into functions, we won't see any usage of variables in the functions.
2010-07-27glsl2: Make the dead code handler make its own talloc context.Eric Anholt
This way, we don't need to pass in a parse state, and the context doesn't grow with the number of passes through optimization.
2010-07-27mesa: Allow large temporary indices coming into the temporary reg allocator.Eric Anholt
This gets glsl-vs-raytrace, glsl-fs-raytrace running on the new compiler.
2010-07-27glsl2: Talloc type names.Eric Anholt
Otherwise, we end up losing structure names after compile time, and dumping IR often ends up reporting some other mysterious string.
2010-07-27ir_to_mesa: Set the swizzle on constant struct src regs.Eric Anholt
MESA_GLSL=nopt now produces believable output for glsl-fs-raytrace.
2010-07-27ir_to_mesa: Fix stray "break" that broke functions of >1 argument.Eric Anholt
2010-07-27glsl2: Add optimization pass for algebraic simplifications.Eric Anholt
This cleans up the assembly output of almost all the non-logic tests glsl-algebraic-*. glsl-algebraic-pow-two needs love (basically, flattening to a temporary and squaring it).
2010-07-27ir_validate: Check the types of expression operations.Eric Anholt
2010-07-27glsl2: Fix missing visit_continue return in ir_validate.Eric Anholt
2010-07-26ir_to_mesa: Fix up handling of void function returns.Eric Anholt
void functions have a type of glsl_type::void_type, not a null type.
2010-07-26ir_to_mesa: Actually allocate the right size for constant matrix temps.Eric Anholt
2010-07-26ir_to_mesa: Add support for structure constants.Eric Anholt
Fixes: TPPStreamCompiler::assignOperands
2010-07-26glsl2: Also steal the constant components of aggregate-typed ir_constants.Eric Anholt
2010-07-26glsl2: Steal ir_variable's constant_value field.Kenneth Graunke
Fixes a link-time crash in glsl-vs-cross-3.
2010-07-26glsl2: Remove old .gitignore file from when we were out of tree.Eric Anholt
2010-07-26Merge remote branch 'origin/master' into glsl2Eric Anholt
This pulls in multiple i965 driver fixes which will help ensure better testing coverage during development, and also gets past the conflicts of the src/mesa/shader -> src/mesa/program move. Conflicts: src/mesa/Makefile src/mesa/main/shaderapi.c src/mesa/main/shaderobj.h
2010-07-26ir_to_mesa: Flag when we use the address reg.Eric Anholt
Hardware backends will get angry otherwise.
2010-07-26glx: Remove function prototypes no longer necessaryKristian Høgsberg
2010-07-26i965: Fix reversed naming of the operations in compute-to-mrf optimization.Eric Anholt
Also fix up comments, so that the difference between the two passes is clarified.
2010-07-26i965: Clean up a few magic numbers to use brw_defines.h defs.Eric Anholt
2010-07-26i965: Use MIN2, MAX2 instead of rolling our own.Eric Anholt
2010-07-26i965: Fold the "is arithmetic" bit of 965 opcodes into the opcode list.Eric Anholt
2010-07-26i965: Remove some duped register size/count definitionsEric Anholt
2010-07-26i965: Move the GRF-to-MRF optimizations to brw_optimize.c.Eric Anholt
2010-07-26i965: Improve (i.e. remove) some grf-to-mrf unnecessary movesBenjamin Segovia
Several routines directly analyze the grf-to-mrf moves from the Gen binary code. When it is possible, the mov is removed and the message register is directly written in the arithmetic instruction Also redundant mrf-to-grf moves are removed (frequently for example, when sampling many textures with the same uv) Code was tested with piglit, warsow and nexuiz on an Ironlake machine. No regression was found there Note that the optimizations are *deactivated* on Gen4 and Gen6 since I did test them properly yet. No reason there are bugs but who knows The optimizations are currently done in branch free programs *only*. Considering branches is more complicated and there are actually two paths: one for branch free programs and one for programs with branches Also some other optimizations should be done during the emission itself but considering that some code is shader between vertex shaders (AOS) and pixel shaders (SOA) and that we may have branches or not, it is pretty hard to both factorize the code and have one good set of strategies
2010-07-26glx: Drop debug fprintf that snug in with the previous commitKristian Høgsberg
2010-07-26glx: Enable copy subbuffer patch when GLX_DIRECT_RENDERING is #definedKristian Høgsberg
Depending on __DRI_COPY_SUB_BUFFER doesn't work when we no longer include dri_interface.h. https://bugs.freedesktop.org/show_bug.cgi?id=29264
2010-07-26st/egl: Fix debug linenobled
Acked-by: Jakob Bornecrantz <jakob@vmware.com>
2010-07-26util: fix CPU detection on OS Xnobled
s/PIPE_OS_DARWIN/PIPE_OS_APPLE, since there is no PIPE_OS_DARWIN. Acked-by: Vinson Lee <vlee@vmware.com>
2010-07-26st/xorg: fix use-after-freenobled
Acked-by: Jakob Bornecrantz <jakob@vmware.com>
2010-07-26i965g: Enable llvm in dri driver if builtJakob Bornecrantz
2010-07-26i915g: Fix llvm buildnobled
Acked-by: Jakob Bornecrantz <jakob@vmware.com>
2010-07-26glx: Drop duplicate psc field in dri context structKristian Høgsberg
Same problem as fixed for drisw in 4d58b5b482d06ab8d4c4b2db33d0b48b7c82d064.
2010-07-26i965: Allow VS MOVs to use immediate constants.Eric Anholt
Clarifies program assembly, and with a little tweak to always use constant_map, we could cut down on constant buffer payload.
2010-07-26r600g: implememt the LIT instructionStephan Schmid
2010-07-26util: fix another mutex leak in mempoolMarek Olšák
By fixing one, I introduced another. Crap.
2010-07-26util: fix mutex leaks in mempoolMarek Olšák
2010-07-26r300g: fix macro substitution problemDave Airlie
isn't a problem yet, but have issues in hiz branch. Signed-off-by: Dave Airlie <airlied@redhat.com>
2010-07-25r300g: implement D24X8 texture sampling for r3xx-r4xxMarek Olšák
Because the hw can't sample it, I reinterpret the format as G16R16 and sample the G component. This gives 16 bits of precision, which should be enough for depth texturing (surprisingly, the sampled values are exactly the same as in D16 textures). This also enables EXT_packed_depth_stencil on those old chipsets, finally.