summaryrefslogtreecommitdiff
path: root/src/mesa/drivers/dri/i965/brw_fs.cpp
AgeCommit message (Collapse)Author
2010-09-09i965: Add switch cases for ir_unop_noise, which should have been lowered.Eric Anholt
Fixes compiler warnings.
2010-09-08glsl: add several EmitNo* options, and MaxUnrollIterationsLuca Barbieri
This increases the chance that GLSL programs will actually work. Note that continues and returns are not yet lowered, so linking will just fail if not supported. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>
2010-09-07i965: Make pixel_xy results UW.Eric Anholt
There is a restriction on the destination of an operation involving a vector immediate being 128-bit aligned and the destination horizontal stride being equivalent to 2 bytes. Fixes bad pixel_x results from gl_FragCoord, where each pair had the same value.
2010-09-07i965: Don't bother with RNDZ for f2i.Eric Anholt
The default type conversion for MOV should be fine, and RNDZ actually requires two instructions.
2010-09-07i965: Align the start of attribute interp coefficients in FS to use PLN.Eric Anholt
2010-09-07i965: Just assert when we flagged a compile error in the FS for now.Eric Anholt
Dumping back to potentially 16-wide dispatch doesn't really work out at the moment, and hopefully I'll just be able to resolve all the failures so we never have to do this at all.
2010-09-07i965: Clean up fs_reg setup by using a helper for constructors.Eric Anholt
2010-08-30i965: Add support for loops to the new FS backend.Eric Anholt
This includes a handy little safety check to prevent the loop from going "too long", as permitted by the spec. I haven't gone out of my way to test it, though… Fixes 20 more piglit tests.
2010-08-30i965: Add "discard" support to the new FS backend.Eric Anholt
Fixes 3 testcases related to discard.
2010-08-30i965: Fix the new implementation of ir_unop_sign to match brw_wm_emit.cEric Anholt
Like the comparison operations, this suffered from CMP only setting the low bit. Doing the AND instructions would be the same instruction count as the more obvious conditional moves, so do cond moves. Fixes glsl-fs-sign and 6 other cases, like trig functions that use sign() internally.
2010-08-30i965: Add support for texturing with bias to i965 FS backend.Eric Anholt
Fixes 5 piglit tests for bias. Note that LOD is a 1.30 feature and not yet supported.
2010-08-28i965: Add initial support for texturing to the new FS backend.Eric Anholt
Fixes 11 piglit tests.
2010-08-28i965: Move libdrm/C++ hack introduced in fa2deb3d to intel_context.hCedric Vivier
Fixes build on Linux/GCC 4.4 as libdrm includes are also used by other brw_fs_*.cpp files. Bug #29855
2010-08-27i965: Don't strip negate/abs flags when assigning uniform locations.Eric Anholt
Fixes glsl-algebraic-sub-zero-4.
2010-08-27i965: Add missing handling for BRW_OPCODE_SEL.Eric Anholt
Fixes 4 piglit tests about min, max, and clamp.
2010-08-27i965: Mask out higher bits of the result of BRW_CMP producing a boolean.Eric Anholt
When it says it sets the LSB, that's not just a hint as to where the result goes. Only the LSB is modified. Fixes 20 piglit cases.
2010-08-27i965: Fix the types of immediate integer values.Eric Anholt
When we're trying to do integer ops, handing a float in doesn't help.
2010-08-27i965: Add translation for RNDD and RNDZ.Eric Anholt
Fixes: glsl-fs-any. glsl1-integer division with uniform var
2010-08-27i965: Add support for ir_binop_mod using do_mod_to_fract.Eric Anholt
Fixes glsl-fs-mod.
2010-08-27i965: Fix swapped instructions in ir_unop_abs and ir_unop_neg.Eric Anholt
Fixes glsl-fs-neg and 5 other tests.
2010-08-27i965: Add generate() handling for AND, OR, XOR.Eric Anholt
10 more piglit tests pass.
2010-08-27i965: Add support for if instructions in the new FS backend.Eric Anholt
20 more piglit tests pass.
2010-08-27i965: When encountering an unknown opcode in new FS backend, print its name.Eric Anholt
2010-08-27i965: Fix the maximum grf counting in the new FS backend.Eric Anholt
glsl-algebraic-rcp-rsq managed to use 33 registers, and we claimed to only use 32, so the write to g32 would go stomping over the precious g0 of some other thread.
2010-08-27i965: Validate the IR tree after doing our custom optimization passes.Eric Anholt
This wouldn't catch the last failure fixed in them, because we don't validate assignments well (due to the fact that we've got a pretty glaring inconsistency in how we handle assignment writemasking), but it could catch other failure we may produce.
2010-08-27i965: Add a bit of support for matrices to the new FS.Eric Anholt
2010-08-27i965: Fix destination writemasking in the new FS.Eric Anholt
2010-08-26i965: Add preliminary support for uniforms to the new FS backend.Eric Anholt
+269 piglits
2010-08-26i965: Abort on gl_FragDepth in the new FS backend for now.Eric Anholt
It hangs the GPU due to FB_WRITE handling being incomplete. There are bigger issues to handle first.
2010-08-26i965: Fix up and actually enable the NewShader and NewShaderProgram hooks.Eric Anholt
2010-08-26i965: Hack in avoidance of c++ reserved keyword in libdrm.Eric Anholt
I'm also fixing this upstream in libdrm, but this avoids new libdrm dependency for the moment.
2010-08-26i965: Add GLSL IR-level source annotation and comments to new FS debug.Eric Anholt
This should make debugging way easier, as now we have context for reading large programs.
2010-08-26i965: Use the implied move in brw_math() in the new FS.Eric Anholt
2010-08-26i965: Add support for in varyings to the new FS codegen.Eric Anholt
At least some tests, like glsl-vs-sign, now work.
2010-08-26i965: Start building the codegen visitor.Eric Anholt
This can successfully emit a real program that generates magenta now.
2010-08-26i965: Start building direct GLSL2 IR to 965 assembly codegen.Eric Anholt
Our channel-expressions and vector-splitting changes now happen into a private copy of the IR that we maintain for ourselves. Uniform assignment still happens by the core, so we continue using Mesa IR generation not just for swrast fallbacks but also for uniform values (since there's no storage for their contents other than shader_program->FragmentProgram->Parameters->ParameterValues). And most importantly, at the moment no actual codegen is hooked up other than emitting our favorite color to the framebuffer.
2010-08-26i965: Add new pass to split vectors into scalar variablesEric Anholt
Combined with the previous pass, this lets other optimization passes do their work thanks to ir_tree_grafting. Still have regression in instruction count with INTEL_NEW_FS, but register count is even better.
2010-08-26i965: Add a pass for the FS to reduce vector expressions down to scalar.Eric Anholt
This is a step towards implementing a GLSL IR backend for the 965 fragment shader. Because it has downsides with the current codegen, it is hidden under the environment variable INTEL_NEW_FS. This results in an increase in instruction count at the moment (1444 -> 1752 for glsl-fs-raytrace, 345 -> 359 on my demo), because dot products are turned into a series of multiplies and adds instead of a custom expansion of MULs and MACs, and by not splitting the variable types up we don't get tree grafting and thus there are extra moves of temporary storage. However, register count drops for the non-GLSL path (64 -> 56 on my demo shader) because the register allocator sees all the sub-operations.
2010-08-26i965: Start building 965 FS backend.Eric Anholt