summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2010-09-05gallivm: Cope with tgsi instruction reallocation failure.José Fonseca
2010-09-05ir_reader: Only validate IR when a global 'debug' flag is set.Kenneth Graunke
This extra validation is very useful when working on the built-ins, but in general overkill - the results should stay the same unless the built-ins or ir_validate have changed. Also, validating all the built-in functions in every test case makes piglit run unacceptably slow.
2010-09-05r300g,r300c: memset the compiler struct to zerosMarek Olšák
This should fix bogus reports "Too many temporaries." and maybe some others.
2010-09-04r300/compiler: Remove stray break statementTom Stellard
This fixes glsl-fs-loop-nested.
2010-09-05nvfx: support unlimited constants and immediates in fpLuca Barbieri
2010-09-05nvfx: support using blitter to copy depth/stencil resources, fix HeavenLuca Barbieri
We might want to copy them as color ones though. Also works around crash in Unigine Heaven due to failing to allocate a 64 MB temporary in GART for a CPU copy. Unigine Heaven now works on nv40, albeit with very heavy glitches (with the floating branch with render_hdr 0).
2010-09-05r300/compiler: fix the instruction limit in vertex shadersMarek Olšák
Broken with commit d774b0c710bb7d833d17bd12f5151a0176baad96. Reported by Chris Rankin.
2010-09-04nvfx: support rendering to more formatsLuca Barbieri
2010-09-04nvfx: move 2D format selection logic to 2D codeLuca Barbieri
2010-09-04nvfx: fix swizzling of high bpp surfacesLuca Barbieri
2010-09-04nvfx: fix some subrectangle copiesLuca Barbieri
Actually, we may want to get rid of the x/y coordinates for linear surfaces, and realign the origin from scratch if necessary, instead of doing this "on-demand realignment".
2010-09-04nvfx: fix inlinining in nv04_2d.cLuca Barbieri
2010-09-04nvfx: fix the temporary copying logic and add assertsLuca Barbieri
2010-09-04nvfx: prevent swizzled rendering into formats where it's not supportedLuca Barbieri
2010-09-04Revert "ir_to_mesa: Load all the STATE_VAR elements of a builtin uniform to ↵Marek Olšák
a temp." This reverts commit 5ad74779cea07cc6a19a52874cdaef8b018e2f1b. Sorry, but I had to revert this. Any commit which needlessly increases the number of temporaries is wrong. More temporaries mean less shader performance because of reduced parallelism and therefore less efficient latency hiding. In this case, there is possible performance degradation of every shader which uses GL state variables. I cannot accept this.
2010-09-04Revert "r300g: refuse to create a texture with size 0"Marek Olšák
This reverts commit 5cdedaaf295acae13ac10feeb3143d83bc53d314. https://bugs.freedesktop.org/show_bug.cgi?id=30002 Conflicts: src/gallium/drivers/r300/r300_texture.c
2010-09-04r300g: remove unnecessary assignmentsMarek Olšák
2010-09-04r300/compiler: indent printed instructions according to the branch depthMarek Olšák
2010-09-04r300g: skip draw calls with no vertex elements, fixing hardlocksMarek Olšák
2010-09-04r300/compiler: use limits from the compiler input instead of inline constantsMarek Olšák
2010-09-04r300/compiler: improve register allocation with indexable temporaries for VSMarek Olšák
Register allocation can now reallocate temporaries right after the last indexed source operand, instead of being disabled for the whole shader.
2010-09-04r300/compiler: fix handling of indexed temporaries in peepholeMarek Olšák
2010-09-04r300/compiler: disable deadcode elimination for indexed dst operandsMarek Olšák
2010-09-04r300/compiler: allocate at least FS inputs if register allocation is disabledMarek Olšák
2010-09-04r300g: add a new debug option which disables compiler optimizationsMarek Olšák
Those are: - dead-code elimination - constant folding - peephole (mainly copy propagation) - register allocation There are some bugs which I need to track down. Also fix up the descriptions of all the debug options.
2010-09-04r300/compiler: compute the final number of temporaries during translationMarek Olšák
And not during the register allocation, which may be skipped for debugging purposes. Also the predicate register is now added to the number of temps.
2010-09-04r300/compiler: make optimizations not use 0.5 swizzles in vertex shadersMarek Olšák
2010-09-04r300/compiler: use peephole and constant folding for vertex shaders tooMarek Olšák
2010-09-04r300/compiler: remove unused enum OPCODE_REPL_ALPHAMarek Olšák
We use RC_OPCODE_REPL_ALPHA instead.
2010-09-04r300/compiler: refactor fragment shader compilationMarek Olšák
This cleans up the mess in r3xx_compile_fragment_program.
2010-09-04r300/compiler: add new compiler parameter max_constantsMarek Olšák
2010-09-04r300/compiler: refactor vertex shader compilationMarek Olšák
First list compiler passes in an array, then run the new function rc_run_compiler. Every backend may need a different set of passes. This cleans up the mess in r3xx_compile_vertex_program.
2010-09-04r300/compiler: remove a redundant parameter in rc_pair_regallocMarek Olšák
2010-09-04r300/compiler: remove a redundant parameter in rc_dataflow_deadcodeMarek Olšák
&c->Base == c.
2010-09-04r300/compiler: use null-terminated array of transformation functionsMarek Olšák
I need to reduce the number of parameters of each compiler pass function. This is part of a larger cleanup.
2010-09-04r300g: only check for an empty shader if there are no compile errorsMarek Olšák
2010-09-04r300/compiler: add new compiler parameter max_alu_instsMarek Olšák
2010-09-04r300/compiler: put emulate_loop_state in radeon_compilerMarek Olšák
2010-09-04ir_reader: Run ir_validate on the generated IR.Kenneth Graunke
It's just too easy to get something wrong in hand-written IR.
2010-09-04ir_reader: Emit global variables at the top of the instruction list.Kenneth Graunke
Since functions are emitted when scanning for prototypes, functions always come first, even if the original IR listed the variable declarations first. Fixes an ir_validate error (to be turned on in the next commit).
2010-09-04ir_reader: Drop support for reading the old assignment format.Kenneth Graunke
2010-09-04glsl: Regenerate autogenerated file builtin_function.cpp.Kenneth Graunke
2010-09-04glsl/builtins: Convert assignments to new format (with write mask).Kenneth Graunke
2010-09-04ir_reader: Read the new assignment format (with write mask).Kenneth Graunke
This preserves the ability to read the old format, for momentary compatibility with all the existing IR implementations of built-ins.
2010-09-04ir_reader: Track the current function and report it in error messages.Kenneth Graunke
2010-09-04glsl/builtins: Actually print the info log if reading a builtin failed.Kenneth Graunke
2010-09-04nvfx: consolidate tiny filesLuca Barbieri
We probably want to reorganize the remaining files too, but that's for later, maybe.
2010-09-04mesa/st: add missing _mesa_set_fetch_functions in st_get_tex_imageLuca Barbieri
Fixes piglit fdo25614-genmipmap.
2010-09-04nvfx: fix vp DP2Luca Barbieri
2010-09-04nvfx: implement fp SSG properlyLuca Barbieri