android-x86-mesa.git - Androïd/x86 port of Mesa drivers

Age	Commit message (Collapse)	Author
2011-01-18	ra: Take advantage of the adjacency list in finding a node to spill.	Eric Anholt
	This revealed a bug in ra_get_spill_benefit where we only considered the benefit of the first adjacency we were to remove, explaining some of the ugly spilling I've seen in shaders. Because of the reduced spilling, it reduces the runtime of glsl-fs-convolution-1 36.9% +/- 0.9% (n=5).
2011-01-18	ra: Remove unused "name" field in regs.	Eric Anholt

2011-01-18	ra: Take advantage of the adjacency list in ra_select() too.	Eric Anholt
	Reduces runtime of glsl-fs-convolution-1 another 13.9% +/- 0.6% (n=5).
2011-01-18	ra: Add an adjacency list to trade space for time in ra_simplify().	Eric Anholt
	This was recommended in the original paper, but I figued "make it run" before "make it fast". Now we make it fast. Reduces the runtime of glsl-fs-convolution-1 by 12.7% +/- 0.6% (n=5).
2011-01-18	ra: Trade off some space to get time efficiency in ra_set_finalize().	Eric Anholt
	Our use of the register allocator in i965 is somewhat unusual. Whereas most architectures would have a smaller set of registers with fewer register classes and reuse that across compilation, we have 1, 2, and 4-register classes (usually) and a variable number up to 128 registers per compile depending on how many setup parameters and push constants are present. As a result, when compiling large numbers of programs (as with glean texCombine going through ff_fragment_shader), we spent much of our CPU time in computing the q[] array. By keeping a separate list of what the conflicts are for a particular reg, we reduce glean texCombine time 17.0% +/- 2.3% (n=5). We don't expect this optimization to be useful for 915, which will have a constant register set, but it would be useful if we were switch to this register allocator for Mesa IR.
2011-01-15	Merge branch 'draw-instanced'	Brian Paul
	Conflicts: src/gallium/auxiliary/draw/draw_llvm.c src/gallium/drivers/llvmpipe/lp_state_fs.c src/glsl/ir_set_program_inouts.cpp src/mesa/tnl/t_vb_program.c
2011-01-14	ir_to_mesa: Fix segfaults on ir_to_mesa invocation after MSVC change.	Eric Anholt

2011-01-14	mesa: Dynamically allocate acp array in ir_to_mesa_visitor::copy_propagate.	Vinson Lee
	Fixes these MSVC errors. ir_to_mesa.cpp(2644) : error C2057: expected constant expression ir_to_mesa.cpp(2644) : error C2466: cannot allocate an array of constant size 0 ir_to_mesa.cpp(2644) : error C2133: 'acp' : unknown size ir_to_mesa.cpp(2646) : error C2070: 'ir_to_mesa_instruction []': illegal sizeof operand ir_to_mesa.cpp(2709) : error C2070: 'ir_to_mesa_instruction []': illegal sizeof operand ir_to_mesa.cpp(2718) : error C2070: 'ir_to_mesa_instruction *[]': illegal sizeof operand
2011-01-14	mesa: Add channel-wise copy propagation to ir_to_mesa.	Eric Anholt
	This catches more opportunities than the prog_optimize.c code on openarena's fixed function shaders turned to GLSL, mostly due to looking at multiple source instructions for copy propagation opportunities. It should also be much more CPU efficient than prog_optimize.c's code.
2011-01-09	mesa: Include mfeatures.h in program.c.	Vinson Lee
	Include mfeatures.h for feature tests.
2010-12-27	glsl: Support if-flattening beyond a given maximum nesting depth.	Kenneth Graunke
	This adds a new optional max_depth parameter (defaulting to 0) to lower_if_to_cond_assign, and makes the pass only flatten if-statements nested deeper than that. By default, all if-statements will be flattened, just like before. This patch also renames do_if_to_cond_assign to lower_if_to_cond_assign, to match the new naming conventions.
2010-12-18	mesa: Clean up header file inclusion in prog_statevars.h.	Vinson Lee

2010-12-14	mesa: more program debug code	Brian Paul

2010-12-14	mesa: Clean up header file inclusion in prog_optimize.h.	Vinson Lee

2010-12-14	mesa: Clean up header file inclusion in prog_cache.h.	Vinson Lee

2010-12-14	mesa: Clean up header file inclusion in nvvertparse.h.	Vinson Lee

2010-12-13	ir_to_mesa: Don't generate swizzles for record derefs of non-scalar/vectors	Ian Romanick
	This is the same as what the array dereference handler does. Fixes piglit test glsl-link-struct-array (bugzilla #31648). NOTE: This is a candidate for the 7.9 and 7.10 branches.
2010-12-11	mesa: Clean up header file inclusion in nvfragparse.h.	Vinson Lee

2010-12-11	mesa: Clean up header file inclusion in ir_to_mesa.h.	Vinson Lee

2010-12-10	mesa: implement system values in program interpreter	Brian Paul

2010-12-09	mesa: Clean up header file inclusion in arbprogparse.h.	Vinson Lee

2010-12-08	mesa: ir_to_mesa support for system values	Brian Paul

2010-12-08	mesa: program printing for PROGRAM_SYSTEM_VALUE	Brian Paul

2010-12-06	symbol_table: Add support for adding a symbol at top-level/global scope.	Kenneth Graunke

2010-12-06	mesa: Bump the number of bits in the register index.	José Fonseca
	More than 1023 temporaries were being used for a Cinebench shader before doing temporary optimization, causing the index value to wrap around to -1024.
2010-12-03	mesa: update comments, remove dead code	Brian Paul

2010-12-03	mesa: remove unneeded cast	Brian Paul

2010-12-03	mesa, st/mesa: fix gl_FragCoord with FBOs in Gallium	Marek Olšák
	gl_FragCoord.y needs to be flipped upside down if a FBO is bound. This fixes: - piglit/fbo-fragcoord - https://bugs.freedesktop.org/show_bug.cgi?id=29420 Here I add a new program state STATE_FB_WPOS_Y_TRANSFORM, which is set based on whether a FBO is bound. The state contains a pair of transformations. It can be either (XY=identity, ZW=transformY) if a FBO is bound, or (XY=transformY, ZW=identity) otherwise, where identity = (1, 0), transformY = (-1, height-1). A classic driver (or st/mesa) may, based on some other state, choose whether to use XY or ZW, thus negate the conditional "if (is a FBO bound) ...". The reason for this is that a Gallium driver is allowed to only support WPOS relative to either the lower left or the upper left corner, so we must flip the Y axis accordingly again. (the "invert" parameter in emit_wpos_inversion) NOTE: This is a candidate for the 7.9 branch. Signed-off-by: Marek Olšák <maraeo@gmail.com> Signed-off-by: Brian Paul <brianp@vmware.com>
2010-12-01	glsl: Lower ir_binop_pow to a sequence of EXP2 and LOG2	Ian Romanick

2010-12-01	glsl: Add a lowering pass to move discards out of if-statements.	Kenneth Graunke
	This should allow lower_if_to_cond_assign to work in the presence of discards, fixing bug #31690 and likely #31983. NOTE: This is a candidate for the 7.9 branch.
2010-12-01	ir_to_mesa: Add support for conditional discards.	Marek Olšák
	NOTE: This is a candidate for the 7.9 branch. Signed-off-by: Marek Olšák <maraeo@gmail.com> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>
2010-11-23	glsl: start restoring some geometry shader code	Brian Paul

2010-11-23	glsl: better handling of linker failures	Brian Paul
	Upon link error, exit translation loop, free program instructions. Check for null pointers in calling code.
2010-11-23	mesa: replace #defines with new gl_shader_type enum	Brian Paul

2010-11-23	mesa: _mesa_valid_register_index() to validate register indexes	Brian Paul

2010-11-23	mesa: rename, make _mesa_register_file_name() non-static	Brian Paul
	Plus remove unused parameter.
2010-11-23	glsl: use gl_register_file in a few places	Brian Paul

2010-11-23	glsl: fix off by one in register index assertion	Brian Paul

2010-11-19	ir_to_mesa: Detect and emit MOV_SATs for saturate constructs.	Eric Anholt
	The goal here is to avoid regressing performance on ir_to_mesa drivers for fixed function fragment shaders requiring saturates.
2010-11-19	glsl: Combine many instruction lowering passes into one.	Kenneth Graunke
	This should save on the overhead of tree-walking and provide a convenient place to add more instruction lowering in the future. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>
2010-11-19	glsl: Add ir_quadop_vector expression	Ian Romanick
	The vector operator collects 2, 3, or 4 scalar components into a vector. Doing this has several advantages. First, it will make ud-chain tracking for components of vectors much easier. Second, a later optimization pass could collect scalars into vectors to allow generation of SWZ instructions (or similar as operands to other instructions on R200 and i915). It also enables an easy way to generate IR for SWZ instructions in the ARB_vertex_program assembler.
2010-11-19	glsl: Eliminate assumptions about size of ir_expression::operands	Ian Romanick
	This may grow in the near future.
2010-11-19	glsl: Add ir_unop_sin_reduced and ir_unop_cos_reduced	Ian Romanick
	The operate just like ir_unop_sin and ir_unop_cos except that they expect their inputs to be limited to the range [-pi, pi]. Several GPUs require this limited range for their sine and cosine instructions, so having these as operations (along with a to-be-written lowering pass) helps this architectures. These new operations also matche the semantics of the GL_ARB_fragment_program SCS instruction. Having these as operations helps in generating GLSL IR directly from assembly fragment programs.
2010-11-18	ir_to_mesa: Generate smarter code for some conditional moves	Ian Romanick
	Condiation moves with a condition of (a < 0), (a > 0), (a <= 0), or (a >= 0) can be generated with "a" directly as an operand of the CMP instruction. This doesn't help much now, but it will help with assembly shaders that use the CMP instruction.
2010-11-17	glsl: Remove the ir_binop_cross opcode.	Kenneth Graunke

2010-11-09	ir_to_mesa: Refactor code for emitting DP instructions	Ian Romanick

2010-11-02	mesa: Fix C++ includes in sampler.cpp	Chad Versace
	Some C++ header files were included in an extern "C" block. When building with Clang, this caused the build to fail due to namespace errors. (GCC did not report any errors.) Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>
2010-10-25	mesa: silence enum comparison warning	Brian Paul
	http://bugs.freedesktop.org/show_bug.cgi?id=31069
2010-10-22	mesa: move declaration before code	Brian Paul

2010-10-21	i965: Add support for register spilling.	Eric Anholt
	It can be tested with if (0) replaced with if (1) to force spilling for all virtual GRFs. Some simple tests work, but large texturing tests fail.