android-x86-mesa.git - Androïd/x86 port of Mesa drivers

Age	Commit message (Collapse)	Author
2011-01-04	i965: Correct comment for gen6 fb write control message setting	Zhenyu Wang
	Remove incorrect headless comment for gen6 fb write message. Note current SIMD16 mode has already done right for control message.
2010-12-22	i965: explicit tell header present for fb write on sandybridge	Zhenyu Wang
	Determine header present for fb write by msg length is not right for SIMD16 dispatch, and if there're more output attributes, header present is not easy to tell from msg length. This explicitly adds new param for fb write to say header present or not. Fixes many cases' hang and failure in GL conformance test.
2010-12-21	i965: Avoid using float type for raw moves, to work around SNB issue.	Eric Anholt
	The SNB alt-mode math does the denorm and inf reduction even for a "raw MOV" like we do for g0 message header setup, where we are moving values that aren't actually floats. Just use UD type, where raw MOVs really are raw MOVs. Fixes glxgears since c52adfc2e1d130effea940e75690897eb5d3ceaa, but no piglit tests had regressed(!)
2010-12-13	i965: Improve the hacks for ARB_fp scalar^scalar POW on gen6.	Eric Anholt
	This is still awful, but my ability to care about reworking the old backend so we can just get a temporary value into a POW is awfully low since the new backend does this all sensibly. Fixes: fp1-LIT test 1 fp1-LIT test 3 (case x < 0) fp1-POW test (exponentiation) fp-lit-mask
2010-12-13	i956: Fix the old FP path fragment position setup on gen6.	Eric Anholt
	Fixes fp-arb-fragment-coord-conventions-none
2010-12-08	i965: Drop KIL_NV from the ff/ARB_fp path since it was only used for GLSL.	Eric Anholt

2010-12-08	i965: Use the new pixel mask location for gen6 ARB_fp KIL instructions.	Eric Anholt
	Fixes: fp-kil fp-generic/kil-swizzle.
2010-12-08	i965: Set the render target index in gen6 fixed-function/ARB_fp path.	Eric Anholt
	Fixes: fbo-drawbuffers2-blend fbo-drawbuffers2-colormask
2010-12-07	i965: Work around gen6 ignoring source modifiers on math instructions.	Eric Anholt
	With the change of extended math from having the arguments moved into mrfs and handed off through message passing to being directly hooked up to the EU, it looks like the piece for doing source modifiers (negate and abs) was left out. Fixes: fog-modes glean/fp1-ARB_fog_exp test glean/fp1-ARB_fog_exp2 test glean/fp1-Computed fog exp test glean/fp1-Computed fog exp2 test ext_fog_coord-modes
2010-12-06	i965: Fix up 16-wide gen6 FB writes after various refactoring.	Eric Anholt

2010-12-06	i965: Move payload reg setup to compile, not lookup time.	Eric Anholt
	Payload reg setup on gen6 depends more on the dispatch width as well as the uses_depth, computes_depth, and other flags. That's something we want to decide at compile time, not at cache lookup. As a bonus, the fragment shader program cache lookup should be cheaper now that there's less to compute for the hash key.
2010-11-09	i965: Add support for math on constants in gen6 brw_wm_glsl.c path.	Eric Anholt
	Fixes 10 piglit cases that were assertion failing.
2010-11-09	i965: Allow OPCODE_SWZ to put immediates in the first arg.	Eric Anholt
	Fixes assertion failure with texture swizzling in the GLSL path when it's triggered (such as gen6 FF or ARB_fp shadow comparisons). Fixes: texdepth texSwizzle fp1-DST test fp1-LIT test 3
2010-11-03	intel: Annotate debug printout checks with unlikely().	Eric Anholt
	This provides the optimizer with hints about code hotness, which we're quite certain about for debug printouts (or, rather, while we developers often hit the checks for debug printouts, we don't care about performance while doing so).
2010-10-22	i965: Add support for pull constants to the new FS backend.	Eric Anholt
	Fixes glsl-fs-uniform-array-5, but not 6 which fails in ir_to_mesa.
2010-10-21	i965: Add support for register spilling.	Eric Anholt
	It can be tested with if (0) replaced with if (1) to force spilling for all virtual GRFs. Some simple tests work, but large texturing tests fail.
2010-10-14	i965: Clean up a warning in the old fragment backend.	Kenneth Graunke
	Hopefully this code can just go away soon.
2010-09-28	i965: fix pixel w interpolation on sandybridge	Zhenyu Wang

2010-09-28	i965: don't do calculation for delta_xy on sandybridge	Zhenyu Wang
	Sandybridge doesn't have Xstart/Ystart in payload header.
2010-09-28	i965: Fix sampler on sandybridge	Zhenyu Wang
	Sandybridge has not much change on texture sampler with Ironlake.
2010-09-28	i965: enable accumulator update in PS kernel too on sandybridge	Zhenyu Wang
	Accumulator update flag must be set for implicit update on sandybridge.
2010-09-28	i965: Add support for POW in gen6 FS.	Eric Anholt
	Fixes glsl-algebraic-pow-2 in brw_wm_glsl.c mode.
2010-09-28	i965: Add support for attribute interpolation on Sandybridge.	Eric Anholt
	Things are simpler these days thanks to barycentric interpolation parameters being handed in in the payload.
2010-09-21	i965: Share the KIL_NV implementation between glsl and non-glsl.	Eric Anholt

2010-08-22	i965: Fix 8-wide FB writes on gen6.	Eric Anholt
	My merge of Zhenyu's patch on top of my previous patches broke it by my code expecting simd16 single write and Zhenyu's simd8 path being disabled by mine. Merge the two for success.
2010-08-22	i965: Fix brw_math1 with scalar argument in gen6 FS.	Eric Anholt
	The docs claim two conflicting things: One, that a scalar source is supported. Two, source hstride must be 1 and width must be exec size. So splat a constant argument out into a full reg to operate on, since violating the second set of constraints is clearly failing. The alternative here might be to do a 1-wide exec on a constant argument for math1. It would probably save cycles too. But I'll leave that for the glsl2-965 branch. Fixes glsl-algebraic-div-one-2.shader_test.
2010-08-20	i965: Also use the SIMD8 FB writes for SIMD8 mode on non-SNB.	Eric Anholt

2010-08-20	i965: Add support for FB writes on Sandybridge.	Zhenyu Wang

2010-08-20	i965: Fix DP write channel ordering on Sandybridge.	Eric Anholt
	The SIMD16 message no longer has the goofy interleaved format that made Compr4 compression necessary before.
2010-08-16	i965: Use the implied move available in most brw_wm_emit brw_math() calls.	Eric Anholt
	This saves an extra message reg move in the program, though I'm not clear on whether it will have any performance impact other than cache footprint. It will also fix those math calls on Sandybridge, where the brw_eu_emit.c brw_math() support relies on the implied move being used.
2010-08-09	i965: More s/stderr/stdout/ for program debug.	Eric Anholt

2010-07-26	Merge remote branch 'origin/master' into glsl2	Eric Anholt
	This pulls in multiple i965 driver fixes which will help ensure better testing coverage during development, and also gets past the conflicts of the src/mesa/shader -> src/mesa/program move. Conflicts: src/mesa/Makefile src/mesa/main/shaderapi.c src/mesa/main/shaderobj.h
2010-07-26	i965: Fix reversed naming of the operations in compute-to-mrf optimization.	Eric Anholt
	Also fix up comments, so that the difference between the two passes is clarified.
2010-07-26	i965: Clean up a few magic numbers to use brw_defines.h defs.	Eric Anholt

2010-07-26	i965: Move the GRF-to-MRF optimizations to brw_optimize.c.	Eric Anholt

2010-07-26	i965: Improve (i.e. remove) some grf-to-mrf unnecessary moves	Benjamin Segovia
	Several routines directly analyze the grf-to-mrf moves from the Gen binary code. When it is possible, the mov is removed and the message register is directly written in the arithmetic instruction Also redundant mrf-to-grf moves are removed (frequently for example, when sampling many textures with the same uv) Code was tested with piglit, warsow and nexuiz on an Ironlake machine. No regression was found there Note that the optimizations are deactivated on Gen4 and Gen6 since I did test them properly yet. No reason there are bugs but who knows The optimizations are currently done in branch free programs only. Considering branches is more complicated and there are actually two paths: one for branch free programs and one for programs with branches Also some other optimizations should be done during the emission itself but considering that some code is shader between vertex shaders (AOS) and pixel shaders (SOA) and that we may have branches or not, it is pretty hard to both factorize the code and have one good set of strategies
2010-07-02	i965: Add support for the DP2 opcode, which we use for dot(vec2, vec2).	Eric Anholt
	The original glsl compiler would generate a.x * b.x + a.y * b.y, which we would do mul+mul+add for instead of this mul+mac. Fixes glsl-fs-dot-vec2.
2010-06-30	i965: Add support for OPCODE_SSG.	Eric Anholt
	The old compiler didn't use SSG, and instead emitted SGT/SGT/SUB. We can do a little better for SSG than we do for the SGT series.
2010-05-14	i965: Dump out the correct shared function for SEND on Ironlake.	Eric Anholt

2010-04-21	intel: Clean up chipset name and gen num for Ironlake	Zhenyu Wang
	Rename old IGDNG to Ironlake, and set 'gen' number for Ironlake as 5, so tracking the features with generation num instead of special is_ironlake flag. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Zhenyu Wang <zhenyuw@linux.intel.com>
2010-03-22	i965: Allow FS constants to be used as immediates instead of push/pull.	Eric Anholt
	The hope is to later take advantage of the reduced constant usage to free up regs. This only covers the GLSL path at the moment, because the brw_wm_emit path doesn't get the information as to whether a float value is a constant or a uniform.
2010-03-22	i965: Optimize OPCODE_CMP by using BRW_SEL to choose results.	Eric Anholt
	Tested with piglit glsl-fs-sqrt-branch, fp-cmp.vpfp.
2010-03-16	Revert "i965: Do FS SLT, SGT, and friends using CMP, SEL instead of CMP, ↵	Eric Anholt
	MOV, MOV." This reverts commit 46450c1f3f93bf4dc96696fc7e0f0eb808d9c08a. I was wrong about null reg behavior -- it reads undefined, not 0. And they're not kidding.
2010-03-12	i965: Clarify the roles of emit_pixel_xy(), emit_delta_xy(), emit_wpos_xy().	Eric Anholt

2010-03-12	i965: Clarify that DELTAXY always occurs for both X and Y.	Eric Anholt

2010-03-12	i965: Do FS SLT, SGT, and friends using CMP, SEL instead of CMP, MOV, MOV.	Eric Anholt

2010-03-12	i965: When doing a swizzled kill pixel, don't do redundant channel compares.	Eric Anholt
	This was obvious when looking at the compiled output of ETQW's shaders.
2010-03-12	i965: Use the SEL instruction to implement MIN and MAX.	Eric Anholt
	Saves an instruction over doing conditional moves.
2010-03-10	i965: Use the PLN instruction when possible in interpolation.	Eric Anholt
	Saves an instruction in PINTERP, LINTERP, and PIXEL_W from brw_wm_glsl.c For non-GLSL it isn't used yet because the deltas have to be laid out differently.
2010-03-10	i965: Add support for the CMP opcode in the GLSL path.	Eric Anholt
	This would be triggered by use of sqrt() along with control flow. Fixes piglit-fs-sqrt-branch and a bug in Yo Frankie!.