android-x86-mesa.git - Androïd/x86 port of Mesa drivers

Age	Commit message (Collapse)	Author
2010-03-22	i965: Remove gratuitous jump or nop from OPCODE_END to vertex emit.	Eric Anholt
	Just emit the URB write at END time. Subroutine code that sits after OPCODE_END won't be executed since we've ended the thread at the point that the URB write is done.
2010-03-16	Revert "i965: Do VS SGT, SLT, and friends using CMP, SEL instead of CMP, ↵	Eric Anholt
	MOV, MOV." This reverts commit 8ef3b1834a896927bdd4f2aea552cdb732849da9. Fixes piglit glsl-vs-if.
2010-03-12	i965: Do VS SGT, SLT, and friends using CMP, SEL instead of CMP, MOV, MOV.	Eric Anholt

2010-03-12	i965: Fix up VS DP4 sequences to avoid dependency control.	Eric Anholt
	This is recommended by the B-Spec. I wasn't able to measure any difference in ETQW.
2010-03-09	i965: Fix nested loops in the VS.	Eric Anholt
	We were patching up all the break and continues between the start of our loop and the end of our loop, even if they were breaks/continues for an inner loop. Avoiding patching already patched breaks/continues fixes piglit glsl-vs-loop-nested.
2010-03-03	i965: Fix up Sandybridge VS sizing.	Eric Anholt

2010-02-25	i965: Fix up the VUE handling for SNB, and hopefully clarify comments.	Eric Anholt

2010-02-25	i965: Lump SNB in with Ironlake for bigger VUEs.	Eric Anholt
	This gets the VS to the point of accepting vertices. \o/
2010-02-25	i965: Add SNB math opcode support.	Eric Anholt
	This is untested at this point.
2010-02-19	Replace the _mesa_*printf() wrappers with the plain libc versions	Kristian Høgsberg

2010-01-31	i965: Silence uninitialized variable warning.	Vinson Lee

2010-01-19	i965: Upload as many VS constants as possible through the push constants.	Eric Anholt
	The pull constants require sending out to an overworked shared unit and waiting for a response, while push constants are nicely loaded in for us at thread dispatch time. By putting things we access in every VS invocation there, ETQW performance improved by 2.5% +/- 1.6% (n=6).
2010-01-18	i965: Clean up constbuf handling by splitting reladdr/non-reladdr loads.	Eric Anholt
	The codepaths in the function were almost entirely different.
2010-01-18	i965: Only set up the stack register if it's going to get used.	Eric Anholt

2010-01-18	i965: Fix loads of non-relative-addr constants after a reladdr load.	Eric Anholt
	Fixes piglit vp-arl-constant-array-huge-overwritten.
2009-12-22	intel: Replace IS_965 checks with context structure usage.	Eric Anholt
	Saves another 600 bytes or so of code.
2009-12-22	intel: Replace IS_IGDNG checks with intel->is_ironlake or needs_ff_sync.	Eric Anholt
	Saves ~480 bytes of code.
2009-12-18	i965: Add support for OPCODE_CMP in the VS to fix GLSL sqrt()	Eric Anholt
	Bug #25628. Fixes piglit case glsl-vs-sqrt-zero.
2009-11-17	Merge branch 'outputswritten64'	Ian Romanick
	Add a GLbitfield64 type and several macros to operate on 64-bit fields. The OutputsWritten field of gl_program is changed to use that type. This results in a fair amount of fallout in drivers that use programs. No changes are strictly necessary at this point as all bits used are below the 32-bit boundary. Fairly soon several bits will be added for clip distances written by a vertex shader. This will cause several bits used for varyings to be pushed above the 32-bit boundary. This will affect any drivers that support GLSL. At this point, only the i965 driver has been modified to support this eventuality. I did this as a "squash" merge. There were several places through the outputswritten64 branch where things were broken. I foresee this causing difficulties later for bisecting. The history is still available in the branch. Conflicts: src/mesa/drivers/dri/i965/brw_wm.h
2009-11-13	i965: Avoid moving the current value back into the accumulator for MAD.	Eric Anholt
	This is a 2.9% (+/-.3%) performance win for my GL demo, which hits MAD sequences for matrix transforms.
2009-11-10	i965: Unalias src/dst registers for SGE and friends.	Eric Anholt
	Fixes piglit vp-sge-alias test, and the googleearth ground shader. \o/ Bug #22228
2009-11-10	i965: Allow use of PROGRAM_LOCAL constants in ARB_vp.	Eric Anholt
	Fixes piglit arl.vp.
2009-09-10	i965: Enable loops in the VS.	Eric Anholt
	Passes piglit glsl-vs-loop testcase. Bug #20171
2009-09-04	i965: Don't set the complete field when there is more VUE yet to come.	Eric Anholt
	This should help with things like lightsmark, but I don't have a testcase for this commit.
2009-08-29	i965: Support PROGRAM_ENV_PARAMs in brw_vs_emit.c	Eric Anholt

2009-08-07	i965: Replace the subroutine-skipping jump in VS with a NOP if it's a NOP.	Eric Anholt
	This showed a 1.9% (+/-.3%, n=3) improvement in OA performance with high geometry settings.
2009-08-04	i965: Fix dangerous warning I let slip in.	Eric Anholt

2009-08-04	i965: Respect CondSwizzle in OPCODE_IF.	Eric Anholt
	Fixes piglit glsl-vs-if-bool and progs/glsl/twoside, and will likely be useful for the looping code. Bug #18992
2009-08-04	i965: Emit conditional code updates as required for GLSL VS if statements.	Eric Anholt
	Previously, we'd be branching based on whatever condition code happened to be laying around.
2009-08-04	i965: Hook up the disassembler for INTEL_DEBUG={wm,vs}.	Eric Anholt
	I was getting tired of doing the dance of INTEL_DEBUG=batch, copying it out, and running intel-gen4disasm on it.
2009-08-03	i965: Even if no VS inputs are set, still load some amount of URB as required.	Eric Anholt
	See comment on Vertex URB Entry Read Length for VS_STATE. This, combined with the previous three commits, fixes #22945.
2009-08-03	i965: Make sure the VS URB size is big enough to fit a VF VUE.	Eric Anholt
	This fix is just from code and docs inspection, but it may fix hangs on some applications.
2009-07-13	i965: add support for new chipsets	Xiang, Haihao
	1. new PCI ids 2. fix some 3D commands on new chipset 3. fix send instruction on new chipset 4. new VUE vertex header 5. ff_sync message (added by Zou Nan Hai <nanhai.zou@intel.com>) 6. the offset in JMPI is in unit of 64bits on new chipset 7. new cube map layout
2009-06-30	i965: first attempt at handling URB overflow when there's too many vs outputs	Brian Paul
	If we can't fit all the VS outputs into the MRF, we need to overflow into temporary GRF registers, then use some MOVs and a second brw_urb_WRITE() instruction to place the overflow vertex results into the URB. This is hit when a vertex/fragment shader pair has a large number of varying variables (12 or more). There's still something broken here, but it seems close...
2009-06-30	i965: comments and a new assertion	Brian Paul

2009-06-19	i965: initial code for loops in vertex programs	Brian Paul

2009-06-19	i965: asst clean-ups, etc in brw_vs_emit()	Brian Paul

2009-05-08	i965: const qualifiers	Brian Paul

2009-05-07	i965: relAddr local var (to make debug/test a little easier)	Brian Paul

2009-05-01	Merge branch 'const-buffer-changes'	Brian Paul
	Conflicts: src/mesa/drivers/dri/i965/brw_curbe.c src/mesa/drivers/dri/i965/brw_vs_emit.c src/mesa/drivers/dri/i965/brw_wm_glsl.c
2009-04-27	i965: only upload constant buffer data when we actually need the const buffer	Brian Paul
	Make the use_const_buffer field per-program and only call the code which updates the constant buffer's data if the flag is set. This should undo the perf regression from 20f3497e4b6756e330f7b3f54e8acaa1d6c92052 (cherry picked from master, commit dc9705d12d162ba6d087eb762e315de9f97bc456)
2009-04-27	i965: only upload constant buffer data when we actually need the const buffer	Brian Paul
	Make the use_const_buffer field per-program and only call the code which updates the constant buffer's data if the flag is set. This should undo the perf regression from 20f3497e4b6756e330f7b3f54e8acaa1d6c92052
2009-04-22	i965: disable debug printf	Brian Paul

2009-04-22	i965: enable VS constant buffers	Brian Paul
	In the VS constants can now be handled in two different ways: 1. If there's room in the GRF, put constants there. They're preloaded from the CURBE prior to VS execution. This is the historical approach. The problem is the GRF may not have room for all the shader's constants and temps and misc registers. Hence... 2. Use a separate constant buffer which is read from using a READ message. This allows a very large number of constants and frees up GRF regs for shader temporaries. This is the new approach. May be a little slower than 1. 1 vs. 2 is chosen according to how many constants and temps the shader needs.
2009-04-17	i915: fix broken indirect constant buffer reads	Brian Paul
	The READ message's msg_control value can be 0 or 1 to indicate that the Oword should be read into the lower or upper half of the target register. It seems that the other half of the register gets clobbered though. So we read into two dest registers then use a MOV to combine the upper/lower halves.
2009-04-17	i965: updated CURBE allocation code	Brian Paul
	Now that we have real constant buffers, the demands on the CURBE are lessened. When we use real VS/WM constant buffers we only use the CURBE for clip planes.
2009-04-16	Merge branch 'register-negate'	Brian Paul

2009-04-16	i965: implement relative addressing for VS constant buffer reads	Brian Paul
	A scatter-read should be possible, but we're just using two READs for the time being.
2009-04-16	i965: handle address reg in get_dst()	Brian Paul

2009-04-16	i965: fix const buffer temp register clobbering	Brian Paul
	Calls to release_tmps() were causing the temps holding constants to get recycled.