android-x86-mesa.git - Androïd/x86 port of Mesa drivers

Age	Commit message (Collapse)	Author
2011-03-04	i965: Fix extending VB packets	Chris Wilson
	Computation of the delta of this array from the last had a silly little bug and ignored any initial delta==0 causing grief in Nexuiz and friends. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-03-04	i965: Handle URB_FENCE erratum for Broadwater	Chris Wilson
	There is a silicon bug which causes unpredictable behaviour if the URB_FENCE command should cross a cache-line boundary. Pad before the command to avoid such occurrences. As this command only applies to gen4/5, do the fixup unconditionally as the specs do not actually state for which chip it was fixed (and the cost is negligible)... Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-03-04	i965: Align index to type size and flush if the type changes	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-03-04	i965: Prevent using a zero sized (or of unknown type) vertex array	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-03-03	i965: SNB GT1 has only 32k urb and max 128 urb entries.	Zou Nan hai
	Signed-off-by: Zou Nan hai <nanhai.zou@intel.com>
2011-03-02	i965: Maxinum the usage of urb space on SNB.	Zou Nan hai
	SNB has 64k urb space, we only use piece of them. The more urb space we alloc, the more concurrent vs threads we can run. push the urb space usage to the limit. Signed-off-by: Zou Nan hai <nanhai.zou@intel.com>
2011-03-01	i965: Use negative relocation deltas to minimse vertex uploads	Chris Wilson
	With relaxed relocation checking in the kernel, we can specify a negative delta (i.e. pointing outside of the target bo) in order to fake a range in a large buffer. We only then need to upload the elements used and adjust the buffer offset such that they correspond with the indices used in the DrawArrays. (Depends on libdrm 0209428b3918c4336018da9293cdcbf7f8fedfb6) Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-03-01	i965: Undo 'continuation of vb packets'	Chris Wilson
	This breaks nexuiz for unknown reason; disable until a true fix can be found.
2011-03-01	i965: Fix uploading of shortened vertex packets	Chris Wilson
	... handle all cases and not just the interleaved upload. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-03-01	i965: Upload all vertices used	Chris Wilson
	... and take advantage of start_vertex_bias to trim to [min_index, max_index] where possible (i.e. when we need to upload all arrays). Fixes half_float_vertex(misc.fillmode.wireframe) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=34595 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-03-01	Revert "i965/fs: Correctly set up gl_FragCoord.w on Sandybridge."	Kenneth Graunke
	This reverts commit 4a3b28113c3d23ba21bb8b8f5ebab7c567083a6d, as it caused a regression on Ironlake (bug #34646).
2011-03-01	i965: bump VS thread number to 60 on SNB	Zou Nan hai
	Signed-off-by: Zou Nan hai <nanhai.zou@intel.com>
2011-02-25	i965/fs: Initial plumbing to support TXD.	Kenneth Graunke
	This adds the opcode and the code to convert ir_txd to OPCODE_TXD; it doesn't actually add support yet.
2011-02-25	i965/fs: Complete TXL support on gen5+.	Kenneth Graunke
	Initial plumbing existed to turn the ir_txl into OPCODE_TXL, but it was never handled.
2011-02-25	i965/fs: Complete TXL support on gen4.	Kenneth Graunke
	Initial plumbing existed to turn the ir_txl into OPCODE_TXL, but it was never handled.
2011-02-25	i965/fs: Use a properly named constant in TXB handling.	Kenneth Graunke
	The old value, BRW_SAMPLER_MESSAGE_SIMD8_SAMPLE makes it sound like we're doing a non-bias texture lookup. It has the same value as the new constant BRW_SAMPLER_MESSAGE_SIMD8_SAMPLE_BIAS_COMPARE, so there should be no functional changes.
2011-02-25	i965: Add #defines for gen4 SIMD8 TXB/TXL with shadow comparison.	Kenneth Graunke
	From volume 4, page 161 of the public i965 documentation.
2011-02-24	i965: Increase Sandybridge point size clamp in the clip state.	Kenneth Graunke
	255.875 matches the hardware documentation. Presumably this was a typo. NOTE: This is a candidate for the 7.10 branch, along with commit 2bfc23fb86964e4153f57f2a56248760f6066033. Reviewed-by: Eric Anholt <eric@anholt.net>
2011-02-24	i965: Remember to pack the constant blend color as floats into the batch	Chris Wilson
	Fixes regression from aac120977d1ead319141d48d65c9bba626ec03b8. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=34597 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-24	intel: Reset the buffer offset after releasing reference to packed upload	Chris Wilson
	Fixes oglc/vbo(basic.bufferdata) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=34603 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-24	i965: Unmap the correct pointer after discontiguous upload	Chris Wilson
	Fixes piglit/fbo-depth-sample-compare: ==14722== Invalid free() / delete / delete[] ==14722== at 0x4C240FD: free (vg_replace_malloc.c:366) ==14722== by 0x84FBBFD: intel_upload_unmap (intel_buffer_objects.c:695) ==14722== by 0x85205BC: brw_prepare_vertices (brw_draw_upload.c:457) ==14722== by 0x852F975: brw_validate_state (brw_state_upload.c:394) ==14722== by 0x851FA24: brw_draw_prims (brw_draw.c:365) ==14722== by 0x85F2221: vbo_exec_vtx_flush (vbo_exec_draw.c:389) ==14722== by 0x85EF443: vbo_exec_FlushVertices_internal (vbo_exec_api.c:543) ==14722== by 0x85EF49B: vbo_exec_FlushVertices (vbo_exec_api.c:973) ==14722== by 0x86D6A16: _mesa_set_enable (enable.c:351) ==14722== by 0x42CAD1: render_to_fbo (in /home/ickle/git/piglit/bin/fbo-depth-sample-compare) ==14722== by 0x42CEE3: piglit_display (in /home/ickle/git/piglit/bin/fbo-depth-sample-compare) ==14722== by 0x42F508: display (in /home/ickle/git/piglit/bin/fbo-depth-sample-compare) ==14722== Address 0xc606310 is 0 bytes after a block of size 18,720 alloc'd ==14722== at 0x4C244E8: malloc (vg_replace_malloc.c:236) ==14722== by 0x85202AB: copy_array_to_vbo_array (brw_draw_upload.c:256) ==14722== by 0x85205BC: brw_prepare_vertices (brw_draw_upload.c:457) ==14722== by 0x852F975: brw_validate_state (brw_state_upload.c:394) ==14722== by 0x851FA24: brw_draw_prims (brw_draw.c:365) ==14722== by 0x85F2221: vbo_exec_vtx_flush (vbo_exec_draw.c:389) ==14722== by 0x85EF443: vbo_exec_FlushVertices_internal (vbo_exec_api.c:543) ==14722== by 0x85EF49B: vbo_exec_FlushVertices (vbo_exec_api.c:973) ==14722== by 0x86D6A16: _mesa_set_enable (enable.c:351) ==14722== by 0x42CAD1: render_to_fbo (in /home/ickle/git/piglit/bin/fbo-depth-sample-compare) ==14722== by 0x42CEE3: piglit_display (in /home/ickle/git/piglit/bin/fbo-depth-sample-compare) ==14722== by 0x42F508: display (in /home/ickle/git/piglit/bin/fbo-depth-sample-compare) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=34604 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-22	i965: Increase Sandybridge point size clamp.	Kenneth Graunke
	255.875 matches the hardware documentation. Presumably this was a typo. Found by inspection. Not known to fix any issues. Reviewed-by: Eric Anholt <eric@anholt.net>
2011-02-22	i965/fs: Correctly set up gl_FragCoord.w on Sandybridge.	Kenneth Graunke
	pixel_w is the final result; wpos_w is used on gen4 to compute it. NOTE: This is a candidate for the 7.10 branch. Reviewed-by: Eric Anholt <eric@anholt.net>
2011-02-22	i965/fs: Refactor control flow stack handling.	Kenneth Graunke
	We can't safely use fixed size arrays since Gen6+ supports unlimited nesting of control flow. NOTE: This is a candidate for the 7.10 branch. Reviewed-by: Eric Anholt <eric@anholt.net>
2011-02-22	i965/fs: Avoid register coalescing away gen6 MATH workarounds.	Kenneth Graunke
	The code that generates MATH instructions attempts to work around the hardware ignoring source modifiers (abs and negate) by emitting moves into temporaries. Unfortunately, this pass coalesced those registers, restoring the original problem. Avoid doing that. Fixes several OpenGL ES2 conformance failures on Sandybridge. NOTE: This is a candidate for the 7.10 branch. Reviewed-by: Eric Anholt <eric@anholt.net>
2011-02-22	i965/fs: Apply source modifier workarounds to POW as well.	Kenneth Graunke
	Single-operand math already had these workarounds, but POW (the only two operand function) did not. It needs them too - otherwise we can hit assertion failures in brw_eu_emit.c when code is actually generated. NOTE: This is a candidate for the 7.10 branch. Reviewed-by: Eric Anholt <eric@anholt.net>
2011-02-22	i965: Fix shaders that write to gl_PointSize on Sandybridge.	Kenneth Graunke
	gl_PointSize (VERT_RESULT_PSIZ) doesn't take up a message register, as it's part of the header. Without this fix, writing to gl_PointSize would cause the SF to read and use the wrong attributes, leading to all kinds of random looking failure. Reviewed-by: Eric Anholt <eric@anholt.net>
2011-02-22	i965: Trim the interleaved upload to the minimum number of vertices	Chris Wilson
	... should have no impact on a properly formatted draw operation. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-22	i965: Reinstate max-index paranoia	Chris Wilson
	Don't trust the applications not to reference beyond the end of the vertex buffers. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-22	i965: Zero the offset into the vbo when uploading non-interleaved	Chris Wilson
	Fixes regression from 559435d9152acc7162e4e60aae6591c7c6c8274b. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	i965: Fix VB packet reuse when offset for the new buffer isn't stride aligned.	Eric Anholt
	Fixes regression in scissor-stencil-clear and 5 other tests.
2011-02-21	i965: Remove spurious duplicate ADVANCE_BATCH	Chris Wilson
	... a leftover from a bad merge. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	i965: Use compiler builtins when available	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	i965: Micro-optimise check_state	Chris Wilson
	Replace the intermediate tests due to the logical or with the bitwise or. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	i965: Remove unused 'next_free_page' member	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	intel: extend current vertex buffers	Chris Wilson
	If the next vertex arrays are a (discontiguous) continuation of the current arrays, such that the new vertices are simply offset from the start of the current vertex buffer definitions we can reuse those defintions and avoid the overhead of relocations and invalidations. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	intel: Use specified alignment for writes into the upload buffer	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	i965: Clean up brw_prepare_vertices()	Chris Wilson
	Use a temporary glarray variable to replace the numerous input->glarray. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	intel: combine short memcpy using a temporary allocated buffer	Chris Wilson
	Using a temporary buffer for large discontiguous uploads into the common buffer and a single buffered upload is faster than performing the discontiguous copies through a mapping into the GTT. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	i965: upload normal arrays as interleaved	Chris Wilson
	Upload the non-vbo arrays into a single interleaved buffer object, and so need to just emit a single vertex buffer relocation. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	i965: interleaved vbo	Chris Wilson
	If the user passed in several arrays interleaved in the same vbo, only emit a single vertex buffer and relocation. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	i965: emit one vb packet per vbo	Chris Wilson
	Track reuse of the vertex buffer objects and so minimise the number of vertex buffers used by the hardware (and their relocations). Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	i965: upload transient indices into the same discontiguous buffer	Chris Wilson
	As we now pack the indices into a common upload buffer, we can reuse a single CMD_INDEX_BUFFER packet and translate each invocation with a start vertex offset. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	i965: suppress repeat-emission of identical vertex elements	Chris Wilson
	Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	i965: Move repeat-instruction-suppression to batchbuffer core	Chris Wilson
	Move the tracking of the last emitted instructions into the core batchbuffer routines and take advantage of the shadow batch copy to avoid extra memory allocations and copies. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	intel: use pwrite for batch	Chris Wilson
	It's faster. Not only is the memcpy more efficiently performed in the kernel (making up for the system call overhead), but by not using mmap we remove the greater overhead of tracking the vma of every batch. And it means we can read back from the batch buffer without incurring the cost of a uncached read through the GTT. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	i965: drop state_bo references to batch_bo	Chris Wilson
	As we use state relocations and we know that all the state belongs to the same bo, we can drop the multiple references to the same bo. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	i965: directly write wm state to batch	Chris Wilson
	As we write directly into the batch in system memory, we do not need to write first to the stack (as was to avoid read back through the GTT) Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	i965: write cc straight to batch	Chris Wilson
	As we write directly into the batch in system memory, we do not need to write first to the stack (as was to avoid read back through the GTT) Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2011-02-21	i965: switch gen6 to use its own cc state bo	Chris Wilson
	In preparation for a greater change, use the color_calc_state_bo already provisioned for this purpose. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>