summaryrefslogtreecommitdiff
path: root/src/mesa/drivers/dri
AgeCommit message (Collapse)Author
2010-03-16i965: Unalias all GLSL source regs from the destination regs used.Eric Anholt
We were doing it ad-hoc before, as instructions with potential aliasing problems were identified. But thanks to swizzling basically anything can have aliasing, so just do it generally at source reg setup time. This is somewhat inefficient, because sometimes an operation doesn't need unaliasing protection if the swizzling is safe, but the unaliasing before didn't cover those cases either. Fixes piglit glsl-fs-loop. (cherry picked from commit 6b194dab6b4d9f12cdd54c699b23c0d3420a49c2)
2010-03-16i965: Fix nested loops in the VS.Eric Anholt
We were patching up all the break and continues between the start of our loop and the end of our loop, even if they were breaks/continues for an inner loop. Avoiding patching already patched breaks/continues fixes piglit glsl-vs-loop-nested. (cherry picked from commit f6f547d87ea68f44c50a0b0231b7360ca94b2975)
2010-03-16i965: Fix up VP constbuf leak on program delete.Eric Anholt
(cherry picked from commit 7f6d2754d586545ab6c970acffdd897294879039)
2010-03-16i965: Fix up the handling of point sprite coordinate replacement.Eric Anholt
The code was walking over the regs of pairs of attributes and checking whether the attribute with a given reg index had point sprite enabled. So the point sprite setup code was rarely even getting executed. Instead, we need to determine which channels of a reg need point sprite coordinate replacement. In addition, it was multiplying the attribute by 1/w, when it's supposed to cover (0, 1) in each direction regardless of w, and it wasn't filling in the Z and W components of the texcoord as specified. Fixes piglit point-sprite and the spriteblast demo. Bug #24431, #22245. (cherry picked from commit bc632d04370566c1156cbd0345fe303834f0b910)
2010-03-16i965: Add support for the CMP opcode in the GLSL path.Eric Anholt
This would be triggered by use of sqrt() along with control flow. Fixes piglit-fs-sqrt-branch and a bug in Yo Frankie!. (cherry picked from commit 48dca99feb394febc3af44e14f23fb12a9cc9204)
2010-03-16i965: Fix the response len of masked sampler messages for 8-wide dispatch.Eric Anholt
The bad response length would hang the GPU with a masked sample in a shader using control flow. For 8-wide, the response length is always 4, and masked slots are just not written to. brw_wm_glsl.c already allocates registers in the right locations. Fixes piglit glsl-fs-bug25902 (fd.o bug #25902). (cherry picked from commit f6d210c284751ac50a8d6358de7e75a1ff1e4ac7) (cherry picked from commit dc8c0359448cdae7b367552ba58783c04b199778)
2010-03-16Correct GL_EQUIV code in r67/7xx.Matthew W. S. Bell
From 247e121106e8d3e389f2e5a6edf13ea70ac18df7 Mon Sep 17 00:00:00 2001 These seem to be documented in <http://www.svgopen.org/2003/papers/RasterOperationsUsingFilterElements/index.html>.
2010-03-16radeon: Fix buffer object unmap to be called only once for dma buffers.Pauli Nieminen
If flush happens inside radeonRefillCurrentMaRegion the last dma buffer would be unmapped twice. Unmapping buffer when moving buffer to wait list fixes the mapping error.
2010-03-12dri/r700: include shader/programopt.h instead of programopt.c.Luc Verhaegen
Signed-off-by: Brian Paul <brianp@vmware.com>
2010-03-12Grammar and spelling fixesJeff Smith
Signed-off-by: Jeff Smith <whydoubt@yahoo.com> Signed-off-by: Brian Paul <brianp@vmware.com>
2010-03-11radeon: Fix memory leaks from early return.Vinson Lee
2010-03-08radeon: don't attempt to use alternate blit formats for depth buffersAlex Deucher
noticed by fredrikh on IRC.
2010-03-08r600: enable GL_ARB_pixel_buffer_objectAlex Deucher
2010-03-08radeon/r200/r600: allow src and dst BOs to be placed in GTT during blitAlex Deucher
2010-03-08radeon/r200/r600: reset bos when validating buffers during blitAlex Deucher
2010-03-08r600: recalculate point size, if point min/max size changesAlex Deucher
2010-03-08r600: no need to flush on context initAlex Deucher
2010-03-08r300: don't enable EXT_packed_depth_stencilMaciej Cencora
R300 hw doesn't support sampling from Z24_S8 or S8_Z24 formats.
2010-03-08radeon: no need to emit full state twice after flushMaciej Cencora
2010-03-08r300: allow src and dst BOs to be placed in GTT during blitMaciej Cencora
Fixes some relocation failures
2010-03-08r300: reset bos when validating buffers during blitMaciej Cencora
2010-03-08radeon: fallback to software in glCopyTexImage if blit isn't availableMaciej Cencora
2010-03-08r300: VAP flush is needed only when vertex program or constants are changedMaciej Cencora
2010-03-08r300: recalculate point size, if point min/max size changesMaciej Cencora
Fixes two wine d3d9 unit tests
2010-03-08r300: no need to flush on context initMaciej Cencora
2010-03-08r300: remove unnecessary codeMaciej Cencora
_tnl_UpdateFixedFunctionProgram is already called in r300_draw.c
2010-03-05i965: Only use the effective subset of the WM sampler key.Eric Anholt
The whole structure is 836 bytes, but if only the first one or two samplers are used (as is common), the part that matters is only 56 or 108 bytes. By using just that subset as the key (since the key size is also part of the key), we improve firefox-talos-gfx performance by 1%.
2010-03-05intel: Remove support for the DRI1 TFP extension.Eric Anholt
2010-03-05i965: Remove the paths for making a const surface object for a NULL BO.Eric Anholt
2010-03-05i965: Remove the unused SDC BO pointer from the sampler state key.Eric Anholt
Since it's a BO pointer, it already lives outside the key in brw->wm.sdc_bo[] which is used for sampler state lookup and setup.
2010-03-05i965: Rename SNB brw_*_key structs to gen6_*_key structs.Eric Anholt
2010-03-05i965: Pack gen6 blend_state_key structure.Eric Anholt
2010-03-05intel: Only do batchbuffer debug if --enable-debug is used.Eric Anholt
This saves 6.6KB on the 965 driver, and appears to speed firefox-talos-gfx up by 1-2%. Unlike many other asserts in the driver, when we make a mistake that would trigger one of these it generally shows up all the time for developers, so turning it off for release seems fine.
2010-03-05intel: Replace batch macro contents with function calls.Eric Anholt
This manages to cut down another 3800 bytes.
2010-03-05intel: Move the assertions about reloc delta from the macros to the function.Eric Anholt
Cuts another 1800 bytes from the driver.
2010-03-05intel: Check that the batch is mapped per BEGIN, not each OUT.Eric Anholt
Shaves 800 bytes off the driver.
2010-03-05i965: Try to assign SNB SF inputs to outputs.Eric Anholt
2010-03-05i965: Fix the SNB SF input size.Eric Anholt
2010-03-05Merge branch 'object-purgeable'Chris Wilson
Acked-by: Brian Paul <brianp@vmware.com>
2010-03-05APPLE_object_purgeable: intelChris Wilson
Implement support for purgeable objects by using the GEM madvise ioctl. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2010-03-04intel: Remove non-kernel-exec-fencing support.Eric Anholt
Shaves 60k off the driver from removing the broken spans code. This means we now require 2.6.29, which seems fair given that it's a year old and we've removed support for non-KMS already in the last release of 2D.
2010-03-04intel: Remove the unused s8 spans code. Not hit during no_rast piglit.Eric Anholt
Shaves 5.5k off of the driver.
2010-03-04dri/nouveau: Remove some CI remnants.Francisco Jerez
2010-03-04dri/nouveau: Trivially add GL_ARB_texture_mirrored_repeat.Andrew Randrianasulu
Signed-off-by: Francisco Jerez <currojerez@riseup.net>
2010-03-04dri/nouveau: Enable GL_EXT_stencil_wrap.Andrew Randrianasulu
Signed-off-by: Francisco Jerez <currojerez@riseup.net>
2010-03-04dri/nouveau: Add GL_EXT_stencil_wrap for nv04.Andrew Randrianasulu
Signed-off-by: Francisco Jerez <currojerez@riseup.net>
2010-03-04dri/nouveau: Fix rb->DataType for z24s8 renderbuffers.Francisco Jerez
2010-03-04dri/nouveau: Try to keep client buffers smaller than the scratch VBO length.Francisco Jerez
2010-03-04dri/nouveau: Pack client arrays as they're copied to the real BO.Francisco Jerez
2010-03-04i915: Enable texture tiling by default.Eric Anholt
This now passes piglit testcases, and shows performance improvements on openarena. Blit-heavy apps may see degradation, but the expectation is that the common case of 3D rendering will see significant wins. This regresses gen-teximage on my 855, and no piglit tests on my 945.