Age | Commit message (Collapse) | Author | |
---|---|---|---|
2010-10-09 | gallivm: faster iround implementation for sse2 | Roland Scheidegger | |
sse2 supports round to nearest directly (or rather, assuming default nearest rounding mode in MXCSR). Use intrinsic to use this rather than round (sse41) or bit manipulation whenever possible. | |||
2010-10-09 | gallivm: fix trunc/itrunc comment | Roland Scheidegger | |
trunc of -1.5 is -1.0 not 1.0... | |||
2010-10-08 | gallivm: Remove unnecessary header. | Vinson Lee | |
2010-10-08 | gallivm: Help for combined extraction and broadcasting. | José Fonseca | |
Doesn't change generated code quality, but saves some typing. | |||
2010-10-08 | llvmpipe: First minify the texture size, then broadcast. | José Fonseca | |
2010-10-08 | gallivm: Move into the as much of the second level code as possible. | José Fonseca | |
Also, pass more stuff trhough the sample build context, instead of arguments. | |||
2010-10-08 | gallivm: Warn when doing inefficient integer comparisons. | José Fonseca | |
2010-10-08 | gallivm: round rather than truncate in new 4x4f->1x16ub conversion path | Keith Whitwell | |
2010-10-08 | gallivm: Use the wrappers for SSE pack intrinsics. | José Fonseca | |
Fixes assertion failures on LLVM 2.6. | |||
2010-10-08 | gallivm: special case conversion 4x4f to 1x16ub | Keith Whitwell | |
Nice reduction in the number of operations required for final color output in many shaders. | |||
2010-10-08 | gallivm: Implement brilinear filtering. | José Fonseca | |
2010-10-08 | gallivm: Fix copy'n'paste typo in previous commit. | José Fonseca | |
2010-10-08 | gallivm: Clamp mipmap level and zero mip weight simultaneously. | José Fonseca | |
2010-10-08 | gallivm: Use lp_build_ifloor_fract for lod computation. | José Fonseca | |
Forgot this one before. | |||
2010-10-08 | gallivm: Don't compute the second mipmap level when frac(lod) == 0 | José Fonseca | |
2010-10-08 | gallivm: Simplify lp_build_mipmap_level_sizes' interface. | José Fonseca | |
2010-10-08 | gallivm: Do not do mipfiltering when magnifying. | José Fonseca | |
If lod < 0, then invariably follows that ilevel0 == ilevel1 == 0. | |||
2010-10-07 | gallivm: Vectorize the rho computation. | José Fonseca | |
2010-10-06 | util: Cleanup util_pack_z_stencil and friends. | José Fonseca | |
- Handle PIPE_FORMAT_Z32_FLOAT packing correctly. - In the integer version z shouldn't be passed as as double. - Make it clear that the integer versions should only be used for masks. - Make integer type sizes explicit (uint32_t for now, although uint64_t will be necessary later to encode f32_s8_x24). | |||
2010-10-06 | gallivm: Compute lod as integer whenever possible. | José Fonseca | |
More accurate/faster results for PIPE_TEX_MIPFILTER_NEAREST. Less FP <-> SI conversion overall. | |||
2010-10-06 | gallivm: Only apply min/max_lod when necessary. | José Fonseca | |
2010-10-06 | gallivm: don't apply zero lod_bias | Keith Whitwell | |
2010-10-06 | gallivm: Combined ifloor & fract helper. | José Fonseca | |
The only way to ensure we don't do redundant FP <-> SI conversions. | |||
2010-10-06 | gallivm: Fast implementation of iround(log2(x)) | José Fonseca | |
Not tested yet, but should be correct. | |||
2010-10-06 | gallivm: Use a faster (and less accurate) log2 in lod computation. | José Fonseca | |
2010-10-06 | gallivm: Take the type signedness in consideration in round/ceil/floor. | José Fonseca | |
2010-10-05 | pb: fix numDelayed accounting | Dave Airlie | |
we weren't decreasing when removing from the list. | |||
2010-10-05 | pb: don't keep checking buffers after first busy | Dave Airlie | |
If we assume busy buffers are added to the list in order its unlikely we'd fine one after the first busy one that isn't busy. | |||
2010-09-30 | draw: check for null sampler pointers | Brian Paul | |
http://bugs.freedesktop.org/show_bug.cgi?id=30516 | |||
2010-09-30 | gallivm: added some comments | Brian Paul | |
2010-09-30 | Revert "Prefer intrinsics to handrolled atomic ops." | Tom Fogal | |
This reverts commit 5f66b340aa49c6bc8d0acb2d1a6f8e9a7ef2cb2e, quickly fixing 30514. | |||
2010-09-30 | Prefer intrinsics to handrolled atomic ops. | Tom Fogal | |
2010-09-30 | Implement x86_64 atomics for compilers w/o intrinsics. | Tom Fogal | |
Really old gcc's (3.3, at least) don't have support for the intrinsics we need. This implements a fallback for that case. | |||
2010-09-30 | gallivm: check for level=0 case in lp_build_minify() | Brian Paul | |
This lets us avoid the shift and max() operations. | |||
2010-09-30 | gallivm: More comprehensive border usage logic. | José Fonseca | |
2010-09-30 | gallium/util: remove duplicated include | Nicolas Kaiser | |
Remove duplicated include. Signed-off-by: Brian Paul <brianp@vmware.com> | |||
2010-09-30 | gallium/rtasm: remove duplicated include | Nicolas Kaiser | |
Remove duplicated include. Signed-off-by: Brian Paul <brianp@vmware.com> | |||
2010-09-29 | gallivm: Use SSE4.1's ROUNDSS/ROUNDSD for scalar rounding. | José Fonseca | |
2010-09-29 | draw: pass sampler state down to llvm jit state | Brian Paul | |
Fixes a regression caused from the change to make min/max lod dynamic state. https://bugs.freedesktop.org/show_bug.cgi?id=30437 | |||
2010-09-27 | llvmpipe: fix swizzling of texture border color | Brian Paul | |
The pipe_sampler_view's swizzle terms also apply to the texture border color. Simply move the apply_sampler_swizzle() call after we fetch the border color. Fixes many piglit texwrap failures. | |||
2010-09-27 | ureg: support centroid interpolation | Luca Barbieri | |
2010-09-26 | util/u_blitter: fix leak | Joakim Sindholt | |
2010-09-25 | gallivm: fix repeat() function for NPOT textures | Brian Paul | |
The trick of casting the coord to an unsigned value only works for POT textures. Add a bias instead. This fixes a few piglit texwrap failures. | |||
2010-09-25 | gallivm: Remove dead experimental code. | José Fonseca | |
2010-09-25 | gallivm: Fetch the lod from the dynamic state when min_lod == max_lod. | José Fonseca | |
2010-09-25 | draw: Fullfil the new min_lod/max_lod/lod_bias/border_color dynamic state | José Fonseca | |
2010-09-25 | gallivm: optimize yuv decoding | Roland Scheidegger | |
this is more a proof to show vector shifts on x86 with per-element shift count are evil. Since we can avoid the shift with a single compare/select, use that instead. Replaces more than 20 instructions (and slow ones at that) with about 3, and cuts compiled shader size with mesa's yuvsqure demo by over 10% (no performance measurements done - but selection is blazing fast). Might want to revisit that for future cpus - unfortunately AVX won't have vector shifts neither, but AMD's XOP will, but even in that case using selection here is probably not slower. | |||
2010-09-25 | gallivm: don't use URem/UDiv when calculating offsets for blocks | Roland Scheidegger | |
While it's true that llvm can and will indeed replace this with bit arithmetic (since block height/width is POT), it does so (llvm 2.7) by element and hence extracts/shifts/reinserts each element individually. This costs about 16 instructions (and extract is not really fast) vs. 1... | |||
2010-09-25 | gallivm: fix copy&paste bug | Roland Scheidegger | |
looks like pot_depth should be used, not pot_height (found by accident, not verified) | |||
2010-09-25 | util: fix util_pack_color for B4G4R4A4 | Marek Olšák | |
NOTE: This is a candidate for the 7.9 branch. |