Age | Commit message (Collapse) | Author | |
---|---|---|---|
2010-06-13 | r300g: turn clip state into a CB | Marek Olšák | |
2010-06-13 | r300g: turn blend color into a CB | Marek Olšák | |
2010-06-13 | r300g: turn blend state into a CB | Marek Olšák | |
2010-06-13 | r300g: add API for building command buffers | Marek Olšák | |
The idea is to build a hardware command buffer for every CSO and memcpy the buffer to a command stream at bind time (or dirty-state-emission time, to be precise). | |||
2010-06-13 | r300g: inline FLUSH_CS | Marek Olšák | |
The fewer macros, the better. | |||
2010-06-13 | r300g: reorder CS macros and document them a little | Marek Olšák | |
2010-06-13 | r300g: drop DBG_CS | Marek Olšák | |
I'd like the CS macros to be as lightweight as possible for performance reasons. | |||
2010-06-13 | r300g: inline CHECK_CS | Marek Olšák | |
2010-06-13 | r300g: replace r300_cs_info with simplier get_cs_free_dwords | Marek Olšák | |
2010-06-13 | r300g: fix multiple render targets | Marek Olšák | |
This fixes tests/drawbuffers. | |||
2010-06-13 | r300g: remove r300_state.h | Marek Olšák | |
2010-06-13 | r300g: move two-sided stencilref fallback to its own file | Marek Olšák | |
2010-06-13 | r300g: move index buffer translate functions to their new home | Marek Olšák | |
2010-06-13 | r300g: add fallback for unaligned/unsupported vertex stride/offset/format | Marek Olšák | |
There is a problem though, the translate module cannot emit half float vertices. | |||
2010-06-13 | r300g: upload only vertex buffers referenced by vertex elements | Marek Olšák | |
2010-06-12 | i965: Fix gen6 front cull mode. | Eric Anholt | |
2010-06-12 | i965: Use the new message header format for FF_SYNC on gen6. | Zhenyu Wang | |
2010-06-12 | i965: Add support for math instructions in the gen6 WM. | Zhenyu Wang | |
2010-06-12 | i965: Set the correct WM GRF start reg on gen6. | Zhenyu Wang | |
2010-06-12 | i965: Update gen6 paths for the streaming rework. | Eric Anholt | |
2010-06-12 | i965: Stream out CC unit state. | Eric Anholt | |
before: [ # ] backend test min(s) median(s) stddev. count [ 0] gl firefox-talos-gfx 31.791 32.287 1.11% 6/6 after: [ 0] gl firefox-talos-gfx 31.198 31.675 0.96% 6/6 | |||
2010-06-12 | draw/gs: copy the outputs only if we emitted something | Zack Rusin | |
2010-06-12 | softpipe: small cleanup | Zack Rusin | |
2010-06-12 | r300/compiler: fix scons build | Joakim Sindholt | |
2010-06-12 | i965: Remove unnecessary header. | Vinson Lee | |
2010-06-11 | scons: Disable i965g build if using MSVC. | Vinson Lee | |
i965g uses C99 constructs that are not supported by MSVC. | |||
2010-06-11 | scons: Disable i915g build if using MSVC. | Vinson Lee | |
i915g uses C99 constructs that are not supported by MSVC. | |||
2010-06-11 | r300/compiler: Handle more complex conditionals in loops. | Tom Stellard | |
2010-06-11 | r300/compiler: Fix warning. | Tom Stellard | |
2010-06-11 | r300/compiler: Handle SGT and SLE at the beginning of loops. | Tom Stellard | |
2010-06-11 | r300/compiler: Verify assumptions about opcode types. | Tom Stellard | |
2010-06-11 | r300/compiler: Unroll loops that decrement the counter. | Tom Stellard | |
e.g. for(i=10; i>0; i--) | |||
2010-06-11 | r300/compiler: Unroll loops that have a constant number of iterations. | Tom Stellard | |
This only works with for loops that increment the counter. e.g. for(i=0; i<10; i++) | |||
2010-06-11 | r300/compiler: Implement simple loop emulation | Tom Stellard | |
The loop emulation unrolls loops as may times as possbile while still keeping the shader program below the maximum instruction limit. At this point, there are no checks for constant conditionals. This is only enabled for fragment shaders. | |||
2010-06-11 | i965: Remove the surface key used to generate constant surfaces. | Eric Anholt | |
We had to fill out all that junk when using the cache, but no more. | |||
2010-06-11 | i965: Warning fixes from the i965-streaming merge. | Eric Anholt | |
2010-06-11 | gallium/softpipe/draw: support samplers in geometry shaders | Zack Rusin | |
2010-06-11 | tgsi: support 2d indirect addressing | Zack Rusin | |
2010-06-11 | i965: Use the state base address to avoid relocations. | Eric Anholt | |
This makes the binding table code simpler, and is required for gen6, which requires binding table addresses to be under 64k offset from the surface state base addr. No significant change in performance on firefox-talos-gfx. | |||
2010-06-11 | i965: GC the last two arguments to brw_cache_data. | Eric Anholt | |
Now that the binding table is streamed indirect state, they were always NULL/0. | |||
2010-06-11 | i965: Remove brw_state_cache_bo_delete now that it's unused again. | Eric Anholt | |
2010-06-11 | i965: Remove caching of surface state objects. | Eric Anholt | |
It turns out that computing a 56 byte key to look up a 20-byte object out of a hash table was some sort of a bad idea. Whoops. before: [ # ] backend test min(s) median(s) stddev. count [ 0] gl firefox-talos-gfx 37.799 38.203 0.39% 6/6 after: [ 0] gl firefox-talos-gfx 34.761 34.784 0.17% 5/6 | |||
2010-06-11 | i965: Convert the binding table to streamed indirect state. | Eric Anholt | |
This slightly reduces reduces cairo-gl firefox-talos-gfx runtime on my Ironlake: before: [ # ] backend test min(s) median(s) stddev. count [ 0] gl firefox-talos-gfx 38.236 38.383 0.43% 5/6 after: [ 0] gl firefox-talos-gfx 37.799 38.203 0.39% 6/6 It turns out the cost of caching these objects and looking them up in the cache again is greater than the cost of just computing the object again, particularly when the overhead of having a separate BO to pin is removed. (Those that are paying close attention will note that this is a reversal of the path I was moving the driver in a couple of years ago. The major thing that has changed is that back then all state was recomputed when we wrapped the streaming state buffer, including recompiling our precious programs. Now, we're uncaching just the objects that are cheap to compute, and retaining caching of expensive objects) | |||
2010-06-11 | i965: Split constant buffer setup from its surface state/binding state. | Eric Anholt | |
This was bothering me when redoing the binding tables. | |||
2010-06-11 | i965: Add support for streaming indirect state rather than caching objects. | Eric Anholt | |
2010-06-11 | i965: Set the CC VP state immediately on state change. | Eric Anholt | |
The cache lookup of these two little floats was .12% of total CPU time on firefox-talos-gfx because we did it any time commonly-changed state changed. On the other hand, updating the CC VP bo immediately whenver CC VP state changes is a .07% overhead due to putting a driver hoook in glEnable(). | |||
2010-06-11 | i965: Update old comment about state cache sizing. | Eric Anholt | |
2010-06-11 | i965: Move no_batch_wrap assertion out across the area we're trying to verify. | Eric Anholt | |
It's more likely that we wrap badly in state setup than in the little primitive packet. | |||
2010-06-10 | i965: remove UseProgram driver callback | Brian Paul | |
It just duplicated the default/core Mesa behaviour. | |||
2010-06-11 | docs: Update EGL doc. | Chia-I Wu | |
Update for recent removal of demos and additions of new displays and functions. |