summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2010-09-13loop_controls: fix analysis of already analyzed loopsLuca Barbieri
The loop_controls pass didn't look at the counter values it put in ir_loop on previous iterations, so while the first iteration worked, subsequent ones couldn't determine max_iterations.
2010-09-13i965: Request that returns be lowered in shader mainIan Romanick
Fixes piglit tests glsl-vs-main-return and glsl-fs-main-return.
2010-09-13glsl: call ir_lower_jumps according to compiler optionsLuca Barbieri
2010-09-13glsl: add continue/break/return unification/elimination pass (v2)Luca Barbieri
Changes in v2: - Base class renamed to ir_control_flow_visitor - Tried to comply with coding style This is a new pass that supersedes ir_if_return and "lowers" jumps to if/else structures. Currently it causes no regressions on softpipe and nv40, but I'm not sure whether the piglit glsl tests are thorough enough, so consider this experimental. It can be asked to: 1. Pull jumps out of ifs where possible 2. Remove all "continue"s, replacing them with an "execute flag" 3. Replace all "break" with a single conditional one at the end of the loop 4. Replace all "return"s with a single return at the end of the function, for the main function and/or other functions This gives several great benefits: 1. All functions can be inlined after this pass 2. nv40 and other pre-DX10 chips without "continue" can be supported 3. nv30 and other pre-DX10 chips with no control flow at all are better supported Note that for full effect we should also teach the unroller to unroll loops with a fixed maximum number of iterations but with the canonical conditional "break" that this pass will insert if asked to. Continues are lowered by adding a per-loop "execute flag", initialized to TRUE, that when cleared inhibits all execution until the end of the loop. Breaks are lowered to continues, plus setting a "break flag" that is checked at the end of the loop, and trigger the unique "break". Returns are lowered to breaks/continues, plus adding a "return flag" that causes loops to break again out of their enclosing loops until all the loops are exited: then the "execute flag" logic will ignore everything until the end of the function. Note that "continue" and "return" can also be implemented by adding a dummy loop and using break. However, this is bad for hardware with limited nesting depth, and prevents further optimization, and thus is not currently performed.
2010-09-13glsl: add ir_control_flow_visitorLuca Barbieri
This is just a subclass of ir_visitor with empty implementations of all the visit methods for non-control flow nodes. Used to avoid duplicating that in ir_visitor subclasses. ir_hierarchical_visitor is another way to solve this, but is less natural for some applications.
2010-09-13llvmpipe: Fix non SSE2 builds.José Fonseca
Should fix fdo 30168.
2010-09-13r300g/swtcl: unlock VBO after draw_flushMarek Olšák
https://bugs.freedesktop.org/show_bug.cgi?id=29901 https://bugs.freedesktop.org/show_bug.cgi?id=30132
2010-09-13llvmpipe: Change asm to __asm__.Witold Baryluk
According to gcc documentation both are equivalent, second are prefered as first can make conflict with existing symbols. Signed-off-by: José Fonseca <jfonseca@vmware.com>
2010-09-13EGL DRI2: 0xa011 is Pineview not IronlakeJesse Barnes
Point about needing a better way to do this validated.
2010-09-13r600c: const buffer sizes must be a multiple of 16 constsAlex Deucher
This applies to r6xx/r7xx/evergreen
2010-09-13EGL DRI2: add PCI ID for Ironlake mobileJesse Barnes
Allows KMS EGL driver to load. We need a better way of doing this.
2010-09-13r600c/eg: remove obselete commentAlex Deucher
2010-09-13r600c/eg: remove unused emit timestamp functionAlex Deucher
2010-09-13r600c/eg: emit CB_BLEND_ALPHA with the other blend valuesAlex Deucher
saves a few dwords
2010-09-13r600c: remove redundant state emit on evergreenAlex Deucher
r700start3d already emits the context control packets
2010-09-13mesa: Revert accidentally committed vertex code chunkKristian Høgsberg
2010-09-13r600c: eg: fix typoAndre Maasikas
probably copy/paste error
2010-09-13r600c: eg: 256 float4 constants may need more than 256 bytesAndre Maasikas
2010-09-13r600c: eg - fix uninitialized variableAndre Maasikas
2010-09-13glx: Don't destroy DRI2 drawables for legacy glx drawablesKristian Høgsberg
For GLX 1.3 drawables, we can destroy the DRI2 drawable when the GLX drawable is destroyed. However, for legacy drawables, there os no good way of knowing when the application is done with it, so we just let the DRI2 drawable linger on the server. The server will destroy the DRI2 drawable when it destroys the X drawable or the client exits anyway. https://bugs.freedesktop.org/show_bug.cgi?id=30109
2010-09-13r300g: fix SWTCLMarek Olšák
https://bugs.freedesktop.org/show_bug.cgi?id=29901
2010-09-13llvmpipe: Unbreak rasterization on 64bit.José Fonseca
2010-09-13gallium: Change the resource_copy_region semantics to allow copies between ↵José Fonseca
different yet compatible formats
2010-09-13r600g: evergreen fixup dsa state for running query.Dave Airlie
evergreen is always the same as r700 here.
2010-09-13r600c: remove stray unmap callAndre Maasikas
no idea how/why it got there
2010-09-13llvmpipe: use gcc asm only with gccJosé Fonseca
2010-09-13r300g: print unassigned FS inputs for DBG_RSMarek Olšák
2010-09-13r300g: fix map_bufferMarek Olšák
https://bugs.freedesktop.org/show_bug.cgi?id=30145
2010-09-13r300/compiler: fix warningsMarek Olšák
2010-09-13r300g: add new debug options for dumping scissor regs and disabling CBZB clearMarek Olšák
2010-09-13r300g: skip rendering if CS space validation failsMarek Olšák
radeon_cs_space_check flushes the pipe context on failure, retries the validation, and returns -1 if it fails again. At that point, there is nothing we can do, so let's skip draw operations instead of getting stuck in an infinite loop. This code path ideally should never be hit.
2010-09-13r300g: remove u_upload_flush from r300_draw_arraysMarek Olšák
This a leftover probably and is unnecessary, since we flush u_upload_mgr in r300_flush.
2010-09-12nvfx: Remove unused variables.Vinson Lee
2010-09-12nvfx: Move declaration before code.Vinson Lee
Fixes SCons build.
2010-09-12llvmpipe: introduce tri_3_4 for tiny trianglesKeith Whitwell
2010-09-12llvmpipe: allow tri_3_16 at any 4-aligned location within a tileKeith Whitwell
Doesn't require 16-alignment, so catch more cases.
2010-09-12llvmpipe: refactor tri_3_16Keith Whitwell
Keep step array as a set of four m128i's and reuse throughout the rasterization.
2010-09-12llvmpipe: pass linear masks to fragment shaderKeith Whitwell
Fragment shader can extract the correct bits for each quad.
2010-09-12llvmpipe: fix warnings on both 32 and 64 bit buildsKeith Whitwell
2010-09-12llvmpipe: fix wierd performance regression in isosurfKeith Whitwell
I really don't understand the mechanism behind this, but it seems like the way data blocks for a scene are malloced, and in particular whether we treat them as stack or a queue, and whether we retain the most recently allocated or least recently allocated has a real affect (~5%) on isosurf framerates... This is probably specific to my distro or even just my machine, but none the less, it's nicer not to see the framerates go in the wrong direction.
2010-09-12pb: Fix the build, and add notes.José Fonseca
2010-09-12llvmpipe: Only generate the whole shader specialization for opaque shaders.José Fonseca
If not opaque, then the color buffer will have to be read any way, therefore the specialization is pointless.
2010-09-12pb: add void * for flush ctx to mapping functionsDave Airlie
If the buffer we are attempting to map is referenced by the unsubmitted command stream for this context, we need to flush the command stream, however to do that we need to be able to access the context at the lowest level map function, currently we set the buffer in the toplevel map, but this racy between context. (we probably have a lot more issues than that.) I'll look into a proper solution as suggested by jrfonseca when I get some time.
2010-09-11nv30: fix breakage due to 10 texcoord support on nv40Luca Barbieri
2010-09-12Add missing files to the tarball file lists.Chia-I Wu
2010-09-12mesa: Fix depend.es[12] generation when LLVM is enabled.Chia-I Wu
"llvm-config --cflags" outputs -f options, which conflict makedepend. Clean up compiler flags and append LLVM_CFLAGS to the new xxx_CFLAGS instead of xxx_CPPFLAGS, where xxx may be MESA, ES1, or ES2.
2010-09-11r600g: Undo bo placement change.Tilman Sauerbeck
This reverts a part of e795ca8f3175fa6fd97b6b2ef2775e3f8803012a that causes artefacts and a performance drop. Signed-off-by: Tilman Sauerbeck <tilman@code-monkey.de>
2010-09-11llvmpipe: Silence some warnings.José Fonseca
2010-09-11gallivm: nr_channels is only valid for formats with plain layout.José Fonseca
This is erroneously throwing non plain formats out of the faster AoS sampling path. Doing 8bit interpolation for single channels such as L8 should be no worse than with floating point. But this may need more investigation.
2010-09-11gallivm: Use const keyword on swizzles.José Fonseca