summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2010-11-20r600g: Fix location for clip plane registersOwen W. Taylor
The stride between the different clip plane registers was incorrect. https://bugs.freedesktop.org/show_bug.cgi?id=31788 agd5f: fix evergreen as well.
2010-11-20r300g: fix rendering with no vertex elementsMarek Olšák
Fixes glsl-vs-point-size, although I meant to fix glsl-novertexdata. Since swrast fails glsl-novertexdata too, I guess it's a core issue.
2010-11-19i965: Remove duplicate MRF writes in the FS backend.Eric Anholt
This is quite common for multitexture sampling, and not only cuts down on the second and later set of MOVs, but typically also allows compute-to-MRF on the first set. No statistically siginficant performance difference in nexuiz (n=3), but it reduces instruction count in one of its shaders and seems like a good idea.
2010-11-19i965: Improve compute-to-mrf.Eric Anholt
We were skipping it if the instruction producing the value we were going to compute-to-mrf used its result reg as a source reg. This meant that the typical "write interpolated color to fragment color" or "texture from interpolated texcoord" shader didn't compute-to-MRF. Just don't check for the interference cases until after we've checked if this is the instruction we wanted to compute-to-MRF. Improves nexuiz high-settings performance on my laptop 0.48% +- 0.08% (n=3).
2010-11-19ir_to_mesa: Detect and emit MOV_SATs for saturate constructs.Eric Anholt
The goal here is to avoid regressing performance on ir_to_mesa drivers for fixed function fragment shaders requiring saturates.
2010-11-19i965: Recognize saturates and turn them into a saturated mov.Eric Anholt
On pre-gen6, this turns 4 instructions into 1. We could still do better by folding the saturate into the instruction generating the value if nobody else uses it, but that should be a separate pass.
2010-11-19glsl: Add a helper function for determining if an rvalue could be a saturate.Eric Anholt
Hardware pretty commonly has saturate modifiers on instructions, and this can be used in codegen to produce those, without everyone else needing to understand clamping other than min and max.
2010-11-19i965: Fold constants into the second arg of BRW_SEL as well.Eric Anholt
This hits a common case with min/max operations.
2010-11-19i965: Remove extra \n at the end of every instruction in INTEL_DEBUG=wm.Eric Anholt
2010-11-19i965: Just use memset() to clear most members in FS constructors.Eric Anholt
This should make it a lot harder to forget to zero things.
2010-11-19i965: Fix compute_to_mrf to not move a MRF write up into another live range.Eric Anholt
Fixes glsl-fs-copy-propagation-texcoords-1.
2010-11-19mesa: Include C++ files in the makedepend of DRI drivers.Eric Anholt
2010-11-19glsl: Fix type of label 'default' in switch statement.Vinson Lee
2010-11-19glsl: Add lower_vector.cpp to SConscript.Vinson Lee
2010-11-19glsl: Fix matrix constructors with vector parametersIan Romanick
When the semantics of write masks in assignments were changed, this code was not correctly updated. Fixes piglit test glsl-mat-from-vec-ctor-01.
2010-11-19glsl: Combine many instruction lowering passes into one.Kenneth Graunke
This should save on the overhead of tree-walking and provide a convenient place to add more instruction lowering in the future. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>
2010-11-19glsl: Simplify a type check by using type->is_integer().Kenneth Graunke
2010-11-19glsl: Add ir_quadop_vector expressionIan Romanick
The vector operator collects 2, 3, or 4 scalar components into a vector. Doing this has several advantages. First, it will make ud-chain tracking for components of vectors much easier. Second, a later optimization pass could collect scalars into vectors to allow generation of SWZ instructions (or similar as operands to other instructions on R200 and i915). It also enables an easy way to generate IR for SWZ instructions in the ARB_vertex_program assembler.
2010-11-19glsl: Add unary ir_expression constructorIan Romanick
2010-11-19glsl: Add ir_rvalue::is_negative_one predicateIan Romanick
2010-11-19glsl: Eliminate assumptions about size of ir_expression::operandsIan Romanick
This may grow in the near future.
2010-11-19glsl: Add ir_unop_sin_reduced and ir_unop_cos_reducedIan Romanick
The operate just like ir_unop_sin and ir_unop_cos except that they expect their inputs to be limited to the range [-pi, pi]. Several GPUs require this limited range for their sine and cosine instructions, so having these as operations (along with a to-be-written lowering pass) helps this architectures. These new operations also matche the semantics of the GL_ARB_fragment_program SCS instruction. Having these as operations helps in generating GLSL IR directly from assembly fragment programs.
2010-11-19r600g: use full range of VS resources for vertex samplersAlex Deucher
Now that we have fetch shaders, the full range of VS resources can be used for sampling.
2010-11-19r600g: use meaningful defines for chiprevAlex Deucher
Makes the code much clearer.
2010-11-19r600g: translate ARR instruction for evergreenAlex Deucher
evergreen variant of: 9f7ec103e26c67cb077fd7d94d2fb68562b86c40
2010-11-19r600g: add fetch shader capabilitiesJerome Glisse
Use fetch shader instead of having fetch instruction in the vertex shader. Allow to restrict shader update to a smaller part when vertex buffer input layout changes. Signed-off-by: Jerome Glisse <jglisse@redhat.com>
2010-11-19r600g: All EVENT_WRITE packets need the EVENT_INDEX fieldAlex Deucher
6xx-evergreen
2010-11-19dri/nouveau: Clean up magic numbers in get_rt_formatViktor Novotný
Signed-off-by: Viktor Novotný <noviktor@seznam.cz> Signed-off-by: Francisco Jerez <currojerez@riseup.net>
2010-11-19r600g: fix occlusion query on evergreen (avoid lockup)Jerome Glisse
Occlusion query on evergreen need the event index field to be set otherwise we endup locking up the GPU. Signed-off-by: Jerome Glisse <jglisse@redhat.com>
2010-11-19llvmpipe: twoside for specular color alsoKeith Whitwell
2010-11-19llvmpipe: fix up twoside after recent changesKeith Whitwell
Fix my slot/attr confusion.
2010-11-19llvmpipe: fix such that offset/twoside function only does in-place modificationHui Qi Tay
2010-11-18ir_to_mesa: Generate smarter code for some conditional movesIan Romanick
Condiation moves with a condition of (a < 0), (a > 0), (a <= 0), or (a >= 0) can be generated with "a" directly as an operand of the CMP instruction. This doesn't help much now, but it will help with assembly shaders that use the CMP instruction.
2010-11-18glsl: Make is_zero and is_one virtual methods of ir_rvalueIan Romanick
This eliminates the need in some cames to validate that an rvalue is an ir_constant before checking to see if it's 0 or 1.
2010-11-18mesa: pass gl_format to _mesa_init_teximage_fields()Brian Paul
This should prevent the field going unset in the future. See bug http://bugs.freedesktop.org/show_bug.cgi?id=31544 for background. Also remove unneeded calls to clear_teximage_fields(). Finally, call _mesa_set_fetch_functions() from the _mesa_init_teximage_fields() function so callers have one less thing to worry about.
2010-11-18scons: Use inline wrap helpers more consistently.José Fonseca
2010-11-18gallium/noop: report GL 2.1Dave Airlie
this should at least make app use the same paths as they would for a real driver.
2010-11-17glsl: Fix 'control reaches end of non-void function' warning.Vinson Lee
Fix this GCC warning. ir.cpp: In static member function 'static unsigned int ir_expression::get_num_operands(ir_expression_operation)': ir.cpp:199: warning: control reaches end of non-void function
2010-11-18mesa: Clean up core.h.Chia-I Wu
Remove version.h and context.h from core.h.
2010-11-18st/glx: Replace MESA_VERSION_STRING by xmesa_get_name.Chia-I Wu
xmesa_get_name returns the name of the st_api, which is the same as MESA_VERSION_STRING.
2010-11-18st/wgl: Use st_context_iface::share for DrvShareLists.Chia-I Wu
2010-11-18gallium: Add st_context_iface::share to st_api.Chia-I Wu
It will be used to implement wglShareLists. Fill st_context_iface::copy for glXCopyContext as well.
2010-11-18gallium: Add st_api::name.Chia-I Wu
It is the name of the rendering API. This field is informative.
2010-11-18st/vega: Do not wait NULL fences.Chia-I Wu
2010-11-18i965: Eliminate dead code more aggressively.Eric Anholt
If an instruction writes reg but nothing later uses it, then we don't need to bother doing it. Before, we were just killing code that was never read after it was ever written. This removes many interpolation instructions for attributes with only a few comopnents used. Improves nexuiz high-settings performance .46% +/- .12% (n=3) on my Ironlake.
2010-11-17mesa: upgrade to glext.h version 66Brian Paul
The type of the num/count parameter to glProgramParameters4[df]vNV() changed so some API dispatch code needed updates too.
2010-11-17r600g: fix buffer alignmentAlex Deucher
This should fix the remaining buffer alignment issues in r600g.
2010-11-18i965: Fail on loops on gen6 for now until we write the EU emit code for it.Eric Anholt
2010-11-18i965: Add dumping of the sampler default color.Eric Anholt
2010-11-18i965: Add state dumping for sampler state.Eric Anholt