summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2010-08-16translate_sse: remove useless generated function wrappersLuca Barbieri
Currently translate_sse puts two trivial wrappers in the translate vtable. These slow it down and enlarge the source code for no gain, except perhaps the ability to set a breakpoint there, so remove them. Breakpoints can be set on the caller of the translate functions, with no loss of functionality.
2010-08-16translate_generic: factor out common code between linear and indexedLuca Barbieri
This moves the common code into a separate ALWAYS_INLINE function.
2010-08-16translate_generic: use memcpy if possible (v3)Luca Barbieri
Changes in v3: - If we can do a copy, don't try to get an emit func, as that can assert(0) Changes in v2: - Add comment regarding copy_size When used in GPU drivers, translate can be used to simultaneously perform a gather operation, and convert away from unsupported formats. In this use case, input and output formats will often be identical: clearly it would make sense to use a memcpy in this case. Instead, translate will insist to convert to and from 32-bit floating point numbers. This is not only extremely expensive, but it also loses precision for 32/64-bit integers and 64-bit floating point numbers. This patch changes translate_generic to just use memcpy if the formats are identical, non-blocked, and with an integral number of bytes per pixel (note that all sensible vertex formats are like this).
2010-08-16drwa: Fix polygon edge flags.Chia-I Wu
Fix a copy-and-paste error introduced by f141abdc8fdbff41e16b0ce53fa3fa8fba32a7f9.
2010-08-16draw: No need to make max_vertices even.Chia-I Wu
Triangle strip alternates the front/back orientation of its triangles. max_vertices was made even so that varray never splitted a triangle strip at the wrong positions. It did not work with triangle strips with adjacencies. And it is no longer relevant with vsplit.
2010-08-16draw: Remove DRAW_PIPE_MAX_VERTICES and DRAW_PIPE_FLAG_MASK.Chia-I Wu
The higher bits of draw elements are no longer used for the stipple or edge flags.
2010-08-16drwa: Add PRIMITIVE macro to vsplit.Chia-I Wu
PRIMITIVE is used by the indexed path to flush the entire primitive with custom vertex count checks. It replaces the existing fast path.
2010-08-16draw: last_vertex_last is always true for GS and SO.Chia-I Wu
That is, OpenGL decomposition rule is assumed. There should be a pipe_context state to specify the rules.
2010-08-16draw: Remove varray and vcache.Chia-I Wu
They have been deprecated by vsplit.
2010-08-16draw: Replace vcache by vsplit.Chia-I Wu
vcache decomposes primitives while vsplit splits primitives. Splitting is generally easier to do and is faster. More importantly, vcache depends on flatshade_first to decompose. The outputs may have incorrect vertex order which is significant to GS.
2010-08-16draw: Replace varray by vsplit.Chia-I Wu
vsplit is a superset of varray. It sets the split flags comparing to varray.
2010-08-16draw: Add vsplit frontend.Chia-I Wu
vsplit is based on varray. It sets the split flags when a primitive is splitted. It also has support for indexed primitives. For indexed primitives, unlike vcache, vsplit splits the primitives instead of decomposes them.
2010-08-16draw: Add new util function draw_pt_trim_count.Chia-I Wu
draw_pt_trim_count is renamed from trim in draw_pt.c.
2010-08-16draw: Simplify frontend interface a little.Chia-I Wu
The run method is simplified to take the start vertex and the vertex count.
2010-08-16draw: Add prim flags to middle ends.Chia-I Wu
Update the middle end interface to pass the primitive flags from the frontends to the pipeline. No frontend sets the flags yet.
2010-08-16draw: Add flags to draw_prim_info.Chia-I Wu
A primitive may be splitted in frontends. The splitted primitives should convey certain flag bits so that the decomposer can correctly decide the stipple or edge flags. This commit adds flags to draw_prim_info and updates the decomposer to honor the flags. Frontends and middle ends will be updated later.
2010-08-16st/mesa: test for FEATURE definesnobled
'struct dd_function_table' only conditionally contains the function pointer NewFramebuffer and friends based on FEATURE_EXT_framebuffer_* defines. (See src/mesa/main/dd.h) Fixes the build when the features are disabled and the vfuncs don't exist.
2010-08-16dri/nouveau: test for FEATURE definesnobled
'struct dd_function_table' only conditionally contains the function pointer NewFramebuffer and friends based on FEATURE_EXT_framebuffer_* defines. (See src/mesa/main/dd.h) Fixes the build when the features are disabled and the vfuncs don't exist.
2010-08-16dri/radeon: test for FEATURE definesnobled
'struct dd_function_table' only conditionally contains the function pointer NewFramebuffer and friends based on FEATURE_EXT_framebuffer_* defines. (See src/mesa/main/dd.h) Fixes the build when the features are disabled and the vfuncs don't exist.
2010-08-16x86: Remove unnecessary header from sse.h.Vinson Lee
2010-08-15x86: Include missing headers in mmx.h.Vinson Lee
Include compiler.h for _ASMAPI symbol. Include mtypes.h for GLcontext symbol.
2010-08-15x86: Remove unnecessary header from 3dnow.h.Vinson Lee
2010-08-15glsl: Clean up header file inclusion in slang_typeinfo.h.Vinson Lee
Remove imports.h, mtypes.h, and slang_vartable.h. Include glheader.h for GL symbols.
2010-08-15glsl: Include missing header in slang_codegen.h.Vinson Lee
Include slang_vartable.h for slang_var_table symbol.
2010-08-15glsl: Include missing header in slang_compile_operation.h.Vinson Lee
Include compiler.h for INLINE symbol.
2010-08-15glsl: Clean up header file inclusion in slang_storage.h.Vinson Lee
Remove slang_compile.h. Include glheader.h for GL symbols. Include slang_compile_function.h for slang_function_scope symbol. Include slang_compile_struct.h for slang_struct_scope symbol. Include slang_compile_variable.h for slang_variable_scope symbol. Include slang_typeinfo.h for slang_type_specifier symbol. Include slang_utility.h for slang_atom_pool symbol.
2010-08-15glsl: Clean up header file inclusion in slang_mem.h.Vinson Lee
slang_mem.h Remove imports.h. Include glheader.h for GL symbols. slang_label.c Include imports.h now that slang_mem.c does not include it.
2010-08-15glsl: Clean up header file inclusion in slang_ir.h.Vinson Lee
Remove imports.h and slang_compile.h. Include glheader.h for GL symbols. Include slang_compile_variable.h for slang_variable symbol.
2010-08-15glsl: Include missing header in slang_link.c.Vinson Lee
Include slang_compile.h for _slang_compile function.
2010-08-15glsl: Include missing headers in slang_builtin.c.Vinson Lee
Include slang_typeinfo.h for slang_type_specifier symbol. Include slang_compiler_struct.h for slang_struct_ symbol.
2010-08-16r300g: fix an invalid pointer in freeMarek Olšák
2010-08-16r300g: Let hyperz init failnobled
Signed-off-by: Marek Olšák <maraeo@gmail.com>
2010-08-16r300g: Fix leaks in failed context creationnobled
This changes r300_destroy_context() so it can be called on a partially-initialized context, and uses it when r300_create_context() hits a fatal error. This makes sure r300_create_context() doesn't leak memory or neglect to call r300_update_num_contexts() when it fails. Signed-off-by: Marek Olšák <maraeo@gmail.com>
2010-08-16r300g: Fix macronobled
This fixes a potential bug if (has_hyperz) is false (it would still init the atom as if has_hyperz were true). Signed-off-by: Marek Olšák <maraeo@gmail.com>
2010-08-16r300/compiler: implement DP2 opcodeMarek Olšák
2010-08-16r300/compiler: implement SSG opcodeMarek Olšák
2010-08-16r300/compiler: fix allocation of temporaries in radeonTransformTEXMarek Olšák
2010-08-15llvmpipe: special case triangles which fall in a single 16x16 blockKeith Whitwell
Check for these and route them to a dedicated handler with one fewer levels of recursive rasterization.
2010-08-15llvmpipe: consolidate several loops in lp_rast_triangleKeith Whitwell
2010-08-15llvmpipe: remove all traces of step arrays, pos_tablesKeith Whitwell
No need to calculate these values any longer, nor to store them in the bin data. Improves isosurf a bit more, 115->123 fps.
2010-08-15llvmpipe: eliminate last usage of step array in rast_tmp.hKeith Whitwell
For 16 and 64 pixel levels, calculate a mask which is linear in x and y (ie not in the swizzle layout). When iterating over full and partial masks, figure out position by manipulating the bit number set in the mask, rather than relying on postion arrays. Similarly, calculate the lower-level c values from dcdx, dcdy and the position rather than relying on the step array.
2010-08-15llvmpipe: don't refer to plane->step when dcdx or dcdy would doKeith Whitwell
2010-08-15llvmpipe: also use build_mask at 16, 64 pixel levelsKeith Whitwell
2010-08-15llvmpipe: version of block4 which doesn't need the full step arrayKeith Whitwell
No noticable slowdown with isosurf.
2010-08-15llvmpipe: reorganize block4 loop, nice speedupKeith Whitwell
isosurf 95->115 fps just by exchanging the two inner loops in this function...
2010-08-15gallium/docs: Cleanup debugging. Spelling, grammar, organization.Corbin Simpson
2010-08-15gallium/docs: Add formatting for envvar role; change debugging.Corbin Simpson
Per Jakob's request. Not super-pretty, but it's a good point for modding later.
2010-08-15glsl: Fix self inclusion in slang_compile_function.h.Vinson Lee
Fix self inclusion introduced by commit 4fef77c7c5455b983daa93cacf90d2b3baa3967e.
2010-08-15glsl: Include missing header in slang_ir.h.Vinson Lee
Include prog_instruction.h for gl_inst_opcode symbol.
2010-08-15glsl: Clean up header file inclusion in slang_link.h.Vinson Lee
Remove slang_compile.h. Include mtypes.h for GLcontext symbol.