summaryrefslogtreecommitdiff
AgeCommit message (Collapse)AuthorFilesLines
2009-10-20intel: Only call clock_gettime once per unreference_final.HEADmasterEric Anholt1-15/+32
Notably when freeing a batchbuffer, we often end up freeing many of the buffers it points at as well. Avoiding repeated calls brings us a 9% CPU win for cairo-gl. [ # ] backend test min(s) median(s) stddev. count before: [ 0] gl firefox-talos-gfx 58.941 58.966 0.75% 3/3 after: [ 0] gl firefox-talos-gfx 54.186 54.195 0.49% 3/3
2009-10-20intel: Improve bo_references performance by skipping the tree walk.Eric Anholt1-9/+15
If the target we're asking about hasn't ever been used as a relocation target, then it obviously hasn't been used as a target by the batch's reloc tree. This is the common case for good GL programming where you only map fresh buffers, and gives us a 5% win in cairo-gl. [ # ] backend test min(s) median(s) stddev. count before: [ 0] gl firefox-talos-gfx 64.680 64.756 0.06% 3/3 after: [ 0] gl firefox-talos-gfx 60.816 60.970 0.29% 3/3
2009-10-20nouveau: retry if pushbuf ioctl interrupted by signalBen Skeggs1-0/+6
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
2009-10-19nouveau: function to return status of bo being referenced in pushbufBen Skeggs2-0/+21
Will be used to implement gallium's is_{texture,buffer}_referenced() callbacks properly. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
2009-10-13nouveau: nv30: add render target logbase2 width,heightPatrice Mandin1-0/+4
2009-10-13intel: Add the defines for the kernel overlay support landing in 2.6.33.Eric Anholt1-0/+71
2009-10-13intel: Fallback to atomic-ops.h [libatomic-ops-dev]Chris Wilson2-1/+26
Use the external implementation for atomic operations across a wide range of architectures. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2009-10-13configure: Typo in error message.Chris Wilson1-1/+1
Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2009-10-12configure: Conditionally build libdrm_intelChris Wilson1-5/+14
Only build libdrm_intel automatically if we have support for atomic operations. To force configure to build drm pass --enable-intel, which will cause the configure to error if no support is found. Or pass --disable-intel to explicitly prevent libdrm_intel from being built. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2009-10-08Bump to 2.4.15 for release.2.4.15Eric Anholt1-1/+1
2009-10-08intel: Remove the asserts about the ignored alignment parameter.Eric Anholt1-2/+0
I slipped it in with the alloc_tiled changes, since we were explicitly throwing the parameter away. It caught some bogus released code, which we've now fixed, so remove the asserts to keep old drivers working.
2009-10-06intel: Add a bo_alloc function for tiled BOs.Jesse Barnes5-8/+178
This simplifies driver code in handling object allocation, and also gives us an opportunity to possibly cache tiled buffers if it turns out to be a win. [anholt: This is chopped out of the execbuf2 patch, as it seems to be useful separately and cleans up the execbuf2 changes to be more obvious]
2009-10-06intel: Fix up some stale doxygen comments.Eric Anholt2-5/+5
2009-10-06intel: Reformat to the kernel coding style. Welcome to the 8-space future.Eric Anholt8-2682/+2769
This is done with: Lindent *.[ch] perl -pi -e 's|drm_intel_bo \* |drm_intel_bo *|g' *.[ch] perl -pi -e 's|drm_intel_bufmgr \* |drm_intel_bufmgr *|g' *.[ch] perl -pi -e 's|drm_intel_bo_gem \* |drm_intel_bo_gem *|g' *.[ch] perl -pi -e 's|drm_intel_bufmgr_gem \* |drm_intel_bufmgr_gem *|g' *.[ch] perl -pi -e 's|_fake \* |_fake *|g' *.[ch] hand-editing to whack indented comments into line and other touchups.
2009-10-06intel: Don't allocate more relocation entries than the BO could support.Eric Anholt1-2/+6
This saves 32k of relocation entry storage for many 965 state buffers. No noticeable impact on performance for cairo-gl firefox.
2009-10-03tests: Disable intel-specific tests with --disable-intelChris Wilson1-5/+18
Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2009-10-03intel: Add a configure option to *disable* building libdrm-intelChris Wilson2-16/+29
In conjunction with the atomic operation patch, it may be more convenient for some people to disable building libdrm-intel and its dependencies upon the atomic intrinsics then it is for them to use a supported compiler. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2009-10-03intel: report errnoChris Wilson1-2/+2
Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2009-10-03libdrm_radeon: Update RADEON_TILING_* flags to what's in current kernels.Michel Dänzer1-5/+6
2009-10-03libdrm_radeon: Zero-initialize structures to silence valgrind warningsNicolai Hähnle1-0/+8
Signed-off-by: Nicolai Hähnle <nhaehnle@gmail.com>
2009-10-02intel: Use atomic refcountersChris Wilson4-64/+141
As the target architecture for Intel GPUs is the x86, we can presume to have reasonable compiler support for Intel atomic intrinsics, i.e. gcc, and so use those in preference to pulling in a complicated mess of fragile assembly. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> [anholt: hand-resolved against my previous commit. This brings cairo-gl firefox-talos-gfx time from 65 seconds back down to 62 seconds.] Signed-off-by: Eric Anholt <eric@anholt.net>
2009-10-02intel: Mark cached bo as purgeableChris Wilson2-2/+63
Set the DONTNEED flag on cached buffers so that the kernel is free to discard those when under memory pressure. [anholt: This takes firefox-talos-gfx time from ~62 seconds to ~65 seconds on my GM965, but it seems like a hit worth taking for the improved functionality from saving memory] Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Signed-off-by: Eric Anholt <eric@anholt.net>
2009-10-02intel: Don't free the reloc list when putting a freed BO in the cache.Eric Anholt1-4/+3
This takes firefox-talos-gfx from 74 seconds to 70 seconds on my GM965.
2009-10-01intel: Add a new function to check if a BO's reloc tree references some BO.Eric Anholt4-2/+37
There are a bunch of places in GL where if we can't do this we have to flush the batchbuffer, and the cost of lookups here is outweighed by flush savings.
2009-09-21Bump to 2.4.14 for release.libdrm-2.4.14Eric Anholt1-1/+1
2009-09-21intel: Remove the max_entries stuff that complicated bo reuse.Eric Anholt1-25/+5
I thought I was going to do all sorts of crazy experiments with it. I never did, and it turned out the free-after-a-few-seconds plan is working out fine.
2009-09-21intel: Remove the old swrast flag for reducing cache flushing.Eric Anholt1-37/+25
It hasn't been doing anything effective since 52e5d24fae4af6f2f4a5304a516c8c5ab347a11b, and we pretty much don't bo_map pinned buffers any more anyway.
2009-09-17drmModeFreeConnector: free encoders and propertiesKeith Packard1-0/+3
These were leaking. Signed-off-by: Keith Packard <keithp@keithp.com>
2009-09-15radeon: fix 32/64 bit issue with sign extensionDave Airlie1-6/+6
Not sure what intptr_t was up to here. Reported and tested by: Kevin DeKorte Signed-off-by: Dave Airlie <airlied@redhat.com>
2009-09-07libdrm_intel: include B43 chipset checkZhenyu Wang1-1/+4
Signed-off-by: Zhenyu Wang <zhenyuw@linux.intel.com>
2009-09-04libdrm_intel: add new pci idsZhenyu Wang1-1/+4
New ids for G41, Clarkdale and Arrandale. Make sure we don't need to count fence also on new chips. Signed-off-by: Zhenyu Wang <zhenyuw@linux.intel.com>
2009-08-28Bump to version 2.4.13 for release.libdrm-2.4.13Eric Anholt1-1/+1
2009-08-28Add drm_intel_bo_busy to query whether mapping a BO would block.Eric Anholt4-10/+34
2009-08-28Revert "libdrm_radeon: Always wait for BO idle in bo_map."Michel Dänzer1-1/+4
This reverts commit 0a732983f059c353b267b6bf877e1f0eea4e033f. Paul Nieminen and Dave Airlie pointed out on IRC that this shouldn't be necessary. I was seeing visual corruption in X before I made this change, but I can't reproduce that anymore so it was probably an unrelated issue.
2009-08-27libdrm_radeon: Always wait for BO idle in bo_map.Michel Dänzer1-4/+1
This allows users to eliminate explicit bo_wait calls before bo_map calls.
2009-08-25Kill last remnant of replacefb ioctlJakob Bornecrantz1-1/+0
Kenrels doesn't expose this ioctl
2009-08-24radeon: pull in z pipe changes from kernelAlex Deucher1-0/+2
2009-08-22radeon: add support for busy/domain check interface.Pauli Nieminen3-1/+31
airlied: modified the interface to drop busy return value, just return it normally, also fixed int->uint32_t for domain Signed-off-by: Pauli Nieminen <suokkos@gmail.com>
2009-08-19nouveau: fix a thinko in copyless pushbuf ioctlBen Skeggs1-0/+2
No idea why G80 doesn't hit this, but, this fixes at least one NV40 card.
2009-08-18libdrm_radeon: Optimize copy of table to cs buffer with specialized call.Pauli Nieminen1-0/+9
Using this call in OUT_BATCH_TABLE reduces radeonEmitState cpu usage from 9% to 5% and emit_vpu goes from 7% to 1.5%. I did use calgrind to profile gears for cpu hotspots with r500 card. Signed-off-by: Pauli Nieminen <suokkos@gmail.com>
2009-08-18libdrm_radeon: Fix loops so that compiler can optimize them.Pauli Nieminen1-6/+6
GCC did war about optimization not possible because possible forever loop. Signed-off-by: Pauli Nieminen <suokkos@gmail.com>
2009-08-18libdrm/radeon: Update head of linked list not to point freed memory.Pauli Nieminen1-0/+1
Signed-off-by: Pauli Nieminen <suokkos@gmail.com>
2009-08-18nouveau: support for copy-less pushbuf ioctlBen Skeggs3-16/+143
2009-08-18nouveau: for the moment, assert if we exceed some reloc limitsBen Skeggs1-2/+8
Nasty, but nicer than silently not writing into the pushbuf
2009-08-17radeon: fix bo wait at map time.Dave Airlie1-6/+6
2009-08-16Filter radeon pkgconfig file as do intel and nouveauChristoph Brill1-0/+1
2009-08-15radeon: fix GTT writing space checkDave Airlie1-1/+1
Noticed by vehemens on irc. Signed-off-by: Dave Airlie <airlied@redhat.com>
2009-08-05link libdrm_intel with -lrt for new use of clock_gettime().Anssi Hannula1-1/+1
Signed-off-by: Eric Anholt <eric@anholt.net>
2009-08-01libdrm_radeon: add tiling supportDave Airlie3-15/+92
2009-08-01radeon: don't map/unmap explicitly.Dave Airlie1-5/+14
This caches the mapping and just use mapping as a sync point