summaryrefslogtreecommitdiff
path: root/src
diff options
context:
space:
mode:
authorChuanbo Weng <chuanbo.weng@intel.com>2014-12-03 15:32:43 +0800
committerZhigang Gong <zhigang.gong@intel.com>2014-12-03 15:55:51 +0800
commitb9d1080b709cd0178edb93796e5c3dcdc212f97a (patch)
tree87deed9806faa0148548d451f52e237d729b920f /src
parentda07d7a43d1fecd57a7ed2d35aa4670b7da7c8c8 (diff)
Change CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR from 8 to 16.
Because accessing global memory by uchar16/char16 will fully utilize memory bandwidth, so change CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR from 8 to 16. Three OpenCV cases will speedup from this patch: OCL_ThreshFixture_Threshold, 25% improvement OCL_MaxFixture_Max, 105% improvement OCL_MinFixture_Min, 105% improvement. Signed-off-by: Chuanbo Weng <chuanbo.weng@intel.com> Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>
Diffstat (limited to 'src')
-rw-r--r--src/cl_gt_device.h2
1 files changed, 1 insertions, 1 deletions
diff --git a/src/cl_gt_device.h b/src/cl_gt_device.h
index 37abfd28..ed19f109 100644
--- a/src/cl_gt_device.h
+++ b/src/cl_gt_device.h
@@ -24,7 +24,7 @@
.max_1d_global_work_sizes = {1024 * 1024 * 256, 1, 1},
.max_2d_global_work_sizes = {8192, 8192, 1},
.max_3d_global_work_sizes = {8192, 8192, 2048},
-.preferred_vector_width_char = 8,
+.preferred_vector_width_char = 16,
.preferred_vector_width_short = 8,
.preferred_vector_width_int = 4,
.preferred_vector_width_long = 2,