summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorChuanbo Weng <chuanbo.weng@intel.com>2014-12-03 15:32:43 +0800
committerZhigang Gong <zhigang.gong@intel.com>2015-01-12 09:30:27 +0800
commit7c9def15e1b2001fe0c9e562157640fc27bd47a0 (patch)
treef8c3fbe48be6ba7c0ef91195d1c4fd39232564b6
parent09b3d6a179e7f18337da9a377618285659706cec (diff)
Change CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR from 8 to 16.
Because accessing global memory by uchar16/char16 will fully utilize memory bandwidth, so change CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR from 8 to 16. Three OpenCV cases will speedup from this patch: OCL_ThreshFixture_Threshold, 25% improvement OCL_MaxFixture_Max, 105% improvement OCL_MinFixture_Min, 105% improvement. Signed-off-by: Chuanbo Weng <chuanbo.weng@intel.com> Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>
-rw-r--r--src/cl_gt_device.h2
1 files changed, 1 insertions, 1 deletions
diff --git a/src/cl_gt_device.h b/src/cl_gt_device.h
index 2ec00871..09503273 100644
--- a/src/cl_gt_device.h
+++ b/src/cl_gt_device.h
@@ -24,7 +24,7 @@
.max_1d_global_work_sizes = {1024 * 1024 * 256, 1, 1},
.max_2d_global_work_sizes = {8192, 8192, 1},
.max_3d_global_work_sizes = {8192, 8192, 2048},
-.preferred_vector_width_char = 8,
+.preferred_vector_width_char = 16,
.preferred_vector_width_short = 8,
.preferred_vector_width_int = 4,
.preferred_vector_width_long = 2,