diff options
author | Chuanbo Weng <chuanbo.weng@intel.com> | 2014-12-03 15:32:43 +0800 |
---|---|---|
committer | Zhigang Gong <zhigang.gong@intel.com> | 2015-01-12 09:30:27 +0800 |
commit | 7c9def15e1b2001fe0c9e562157640fc27bd47a0 (patch) | |
tree | f8c3fbe48be6ba7c0ef91195d1c4fd39232564b6 | |
parent | 09b3d6a179e7f18337da9a377618285659706cec (diff) |
Change CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR from 8 to 16.
Because accessing global memory by uchar16/char16 will fully utilize
memory bandwidth, so change CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR from
8 to 16. Three OpenCV cases will speedup from this patch:
OCL_ThreshFixture_Threshold, 25% improvement
OCL_MaxFixture_Max, 105% improvement
OCL_MinFixture_Min, 105% improvement.
Signed-off-by: Chuanbo Weng <chuanbo.weng@intel.com>
Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>
-rw-r--r-- | src/cl_gt_device.h | 2 |
1 files changed, 1 insertions, 1 deletions
diff --git a/src/cl_gt_device.h b/src/cl_gt_device.h index 2ec00871..09503273 100644 --- a/src/cl_gt_device.h +++ b/src/cl_gt_device.h @@ -24,7 +24,7 @@ .max_1d_global_work_sizes = {1024 * 1024 * 256, 1, 1}, .max_2d_global_work_sizes = {8192, 8192, 1}, .max_3d_global_work_sizes = {8192, 8192, 2048}, -.preferred_vector_width_char = 8, +.preferred_vector_width_char = 16, .preferred_vector_width_short = 8, .preferred_vector_width_int = 4, .preferred_vector_width_long = 2, |