Vectorize inner loop of get_tiles and out_transform with SIMD #3

yolanda15 · 2017-06-25T08:01:23Z

I'm ICC compiler TCE. I tried to add omp simd to vectorize the inner loop in get_tiles and out_transform functions which is much more effective than current partial auto-vectorization. Measured the overall performance can improve 10% by this. It also makes layer 3 perform better over MKL DNN.

Add omp simd to vectorize the inner loop in get_tiles and out_transform functions which is more effective than current partial auto-vectorization. Measured the overall performance can improve 10% by this. It also makes layer 3 perform better over MKL DNN.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Vectorize inner loop of get_tiles and out_transform with SIMD #3

Vectorize inner loop of get_tiles and out_transform with SIMD #3

Uh oh!

yolanda15 commented Jun 25, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Vectorize inner loop of get_tiles and out_transform with SIMD #3

Are you sure you want to change the base?

Vectorize inner loop of get_tiles and out_transform with SIMD #3

Uh oh!

Conversation

yolanda15 commented Jun 25, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant