128 elements is still useful for when you have an inner loop that has a small vector operation. Most of the time this is not the case though.
128 elements is still useful for when you have an inner loop that has a small vector operation. Most of the time this is not the case though.