Available on x86 and target feature
sse4.1
only.Expand description
Round the lower single-precision (32-bit) floating-point element in b
down to an integer value, store the result as a single-precision
floating-point element in the lower element of the intrinsic result,
and copies the upper 3 packed elements from a
to the upper elements
of the intrinsic result.