pub fn urcrsa16(a: usize, b: usize) -> usize
stdsimd
Cross halves of subtracts and adds packed 16-bit unsigned numbers, dropping least bits