pub fn smaqa(t: usize, a: usize, b: usize) -> usize
stdsimd
Multiply signed 8-bit elements and add 16-bit elements on results for packed 32-bit chunks