Vector Instructions. Part I

Instructions and registers

Figure 1: Scalar and vector computations


Figure 2: Names of intrinsics in SSE2 (a) and ARM NEON (b)
Table 1: Data type designations for x86 intrinsics

Essential vector instructions

Figure 3: Horizontal addition
Figure 4: The _mm_madd_epi16 instruction
Figure 5: Copying by mask
Figure 6: Shuffling

Where do I get information on vector instructions?



