Add support for new arm memcpy instructions
Armv8.8/9.3 has FEAT_MOPS which adds memcpy, memmove, and memset instructions. In addition to adding to core DR's decoder, we need memtrace-specific work to instrument these: probably by unrolling the loops like we do for x86 rep string operations.