TalkVectorization in C++: From Inline Assembly to Portable Performance with std::simdYuly TarasovSyntacore