Last week when launching the AMD EPYC 9005 "Turin" processors, on the same day AOCC 5.0 was quietly released as the newest version of AMD's Zen-focused compiler derived from LLVM/Clang. With not only adding AMD Zen 5 "znver5" support but also additional vectorization improvements and other performance optimizations, I was eager to run some benchmarks of AOCC 5.0 against the open-source GCC and LLVM/Clang compilers. Here are those initial benchmarks using dual AMD EPYC 9755 128-core Zen 5 processors.
Intel engineers have released the oneAPI Deep Neural Network Library "oneDNN" version 3.6 release that serves as the building blocks for deep learning software like ONNX Runtime, OpenVINO, Apache MXNet, Apache SIGNA, and optionally by PyTorch and TensorFlow with Intel's extensions...