Open-source News

Intel Releases x86-simd-sort 5.0 With 4~5x Faster C++ Object Sorting Using AVX-512

Phoronix - Tue, 02/13/2024 - 05:03
It's been nearly one year to the day since outlining intel's AVX-512 powered sorting library to offer blazing fast sort speeds. Over the past year has brought the 1.0 release, new algorithms in v2.0, AVX2 support and more AVX-512 optimizations in v4.0, and now today Intel is out with x86-simd-sort 5.0 with yet more performance improvements...

AMD Quietly Funded A Drop-In CUDA Implementation Built On ROCm: It's Now Open-Source

Phoronix - Mon, 02/12/2024 - 22:00
While there have been efforts by AMD over the years to make it easier to port codebases targeting NVIDIA's CUDA API to run atop HIP/ROCm, it still requires work on the part of developers. The tooling has improved such as with HIPIFY to help in auto-generating but it isn't any simple, instant, and guaranteed solution -- especially if striving for optimal performance. Over the past two years AMD has quietly been funding an effort though to bring binary compatibility so that many NVIDIA CUDA applications could run atop the AMD ROCm stack at the library level -- a drop-in replacement without the need to adapt source code. In practice for many real-world workloads, it's a solution for end-users to run CUDA-enabled software without any developer intervention. Here is more information on this "skunkworks" project that is now available as open-source along with some of my own testing and performance benchmarks of this CUDA implementation built for Radeon GPUs.

Pages