Intel 對於在 E-cores 上面可以跑 AVX-512 指令集的計畫:AVX10.2
source link: https://blog.gslin.org/archives/2023/08/23/11310/intel-%e5%b0%8d%e6%96%bc%e5%9c%a8-e-cores-%e4%b8%8a%e9%9d%a2%e5%8f%af%e4%bb%a5%e8%b7%91-avx-512-%e6%8c%87%e4%bb%a4%e9%9b%86%e7%9a%84%e8%a8%88%e7%95%ab%ef%bc%9aavx10-2/
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
Intel 對於在 E-cores 上面可以跑 AVX-512 指令集的計畫:AVX10.2
看到「Intel AVX10.2 ISA to enable AVX-512 capabilities on E-cores」這篇提到了 Intel 的技術文件「The Converged Vector ISA: Intel® Advanced Vector Extensions 10」,裡面提到了 Intel 後續對 AVX-512 的計畫。
主要是這張,可以看到在 AVX10.2 的規劃中會支援 E-cores:
不過目前還要等,這邊只放了一個 future 的說明:
目前的傳言是 2024 或 2025 會有 AVX10.1 在 Xeon 上出來:
Intel says that version 1 of the AVX10 vector ISA (AVX10.1) will first be implemented on Intel Xeon “Granite Rapids” processors that, according to some media reports, are expected to launch by 2024 or 2025, so it will likely take a long while before AVX10.2 is implemented on processors with E-cores.
但 AVX10.1 還沒有在 E-cores 上面執行 AVX512 的能力,所以 AVX10.2 應該是更後面...
Related
Linus 狂幹 Intel 的 AVX-512
這幾天蠻熱鬧的消息,Linus 幹翻 Intel 丟出來的 AVX-512:「Alder Lake and AVX-512」。 在維基百科的「Advanced Vector Extensions」這邊有提到,因為 AVX-512 執行時會消耗產生更多的熱量,所以得壓低 Turbo Boost 執行: Since AVX instructions are wider and generate more heat, Intel processors have provisions to reduce the Turbo Boost frequency limit when such instructions are being executed. The throttling is divided into three levels: L0…
July 13, 2020In "Computer"
Intel 用 AVX-512 加速 NumPy 的排序演算法被整合進主線了
Intel 用 AVX-512 加速 NumPy 排序的實做被整合進主線了:「「Intel Publishes Blazing Fast AVX-512 Sorting Library, Numpy Switching To It For 10~17x Faster Sorts」」。 GitHub 的 PR 在「ENH: Vectorize quicksort for 16-bit and 64-bit dtype using AVX512 #22315」這邊,可以看到相關的留言: This patch adds AVX512 based 64-bit on AVX512-SKX and 16-bit sorting on AVX512-ICL. All the AVX512…
February 17, 2023In "Computer"
超快速的 Base64 encoding/decoding 實做
看到「Base64 encoding and decoding at almost the speed of a memory copy」這個,可以超級快速編解碼 Base64 的資料。 實做上是透過 Intel 的 AVX-512 加速,在資料夠大的情況下 (超過 L1 cache 的大小),可以達到接近字串複製的速度 (這邊提到的 memcpy()): We show how we can encode and decode base64 data at nearly the speed of a memory copy (memcpy) on recent Intel processors, as long…
November 21, 2019In "Computer"
Author Gea-Suan LinPosted on August 23, 2023Categories Computer, Hardware, MurmuringTags avx, avx-512, avx10.2, avx512, cores, cpu, e, intel
Leave a Reply
Your email address will not be published. Required fields are marked *
Comment *
Name *
Email *
Website
Notify me of follow-up comments by email.
Notify me of new posts by email.
To respond on your own website, enter the URL of your response which should contain a link to this post's permalink URL. Your response will then appear (possibly after moderation) on this page. Want to update or remove your response? Update or delete your post and re-enter your post's URL again. (Learn More)
Post navigation
Recommend
About Joyk
Aggregate valuable and interesting links.
Joyk means Joy of geeK