[HN Gopher] Contrasting Intel AMX and Apple AMX
       ___________________________________________________________________
        
       Contrasting Intel AMX and Apple AMX
        
       Author : ingve
       Score  : 28 points
       Date   : 2022-09-05 20:27 UTC (2 hours ago)
        
 (HTM) web link (www.corsix.org)
 (TXT) w3m dump (www.corsix.org)
        
       | corsix wrote:
       | The elephant in the room contrast: Apple has been shipping this
       | for years, whereas Intel _might_ ship theirs in something later
       | this year.
        
         | smoldesu wrote:
         | Hasn't Intel been shipping scalar instructions for more than a
         | decade?
        
           | timdorr wrote:
           | Intel's AMX specifically hasn't yet shipped, but will be
           | coming with their new Sapphire Rapids Xeon chips this year.
        
       | Someone wrote:
       | The article would have been much better for me if it drew
       | conclusions about the usefulness of the two (partial) instruction
       | sets.
       | 
       | What can you do easily and what's hard?
       | 
       | I also expected to read something about relative performance of
       | the two.
        
         | corsix wrote:
         | Performance comparison is hard given that the Intel one hasn't
         | shipped yet.
        
           | dougall wrote:
           | Yeah, I think Intel's only number so far is "2048 int8
           | operations/cycle/core" (as opposed to VNNI's 256):
           | https://www.servethehome.com/wp-
           | content/uploads/2021/09/Inte...
           | 
           | Which (assuming 1 multiply-add = 2 operations) is the same
           | int8 operations/cycle as the Apple M1's float16
           | operations/cycle. Intel's 16-bit operations might be the same
           | rate, but I'd guess half? That'll almost certainly be at a
           | higher clock-speed, and one-per-core rather than one-per-
           | four-P-cores. (And I think Apple might have doubled their
           | throughput in M2. As you said, performance comparison is
           | hard.)
        
       | enzanki_ars wrote:
       | Seems to be related to
       | https://news.ycombinator.com/item?id=32722510
        
       ___________________________________________________________________
       (page generated 2022-09-05 23:00 UTC)