Posts tagged "Neural Networks"

Optimizing Neural Network Performance

blog August 2, 2024

Numpy can multiply two 1024x1024 matrices on a 4-core Intel CPU in ~8ms. This is incredibly fast, considering this boils down to 18 FLOPs / core / cycle, with a cycle taking a third of a nanosecond. Numpy does this using a highly optimized BLAS implementation.

Neural Networks Performance Optimization NumPy BLAS Linear Algebra Machine Learning CPU Optimization

QUICK ACTIONS

NAVIGATION

Posts tagged "Neural Networks"

Optimizing Neural Network Performance