We’ve been wasting our processing power to train neural networks that are ten times too big.
by Karen Hao May 10, 2019
Neural networks are the core software of deep learning. Even though they’re so widespread, however, they’re really poorly understood. Researchers have observed their emergent properties without actually understanding why they work the way they do.
Now a new paper out of MIT has taken a major step toward answering this question. And in the proces