I think what’s really depressing here is just how effective scaling seems to be....

Buttons840 · on April 7, 2023

My understanding is that transformers are now favored over RNNs because they parallelize better.

It's hard to imagine, but I wonder if there's some non-parallelizable machine learning algorithms which might outperform these massive models? It seems improbable, but it's a small hope I've had. The greatest intellects were aware of (ourselves) do not scale very well, and maybe the same will ultimately apply to AI?

machiaweliczny · on April 7, 2023

I remember seeing some theoretical analysis that compared computing differences between transformers, LSTMs and RNNs and I think that RNNs are theoretically better (can learn more complex functions). Can't find it now.

rlt · on April 7, 2023

I wonder if there’s an incentive for a large group of companies to fund open source models, sort of like Linux.