I think on top of compute, the available data will also be a problem. A lot of companies are putting things behind a paywall. Then there are entities like Reddit that sell their data. I doubt these open source ones can easily get their hands on those data.
You are viewing a single comment's thread from: