LLM generated text can also be easily detected provided you can figure out which model it came from and the weights within it. For people training models, this won’t be hard to do.
I agree with the take that getting better and better datasets for training is going to get easier over time, rather than harder. The story of AlphaZero is a good example of this too - the best chess AI quickly trounced any AI trained on human games simply by playing against itself. To me, that suggests that training on LLM output will lead to even better results, since you can generate so much more of it.
Advertisers need to see more articles like this. Pulling the ad dollars away from Twitter is the last lever anyone has to try to change it, and it should have happened long ago at this point.