I don't imagine that minor code optimisations would make as much difference as shortening the time the user spends on their device.
The goal would then be to make websites as concise as possible, toss out all ads, comment sections, social media tie-ins, videos, needlessly long-winded articles, anything that slows the visit down, and kick the user out as soon as they have what they were searching for.
This is a very interesting test. It seems to me that, aside from the prompt bias, it may be detecting "yo be real" category questions by the low statistical probability of the words occurring in sequence. Those would be especially low for gibberish words that had an occurrence of zero in its data, which is why it's so good at spotting those.
One could also try and detect nonsense with quad-grams using the same principle, but in any case this metric would tend to mark unique questions and foreign names or brandnames as nonsense.
So far most of this pollution is effectively tagged through accompanying mentions of "GPT", but filtering that from future training data would mean GPT can never learn about itself.
I know people who talk like this, and the user history checks out. Either way, the comment is misinforming: Those quotes are from Descartes, not Turing, whose views as to whether machines could think were opposite.
https://plato.stanford.edu/entries/turing-test
I don't think it's just coincidence. A lot of "A or B" questions tend to have B as the correct answer with the first option just being there to confuse the reader. I find it likely that a statistical algorithm would pick up on that.
Good observation. It is possible that GPT picked up on this statistical tendency. I participated in seven Turing tests, and most of the "X or Y" questions had the latter as the correct answer, so regularly that I set my program to pick it as default when it didn't know. As GPT picks up on statistical word orders, I find this likely to be the case here as well.
Turing only proposed his game as a hypothetical argument, so he barely specified any rules on how to actually perform such a test in practice. He referred to the judging party either as "the man" or "the average interrogator", he never specified a number. Official Turing test events have had anywhere between four to hundreds of people judging.
The goal would then be to make websites as concise as possible, toss out all ads, comment sections, social media tie-ins, videos, needlessly long-winded articles, anything that slows the visit down, and kick the user out as soon as they have what they were searching for.
But capitalism doesn't approve of that.