The experiment employed a three-party design where participants engaged in simultaneous five-minute conversations with both a human and an AI system before determining which was which
Two AI models pass benchmark Turing Test, blurring line between human and machine
