New Delhi, April 4 -- OpenAI's GPT-4.5 and Meta's Llama-3.1 models have passed the Turing Test, a benchmark proposed by Alan Turing in the 1950s to assess whether machines can exhibit intelligent behaviour indistinguishable from humans that has always been held up as a sort of tipping point on the maturity and sophistication of Artificial Intelligence (AI).
Researchers Cameron R. Jones and Benjamin K. Bergen from the University of California San Diego, found that GPT-4.5 performed so convincingly that judges identified it as human 73% of the time-significantly more often than they correctly identified actual human participants. Meta's Llama-3.1-405B achieved a 56% success rate, essentially matching human performance (around 50%), while bas...