A A A

AI struggles to weed out gibberish, study finds

2023-09-15 HKT 01:11

Share this story

Open AI's ChatGPT scored better than some other models, but researchers are still cautious about its uses. File image: Shutterstock

The AI models that power chatbots and other applications still have difficulty distinguishing between nonsense and natural language, according to a study released on Thursday.

The researchers at Columbia University in the United States said their work revealed the limitations of current AI models and suggested it was too early to let them loose in legal or medical settings.

They put nine AI models through their paces, firing hundreds of pairs of sentences at them and asking which were likely to be heard in everyday speech.

They asked 100 people to make the same judgement on pairs of sentences like: "A buyer can own a genuine product also / One versed in circumference of highschool I rambled."

The research, published in the Nature Machine Intelligence journal, then weighed the AI answers against the human answers and found dramatic differences.

Sophisticated models like GPT-2, an earlier version of the model that powers viral chatbot ChatGPT, generally matched the human answers.

Other simpler models did less well. But the researchers highlighted that all the models made mistakes.

"Every model exhibited blind spots, labelling some sentences as meaningful that human participants thought were gibberish," said psychology professor Christopher Baldassano, an author of the report.

"That should give us pause about the extent to which we want AI systems making important decisions, at least for now." (AFP)

AI struggles to weed out gibberish, study finds

AI struggles to weed out gibberish, study finds

AI struggles to weed out gibberish, study finds

All

Local

Greater China

World News

Finance

Sport