Every Rose Has Its Thorns, Including AI. Is The Quality of Chatbots' Threatened?

Published 7. 12. 2023

It is expected that the popularity of AI is going to rise over time. In the future, we will not be able to distinguish whether certain output was generated by AI or created by a human being.

Nowadays, we use AI in many areas of our lives. And already, we struggle to identify what was made by AI and humans. Often, such content is created by AI versions 0.1 or earlier that are deeply rooted in many technologies we use daily.

It is bittersweet that generative AI has been sensational for many. Such excitement proves that some of us are still not ready for the grand arrival of artificial intelligence.

What terrifies many even more is the fact that AI is not ready for itself. Some experts warn about the finite number of natural data available.

Natural data plays a key role in the economics of artificial intelligence. They are vital for the AI models to function properly and to produce good quality content. The more natural data AI models train on (e.g. human-made), the more useful it gets.

Unfortunately, the amount of natural data is limited.

robot

Rita Matulionyte who teaches IT law at Macquarie University writes in her essay for The Conversation “AI researchers have been sounding the dwindling-data-supply-alarm-bells for nearly a year. One study last year by researchers at the AI forecasting organization Epoch AI estimated that AI companies could run out of high-quality textual training data by as soon as 2026, while low-quality text and image data wells could run dry anytime between 2030 and 2060.”

Her article is available here.

We have the option to use synthetic data or AI-generated data. But such solutions needn’t be viable. Why? There is a possibility that synthetic data might destroy the AI models completely. Research on training data shows that data trained on AI-generated content causes the effects of inbreeding - an increase in genetic disorders.

DNA

Due to the current omnipresence of AI, there is more and more synthetic content produced. Paradoxically, the synthetic content can be the biggest threat to generative AI. In other words, by using its own data, AI can become dumb.

I came across this issue for the first time this year in February. I read a comment written by a data researcher Jathan Sadowski from Monash University “a system that is so heavily trained on the outputs of other generative AI’s that it becomes an inbred mutant, likely with exaggerated, grotesque features.”

Sina Alemohammad and Josue Casco-Rodriguez, the machine learning researchers and Ph.D. students in Rice University’s Electrical and Computer Engineering department have dived into this issue quite thoroughly, too. In collaboration with their supervisor, Richard G. Baraniuk, and researchers at Stanford, they wrote an article titled Self-Consuming Generative Models Go MAD (not peer-reviewed yet). MAD is an abbreviation for Model Autophagy Disorder.

You’ll find the interview with them here.

scientists

Baraniuk explains “Say there are companies that, for whatever reason - maybe it’s cheaper to use synthetic data, or they just don’t have enough real data - and they just throw caution to the wind. They say, ‘we’re going to use synthetic data.’”

He describes “What they don’t realize is that if they do this generation after generation, one thing that’s going to happen is the artifacts are going to be amplified. Your synthetic data is going to start to drift away from reality. That’s the thing that’s really the most dangerous, and you might not even realize it’s happening.”

Baraniuk's concerns about the usage of synthetic data are well-founded. He expresses “And by drift away from reality, I mean you’re generating images that are going to become increasingly, like, monotonous and dull. The same thing will happen for text as well if you do this — the diversity of the generated images is going to steadily go down. In one experiment that we ran, instead of artifacts getting amplified, the pictures all converge into basically the same person. It’s totally freaky.”

You might like

Video Marketing Trends 2023: How AI Is Changing the Game

Sales & Marketing

The importance of online marketing videos has been growing for a long time and statistics confirm this trend: 91% of…

Ondrej Svoboda

27. 4. 2023

Phishing in 2024: Don’t Get Caught Up in Nets

Technology

[playht_player width="100%" height="90px" voice="en-US-JaneNeural"] What is phishing? It is an attempt to steal one’s personal data such as payment…

Dagmar Kylarová

5. 4. 2024

Enhancing Work Efficiency with Microsoft 365 Copilot: Another Revolutionary AI Tool

Technology

Employees are realizing AI's potential and seeking ways to enhance productivity. To meet this demand, Microsoft has created Microsoft 365…

Jan Lalinsky

23. 5. 2023

CustomGPTs 101: How To Build One That Will Max Out Your Creativity

Technology

OpenAI just recently equipped ChatGPT with the ability to create CustomGPTs inside the GPT builder and everyone's going crazy for…

Mustaffa Qasim

28. 11. 2023

New SEO tricks: Boost Your Web Visibility With ChatGPT

Technology

Think of ChatGPT as your smart assistant, helping you make your website more attractive to search engines and your audience.…

Antonín Nguyen

23. 10. 2023

Windows 11 'Moment 2' Update: A Game-Changer in the Era of AI?

Technology

What do 'Moments' refer to in Windows 11? In contrast to Windows 10, which used to receive two significant feature…

Antonín Nguyen

8. 3. 2023

Lure People to Your Website. Use TTS in WordPress: A Step-by-step Guide

Technology

[playht_player width="100%" height="90px" voice="en-US-JennyNeural"] People love to listen to content these days. And podcasts are proof of that. Podcasts…

Dagmar Kylarová

15. 1. 2024

Achieve Financial Growth: 5 Must-Try Techniques for Skyrocketing Earnings with AI Tools

Business

The only problem is, 99% of people don’t know how to use ChatGPT and are missing out on a huge…

Mustaffa Qasim

22. 6. 2023

The Future of Work: How Automation and Artificial Intelligence Are Changing the Job Market

Business

Well, let’s start the debate on the Future of Work. Machines can perform actions hundreds or thousands of times faster…

Jan Lalinsky

11. 4. 2023