Generative AI Plagiarism Reaches 60%, Copyleaks Research Shows

StartupHub.ai Staff

Generative AI Plagiarism

Generative AI plagiarism is becoming more contentious for content creation, with recent investigations by Israeli startup, Copyleaks, exposing low authenticity of AI-created content. Their comprehensive research unveils that a significant 60% of outputs from GPT-3.5 manifest elements of plagiarized content.

To discern the level of originality in AI-generated content, Copyleaks analyzed 1,045 outputs from GPT-3.5, spread over 26 diverse subjects. The detailed analysis revealed a high prevalence of Generative AI plagiarism, varying considerably among subjects. For instance, Physics manifested the highest levels of identical text, amounting to 27%.

gpt-3.5 plagiarism
Copyleaks gpt-3.5 plagiarism research, identical text ranking by subject. Credit: Copyleaks.

Conversely, Computer Science displayed the highest instances of paraphrased text, 80.7%. Both Physics and Psychology surfaced as subjects with the highest percentages of minor changes at 25.2%. It implies a challenge of unauthorized content modifications and attributions.

Employing an advanced scoring system, Copyleaks gauged content originality, where a 0% score corresponds to absolute originality and 100% denotes a complete lack thereof. Physics stood out with the highest average Similarity Score at 31.3%. Meanwhile, disciplines like Theater and Humanities showcased the least similarity scores.

Summary of findings by Copyleaks’ research on plagiarism by GPT-3.5 LLM. Credit: Copyleaks. Note: “Identical Text” denotes the most blatant form, where one simply copies another person’s words verbatim and represents them as their own. “Minor Changes” involve making slight modifications to the source material, like altering a verb, while still failing to provide proper credit. Lastly, “Paraphrased Text” involves rephrasing someone else’s ideas in one’s own words without acknowledging the original source.

“As Generative AI adoption grows, it’s important to realize that there are major copyright and IP implications for using AI-generated content,” commented Alon Yamin, CEO and co-founder of Copyleaks. “As our research demonstrates, high percentage of AI content is plagiarized and could be copyrighted which is why it is crucial to identify Generative AI content presence and authenticate it for originality.”

With predictions pointing towards a near future where nearly 90% of online content will be generated by AI, the sanctity of original ideas and critical thinking is in peril. Addressing Generative AI plagiarism is paramount to safeguarding intellectual integrity and originality in this new digital era.

Explore thousands of AI startups.

Browse Startups by Filters

Founder Military Unit

Homepage Military Unit

AI Technology

Homepage AI Tech

Sector Tags

Homepage Sectors Filter

Continue Reading

Mentioned in the article

Market Statistic

0%
of online content may be synthetically generated by 2026.

You cannot copy content of this page

Add or Claim Your Profile

Submit the form below to add or claim an existing profile. 

Within 24 hours, we’ll review your request and connect you to your existing profile for full editing.

Database Search

Filter by:

In Memory of The Murdered

In the early morning on Saturday, October 7th, 2023, terrorist group Hamas infiltrated southern Israel and massacred innocent Israelis, from infants to seniors.

They were mercilessly killed, in unspeakable ways.

StartupHub.ai and the entire technology community pause to honor the murdered. We remember them. We honor their memory and the brightest futures they were supposed to live.

May their families and friends find solace. We grieve with you.

We pray for the release of hostages currently held in Gaza.

Pro Lite Subscription Required

Unlock all search results

With a Pro Lite subscription, you can enjoy unlimited profile viewing. Power your innovation efforts, uninterrupted.

$20/month

cancel anytime

Pro Lite includes: