New Open‑Source Tool to Check If AI Trained on Your Work

New Open‑Source Tool to Check If AI Trained on Your Work

If you’re a writer, artist, or publisher, you can now verify whether your content was used to train large language models—without a data center or a PhD.

In “Copyright Detection in Large Language Models,” David Szczecina, Senan Gaffori, and Edmond Li introduce an open-source platform that makes copyright checks practical and transparent.

  • One-click checks: Upload your content to see potential matches in LLM training sets.
  • Smarter similarity: Improved detection methods to catch near-duplicates and paraphrases.
  • Faster & cheaper: 10–30% lower compute via efficient API calls.
  • Scalable backend: Built to handle big datasets as usage grows.
  • User-friendly: Clear results for creators, publishers, and legal teams.

Why it matters: As legal scrutiny intensifies, creators need accessible proof of use. This platform helps raise the bar for ethical, transparent AI—supporting responsible development and easier copyright enforcement.

Read the research: https://arxiv.org/abs/2511.20623v1

Authors: David Szczecina, Senan Gaffori, Edmond Li (cs.AI)

Paper: https://arxiv.org/abs/2511.20623v1

Register: https://www.AiFeta.com

AI ethics copyright LLM transparency openSource creators research

Read more