New Open‑Source Tool to Check If AI Trained on Your Work
If you’re a writer, artist, or publisher, you can now verify whether your content was used to train large language models—without a data center or a PhD.
In “Copyright Detection in Large Language Models,” David Szczecina, Senan Gaffori, and Edmond Li introduce an open-source platform that makes copyright checks practical and transparent.
- One-click checks: Upload your content to see potential matches in LLM training sets.
- Smarter similarity: Improved detection methods to catch near-duplicates and paraphrases.
- Faster & cheaper: 10–30% lower compute via efficient API calls.
- Scalable backend: Built to handle big datasets as usage grows.
- User-friendly: Clear results for creators, publishers, and legal teams.
Why it matters: As legal scrutiny intensifies, creators need accessible proof of use. This platform helps raise the bar for ethical, transparent AI—supporting responsible development and easier copyright enforcement.
Read the research: https://arxiv.org/abs/2511.20623v1
Authors: David Szczecina, Senan Gaffori, Edmond Li (cs.AI)
Paper: https://arxiv.org/abs/2511.20623v1
Register: https://www.AiFeta.com
AI ethics copyright LLM transparency openSource creators research