Plagiarism Detection in arXiv

dc.contributor.authorSorokina, Daria
dc.date.accessioned2021-05-15T07:35:22Z
dc.date.available2021-05-15T07:35:22Z
dc.date.issued2006-08-11
dc.descriptionplagarism articalsen_US
dc.description.abstractWe describe a large-scale application of methods for finding plagiarism in research document collections. The methods are applied to a collection of 284,834 documents collected by arXiv.org over a 14 year period, covering a few different research disciplines. The methodology efficiently detects a variety of problematic author behaviors, and heuristics are developed to reduce the number of false positives. The methods are also efficient enough to implement as a real-time submission screen for a collection many times largeren_US
dc.identifier.urihttp://hdl.handle.net/10673/789
dc.language.isoenen_US
dc.subjectplagarismen_US
dc.subjectplagarismen_US
dc.titlePlagiarism Detection in arXiven_US

Files