Plagiarism Detection in arXiv
dc.contributor.author | Sorokina, Daria | |
dc.date.accessioned | 2021-05-15T07:35:22Z | |
dc.date.available | 2021-05-15T07:35:22Z | |
dc.date.issued | 2006-08-11 | |
dc.description | plagarism articals | en_US |
dc.description.abstract | We describe a large-scale application of methods for finding plagiarism in research document collections. The methods are applied to a collection of 284,834 documents collected by arXiv.org over a 14 year period, covering a few different research disciplines. The methodology efficiently detects a variety of problematic author behaviors, and heuristics are developed to reduce the number of false positives. The methods are also efficient enough to implement as a real-time submission screen for a collection many times larger | en_US |
dc.identifier.uri | http://hdl.handle.net/10673/789 | |
dc.language.iso | en | en_US |
dc.subject | plagarism | en_US |
dc.subject | plagarism | en_US |
dc.title | Plagiarism Detection in arXiv | en_US |