suspicious document. In PAN 2013 source retrieval subtask the main goal was to\r
identify web pages which have been used as a source of plagiarism for test corpus creation.\r
\r
-The test corpus contained 58 documents each discussing only one theme.\r
+The test corpus contained 58 documents each discussing one topic only.\r
Those documents were created intentionally by\r
semiprofessional writers, thus they featured nearly realistic plagiarism cases~\cite{plagCorpus}.\r
Resources were looked up in the ClueWeb\footnote{\url{http://lemurproject.org/clueweb09.php/}} corpus.\r