KIAM Main page Web Library  •  Publication Searh  Русский 
Publication

KIAM Preprint № 27, Moscow, 2013
Authors: Borisov L.A., Orlov Y. N., Osminin K.P.
Identification of a text author by the letter frequency empirical distribution
Abstract:
The distances distributions between empirical triplet distributions are investigated. The accuracy estimation of these distributions is obtained depending on the length of the text. The method of author identification is examined on the broad class of literature texts. The stabilization length of triplet distributions is approximately equal to one half of the text without dependence on author and text length. The example of cluster method is given for E.I. Roerich philosophical texts.
Keywords:
empirical probability, minimal text length, author identification
Publication language: russian,  pages: 26
Research direction:
Mathematical modelling in actual problems of science and technics
Russian source text:
List of publications citation:
Export link to publication in format:   RIS    BibTeX
View statistics (updated once a day)
over the last 30 days — 9 (+2), total hit from 01.09.2019 — 1055
About authors:
  • Borisov L.A.,  leonidborisoff@gmail.com,  МФТИ
  • Orlov Yurii Nikolaevich,  ov31509f@yandex.ruorcid.org/0000-0002-1356-5137KIAM RAS
  • Osminin K.P.,  osminik@yandex.ru,  Компания Courant