Scientists tap books to isolate literary ‘fingerprint’
December 11th, 2009 - 12:12 pm ICT by IANS ( Leave a comment )London, Dec 11 (IANS) Based on the writing styles of Thomas Hardy, D.H. Lawrence and Herman Melville, physicists have developed a formula to detect the literary “fingerprints” of different authors.
New research describes a new concept from a group of Swedish physicists from Umeå University. The “meta book” uses the frequency with which authors use new words in their work to discern distinct patterns in authors’ written styles.
For more than 75 years, George Kingsley Zipf’s maxim, based on a carefully selected compilation of American English called Brown Corpus, suggested a universal pattern for the frequency of new words used by authors.
Zipf’s law suggests that the frequency ranking of a word is inversely proportional to its occurrence. New research suggests however that the truth behind word frequency is less universal than Zipf asserted, and is linked more with the author’s linguistic ability than any over-arching linguistic rule.
Researchers first found that the occurrence of new words in the texts by Hardy, Lawrence and Melville did begin to drop off in their texts as their book gets longer, despite new settings and plot-twists.
Their evidence also shows however that the rate of unique word drop-off varies for different authors and, most significantly, is consistent across the entire works of any one of the three authors they analysed.
The statistical analysis was applied to entire novels, sections from novels, complete works and amalgamations from different works by the same authors - they all had a unique word-frequency “fingerprints”.
By using the statistical patterns evident from their study, the researchers have pondered the idea of a meta-book - a code for each author which could represent their entire work, completed or in the mental pipeline, says Umeå University release.
“These findings lead us towards the meta book concept - the writing of a text can be described by a process where the author pulls a piece of text out of a large mother book (the meta book) and puts it down on paper,” write the study authors.
These findings were published in the Thursday edition of the New Journal of Physics.
- Formula to detect an author's literary 'fingerprint' developed - Dec 11, 2009
- New algorithm identifies ghost writing in Old Testament - Oct 12, 2011
- Thomas Hardy's home to be opened in full to the public - Dec 29, 2010
- Longer words get message across faster, more effectively - Jan 25, 2011
- I wanted to know Indian writers, find myself: Horowitz (Interview) - Nov 30, 2010
- People produce consistent, 'signature' brainwave patterns: Teen study - Apr 27, 2011
- Computer program resurrects 'lost' languages - Jul 20, 2010
- New technique to identify senders of anonymous emails - Mar 09, 2011
- Does 'coded message in Bible point to Qaeda's nuclear weapon hideout'? - Dec 11, 2010
- Using poetry to teach computers better language skills - Nov 21, 2010
- Your walk can give you away - Oct 09, 2011
- Historical context, not the brain, drives language development: Study - Apr 15, 2011
- Couples' way of talking can predict relationship success - Jan 26, 2011
- Online newspaper archives can help trace changes in language usage - Jun 27, 2009
- Scot scientists pull fingerprints from fabrics - Jan 31, 2011
Tags: brown corpus, complete works, d h lawrence, distinct patterns, fingerprint, fingerprints, george kingsley, herman melville, linguistic ability, linguistic rule, maxim, physicists, plot twists, statistical analysis, statistical patterns, thomas hardy, universal pattern, word frequency, writing styles, zipf