Functions Overview

TextDNA performs several statistical manipulations on text input. It can be difficult to understand how these manipulations apply to raw text as opposed to n-gram input. This page will offer a digestible example of how nine lines of text are manipulated through the below functions. Comparing the function output to the original output in this example can help you gain an understanding as to how TextDNA arranges text input.

Text Sequences

Pretend the following nine lines each represent a sequence in TextDNA. These lines are taken from a scene in Hamlet.

  1. Ay, my lord.
  2. It might, my lord.
  3. Not a jot more, my lord.
  4. Ay, my lord, and of calf-skins too.
  5. What's that, my lord?
  6. E'en so.
  7. E'en so, my lord.
  8. 'Twere to consider too curiously, to consider so.
  9. That is Laertes, a very noble lord: mark.

Word Frequency

Sequence Frequency

Sequence Co-Occurrence

7654321
seq. 1lord (1,2,3,4,5,7,9)my (1,2,3,4,5,7)                                    ay (1,4)
seq. 2lord (1,2,3,4,5,7,9)my (1,2,3,4,5,7)it (1), might (1)
seq. 3lord (1,2,3,4,5,7,9)my (1,2,3,4,5,7)a (3,9)jot (1), more (1), not (1)
seq. 4lord (1,2,3,4,5,7,9)my (1,2,3,4,5,7)ay (1, 4), too (4,8)and (1), calf-skins (1), of (1)
seq. 5lord (1,2,3,4,5,7,9)my (1,2,3,4,5,7)that (5,9)what (5)
seq. 6so (6,7,8)e'en (6,7)
seq. 7lord (1,2,3,4,5,7,9)my (1,2,3,4,5,7)so (6,7,8)e'en (6,7)
seq. 8so (6,7,8)too (4,8)consider (1), curiously (1), to (1), 'twere (1)
seq. 9lord (1,2,3,4,5,7,9)a (3,9)is (1), Laertes (1), mark (1), noble (1), very (1)


TextDNA Home

Email danielle.szafir@colorado.edu for more information.