Using text analytic tools allows the reader to have concrete information about what they are reading quickly. For instance, a book can be searchable while in an ebook form, but with the right tools a visualization from the text is more accessible, than simply being able to search through data. These tools can be used for interpreting data to help support a thesis.
“TAPoR is a gateway to the tools used in sophisticated text analysis and retrieval. Please contribute!
The project is led by Geoffrey Rockwell, Stefan Sinclair, and Kirsten C. Uszkalo and housed at the University of Alberta.
Support for this project came from the Canadian Institute for Research Computing in the Arts, University of Alberta, the Social Science and Humanities Research Council and the Canada Foundation for Innovation” (http://www.tapor.ca/docs?name=about_tapor)
“Hermeneuti.ca is a collaborative project by Stéfan Sinclair & Geoffrey Rockwell to think through some foundations of contemporary text analysis, including issues related to the electronic texts used, the tools and methodologies available, and the various forms that can take the expression of results from text analysis” (http://hermeneuti.ca)
Have an electronic text you’d like to explore through text analytics. It should be a Plain Text UTF-8 file, so as not to interfere with any of the tools available on TAPoR or Voyant. You can find texts without copyright restrictions from Project Gutenberg.
In the URL search box, I copied and pasted the URL from the Plain Text version of Moby Dick, and click “Reveal.”
A screen like this should pop up:
From the “Frequency” I checked the first word “whale” then this prompts Voyant to open another window:
Here in this screen you can select and deselect the words you’d like to see charted together, or separately. The Voyant Corpus Term Frequency tool allows you to use multiple Voyant tools at once like Cirrus: the Word Cloud program, or Term Frequency: which allows you to see the frequency graphed throughout the text.
For more text analysis tools there is a more complete list here: DiRT (Digital Research Tools Wiki)