Case Study Analysis
This week we will explore:
Case Study Analysis;
Quantitative Data;
Cross-Case Synthesis;
Generalising; and
Analysis Tools.
Voyant Tools provides a range of analsysis and visualisation tools to analyse text
https://voyant-tools.org/docs/#!/guide/tools
These have now been consolodated into a single interface, but the following provides a summary:
Bubblelines visualizes the frequency and repetition of a word’s use in a corpus. Each document in the corpus is represented as a horizontal line and divided into segments of equal length. Each selected word is represented as a bubble with the size of the bubble indicating the word’s frequency in the corresponding segment of text. The larger the bubble’s radius the more frequently the word occurs. [ more documentation | use it ]
Bubbles represents the relative frequency of words in a corpus through a cloud of bubbles. Each word is represented by a bubble, where the size of the bubble is proportionate to the frequency with which the word occurs in the corpus. The larger the bubble’s radius the more frequently the word occurs. [ more documentation | use it ]
Cirrus is a word cloud displaying the frequency of words appearing in a corpus. Words occurring more frequently appear larger. [ more documentation | use it ]
Corpus Grid provides an overview of a corpus, displaying each document’s title, total number of words (word tokens), number of unique words (word types), and lexical density (the ratio of tokens to types). [ more documentation | use it ]
Corpus Summary is a tool that provides a simple, textual overview of the current corpus. This includes number of words, number of unique words, longest and shortest documents, highest and lowest vocabulary density, most frequent words, notable peaks in frequency, and distinctive words. [ more documentation | use it ]
Corpus Type Frequencies Grid provides an ordered list for all the words’ frequencies appearing in a corpus. As well additional columns can be toggled to show other statistical information, including a small line graph for term frequency across the corpus. [ more documentation | use it ]
Document Input Add allows the user to dynamically add additional documents to a corpus after importing the initial texts. [ more documentation | use it ]
Document Type Collocate Frequencies Grid provides and ordered list of word collocation for a specified word and document. [ more documentation | use it ]
Document Type Frequencies Grid provides an ordered list of word frequencies along with other statistical data for each document in a corpus. [ more documentation | use it ]
Document Type KWICs Grid displays a table contextualizing a selected word with the phrases or paragraphs of text that directly precede and follow each instances of the word throughout the corpus. [ more documentation | use it ]
Knots represents a corpus as a series of twisted lines. Each line depicts a selected term over the length of the corpus. The extent to which lines overlap indicates the level of correspondence or linkage between the terms. [ more documentation | use it ]
Lava displays multiple levels of a corpus in a three-dimensional environment. Clicking on a document within the corpus expands the Lava visualization in a ring to further explore terms within their context. [ more documentation | use it ]
Links represents the collocation of terms in a corpus by depicting them in a network through the use of a force directed graph. In this graph the frequency of the word is indicate by relative size of the term. [ more documentation | use it ]
Mandala allows the importing of “textual” files to perform analysis on the frequency and linkage of terms. For example, importing a play would allow the user to find the linkage and frequency between a term and its speaker. [ more documentation | use it ]
Reader provides a viewing window to allow the user to read the full text of the corpus that they have imported into Voyant Tools. [ more documentation | use it ]
RezoViz visualizes the relationships between people, locations and organizations in a collection of documents. Links are created between every pair of people, locations and organizations that occur in the same document. [ more documentation | use it ]
ScatterPlot displays the correspondance of word use in a corpus. This visualization relies on a statistical analysis that takes the word’s correspondance from each document (where each document represents a dimension) and reduces it to a three dimensional space to easily visualize the data through a scatterplot. [ more documentation | use it ]
Termometer depicts the change of the frequency of word across a corpus spread over time. It provides a more compact version of the tool TermsRadio. Unlike TermsRadio the temporal dimension of the tool is not expanded across the x-axis. Instead the change in frequency of the word is captured in through movement in the y-axis. [ more documentation | use it ]
TermsRadio provides a scrolling line graph that can depict the change of the frequency of word across a corpus spread over time. [ more documentation | use it ]
Type Frequencies Chart shows a line graph depicting the distribution of a word’s occurrence across a corpus. [ more documentation | use it ]
Word Count Fountain visualizes word frequencies as a fountain. Each stream represents a unique word, where its height represents frequency the term occurred in the corpus. [ more documentation | use it ]