This annotated bibliography is the first step in a proposed project using work that I completed in INFO 5709 as a pilot study. This project will not use the box office data that the previous project relied upon, instead exploring several methods of classifying and clustering the textual data using TF-IDF as an important feature. I will also explore creative methods for visualizing large amounts of textual data. To this end, I’ve included many sources that explore word frequency as a metric – what does it mean? How is it used in creating wordlists? I’ve also included several textual visualization projects that innovate new standards for the visual representation of textual data.