<?xml version="1.0" encoding="utf-8"?>
<rss xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0" xml:base="https://dilac.iac.gatech.edu/">
  <channel>
    <title>Topic Modeling</title>
    <link>https://dilac.iac.gatech.edu/</link>
    <description/>
    <language>en</language>
    
    <item>
  <title>TMI: A Curatorial Approach to Finding Data in a Literary Corpus</title>
  <link>https://dilac.iac.gatech.edu/dilac-projects/tmi</link>
  <description>&lt;span class="field field--name-title field--type-string field--label-hidden"&gt;TMI: A Curatorial Approach to Finding Data in a Literary Corpus&lt;/span&gt;

            &lt;div class="clearfix text-formatted field field--name-body field--type-text-with-summary field--label-hidden field__item"&gt;&lt;p&gt;&lt;img alt="Text Analysis Dashboard" data-entity-type="file" data-entity-uuid="fa7d0b4c-0d10-4eed-9112-66a31142d51f" src="https://dilac.iac.gatech.edu/sites/default/files/inline-images/Dashboard-page.png" class="align-right" width="450" height="349" loading="lazy"&gt;Principal Investigator: Brad Rittenhouse&lt;br&gt;
Project Team: Sudeep Agarwal, Taha Merghani, Madison McRoy, Nate Knauf, Sidharth Potdar, and Kevin Kusuma&lt;/p&gt;

&lt;p&gt;Informationally-dense literature, sometimes referred to as “encyclopedic narrative,” has often been prized by scholars, and afforded a prestigious place in the literary canon. However, these works and the prestige that comes with them tend to be overwhelmingly male: books like Thomas Pynchon’s &lt;em&gt;Gravity’s Rainbow&lt;/em&gt; and Herman Melville’s &lt;em&gt;Moby Dick&lt;/em&gt;, for instance, assemble knowledge on topics like ballistics and whaling. This project explores methods for agnostically identifying instances of information aggregation across a literary corpus, sidestepping human biases that overvalue data from mathematics and the hard sciences.&lt;/p&gt;

&lt;p&gt;Working on the Wright American Fiction corpus, which includes nearly every work of American fiction written between 1850 and 1875, we have developed a curatorial process that points to specific passages where material information—the people, products, and print that proliferated during this period of American history—accretes. Manifested as noun density, this measure allows us to quantify literary data at a suitable level of specificity: it allows us to find writers who are struggling to represent their newly dense material reality aesthetically, but stops short of proscribing specific types of material data. Writers who catalogue whales, like Melville, are counted equally to writers who may concern themselves more with, say, household items.&amp;nbsp;&lt;/p&gt;

&lt;p&gt;Moving forward, we hope to refine our algorithm to detect not just the presence of information, but its absence. African-American writers, for instance, often struggled with assembling hereditary information, which was often kept from them, or in representing experiences too traumatic to be told. We are also working on interactive frameworks for visualizing our data and allowing others to explore it.&lt;/p&gt;
&lt;/div&gt;
      &lt;span class="field field--name-uid field--type-entity-reference field--label-hidden"&gt;&lt;span&gt;morangi3&lt;/span&gt;&lt;/span&gt;
&lt;span class="field field--name-created field--type-created field--label-hidden"&gt;&lt;time datetime="2018-01-16T11:10:37-05:00" title="Tuesday, January 16, 2018 - 11:10" class="datetime"&gt;Tue, 01/16/2018 - 11:10&lt;/time&gt;
&lt;/span&gt;

  &lt;div class="field field--name-field-project-year field--type-list-string field--label-above"&gt;
    &lt;div class="field__label"&gt;Project Year&lt;/div&gt;
          &lt;div class="field__items"&gt;
              &lt;div class="field__item"&gt;2017-18&lt;/div&gt;
              &lt;/div&gt;
      &lt;/div&gt;

  &lt;div class="field field--name-field-project-leaders field--type-string field--label-above"&gt;
    &lt;div class="field__label"&gt;Project Leads&lt;/div&gt;
          &lt;div class="field__items"&gt;
              &lt;div class="field__item"&gt;Brad Rittenhouse&lt;/div&gt;
              &lt;/div&gt;
      &lt;/div&gt;

  &lt;div class="field field--name-field-students field--type-string field--label-above"&gt;
    &lt;div class="field__label"&gt;Students&lt;/div&gt;
          &lt;div class="field__items"&gt;
              &lt;div class="field__item"&gt;Sudeep Agarwal&lt;/div&gt;
          &lt;div class="field__item"&gt;Taha Merghani&lt;/div&gt;
          &lt;div class="field__item"&gt;Madison McRoy&lt;/div&gt;
              &lt;/div&gt;
      &lt;/div&gt;

  &lt;div class="field field--name-field-tags field--type-entity-reference field--label-above"&gt;
    &lt;div class="field__label"&gt;Genre Tags&lt;/div&gt;
          &lt;div class="field__items"&gt;
              &lt;div class="field__item"&gt;&lt;a href="https://dilac.iac.gatech.edu/taxonomy/term/16" hreflang="en"&gt;Text Analysis&lt;/a&gt;&lt;/div&gt;
          &lt;div class="field__item"&gt;&lt;a href="https://dilac.iac.gatech.edu/taxonomy/term/14" hreflang="en"&gt;Machine Learning&lt;/a&gt;&lt;/div&gt;
          &lt;div class="field__item"&gt;&lt;a href="https://dilac.iac.gatech.edu/taxonomy/term/17" hreflang="en"&gt;Topic Modeling&lt;/a&gt;&lt;/div&gt;
          &lt;div class="field__item"&gt;&lt;a href="https://dilac.iac.gatech.edu/taxonomy/term/27" hreflang="en"&gt;Digital Humanities&lt;/a&gt;&lt;/div&gt;
              &lt;/div&gt;
      &lt;/div&gt;

  &lt;div class="field field--name-field-project-url field--type-link field--label-above"&gt;
    &lt;div class="field__label"&gt;Project Site&lt;/div&gt;
              &lt;div class="field__item"&gt;&lt;a href="http://tmi.gatech.edu"&gt;TMI Project Site&lt;/a&gt;&lt;/div&gt;
          &lt;/div&gt;

  &lt;div class="field field--name-field-contactemail field--type-string field--label-above"&gt;
    &lt;div class="field__label"&gt;Contact Email&lt;/div&gt;
              &lt;div class="field__item"&gt;bcrittenhouse@gatech.edu&lt;/div&gt;
          &lt;/div&gt;
</description>
  <pubDate>Tue, 16 Jan 2018 16:10:37 +0000</pubDate>
    <dc:creator>morangi3</dc:creator>
    <guid isPermaLink="false">50 at https://dilac.iac.gatech.edu</guid>
    </item>

  </channel>
</rss>
