<?xml version="1.0" encoding="utf-8"?>
<rss xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0" xml:base="https://dilac.iac.gatech.edu/">
  <channel>
    <title>Machine Learning</title>
    <link>https://dilac.iac.gatech.edu/</link>
    <description/>
    <language>en</language>
    
    <item>
  <title>TMI: A Curatorial Approach to Finding Data in a Literary Corpus</title>
  <link>https://dilac.iac.gatech.edu/dilac-projects/tmi</link>
  <description>&lt;span class="field field--name-title field--type-string field--label-hidden"&gt;TMI: A Curatorial Approach to Finding Data in a Literary Corpus&lt;/span&gt;

            &lt;div class="clearfix text-formatted field field--name-body field--type-text-with-summary field--label-hidden field__item"&gt;&lt;p&gt;&lt;img alt="Text Analysis Dashboard" data-entity-type="file" data-entity-uuid="fa7d0b4c-0d10-4eed-9112-66a31142d51f" src="https://dilac.iac.gatech.edu/sites/default/files/inline-images/Dashboard-page.png" class="align-right" width="450" height="349" loading="lazy"&gt;Principal Investigator: Brad Rittenhouse&lt;br&gt;
Project Team: Sudeep Agarwal, Taha Merghani, Madison McRoy, Nate Knauf, Sidharth Potdar, and Kevin Kusuma&lt;/p&gt;

&lt;p&gt;Informationally-dense literature, sometimes referred to as “encyclopedic narrative,” has often been prized by scholars, and afforded a prestigious place in the literary canon. However, these works and the prestige that comes with them tend to be overwhelmingly male: books like Thomas Pynchon’s &lt;em&gt;Gravity’s Rainbow&lt;/em&gt; and Herman Melville’s &lt;em&gt;Moby Dick&lt;/em&gt;, for instance, assemble knowledge on topics like ballistics and whaling. This project explores methods for agnostically identifying instances of information aggregation across a literary corpus, sidestepping human biases that overvalue data from mathematics and the hard sciences.&lt;/p&gt;

&lt;p&gt;Working on the Wright American Fiction corpus, which includes nearly every work of American fiction written between 1850 and 1875, we have developed a curatorial process that points to specific passages where material information—the people, products, and print that proliferated during this period of American history—accretes. Manifested as noun density, this measure allows us to quantify literary data at a suitable level of specificity: it allows us to find writers who are struggling to represent their newly dense material reality aesthetically, but stops short of proscribing specific types of material data. Writers who catalogue whales, like Melville, are counted equally to writers who may concern themselves more with, say, household items.&amp;nbsp;&lt;/p&gt;

&lt;p&gt;Moving forward, we hope to refine our algorithm to detect not just the presence of information, but its absence. African-American writers, for instance, often struggled with assembling hereditary information, which was often kept from them, or in representing experiences too traumatic to be told. We are also working on interactive frameworks for visualizing our data and allowing others to explore it.&lt;/p&gt;
&lt;/div&gt;
      &lt;span class="field field--name-uid field--type-entity-reference field--label-hidden"&gt;&lt;span&gt;morangi3&lt;/span&gt;&lt;/span&gt;
&lt;span class="field field--name-created field--type-created field--label-hidden"&gt;&lt;time datetime="2018-01-16T11:10:37-05:00" title="Tuesday, January 16, 2018 - 11:10" class="datetime"&gt;Tue, 01/16/2018 - 11:10&lt;/time&gt;
&lt;/span&gt;

  &lt;div class="field field--name-field-project-year field--type-list-string field--label-above"&gt;
    &lt;div class="field__label"&gt;Project Year&lt;/div&gt;
          &lt;div class="field__items"&gt;
              &lt;div class="field__item"&gt;2017-18&lt;/div&gt;
              &lt;/div&gt;
      &lt;/div&gt;

  &lt;div class="field field--name-field-project-leaders field--type-string field--label-above"&gt;
    &lt;div class="field__label"&gt;Project Leads&lt;/div&gt;
          &lt;div class="field__items"&gt;
              &lt;div class="field__item"&gt;Brad Rittenhouse&lt;/div&gt;
              &lt;/div&gt;
      &lt;/div&gt;

  &lt;div class="field field--name-field-students field--type-string field--label-above"&gt;
    &lt;div class="field__label"&gt;Students&lt;/div&gt;
          &lt;div class="field__items"&gt;
              &lt;div class="field__item"&gt;Sudeep Agarwal&lt;/div&gt;
          &lt;div class="field__item"&gt;Taha Merghani&lt;/div&gt;
          &lt;div class="field__item"&gt;Madison McRoy&lt;/div&gt;
              &lt;/div&gt;
      &lt;/div&gt;

  &lt;div class="field field--name-field-tags field--type-entity-reference field--label-above"&gt;
    &lt;div class="field__label"&gt;Genre Tags&lt;/div&gt;
          &lt;div class="field__items"&gt;
              &lt;div class="field__item"&gt;&lt;a href="https://dilac.iac.gatech.edu/taxonomy/term/16" hreflang="en"&gt;Text Analysis&lt;/a&gt;&lt;/div&gt;
          &lt;div class="field__item"&gt;&lt;a href="https://dilac.iac.gatech.edu/taxonomy/term/14" hreflang="en"&gt;Machine Learning&lt;/a&gt;&lt;/div&gt;
          &lt;div class="field__item"&gt;&lt;a href="https://dilac.iac.gatech.edu/taxonomy/term/17" hreflang="en"&gt;Topic Modeling&lt;/a&gt;&lt;/div&gt;
          &lt;div class="field__item"&gt;&lt;a href="https://dilac.iac.gatech.edu/taxonomy/term/27" hreflang="en"&gt;Digital Humanities&lt;/a&gt;&lt;/div&gt;
              &lt;/div&gt;
      &lt;/div&gt;

  &lt;div class="field field--name-field-project-url field--type-link field--label-above"&gt;
    &lt;div class="field__label"&gt;Project Site&lt;/div&gt;
              &lt;div class="field__item"&gt;&lt;a href="http://tmi.gatech.edu"&gt;TMI Project Site&lt;/a&gt;&lt;/div&gt;
          &lt;/div&gt;

  &lt;div class="field field--name-field-contactemail field--type-string field--label-above"&gt;
    &lt;div class="field__label"&gt;Contact Email&lt;/div&gt;
              &lt;div class="field__item"&gt;bcrittenhouse@gatech.edu&lt;/div&gt;
          &lt;/div&gt;
</description>
  <pubDate>Tue, 16 Jan 2018 16:10:37 +0000</pubDate>
    <dc:creator>morangi3</dc:creator>
    <guid isPermaLink="false">50 at https://dilac.iac.gatech.edu</guid>
    </item>
<item>
  <title>TOME: Interactive TOpic Model and MEtadata Visualization</title>
  <link>https://dilac.iac.gatech.edu/dilac-projects/topic-model-metadata-visualization</link>
  <description>&lt;span class="field field--name-title field--type-string field--label-hidden"&gt;TOME: Interactive TOpic Model and MEtadata Visualization&lt;/span&gt;

            &lt;div class="clearfix text-formatted field field--name-body field--type-text-with-summary field--label-hidden field__item"&gt;&lt;p&gt;TOME is a tool to support the interactive exploration and visualization of text-based archives, supported by a Digital Humanities Startup Grant from the National Endowment for the Humanities (Lauren Klein and Jacob Eisenstein, co-PIs). Drawing upon the technique of topic modeling—a computational method for identifying themes that recur across a collection—our tool allows humanities scholars to trace the evolution and circulation of these themes across printing networks and over time.&lt;/p&gt;

&lt;p&gt;Publications related to this project include:&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;
	&lt;p&gt;Klein, L., J. Eisenstein, and I. Sun. “Exploratory Thematic Analysis for Digitized Archival Collections.” Digital Scholarship in the Humanities 30.1 (2015).&lt;/p&gt;
	&lt;/li&gt;
	&lt;li&gt;
	&lt;p&gt;Eisenstein, J., I. Sun and L. Klein. “Exploratory Text Analysis for Large Document Archives.’ Proceedings of Digital Humanities 2014. Hamburg: Univ. of Hamburg, 2014.&lt;/p&gt;
	&lt;/li&gt;
	&lt;li&gt;
	&lt;p&gt;Eisenstein, J. and L. Klein. “Reading Thomas Jefferson with TopicViz: Towards a Thematic Method for Exploring Large Cultural Archives.” Proceedings of the Implementing New Knowledge Environments (INKE) Annual Meeting 2012. Vancouver: Scholarly and Research Communication, 2013.&lt;/p&gt;
	&lt;/li&gt;
	&lt;li&gt;
	&lt;p&gt;Presentations related to this project include:&lt;/p&gt;
	&lt;/li&gt;
	&lt;li&gt;
	&lt;p&gt;“Developing and Sustaining Collaborative Research.” Roundtable. Modern Language Association, Austin, TX, January 2016.&lt;/p&gt;
	&lt;/li&gt;
	&lt;li&gt;
	&lt;p&gt;“The Carework and Codework of Nineteenth-Century Abolitionist Newspapers.” The Digital Antiquarian, American Antiquarian Society, May 2015.&lt;/p&gt;
	&lt;/li&gt;
	&lt;li&gt;
	&lt;p&gt;“Beyond the Digital Surrogate: Discovery and Analysis of Digital Collections.” Roundtable. Digital Library Federation Forum, Atlanta, GA, October 2014.&lt;/p&gt;
	&lt;/li&gt;
	&lt;li&gt;
	&lt;p&gt;“The Best-Laid Schemes: Reflections on Three Years of the NEH ODH Data Management Plan Requirement.” Roundtable. Digital Library Federation Forum, Atlanta, GA, October 2014.&lt;/p&gt;
	&lt;/li&gt;
	&lt;li&gt;
	&lt;p&gt;“Exploratory Thematic Analysis for Historical Newspaper Archives.” Institute for Quantitative Theory and Methods. Emory University, April 2015. Also presented at Digital Humanities, Hamburg, Germany, July 2014.&lt;/p&gt;
	&lt;/li&gt;
	&lt;li&gt;
	&lt;p&gt;“Towards a Thematic Method for Exploring Large Cultural Archives,” with Jacob Eisenstein. Research Foundations for Understanding Books and Reading in the Digital Age, Havana, Cuba, December 2012.&lt;/p&gt;
	&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;
      &lt;span class="field field--name-uid field--type-entity-reference field--label-hidden"&gt;&lt;span&gt;morangi3&lt;/span&gt;&lt;/span&gt;
&lt;span class="field field--name-created field--type-created field--label-hidden"&gt;&lt;time datetime="2017-10-29T16:06:17-04:00" title="Sunday, October 29, 2017 - 16:06" class="datetime"&gt;Sun, 10/29/2017 - 16:06&lt;/time&gt;
&lt;/span&gt;

  &lt;div class="field field--name-field-project-year field--type-list-string field--label-above"&gt;
    &lt;div class="field__label"&gt;Project Year&lt;/div&gt;
          &lt;div class="field__items"&gt;
              &lt;div class="field__item"&gt;2016-17&lt;/div&gt;
          &lt;div class="field__item"&gt;2017-18&lt;/div&gt;
              &lt;/div&gt;
      &lt;/div&gt;

  &lt;div class="field field--name-field-project-leaders field--type-string field--label-above"&gt;
    &lt;div class="field__label"&gt;Project Leads&lt;/div&gt;
          &lt;div class="field__items"&gt;
              &lt;div class="field__item"&gt;Lauren Klein&lt;/div&gt;
              &lt;/div&gt;
      &lt;/div&gt;

  &lt;div class="field field--name-field-tags field--type-entity-reference field--label-above"&gt;
    &lt;div class="field__label"&gt;Genre Tags&lt;/div&gt;
          &lt;div class="field__items"&gt;
              &lt;div class="field__item"&gt;&lt;a href="https://dilac.iac.gatech.edu/taxonomy/term/13" hreflang="en"&gt;Info viz&lt;/a&gt;&lt;/div&gt;
          &lt;div class="field__item"&gt;&lt;a href="https://dilac.iac.gatech.edu/taxonomy/term/16" hreflang="en"&gt;Text Analysis&lt;/a&gt;&lt;/div&gt;
          &lt;div class="field__item"&gt;&lt;a href="https://dilac.iac.gatech.edu/taxonomy/term/14" hreflang="en"&gt;Machine Learning&lt;/a&gt;&lt;/div&gt;
          &lt;div class="field__item"&gt;&lt;a href="https://dilac.iac.gatech.edu/taxonomy/term/27" hreflang="en"&gt;Digital Humanities&lt;/a&gt;&lt;/div&gt;
              &lt;/div&gt;
      &lt;/div&gt;

  &lt;div class="field field--name-field-project-url field--type-link field--label-above"&gt;
    &lt;div class="field__label"&gt;Project Site&lt;/div&gt;
              &lt;div class="field__item"&gt;&lt;a href="http://tome.lmc.gatech.edu"&gt;TOME Project Site&lt;/a&gt;&lt;/div&gt;
          &lt;/div&gt;

  &lt;div class="field field--name-field-contactemail field--type-string field--label-above"&gt;
    &lt;div class="field__label"&gt;Contact Email&lt;/div&gt;
              &lt;div class="field__item"&gt;lauren.klein@lmc.gatech.edu&lt;/div&gt;
          &lt;/div&gt;
</description>
  <pubDate>Sun, 29 Oct 2017 20:06:17 +0000</pubDate>
    <dc:creator>morangi3</dc:creator>
    <guid isPermaLink="false">46 at https://dilac.iac.gatech.edu</guid>
    </item>

  </channel>
</rss>
