<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "https://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1-mathml3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="1.2" xml:lang="en">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">1832</journal-id>
      <journal-title-group>
        <journal-title>Journal of Cultural Analytics</journal-title>
      </journal-title-group>
      <issn pub-type="epub">2371-4549</issn>
      <publisher>
        <publisher-name>Center for Digital Humanities, Princeton University</publisher-name>
      </publisher>
      <self-uri xlink:href="https://culturalanalytics.org/">Website: Journal of Cultural Analytics</self-uri>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">13147</article-id>
      <article-id pub-id-type="doi">10.22148/001c.13147</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Data Set</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>NovelTM Datasets for English-Language Fiction, 1700-2009</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Underwood</surname>
            <given-names>Ted</given-names>
          </name>
          <xref ref-type="aff" rid="author-aff-1">
            <sup>1</sup>
          </xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Kimutis</surname>
            <given-names>Patrick</given-names>
          </name>
          <xref ref-type="aff" rid="author-aff-1">
            <sup>1</sup>
          </xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Witte</surname>
            <given-names>Jessica</given-names>
          </name>
          <xref ref-type="aff" rid="author-aff-1">
            <sup>1</sup>
          </xref>
        </contrib>
      </contrib-group>
      <aff id="author-aff-1">
        <label>1</label>
        <institution-wrap>
          <institution content-type="edu">University of Illinois, Urbana-Champaign</institution>
        </institution-wrap>
      </aff>
      <pub-date publication-format="electronic" date-type="pub" iso-8601-date="2020-05-28">
        <day>28</day>
        <month>5</month>
        <year>2020</year>
      </pub-date>
      <pub-date publication-format="electronic" date-type="collection" iso-8601-date="2021-09-02">
        <year>2020</year>
      </pub-date>
      <volume>5</volume>
      <issue seq="7">2</issue>
      <issue-title>Articles in 2020</issue-title>
      <elocation-id>13147</elocation-id>
      <permissions>
        <license license-type="open-access">
          <ali:license_ref xmlns:ali="http://www.niso.org/schemas/ali/1.0/">
              http://creativecommons.org/licenses/by/4.0
            </ali:license_ref>
          <license-p>
              This is an open access article distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0">Creative Commons Attribution License (4.0)</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
            </license-p>
        </license>
      </permissions>
      <self-uri content-type="pdf" xlink:href="https://culturalanalytics.org/article/13147.pdf"/>
      <self-uri content-type="xml" xlink:href="https://culturalanalytics.org/article/13147.xml"/>
      <self-uri content-type="json" xlink:href="https://culturalanalytics.org/article/13147.json"/>
      <self-uri content-type="html" xlink:href="https://culturalanalytics.org/article/13147"/>
      <abstract>
        <p>This report accompanies a collection of 210,305 volumes, predicted to be fiction, that researchers are encouraged to borrow for their own work. We divide the collection into seven subsets with different emphases (for instance, one where books written by men and women are represented equally, and one composed of only the most prominent and widely-held books). Comparing the pictures produced by these different subsets allows us to assess the resilience or fragility of recent quantitative arguments about literary history. Readers can also simply browse the report as a description of English-language fiction in HathiTrust Digital Library.</p>
      </abstract>
      <kwd-group>
        <kwd>data</kwd>
        <kwd>literary history</kwd>
        <kwd>english literature</kwd>
        <kwd>literature</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
