<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "https://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1-mathml3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="1.2" xml:lang="en">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">1832</journal-id>
      <journal-title-group>
        <journal-title>Journal of Cultural Analytics</journal-title>
      </journal-title-group>
      <issn pub-type="epub">2371-4549</issn>
      <publisher>
        <publisher-name>Center for Digital Humanities, Princeton University</publisher-name>
      </publisher>
      <self-uri xlink:href="https://culturalanalytics.org/">Website: Journal of Cultural Analytics</self-uri>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">11831</article-id>
      <article-id pub-id-type="doi">10.22148/001c.11831</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Gender Dynamics and Critical Reception: A Study of Early 20th-century Book Reviews from The New York Times</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Lavin</surname>
            <given-names>Matthew J.</given-names>
          </name>
          <xref ref-type="aff" rid="author-aff-1">
            <sup>1</sup>
          </xref>
        </contrib>
      </contrib-group>
      <aff id="author-aff-1">
        <label>1</label>
        <institution-wrap>
          <institution content-type="edu">University of Pittsburgh</institution>
        </institution-wrap>
        <institution-wrap>
          <institution-id institution-id-type="ROR">https://ror.org/01an3r305</institution-id>
        </institution-wrap>
      </aff>
      <pub-date publication-format="electronic" date-type="pub" iso-8601-date="2020-01-30">
        <day>30</day>
        <month>1</month>
        <year>2020</year>
      </pub-date>
      <pub-date publication-format="electronic" date-type="collection" iso-8601-date="2021-05-03">
        <year>2020</year>
      </pub-date>
      <volume>5</volume>
      <issue seq="7">1</issue>
      <issue-title>Articles in 2020</issue-title>
      <elocation-id>11831</elocation-id>
      <permissions>
        <license license-type="open-access">
          <ali:license_ref xmlns:ali="http://www.niso.org/schemas/ali/1.0/">
              http://creativecommons.org/licenses/by/4.0
            </ali:license_ref>
          <license-p>
              This is an open access article distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0">Creative Commons Attribution License (4.0)</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
            </license-p>
        </license>
      </permissions>
      <self-uri content-type="pdf" xlink:href="https://culturalanalytics.org/article/11831.pdf"/>
      <self-uri content-type="xml" xlink:href="https://culturalanalytics.org/article/11831.xml"/>
      <self-uri content-type="json" xlink:href="https://culturalanalytics.org/article/11831.json"/>
      <self-uri content-type="html" xlink:href="https://culturalanalytics.org/article/11831"/>
      <abstract>
        <p>This paper focuses on book reviews at the turn-of-the century United States in order to underline fundamental compatibilities between large-scale, computational methods and book historical approaches. It analyzes a dataset of approximately 2,800 book reviews published in The New York Times between January 1, 1905 and December 31, 1925. Several machine learning scenarios are employed to investigate how the underlying reviews constructed gendered norms for reading and readership. Logistic regression models are trained and tested to evaluate how effectively lemma frequencies predict the perceived or presumed gender of an author under review. The paper discusses four different feature selection scenarios, as follows: (1) No terms removed, (2) Stop words removed, (3) Stop words, gender nouns, and titles removed, and (4) Stop words, gender nouns, titles, and common forenames removed. For each scenario, the top lemma coefficients are discussed and interpreted. Tracing the norms (gendered and gendering) of The New York Times Book Review in the early twentieth century demonstrates that even the summary-driven book reviews played an important role in mediating hierarchies of taste and distinction. Further, the paper seeks to demonstrate that cultural analytics methods can be used to investigate a range of research questions related to authorship, publishing, circulation, and reception.</p>
      </abstract>
      <kwd-group>
        <kwd>machine learning</kwd>
        <kwd>reviews</kwd>
        <kwd>gender</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
