<article xmlns:ns0="http://www.w3.org/1999/xlink" xmlns:ns1="http://www.niso.org/schemas/ali/1.0/" xmlns:ns2="http://www.w3.org/1998/Math/MathML" article-type="research-article" dtd-version="1.2" xml:lang="en">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">1832</journal-id>
      <journal-title-group>
        <journal-title>Journal of Cultural Analytics</journal-title>
      </journal-title-group>
      <issn pub-type="epub">2371-4549</issn>
      <publisher>
        <publisher-name>Center for Digital Humanities, Princeton University</publisher-name>
      </publisher>
      <self-uri ns0:href="https://culturalanalytics.org/">Website: Journal of Cultural Analytics</self-uri>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">116368</article-id>
      <article-id pub-id-type="doi">10.22148/001c.116368</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Exploring Gender Differences in Fatwa through Machine Learning</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Mohamed</surname>
            <given-names>Emad</given-names>
          </name>
          <xref ref-type="aff" rid="author-aff-1">
            <sup>1</sup>
          </xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Sarwar</surname>
            <given-names>Raheem</given-names>
          </name>
          <xref ref-type="aff" rid="author-aff-2">
            <sup>2</sup>
          </xref>
        </contrib>
      </contrib-group>
      <aff id="author-aff-1">
        <label>1</label>
        <institution-wrap>
          <institution content-type="edu">Nazarbayev University</institution>
        </institution-wrap>
        <institution-wrap>
          <institution-id institution-id-type="ROR">https://ror.org/052bx8q98</institution-id>
        </institution-wrap>
      </aff>
      <aff id="author-aff-2">
        <label>2</label>
        <institution-wrap>
          <institution content-type="edu">Manchester Metropolitan University</institution>
        </institution-wrap>
        <institution-wrap>
          <institution-id institution-id-type="ROR">https://ror.org/02hstj355</institution-id>
        </institution-wrap>
      </aff>
      <pub-date publication-format="electronic" date-type="pub" iso-8601-date="2024-06-17">
        <day>17</day>
        <month>6</month>
        <year>2024</year>
      </pub-date>
      <pub-date publication-format="electronic" date-type="collection" iso-8601-date="2024-06-17">
        <year>2024</year>
      </pub-date>
      <volume>9</volume>
      <issue seq="3">3</issue>
      <issue-title>The Potential and Limits of Arabic Digital Humanities</issue-title>
      <elocation-id>116368</elocation-id>
      <history>
        <date date-type="received" iso-8601-date="2024-01-08">
          <day>8</day>
          <month>1</month>
          <year>2024</year>
        </date>
        <date date-type="accepted" iso-8601-date="2024-02-12">
          <day>12</day>
          <month>2</month>
          <year>2024</year>
        </date>
      </history>
      <permissions>
        <license license-type="open-access">
          <ns1:license_ref>
              http://creativecommons.org/licenses/by/4.0
            </ns1:license_ref>
          <license-p>
              This is an open access article distributed under the terms of the <ext-link ext-link-type="uri" ns0:href="http://creativecommons.org/licenses/by/4.0">Creative Commons Attribution License (4.0)</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
            </license-p>
        </license>
      </permissions>
      <self-uri content-type="pdf" ns0:href="https://culturalanalytics.org/article/116368.pdf" />
      <self-uri content-type="xml" ns0:href="https://culturalanalytics.org/article/116368.xml" />
      <self-uri content-type="json" ns0:href="https://culturalanalytics.org/article/116368.json" />
      <self-uri content-type="html" ns0:href="https://culturalanalytics.org/article/116368" />
      <abstract>
        <p>This paper focuses on exploring the differences in inquiries made by men and women within a religious context. Additionally, we aim to ascertain whether it’s feasible to forecast the popularity of answers and the factors contributing to their popularity. To achieve this, we compile a new dataset comprising 40,000 question-answer pairs categorized by gender and popularity. These are sourced from online question-and-answer platforms. Our methodology involves comprehensive experimental analysis, utilizing advanced Arabic text preprocessing alongside machine learning algorithms. We concentrate on two primary objectives: predicting the gender of the questioner and forecasting the popularity of answers. Furthermore, we delve into thematic variations based on gender and address pivotal research queries that offer new perspectives within this domain. These include investigating the differences between questions posed by women versus men, exploring the potential for automated classification of queries by gender, predicting the popularity of fatwas, and identifying the contributing factors to their popularity. Our experimental findings demonstrate a 98% accuracy in gender prediction, precise predictions of popularity with minimal margin for error, and the identification of topics and their associations that are more inclined towards either men or women. We intend to share both the dataset and the source code openly with the research community.</p>
      </abstract>
      <kwd-group>
        <kwd>fatwa analysis</kwd>
        <kwd>gender and religion</kwd>
        <kwd>machine learning</kwd>
        <kwd>topic modeling</kwd>
        <kwd>classification</kwd>
        <kwd>regression</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec>
      <title>1. Background and Introduction</title>
      <p>Research on Muslim women has recently grown rapidly <xref ref-type="bibr" rid="ref-303845 ref-303844 ref-303843">(Faiz et al.; Khan and Mollah; Kloos and Ismah)</xref>. This may be due to shifting attention after women in the industrialized world gained considerable rights <xref ref-type="bibr" rid="ref-303846 ref-303868 ref-303852 ref-303853 ref-303855 ref-303854">(Maftuhin; READ and BARTKOWSKI; Nikjoo et al.; Baboolal; Murrar et al.; Abu-Ras and Itzhaki-Braun)</xref>. Most prominent among the issues of Muslim women are how (and why) Muslim women wear the hijab <xref ref-type="bibr" rid="ref-303869 ref-303870 ref-303871">(Abu-Lughod; Acker; Brenner)</xref> and Muslim women’s political participation <xref ref-type="bibr" rid="ref-303872 ref-303873 ref-303874">(Finlay and Hopkins; Akbarzadeh and Roose; Bhimji)</xref>. A recent study of Islam in the English Wikipedia has found that the tenth most salient collocate of the adjectives Muslim/Islamic is the noun woman, ahead of such common collocates as conquest, jurisprudence, state, art, philosophy, terrorism, fundamentalism, and even prophet <xref ref-type="bibr" rid="ref-303856">(Mohamed)</xref>, but it remains true that “studies on Muslim women’s online activities remain few and far between” <xref ref-type="bibr" rid="ref-303859">(Piela)</xref>. Most of these studies focus on Muslim women’s online activism and include their attempts to challenge male dominance. Alexandros Sakellariou <xref ref-type="bibr" rid="ref-303860">(Sakellariou)</xref> used discourse analysis tools to examine female Greek converts to Islam in their digital presence, using their conversion stories as a means of understanding their social milieu as well as their digital and non-digital identities. Rahman, Fung, and Yeo used a small corpus of 1480 online comments to study Canadian attitudes towards the Hijab (the Muslim women’s headscarf). The authors made use of computational tools, namely sentiment analysis, as well as content analysis for their investigation <xref ref-type="bibr" rid="ref-303861">(Rahman et al.)</xref>. One common trait of these studies is that they used small samples and focused on specific issues<xref ref-type="bibr" rid="ref-303851 ref-303850 ref-303849">(Al-Ghadir et al.; Al-Ghadir and Azmi; Al-Sarem and Emara)</xref>.</p>
      <p>In this paper, we introduce a new dataset containing 172,000 question-answer pairs written in Arabic from a religious question-answering platform and show its usefulness to answer important questions such as what are the differences between questions posed by men and women, what makes an answer popular? The answers to such questions add new insights to the existing knowledge in fields of social sciences, cultural analytics, personal law, economics, technology, education, medicine, religious studies, information retrieval, or digital text forensics. Although we limit our investigation to the aforementioned questions, the novel dataset introduced in this paper can be used for a variety of purposes including (1) studying linguistic gender variation since the questions are marked for the gender of the questioner, (2) tracking a specific issue as the questions span 17 years, (3) tracking the rise and fall of specific themes, thanks to the view numbers associated with different dates, (4) Natural Language Processing tasks like summarisation and document similarity in a religious domain, and (5) tracking authority shift in the Muslim world.</p>
      <p>Islamic Question Answering sites such as <ext-link ext-link-type="uri" ns0:href="https://www.islamweb.net/ar/">IslamWeb</ext-link> is similar to other community question-answering platforms by not only allowing people to post a wide variety of questions but also by offering qualified legal scholars a chance to browse and answer any of them. Typically, in these services, new questions can be formulated at any moment, and receive several responses from different qualified legal scholars <xref ref-type="bibr" rid="ref-303858">(Figueroa)</xref></p>
      <p>To describe our dataset we first need to illustrate the concept of Fatwa in Islam. Britannica provides an accurate definition of fatwa <xref ref-type="bibr" rid="ref-303875">(<italic>Fatwa</italic>)</xref>:</p>
      <disp-quote>
        <p>” Fatwa, in Islam, is a formal ruling or interpretation on a point of Islamic law given by a qualified legal scholar (known as a mufti). Fatwas are usually issued in response to questions from individuals or Islamic courts. Though considered authoritative, fatwas are generally not treated as binding judgments; a requester who finds a fatwa unconvincing is permitted to seek another opinion. ”</p>
      </disp-quote>
      <p>While usually directed to scholars of Islam, fatwas span a wide range of topics including politics, financial matters, family problems, and even medical issues, among many others <xref ref-type="bibr" rid="ref-303857 ref-303842 ref-303848 ref-303847">(Agrama; Ismail and Baharuddin; Dahlan et al.; Adel and Numan)</xref>. Traditionally, fatwas were issued by institutions and institutional scholars, but now most fatwas are issued online, either by individual scholars or on web portals belonging to religious institutions. Two main websites (fatwa portals) dominate the fatwa market:</p>
      <list list-type="bullet">
        <list-item>
          <p><ext-link ext-link-type="uri" ns0:href="https://www.islamweb.net/ar/">IslamWeb</ext-link> is a comprehensive website with fatwas, articles, videos, and many other features for Muslims. According to <ext-link ext-link-type="uri" ns0:href="https://www.similarweb.com/website/islamweb.net/#overview">SimilarWeb</ext-link>, it ranks 3,927 globally and is visited by 20.23 million viewers a day with the top countries being Egypt, Saudi Arabia, Algeria, Morocco, and France. The website is owned by the Qatari Ministry of Religious Affairs. It is worth investigating why this website is so popular among Muslims from so many Muslim countries through social science research methods. This study is beyond the scope of this article.</p>
        </list-item>
        <list-item>
          <p><ext-link ext-link-type="uri" ns0:href="https://islamqa.info/">Islam Questions and Answers</ext-link> has a global rank of 6,181 and has 13.66 million daily visits. The website was founded and run by the Syrian/Saudi scholar Muhammad Salih Al-Munajjid. The top countries visiting IslamQA are Saudi Arabia, Egypt, the United States, the United Kingdom, and the United Arab Emirates.<xref ref-type="fn" rid="fn1">1</xref></p>
        </list-item>
      </list>
      <p>We attribute the popularity of these websites partially to the fact that they handle many questions as they answer around 200 questions every day and publish sensitive material while maintaining questioner anonymity. The anonymity guarantees that questioners can experiment freely, talk about their mistakes without being embarrassed, address wrongs without facing the consequences, and be judged based on the presented facts and not on external information <xref ref-type="bibr" rid="ref-303867">(Jordan)</xref>. A full investigation of the factors, social and otherwise, that belie the popularity of these websites is worthy of future research.</p>
      <p>Fatwas raises many questions that experts in Islamic Studies and the sociology of Islam may find interesting. Our newly created dataset should thus be of interest to these scholars and others in computational humanities, digital humanities, computational linguistics, and computational social science.</p>
      <p><bold>Research Questions:</bold> In this paper, we present a new massive dataset and the transformations it went through. In addition, we showcase the usefulness of the dataset in answering three pertinent questions, which add new insights to the existing knowledge:</p>
      <list list-type="order">
        <list-item>
          <p>What, if any, are the differences between the questions posed by women and those asked by men?</p>
        </list-item>
        <list-item>
          <p>Can we automatically classify the questions as either male or female?</p>
        </list-item>
        <list-item>
          <p>Can we predict fatwa popularity? And what makes a fatwa popular?</p>
        </list-item>
      </list>
      <p>The questions follow a logical sequence in which question one focuses on examining existing gendered data, question two applies classification when no gender labels exist, thus discovering more questions asked by women and men, and question three assigns a measure of importance to the questions to which gender has been assigned. The questions thus seek to maximize the benefits of this dataset to the researchers in the fields of Islamic studies, women’s studies, and gender studies.</p>
      <p>The rest of this paper goes as follows: in section 2, we describe the dataset; in section 3 we describe preprocessing and the methods we use to answer the questions; in section 4 we present the results with a discussion of important issues; and in section 5, we conclude the study and outline some future research.</p>
    </sec>
    <sec>
      <title>2. Dataset</title>
      <p>The data used in this study is extracted from the fatwa portal <ext-link ext-link-type="uri" ns0:href="https://www.islamweb.net/ar/">Islam Web</ext-link>, which is one of the largest fatwa portals online. According to the <ext-link ext-link-type="uri" ns0:href="https://www.similarweb.com/website/islamweb.net/#overview">SimilarWeb</ext-link>, IslamWeb has 17.97M daily visits, with a global rank of 3,897, and it ranks first in the category of <italic>Community and Society &gt; Faith and Beliefs</italic>. Islam Web publishes 200 fatwas a day on average on a variety of topics with questions coming from around the world, but the portal does not publish any demographic data, which makes it hard to sort the questions and the answers based on the gender of the questioner. Since gender is one of the major demographic indicators and is of main interest to the authors, we have utilized the morphological nature of Arabic in search of gendered linguistic inflexion. Fortunately, Arabic, the main language of Islam Web, is a morphologically rich language inflected for gender, so in some cases, when the questioner speaks in a personal way, or when the scholar addresses the questioner personally, it becomes possible to identify the gender of the questioner. We use linguistic cues to extract the gender-identifiable fatwas from the fatwa collection.</p>
      <sec>
        <title>2.1. Linguistic cues for gender identification</title>
        <p>With Arabic being a grammatical gender language, speakers use gendered pronouns, nouns, verbs, and adjectives to refer to themselves and others. The word for doctor in Arabic is either <italic>tabib</italic> (male doctor) or <italic>tabiba</italic> (female doctor), so if a question has the expression <italic>ana tabib</italic>, this indicates the questioner is male. We also use the answers for the same purpose. <italic>kama ta’alm</italic> means <italic>as you (male) know</italic>, while the female version is <italic>kama ta’lamin</italic>. We use these linguistic cues to assign gender to questions. We have a four-step approach as follows:</p>
        <list list-type="bullet">
          <list-item>
            <p>starts with a seed list of gendered expressions, including pronouns, nouns, adjectives, and verbs. The list also includes regular expressions in the form <italic>I gendered_noun</italic> and <italic>you gendered_verb</italic>.</p>
          </list-item>
          <list-item>
            <p>extracts all the fatwas that are either male or female.</p>
          </list-item>
          <list-item>
            <p>check the intersection of the male and female fatwas and remove any common fatwas from the sets</p>
          </list-item>
          <list-item>
            <p>examine five hundred fatwas manually to check how accurate the method is.</p>
          </list-item>
        </list>
        <p>After applying the four steps, we found 40458 with uncontested gender information out of 172,000 fatwas. The manual checks found no incorrectly assigned fatwas.</p>
      </sec>
      <sec>
        <title>2.2. The resulting dataset</title>
        <p>The resulting dataset comprises 40458 questions with 17221 asked by women and 23237 asked by men. Questions vary considerably in length with the average number of words per question at 116.83, with a standard deviation of 120.25, a median of 80, a minimum of 3, and a maximum of 3019. There are differences between the lengths of questions by men and women. For men, the average is 109.5 words (std = 117, min = 3, max = 3019, median = 74) while for women the average is 126.79 (std = 123.9, min = 4, max = 2109, median = 89). The dataset is distributed in a JSON streaming file with the keys: <italic>question</italic>, <italic>answer</italic>, <italic>date</italic> indicating the date of publication, <italic>classification</italic>, which is a coarse classification of the question provided by the website and <italic>views</italic>, which is the number of times the fatwa has been viewed. The dataset (40458 questions) has been randomly divided into a training set (85%, 34389 questions) and a test set (15%, 6069 questions). 10% of the training set has also been dedicated as a development set. This division is the same across all the experiments below. All the dataset metadata is available, directly or indirectly in the html. In <xref ref-type="fig" rid="attachment-223594">Figure 1</xref>, we have a screenshot of a recent fatwa with numbers marking the different pieces of the annotation:</p>
        <fig id="attachment-223594">
          <object-id pub-id-type="publisher-id">223594</object-id>
          <label>Figure 1.</label>
          <caption>
            <title>A screenshot of a fatwa and its information: [1] is a categorization of the fatwa, and in this specific case it is about the rules on honesty, [2] is a title that reads the rule on the personal use of work tools, [3] is the fatwa ID number, [4] is the date of publication (up) and how many stars the fatwa received (down), [5] is the number of views, [6] is the question, and [7] is the answer. The red underlines indicate morphological information marking the gender of the questioner (female) and the blue underline is a link to another relevant fatwa.</title>
          </caption>
          <graphic ns0:href="culturalanalytics_2024_9_3_116368_223594.png" />
        </fig>
        <sec>
          <title>2.2.1. Questions and Answers</title>
          <p>The dataset includes questions by the public and their answers by Muslim scholars on various aspects of Islam. The average length of questions is 116.83 words, with a standard deviation of 120.25, a minimum of 3, and a maximum of 3019 (and a median of 80). The average length of answers is 223.3, with a std of 163.8, a min of 0, and a max of 5809 (median = 185). There does not seem to be a strong correlation between question length and answer length as the Pearson correlation is 0.21. In the current work, we use the questions and the answers to predict both gender and views based on textual data, but the questions and answers could be used to study various issues in Islamic studies and the sociology of Islam. Linguistic gender variation is also a potential research question that is made possible by this dataset.</p>
        </sec>
        <sec>
          <title>2.2.2. Titles</title>
          <p>A title is usually a summary of the questions styled as a journalistic headline and is meant to attract attention. Although we do not use titles in this paper, mainly because they don’t have the gender cues and because their content is included in the body of the Fatwa, they hold quite some potential as they can be used in Natural Language Processing research for summarisation and title generation. They can also be used in text similarity experiments and in short text classification.</p>
        </sec>
        <sec>
          <title>2.2.3. Views</title>
          <p>The website records the HTML hits each page receives. This is extremely useful information for fatwa popularity as it could tell us how many people are interested in this specific question, and with some abstraction, how important the theme or category of the fatwa is. There is a strong, but not perfect, correlation between the number of questions in a category and the total number of views per category (Pearson <italic>r</italic> = 0.79).</p>
          <p>The questions and answers in the gender-labeled data have been viewed 180,738,210 times, with an average of 4465 per question, but this is not evenly distributed as the standard deviation is 10124.7, with a minimum of 12 and a maximum of 441706. The median is 2485, which indicates a right-skewed distribution. The 5 top-viewed questions in the corpus are about prayer (441706 views), masturbation (432674), concubinage (318502), sexual excitation and genital cleanliness (314662), and divorce (304675).</p>
          <p>The counts above are based on the views as they were recorded on 8 February 2016, but the data also includes the views as recorded on 8 July 2020. The 2020 counts are available for only 34714 fatwas in the corpus since the website deletes fatwas from time to time. It is not clear why the website deletes some Fatwas. According to the 2020 counts, the 34714 fatwas have been viewed 371,142,364 times, with an average of 10691 views per fatwa. We use views in a regression experiment in which we try to predict the number of views based on the textual content of the question. The purpose of prediction is two-fold: (i) it is used in the regression model to explain the contribution of each theme to the number of views, which helps rank those themes in terms of importance, and (ii) the model can be used to predict how popular a new question may become in future.</p>
        </sec>
        <sec>
          <title>2.2.4. Categories</title>
          <p>Each fatwa is categorized according to a hierarchical set of labels. For example, a question on whether it is Islamically legitimate to work on improving the Arabic Wikipedia is assigned the label <bold>Main <ns2:math display="inline" id="mml-equation-3a022935-185d-4426-8f0e-f7ee7a92922e"><ns2:mo>−</ns2:mo><ns2:mo>&gt;</ns2:mo></ns2:math> Thought, Politics and Art <ns2:math display="inline" id="mml-equation-3141f06b-bac9-45d5-bf25-8aec947f7f42"><ns2:mo>−</ns2:mo><ns2:mo>&gt;</ns2:mo></ns2:math> Culture and Thought</bold><xref ref-type="fn" rid="fn2">2</xref> while a question from a young woman complaining against her father who does not let her drive her own car is classified as <bold>Main <ns2:math display="inline" id="mml-equation-5fbdd0d6-52eb-4850-8db0-86a35a5168d1"><ns2:mo>−</ns2:mo><ns2:mo>&gt;</ns2:mo></ns2:math> Family Matters <ns2:math display="inline" id="mml-equation-409de75b-9e3c-4db0-965d-47f4609c8a02"><ns2:mo>−</ns2:mo><ns2:mo>&gt;</ns2:mo></ns2:math> Women’s Issues</bold>.<xref ref-type="fn" rid="fn3">3</xref> These labels are useful in obtaining a coarse-grained idea of the range of issues raised on these platforms. This could be used in Natural Language Processing and Machine Learning research for learning (hierarchical) classifications.</p>
        </sec>
        <sec>
          <title>2.2.5. Textual evidence</title>
          <p>When the muftis provide answers, they usually support their answers with textual evidence from the Qur’an, the Prophetic traditions, quotes by prominent scholars, or scientific research. These could be useful in understanding the sources governing Muslim thinking. They could also be used in machine learning and digital humanities research for intertextuality detection and text reuse.</p>
          <p>The resulting dataset, as shown in <xref ref-type="fig" rid="attachment-223595">Figure 2</xref> spans 17 years, from 1999 to 2015, with the number of views per fatwa available for February 2016 and July 2020, which could be used for both tracking fatwa popularity through time and regression analysis.</p>
          <fig id="attachment-223595">
            <object-id pub-id-type="publisher-id">223595</object-id>
            <label>Figure 2.</label>
            <caption>
              <title>Fatwa counts per year from 1999 to 2015</title>
            </caption>
            <graphic ns0:href="culturalanalytics_2024_9_3_116368_223595.png" />
          </fig>
        </sec>
      </sec>
    </sec>
    <sec>
      <title>3. Methodology</title>
      <p>To answer the first question: <italic>What, if any, are the differences between the questions posed by women and those asked by men?</italic>, we use topic modelling to find the themes of the questions raised by men and women and the odds ratios to determine which themes are more female and which tend to be more male. To answer the second question: <italic>Can we automatically classify the questions as either male or female?</italic>, we use text classification, mainly through automatic machine learning, and we also try to find which lexical items are more associated with men and women. For the third question: <italic>Can we predict fatwa popularity? and what makes a fatwa popular?</italic> , we use text regression using two popular algorithms: linear regression and random forests regression combined with topic modelling, which is used for explanation. As Arabic is a morphologically rich language, we use stemming throughout.</p>
      <sec>
        <title>3.1. Stemming</title>
        <p>A white space-delimited unit in Arabic is usually made up of zero or more prefixes, a lexical item, and zero or more suffixes. Common prefixes are conjunctions, prepositions, and the definite article while common suffixes are possessive and object pronouns. For example, the orthographic unit <italic>fsykfykhm</italic>, depicted in <xref ref-type="fig" rid="attachment-223593">Figure 3</xref>, is made up of the conjunction <italic>f</italic>, the future particle <italic>s</italic>, the verb <italic>ykfy</italic>, the direct object <italic>k</italic> and the indirect object <italic>hm</italic>. The verb itself is made up of the prefix <italic>n</italic> and the verb stem <italic>kfy</italic>. In most of what we do in this paper, we are mostly concerned with the stem.</p>
        <fig id="attachment-223593">
          <object-id pub-id-type="publisher-id">223593</object-id>
          <label>Figure 3.</label>
          <caption>
            <title>The morphological structure of the Arabic orthographic unit fsykfykhm. The path to the stem is marked by maroon-coloured arrows.</title>
          </caption>
          <graphic ns0:href="culturalanalytics_2024_9_3_116368_223593.png" />
        </fig>
        <p>For stemming to happen, the text is first run through morphological segmentation, which sets segment boundaries within words. The segments are then passed through a part of the speech tagger that assigns grammatical tags (e.g. NOUN, VERB, ADJECTIVE) to these segments. The stem is the main lexical unit in the word and must have a lexical tag (NOUN, VERB, ADJECTIVE). All other tags are discarded in the topic modeling experiments while in lexical experiments they are retained since they are useful for style differentiation. Stemming is achieved through the use of <italic>ArabicSOS</italic>, which is specialized in segmentation, orthographic standardization, and stemming of classical and religious Arabic <xref ref-type="bibr" rid="ref-303862">(Mohamed and Sayyed)</xref>. The stemming effect on the dataset is dramatic as the number of words in the questions is 4729279 with 168827 unique words. The number of segments is 7868382 with 48553 unique segments. The unique segments are hardly 29% of the original unique words.</p>
      </sec>
      <sec>
        <title>3.2. Topic modeling</title>
        <p>Topic modelling is a way of summarising text documents into collections of thematically related words called topics. Due to the nature of Arabic, there is a high type-token ratio as there are too many unique words, and running topic modelling with this is not very useful. For this reason, we use the stems as input to the topic modelling software. We use Mallet <xref ref-type="bibr" rid="ref-303864">(McCallum)</xref> for running the topic modelling and we are interested in two Mallet outputs: (1) the keys of the topics, or the lexical items constituting the topic, and (2) the probabilities associated with the topics, especially the probability of each topic in each document as we use these probabilities as input to the machine learning classifiers below. For consistency in topic modelling, we use the same 50 topics for all the experiments below.</p>
        <sec>
          <title>3.2.1. Classification</title>
          <p>For gender prediction, we use supervised machine learning <xref ref-type="bibr" rid="ref-303877">(Sarwar and Mohamed)</xref>. There are two main settings in these classification experiments:</p>
          <list list-type="bullet">
            <list-item>
              <p><italic>text classification</italic>, in which the dependent variable is the gender (male or female) and the predictive variables are made up of the textual content of the fatwa. The textual content could be word unigrams (each variable is a single word) or word bigrams (each variable is either a unigram or a bigram. For example, in the sentence: <italic>The sky is blue</italic>, the unigrams are [‘the’, ‘sky’, ‘is’, ‘blue’] and the bigrams are [‘the sky’, ‘sky is’, ‘is blue’]. The values of these variables will be the frequencies of these n-grams in each document. We do not use raw frequency, but we use <bold>TFIDF</bold> (Term Frequency Inverse Document Frequency), which computes the most distinctive lexical items for each document and is thus more conducive to accurate classification.</p>
            </list-item>
            <list-item>
              <p><italic>topic classification</italic>, in which we use the output of topic modelling for classification. When we run topic modelling, we obtain, for each document in the corpus, the probabilities of each topic being in that document. If we use 50 topics, then, for each document, we have 50 probabilities corresponding to the 50 topics. These probabilities can then be used to predict the gender of the document. Refer to the section below for results.</p>
            </list-item>
          </list>
        </sec>
      </sec>
      <sec>
        <title>3.3. Regression</title>
        <p>The purpose of regression is to predict, for each question, how popular the question may be. Just like with classification, we use both the textual content and the topics for regression. In both regression and classification, we use a variety of algorithms that will be detailed below. For both regression and classification, we use 80% of the data for training and 20% for testing. We mainly use the <italic>scikit-learn</italic> <xref ref-type="bibr" rid="ref-303865 ref-303876">(Pedregosa et al.; Mohamed and Sarwar)</xref> library, which has a wide range of algorithms.</p>
      </sec>
    </sec>
    <sec>
      <title>4. Discussion of Results and Implications</title>
      <p>In this section, we discuss the results of our experimental studies and their implications. Recall that, in this paper, we present a new massive dataset and the transformations it went through. In addition, we showcase the usefulness of the dataset in answering three pertinent questions, which add new insights to the existing knowledge.</p>
      <sec>
        <title>4.1. Answer to Question 1</title>
        <p>What, if any, are the differences between the questions posed by women and those asked by men?</p>
        <p>In order to find out what the differences may be between the questions asked by women and those asked by men, we use topic modelling. We then use Odds Ratios (see Tables <xref ref-type="table" rid="attachment-223620">3</xref> and <xref ref-type="table" rid="attachment-223621">4</xref>) to find out which topics are more correlated with men and which are more characteristic of women. <xref ref-type="table" rid="attachment-223618">Table 1</xref> lists the 50 topics produced by Mallet ordered by their odds ratios. Since female-centric questions were assigned the zero class, odds ratios less than one are more associated with women while odds ratios larger than one are more characteristic of male-centric questions.</p>
        <table-wrap id="attachment-223618">
          <object-id pub-id-type="publisher-id">223618</object-id>
          <label>Table 1.</label>
          <caption>
            <title>The 50 topics produced by Mallet topic modelling ranked by Odds Ratios from most female (topic 4) to most male (topic 47)</title>
          </caption>
          <table>
            <thead>
              <tr>
                <th>
                  <bold>
                    <bold>OR</bold>
                  </bold>
                </th>
                <th>
                  <bold>
                    <bold>Topic</bold>
                  </bold>
                </th>
                <th>
                  <bold>
                    <bold>Theme</bold>
                  </bold>
                </th>
              </tr>
            </thead>
            <tbody>
              <tr>
                <td>0.0</td>
                <td>4</td>
                <td>husband married family home divorce children problems refusal right</td>
              </tr>
              <tr>
                <td>0.0031</td>
                <td>36</td>
                <td>period menstruation</td>
              </tr>
              <tr>
                <td>0.0627</td>
                <td>49</td>
                <td>tradition problem talk people hope return</td>
              </tr>
              <tr>
                <td>0.0951</td>
                <td>0</td>
                <td>love heart feel person problem haram</td>
              </tr>
              <tr>
                <td>0.1069</td>
                <td>3</td>
                <td>soul feel life fear problem Quran pray depression</td>
              </tr>
              <tr>
                <td>0.1424</td>
                <td>29</td>
                <td>mother sister brother father uncle</td>
              </tr>
              <tr>
                <td>0.1707</td>
                <td>27</td>
                <td>women dress hijab hair religious beard</td>
              </tr>
              <tr>
                <td>0.2459</td>
                <td>22</td>
                <td>call tell family refuse to agree to give</td>
              </tr>
              <tr>
                <td>0.2718</td>
                <td>42</td>
                <td>marriage man woman engage proposal refuse family religious decent</td>
              </tr>
              <tr>
                <td>0.2727</td>
                <td>10</td>
                <td>obsessive_compulsive_disorder devil thoughts</td>
              </tr>
              <tr>
                <td>0.2765</td>
                <td>41</td>
                <td>Ramadan fasting expiation feeding_the_poor</td>
              </tr>
              <tr>
                <td>0.3315</td>
                <td>13</td>
                <td>hit talk anger in_front_of people problem treatment insult</td>
              </tr>
              <tr>
                <td>0.352</td>
                <td>18</td>
                <td>internet social friend love telephone</td>
              </tr>
              <tr>
                <td>0.3925</td>
                <td>20</td>
                <td>ablution urine secretions semen prayer</td>
              </tr>
              <tr>
                <td>0.4183</td>
                <td>19</td>
                <td>magic sleep Quran jinn evil_eye</td>
              </tr>
              <tr>
                <td>0.6063</td>
                <td>38</td>
                <td>money salary haram amount expenses help</td>
              </tr>
              <tr>
                <td>0.6888</td>
                <td>1</td>
                <td>prayer subsistence bless</td>
              </tr>
              <tr>
                <td>0.6954</td>
                <td>16</td>
                <td>illness doctor-patient hospital psychiatrist fetus</td>
              </tr>
              <tr>
                <td>0.6993</td>
                <td>14</td>
                <td>university school student test diploma graduation</td>
              </tr>
              <tr>
                <td>0.9859</td>
                <td>5</td>
                <td>oath vow Mushaf lie</td>
              </tr>
              <tr>
                <td>1.0329</td>
                <td>31</td>
                <td>wash water ablution foot hair</td>
              </tr>
              <tr>
                <td>1.3784</td>
                <td>39</td>
                <td>question thank_you answer bless_you</td>
              </tr>
              <tr>
                <td>1.383</td>
                <td>17</td>
                <td>life problem big family try leave</td>
              </tr>
              <tr>
                <td>1.4215</td>
                <td>12</td>
                <td>time years months week hour</td>
              </tr>
              <tr>
                <td>1.4435</td>
                <td>24</td>
                <td>please answer question</td>
              </tr>
              <tr>
                <td>1.4598</td>
                <td>33</td>
                <td>clothes water dirty wash urine bathroom floor</td>
              </tr>
              <tr>
                <td>1.7441</td>
                <td>28</td>
                <td>marriage certificate dowry agent conditions</td>
              </tr>
              <tr>
                <td>1.8853</td>
                <td>43</td>
                <td>reading Quran prayer miss</td>
              </tr>
              <tr>
                <td>1.9005</td>
                <td>44</td>
                <td>divorce anger return home dispute</td>
              </tr>
              <tr>
                <td>2.1441</td>
                <td>26</td>
                <td>home husband visit family</td>
              </tr>
              <tr>
                <td>2.2113</td>
                <td>37</td>
                <td>haram software game song music film draw tv</td>
              </tr>
              <tr>
                <td>2.3708</td>
                <td>45</td>
                <td>unbelief religion unbeliever insult mockery heart repent</td>
              </tr>
              <tr>
                <td>2.4811</td>
                <td>7</td>
                <td>time prayer athan masjid congregation dawn noon sunset</td>
              </tr>
              <tr>
                <td>2.9943</td>
                <td>8</td>
                <td>food drink wine alcohol pork restaurant</td>
              </tr>
              <tr>
                <td>3.1749</td>
                <td>21</td>
                <td>pilgrimage</td>
              </tr>
              <tr>
                <td>3.4045</td>
                <td>34</td>
                <td>zakat alms</td>
              </tr>
              <tr>
                <td>3.4308</td>
                <td>2</td>
                <td>inheritance estate</td>
              </tr>
              <tr>
                <td>4.1313</td>
                <td>9</td>
                <td>flat land house rent property</td>
              </tr>
              <tr>
                <td>4.4059</td>
                <td>35</td>
                <td>married children another_marriage divorce daughter</td>
              </tr>
              <tr>
                <td>4.7719</td>
                <td>46</td>
                <td>paradise hell judgment chastisement</td>
              </tr>
              <tr>
                <td>5.5224</td>
                <td>30</td>
                <td>country travel city egypt saudi work holiday</td>
              </tr>
              <tr>
                <td>5.9501</td>
                <td>40</td>
                <td>repent masturbation sex lust sin forgive haram</td>
              </tr>
              <tr>
                <td>7.2141</td>
                <td>32</td>
                <td>opinion scholars question disagreement law evidence</td>
              </tr>
              <tr>
                <td>7.484</td>
                <td>15</td>
                <td>car accident court rights compensation claim</td>
              </tr>
              <tr>
                <td>7.9649</td>
                <td>11</td>
                <td>company employee service office salary government</td>
              </tr>
              <tr>
                <td>8.4999</td>
                <td>6</td>
                <td>Islam religion young_man live Christian france language foreign</td>
              </tr>
              <tr>
                <td>12.4295</td>
                <td>25</td>
                <td>sale company buy price shop dollar proceeds trade commodity commission</td>
              </tr>
              <tr>
                <td>14.3603</td>
                <td>23</td>
                <td>verse Quran book exegesis story chapter</td>
              </tr>
              <tr>
                <td>19.339</td>
                <td>48</td>
                <td>bank loan usury installment interest haram</td>
              </tr>
              <tr>
                <td>49.4263</td>
                <td>47</td>
                <td>masjid people congregation innovation sermon heresy</td>
              </tr>
              <tr>
                <td />
                <td />
                <td />
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>We can see from the table that there are clear differences between the questions asked by women and those raised by men. The top concern for women in this dataset is family matters (children, marriage, divorce). We can also see religious ritual questions concerning menstruation and fasting as menstruating women do not have to observe the obligatory fasting in the lunar Islamic month of Ramadan. The current study focuses on a convenience sample of Muslim men and women who are mostly in the Middle East, and the results may thus be hard to generalize. The interest in the family may echo women’s interests in other cultures as well. A study that examined a corpus of happiness moments found out that women and men have different sources of well-being <xref ref-type="bibr" rid="ref-303863">(Mohamed and Mostafa)</xref>, as men’s top source of happiness was related to games and sports while women’s was more related to family and shopping.</p>
        <p>While differences between Men and Women may not be news in general, the details of these differences within the religious domain are of interest. For example, Men seem to be interested in totally different things, such as ensuring that the job or work they have is conformant to the rules of Islam, working and residing in other countries, especially Western countries, and obtaining their residences and citizenship, marriage and divorce consequences, the differences among scholars of Islam, life, death, and the hereafter, the relationships between Muslims and non-Muslims, banking and usury, especially concerning getting a bank loan and whether this is permissible in Islam, trade and transactions and rules on how to make profit and the permissibility of commissions, and questions about praying in mosques vs. praying at home. For example, a representative fatwa for topic 48, the second most male topic reads:</p>
        <disp-quote>
          <p>I’m a 32-year-old young man. I suffer from weak memory (I’m very stupid), and I have failed to hold any job due to this memory problem. Is this a license for me to keep my money in an Islamic bank, knowing that the three Islamic banks in Egypt (Faisal, Al-Baraka and Abu Dhabi) deal in treasury bills? Thank you!<xref ref-type="fn" rid="fn4">4</xref></p>
        </disp-quote>
        <p>While a representative fatwa for topic 4, the most female topic reads:</p>
        <disp-quote>
          <p>My husband is a drug addict, and we’ve had three children together, the youngest being six years old. He wants more kids, but I do not feel like it. I do not use contraceptives, but his addiction stands in the way. Am I committing a sin by not seeking to get pregnant? Please reply quickly. I do not know who else to ask. It’s been months, and I cannot find a solution.<xref ref-type="fn" rid="fn5">5</xref></p>
        </disp-quote>
        <p>The use of topic models also gives us the chance to examine which topics go together. <xref ref-type="fig" rid="attachment-223592">Figure 5</xref> shows the relationships between the topics. To create this network, we only considered the most probable topic pairs. To illustrate, let’s examine topic 4, the most common topic in the questions asked by women. When we have topic 4 as the most probable theme in a certain document, topics 13 and 26 are the most probable second topics in that document, so we connect them together. For topic 4, this results in the graph shown in <xref ref-type="fig" rid="attachment-223596">Figure 4</xref>. What this indicates is that if a question involves matters of problems with their husband, family, children, and divorce, then it is also very likely to mention that the husband hits the questioner, insults her in front of other people, is mad at her and treats her badly (topic 13), and that this may come in the context of a discussion about visiting family (Topic 26). Topic 4 is also very likely to be invoked in the context of topic 44 (divorce anger return home dispute), topic 38 (money salary haram amount expenses help), topic 49 (tradition problem talk people hope to return), and topic 17 (life problem big family tries to leave).</p>
        <fig id="attachment-223596">
          <object-id pub-id-type="publisher-id">223596</object-id>
          <label>Figure 4.</label>
          <caption>
            <title>The topics connecting to and from Topic 4</title>
          </caption>
          <graphic ns0:href="culturalanalytics_2024_9_3_116368_223596.png" />
        </fig>
        <fig id="attachment-223592">
          <object-id pub-id-type="publisher-id">223592</object-id>
          <label>Figure 5.</label>
          <caption>
            <title>A network of the topics discovered by Mallet. Only the second most probable topic per document is connected to the main topic</title>
          </caption>
          <graphic ns0:href="culturalanalytics_2024_9_3_116368_223592.png" />
        </fig>
      </sec>
      <sec>
        <title>4.2. Answer to Question 2: Predicting Gender</title>
        <p>Can we automatically classify the questions as either male or female?</p>
        <p>In our specific case, we want to predict the gender of the author of the question even though we may use the answer as a means to this prediction. If the answer is addressed to a female, based on the morphology, then it is certain that the question was asked by a female. If the addressee is male then the question was asked by a male. In the cases where the morphology does not help identify the gender of the questioner, we do not use that specific Fatwa in training the machine learning classifier.</p>
        <p>We run several experiments to predict gender. In all cases, we use the Term Frequency Inverse Document Frequency (TF-IDF) for feature extraction and an n-gram range of (1, 2), which means we use both individual words and two consecutive words as features. we use three algorithms in this task: Logistic Regression, Random Forests, and Support Vector Machines. As far as the input text is concerned, we vary the input to be either the question alone, the answer alone, or a combination of the answer and the question.</p>
        <table-wrap id="attachment-223619">
          <object-id pub-id-type="publisher-id">223619</object-id>
          <label>Table 2.</label>
          <caption>
            <title>Predicting gender using the questions, answers, or a combination thereof.</title>
          </caption>
          <table>
            <thead>
              <tr>
                <th>
                  <bold>
                    <bold>Input</bold>
                  </bold>
                </th>
                <th>
                  <bold>
                    <bold>algorithm</bold>
                  </bold>
                </th>
                <th>
                  <bold>
                    <bold>Precision</bold>
                  </bold>
                </th>
                <th>
                  <bold>
                    <bold>Recall</bold>
                  </bold>
                </th>
                <th>
                  <bold>
                    <bold>F1</bold>
                  </bold>
                </th>
              </tr>
            </thead>
            <tbody>
              <tr>
                <td>
                  <bold>Questions</bold>
                </td>
                <td>Logistic Regression</td>
                <td>0.84</td>
                <td>0.83</td>
                <td>0.83</td>
              </tr>
              <tr>
                <td />
                <td>Random Forests</td>
                <td>0.81</td>
                <td>0.80</td>
                <td>0.79</td>
              </tr>
              <tr>
                <td />
                <td>Support Vector Machines</td>
                <td>0.82</td>
                <td>0.82</td>
                <td>0.82</td>
              </tr>
              <tr>
                <td />
                <td>Logistic Regression, Topic models</td>
                <td>0.71</td>
                <td>0.68</td>
                <td>0.69</td>
              </tr>
              <tr>
                <td>
                  <bold>Answers</bold>
                </td>
                <td>Logistic Regression</td>
                <td>0.94</td>
                <td>0.94</td>
                <td>0.94</td>
              </tr>
              <tr>
                <td />
                <td>Random Forests</td>
                <td>0.92</td>
                <td>0.92</td>
                <td>0.92</td>
              </tr>
              <tr>
                <td />
                <td>Support Vector Machines</td>
                <td>0.96</td>
                <td>0.96</td>
                <td>0.96</td>
              </tr>
              <tr>
                <td>
                  <bold>Questions + Answers</bold>
                </td>
                <td>Logistic Regression</td>
                <td>0.94</td>
                <td>0.93</td>
                <td>0.93</td>
              </tr>
              <tr>
                <td />
                <td>Random Forests</td>
                <td>0.92</td>
                <td>0.91</td>
                <td>0.91</td>
              </tr>
              <tr>
                <td />
                <td>Support Vector Machines</td>
                <td>
                  <bold>0.98</bold>
                </td>
                <td>
                  <bold>0.98</bold>
                </td>
                <td>
                  <bold>0.98</bold>
                </td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p><xref ref-type="table" rid="attachment-223619">Table 2</xref> lists the results of these experiments. Using both the questions and answers as input with the Support Vector Machines algorithms yields the best results with precision, recall, and an F1 score of 0.98, which is a very high number, and means that we can predict the gender of the one who asked the question with 98% score. Using topic probabilities in gender classification did not yield good results, compared to lexical input, as the best result was an <ns2:math display="inline" id="mml-equation-5e108118-f7ea-4c6b-a4ca-08fe8ef0e7a6"><ns2:mi>F</ns2:mi><ns2:mn>1</ns2:mn></ns2:math> score of 0.68 using Logistic Regression.</p>
        <p>While logistic regression is not the best-performing prediction algorithm, one major advantage of its use is its high interpretability. One can use the coefficients of the features (i.e. the lexical items) to see which words are more likely to be used by males and which are more often used by females. This also applies to the words and phrases more commonly used by the muftis when the questioner is male or female. This also helps with the possibility of generalization. We have extracted 40,000 out of 170,000 fatwas based on morphological clues. If these morphological clues distinguish male from female fatwas, then the possibility of an accurate classifier that generalizes beyond this small corpus is limited. If, on the other hand, the features are not limited to the morphological cues then we can use the classifier to predict much more than the small dataset. Examining the top 100 male lexical features and their equivalent female ones shows that the classifier is capable of generalization. The top 10 male features are not morphological.</p>
        <table-wrap id="attachment-223620">
          <object-id pub-id-type="publisher-id">223620</object-id>
          <label>Table 3.</label>
          <caption>
            <title>Odds ratios for the most male lexical items. Male words are mostly generic while female words are predominately morphologically feminine.</title>
          </caption>
          <table>
            <thead>
              <tr>
                <th>
                  <bold>
                    <bold>Arabic</bold>
                  </bold>
                </th>
                <th>
                  <bold>
                    <bold>English</bold>
                  </bold>
                </th>
                <th>
                  <bold>
                    <bold>OR</bold>
                  </bold>
                </th>
              </tr>
            </thead>
            <tbody>
              <tr>
                <td>إلا بإذن</td>
                <td>except with permission</td>
                <td>179</td>
              </tr>
              <tr>
                <td>ملزم</td>
                <td>obligatory, committed</td>
                <td>178</td>
              </tr>
              <tr>
                <td>لا تعود</td>
                <td>does not return</td>
                <td>176</td>
              </tr>
              <tr>
                <td>مقبل على</td>
                <td>about to</td>
                <td>168</td>
              </tr>
              <tr>
                <td>مع بعض</td>
                <td>with some, together</td>
                <td>165</td>
              </tr>
              <tr>
                <td>تنصح</td>
                <td>advise</td>
                <td>164</td>
              </tr>
              <tr>
                <td>يجيء أحدهم</td>
                <td>one of them comes</td>
                <td>163</td>
              </tr>
              <tr>
                <td>المالية</td>
                <td>the financial</td>
                <td>155</td>
              </tr>
              <tr>
                <td>عملي هذا</td>
                <td>this work of mine</td>
                <td>148</td>
              </tr>
              <tr>
                <td>هل أنا</td>
                <td>Am I</td>
                <td>147</td>
              </tr>
              <tr>
                <td>وراجع الفتاوى</td>
                <td>And review the fatwas</td>
                <td>146</td>
              </tr>
              <tr>
                <td>الباطل</td>
                <td>falsehood</td>
                <td>145</td>
              </tr>
              <tr>
                <td>وأكثر من</td>
                <td>And more than</td>
                <td>144</td>
              </tr>
              <tr>
                <td>لخطبتها</td>
                <td>For her engagement</td>
                <td>141</td>
              </tr>
              <tr>
                <td>وخطيبتي</td>
                <td>And my fiancee</td>
                <td>141</td>
              </tr>
              <tr>
                <td>كنت تقوم</td>
                <td>you used to do</td>
                <td>139</td>
              </tr>
              <tr>
                <td>أنا متأكد</td>
                <td>I am certain</td>
                <td>134</td>
              </tr>
              <tr>
                <td>من رأسه</td>
                <td>of his own accord</td>
                <td>131</td>
              </tr>
              <tr>
                <td>للفتوى</td>
                <td>for the fatwa</td>
                <td>118</td>
              </tr>
              <tr>
                <td>أن تبادر</td>
                <td>to take the initiative</td>
                <td>117</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <table-wrap id="attachment-223621">
          <object-id pub-id-type="publisher-id">223621</object-id>
          <label>Table 4.</label>
          <caption>
            <title>Odds ratios for the most frequent female lexical items. Male words are mostly generic while female words are predominately morphologically feminine.</title>
          </caption>
          <table>
            <thead>
              <tr>
                <th>
                  <bold>
                    <bold>Arabic</bold>
                  </bold>
                </th>
                <th>
                  <bold>
                    <bold>English</bold>
                  </bold>
                </th>
                <th>
                  <bold>
                    <bold>OR</bold>
                  </bold>
                </th>
              </tr>
            </thead>
            <tbody>
              <tr>
                <td>وانظري</td>
                <td>and look</td>
                <td>000</td>
              </tr>
              <tr>
                <td>وراجعي</td>
                <td>and review</td>
                <td>000</td>
              </tr>
              <tr>
                <td>انظري</td>
                <td>look</td>
                <td>000</td>
              </tr>
              <tr>
                <td>أنا فتاة</td>
                <td>I’m a young woman</td>
                <td>000</td>
              </tr>
              <tr>
                <td>واعلمي</td>
                <td>and know that</td>
                <td>000</td>
              </tr>
              <tr>
                <td>السائلة</td>
                <td>the questioner</td>
                <td>000</td>
              </tr>
              <tr>
                <td>وتكفيك</td>
                <td>and it will suffice you</td>
                <td>000</td>
              </tr>
              <tr>
                <td>أنا أم</td>
                <td>I’m a mother</td>
                <td>000</td>
              </tr>
              <tr>
                <td>تعديه</td>
                <td>his aggression</td>
                <td>000</td>
              </tr>
              <tr>
                <td>تحسين</td>
                <td>you feel</td>
                <td>000</td>
              </tr>
              <tr>
                <td>تأتيك</td>
                <td>it comes to you</td>
                <td>000</td>
              </tr>
              <tr>
                <td>فاعلمي</td>
                <td>then know</td>
                <td>000</td>
              </tr>
              <tr>
                <td>زوجي</td>
                <td>my husband</td>
                <td>000</td>
              </tr>
              <tr>
                <td>تراضي</td>
                <td>make (someone) happy</td>
                <td>000</td>
              </tr>
              <tr>
                <td>تعلمين</td>
                <td>you know</td>
                <td>000</td>
              </tr>
              <tr>
                <td>أنا صاحبة</td>
                <td>I am the one with</td>
                <td>000</td>
              </tr>
              <tr>
                <td>وزوجي</td>
                <td>and my husband</td>
                <td>000</td>
              </tr>
              <tr>
                <td>ترضيه</td>
                <td>you make him happy</td>
                <td>000</td>
              </tr>
              <tr>
                <td>تحسين الوضع</td>
                <td>making things better</td>
                <td>000</td>
              </tr>
              <tr>
                <td>أنا امرأة</td>
                <td>I’m a woman</td>
                <td>000</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
      </sec>
      <sec>
        <title>4.3. Answer to question 3: fatwa popularity</title>
        <p><italic>Can we predict the popularity of a fatwa based on its content?</italic> To answer this question, we perform text regression where the predictor variables are the lexical items of the questions, and the response variable is the number of views each question receives.</p>
        <p>To perform the regression, we convert the text into vectors. We do so through TFIDF as explained above. When we compute the number of views, we do so based on the number of views per day. Since the fatwas appeared online on different dates, it may not be fair to compare the raw views. This is the reason we divide the number of views by the number of days a fatwa is online, so we may obtain daily views. The daily views, rather than the raw ones, are what we want to predict. As shown in <xref ref-type="fig" rid="attachment-223595">Figure 2</xref>, the fatwa dates range between 1999 and 2015, but the views were recorded in February 2016 and July 2020. Since dates of publication are available for each fatwa, we subtract the publication date from 8/7/2020, the days the fatwa was downloaded, to obtain the number of days the fatwa was online. We then divide the number of views by the number of days to obtain the views per day. The views per day is the number we predict using the various regression models. We use different algorithms and compare the results to see which one best predicts the daily views per question. We also examine the feature ranking for features that affect the regression model. In terms of evaluation, we evaluate the regression model based on two things: (1) the <ns2:math display="inline" id="mml-equation-447f6f1c-0737-40f4-9a59-6d674bf87043"><ns2:msup><ns2:mi>R</ns2:mi><ns2:mn>2</ns2:mn></ns2:msup></ns2:math> value and (2) how good it is at predicting the views in a test set based on the Mean Absolute Error (MAE). The <ns2:math display="inline" id="mml-equation-b9eb2745-3d0a-4112-9cb1-02557de40c7e"><ns2:msup><ns2:mi>R</ns2:mi><ns2:mn>2</ns2:mn></ns2:msup></ns2:math> is a measure of goodness-of-fit and is usually interpreted as how much variation is explained by the independent variables. An <ns2:math display="inline" id="mml-equation-c5ba59b0-8d82-4063-bdfc-abf3b850bd20"><ns2:msup><ns2:mi>R</ns2:mi><ns2:mn>2</ns2:mn></ns2:msup></ns2:math> of 0.6, for example, indicates that 60% of the variation in the dependent variable can be explained by the independent variables. The <ns2:math display="inline" id="mml-equation-678667e6-52b3-4435-a808-22dfe59907f1"><ns2:msup><ns2:mi>R</ns2:mi><ns2:mn>2</ns2:mn></ns2:msup></ns2:math> is known to be sensitive to the number of independent variables as it increases proportionally with the increase in the number of independent variables. This is a problem in text regression as the number of independent variables is very large, which may lead to inflated <ns2:math display="inline" id="mml-equation-75d51fec-6645-4cab-a1a8-afd1e0c8a0e0"><ns2:msup><ns2:mi>R</ns2:mi><ns2:mn>2</ns2:mn></ns2:msup></ns2:math> values. For this reason, we also use prediction to evaluate the regression model. The MAE is a measure of the average distance between the actual values and the values predicted by the model and can thus be useful in comparing various models.</p>
        <table-wrap id="attachment-223622">
          <object-id pub-id-type="publisher-id">223622</object-id>
          <label>Table 5.</label>
          <caption>
            <title>Regression results where the number of daily views is the dependent variable. Experiments report on using whole words vs segments, with unigrams or unigrams and bigrams, with the algorithm being either linear regression or Random Forests regression</title>
          </caption>
          <table>
            <thead>
              <tr>
                <th>
                  <bold>
                    <bold>Algorithm</bold>
                  </bold>
                </th>
                <th>
                  <bold>
                    <bold>Features</bold>
                  </bold>
                </th>
                <th>
                  <bold>
                    <bold>
                      <ns2:math display="inline" id="mml-equation-2c32aeb1-4e20-4fdc-a2bf-1521132e22e2">
                        <ns2:msup>
                          <ns2:mi>R</ns2:mi>
                          <ns2:mn>2</ns2:mn>
                        </ns2:msup>
                      </ns2:math>
                    </bold>
                  </bold>
                </th>
                <th>
                  <bold>
                    <bold>MAE</bold>
                  </bold>
                </th>
              </tr>
            </thead>
            <tbody>
              <tr>
                <td>
                  <bold>Linear regression (2016)</bold>
                </td>
                <td>word unigrams + bigrams</td>
                <td>0.99953</td>
                <td>5.62</td>
              </tr>
              <tr>
                <td />
                <td>word unigrams</td>
                <td>0.99947</td>
                <td>16.42</td>
              </tr>
              <tr>
                <td />
                <td>segment unigrams</td>
                <td>0.8675</td>
                <td>52.34</td>
              </tr>
              <tr>
                <td />
                <td>segment unigrams + bigrams</td>
                <td>0.9995</td>
                <td>7.83</td>
              </tr>
              <tr>
                <td>
                  <bold>Random Forests (2016)</bold>
                </td>
                <td>word unigrams + bigrams</td>
                <td>0.8647</td>
                <td>3.49</td>
              </tr>
              <tr>
                <td />
                <td>word unigrams</td>
                <td>0.8516</td>
                <td>3.733</td>
              </tr>
              <tr>
                <td />
                <td>segment unigrams</td>
                <td>0.8529</td>
                <td>3.82</td>
              </tr>
              <tr>
                <td />
                <td>segment unigrams + bigrams</td>
                <td>0.863</td>
                <td>3.61</td>
              </tr>
              <tr>
                <td>
                  <bold>Linear Regression (2020)</bold>
                </td>
                <td>word unigrams + bigrams</td>
                <td>0.999</td>
                <td>5.584</td>
              </tr>
              <tr>
                <td />
                <td>word unigrams</td>
                <td>0.999</td>
                <td>15.58</td>
              </tr>
              <tr>
                <td>
                  <bold>Random Forests (2020)</bold>
                </td>
                <td>word unigrams</td>
                <td>0.85</td>
                <td>2.9</td>
              </tr>
              <tr>
                <td />
                <td>unigrams + bigrams</td>
                <td>0.86</td>
                <td>2.67</td>
              </tr>
              <tr>
                <td />
                <td>topic probabilities</td>
                <td>0.87</td>
                <td>3.1</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>The results of regression, as shown in <xref ref-type="table" rid="attachment-223622">Table 5</xref> indicate that when we use a combination of word unigrams and bigrams, both linear regression and Random Forests regression do a good job predicting views per day given the question as input. Linear regression has an <ns2:math display="inline" id="mml-equation-042b2265-ef20-4963-8f0a-3c6bc1b460cd"><ns2:msup><ns2:mi>R</ns2:mi><ns2:mn>2</ns2:mn></ns2:msup></ns2:math> of 0.99953, which is near perfect but may also indicate over-fitting for the training data. The mean absolute error is 5.62 on the test set. This is a good MAE given that the standard deviation of the test set values is 9.55. Random Forests do an even better job at prediction, probably due to their non-over-fitting, with an MAE of 3.49. This indicates that predicting the views per day based on the textual content is feasible. The experiments also show that segmentation does not help as every whole-word experiment yields better results than its segmented counterpart. We attribute this to the availability of data. With our large data set, there is no need for segmentation, whose main purpose is to combat data sparseness.</p>
        <p>The Random Forests algorithm is superior to the linear regression one in prediction. One other nice facet of RF is that it also produces a list of the top features used in regression. Perhaps one should examine what features in the input are responsible for these views. For this, we use the most important features (lexical n-grams) as produced by the RF algorithm. An examination of the top 100 features in the unigram +bigram model reveals that the most important concepts that trigger views are related to sex, cleanliness, and prayer. For example, the top 10 grams can be translated into: <italic>through the condom</italic>, <italic>is foreplay permissible</italic>, <italic>caressing the</italic>, <italic>anal</italic>, <italic>condom</italic>, <italic>inserting part of</italic>, <italic>intercourse</italic>, <italic>prayer</italic>, <italic>at the right time</italic>, <italic>masturbation</italic>, and <italic>orgasm</italic>.</p>
        <p>Perhaps a more effective way for finding out popularity triggers is to find what themes are more viewed. For this purpose, we use the probabilities of the topics produced by topic modelling as features and the views per day as the dependent variable, to rank the topics in terms of their importance to the Random Forest regression algorithm. This ranking shows how much each topic contributes to the number of views per day. <xref ref-type="table" rid="attachment-223623">Table 6</xref> lists the five most important topics, and we can see that questions about marriage and engagement top the list. A common theme here is the problem of the family refusing the prospective fiance(e), and the question is usually what one should do in such a case. This is followed by the question of (premarital and extra-marital) sexual activity and how one should repent. We then have topics on cleanliness and prayer, fasting the month of Ramadan, and what one should do if they miss days of required fasting, and then we have financial questions most of which are about whether some financial transaction is halal (permissible in Islam) or haram (impermissible according to Islamic law). The prominence of sexual topics may be related to the fact that Islam is very restrictive in terms of premarital sex <xref ref-type="bibr" rid="ref-303866">(Adamczyk and Hayes)</xref>.</p>
        <table-wrap id="attachment-223623">
          <object-id pub-id-type="publisher-id">223623</object-id>
          <label>Table 6.</label>
          <caption>
            <title>The topics contributing the most to fatwa popularity.</title>
          </caption>
          <table>
            <thead>
              <tr>
                <th>
                  <bold>
                    <bold>Rank</bold>
                  </bold>
                </th>
                <th>
                  <bold>
                    <bold>Topic</bold>
                  </bold>
                </th>
                <th>
                  <bold>
                    <bold>Keywords</bold>
                  </bold>
                </th>
              </tr>
            </thead>
            <tbody>
              <tr>
                <td>1</td>
                <td>42</td>
                <td>marriage, man, woman, engage, proposal, refuse, family, religious, decent</td>
              </tr>
              <tr>
                <td>2</td>
                <td>40</td>
                <td>repent, masturbation, sex, lust, sin, forgive, haram</td>
              </tr>
              <tr>
                <td>3</td>
                <td>20</td>
                <td>ablution, urine, secretions, semen, prayer</td>
              </tr>
              <tr>
                <td>4</td>
                <td>41</td>
                <td>Ramadan, fasting, expiation, feeding (the poor)</td>
              </tr>
              <tr>
                <td>5</td>
                <td>25</td>
                <td>sale, company, buy, price, shop, dollar, proceeds, trade, commodity, commission</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
      </sec>
    </sec>
    <sec>
      <title>5. Conclusion</title>
      <p>In this paper, we have introduced a versatile dataset with textual content and metadata that make it suitable for research in computational linguistics, Islamic Studies, and Computational Social Science. We have also shown use case examples of the dataset about questions of thematic analysis, text classification, and text regression. The dataset, with the use cases provided, has great potential for further formal and content-based research.</p>
      <p>We found that there are clear differences between the questions asked by women and those raised by men. The top concern for women in this dataset is family matters (children, marriage, divorce). We can also see religious ritual questions concerning menstruation and fasting as menstruating women do not have to observe the obligatory fasting in the lunar Islamic month of Ramadan. We also examined the top 100 male lexical features and their equivalent female ones to show that the classifier is capable of generalization. The top 10 male features are not morphological. Our experimental findings demonstrate a 98% accuracy in gender prediction, precise predictions of popularity with minimal margin for error, and the identification of topics and their associations that are more inclined towards either men or women.</p>
      <p>In the future, we are planning to investigate further questions including: (1) How do the muftis formulate their answers, and do they speak differently to men and women? (2) What is the authority frame that the muftis use to convince the reader of the soundness of their answers? and (3) What are the linguistic differences between male and female-centric questions? The dataset is ripe for investigation and we hope other researchers will find it useful for their research.</p>
      <p>Data repository: <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.7910/DVN/ASAJ4Y">https://doi.org/10.7910/DVN/ASAJ4Y</ext-link></p>
    </sec>
  </body>
  <back>
    <fn-group>
      <fn id="fn1">
        <label>1</label>
        <p>The numbers were obtained on 15 April 2021, and they reflect the immediate period preceding this date. The numbers get updated regularly, so the readers may not get the exact numbers reported here.</p>
      </fn>
      <fn id="fn2">
        <label>2</label>
        <p>
          <ext-link ext-link-type="uri" ns0:href="https://www.islamweb.net/ar/fatwa/385525/">https://www.islamweb.net/ar/fatwa/385525/</ext-link>
        </p>
      </fn>
      <fn id="fn3">
        <label>3</label>
        <p>
          <ext-link ext-link-type="uri" ns0:href="https://www.islamweb.net/ar/fatwa/430699/">https://www.islamweb.net/ar/fatwa/430699/</ext-link>
        </p>
      </fn>
      <fn id="fn4">
        <label>4</label>
        <p>
          <ext-link ext-link-type="uri" ns0:href="https://www.islamweb.net/ar/fatwa/294250">https://www.islamweb.net/ar/fatwa/294250</ext-link>
        </p>
      </fn>
      <fn id="fn5">
        <label>5</label>
        <p>
          <ext-link ext-link-type="uri" ns0:href="https://www.islamweb.net/ar/fatwa/138500">https://www.islamweb.net/ar/fatwa/138500</ext-link>
          <xref ref-type="bibr" rid="ref-303865" />
          <xref ref-type="bibr" rid="ref-303848" />
          <xref ref-type="bibr" rid="ref-303852" />
          <xref ref-type="bibr" rid="ref-303855" />
          <xref ref-type="bibr" rid="ref-303845" />
          <xref ref-type="bibr" rid="ref-303861" />
          <xref ref-type="bibr" rid="ref-303851" />
          <xref ref-type="bibr" rid="ref-303868" />
          <xref ref-type="bibr" rid="ref-303863" />
          <xref ref-type="bibr" rid="ref-303877" />
          <xref ref-type="bibr" rid="ref-303876" />
          <xref ref-type="bibr" rid="ref-303843" />
          <xref ref-type="bibr" rid="ref-303844" />
          <xref ref-type="bibr" rid="ref-303842" />
          <xref ref-type="bibr" rid="ref-303872" />
          <xref ref-type="bibr" rid="ref-303866" />
          <xref ref-type="bibr" rid="ref-303850" />
          <xref ref-type="bibr" rid="ref-303849" />
          <xref ref-type="bibr" rid="ref-303873" />
          <xref ref-type="bibr" rid="ref-303847" />
          <xref ref-type="bibr" rid="ref-303854" />
          <xref ref-type="bibr" rid="ref-303862" />
          <xref ref-type="bibr" rid="ref-303859" />
          <xref ref-type="bibr" rid="ref-303856" />
          <xref ref-type="bibr" rid="ref-303864" />
          <xref ref-type="bibr" rid="ref-303846" />
          <xref ref-type="bibr" rid="ref-303858" />
          <xref ref-type="bibr" rid="ref-303860" />
          <xref ref-type="bibr" rid="ref-303867" />
          <xref ref-type="bibr" rid="ref-303871" />
          <xref ref-type="bibr" rid="ref-303874" />
          <xref ref-type="bibr" rid="ref-303853" />
          <xref ref-type="bibr" rid="ref-303857" />
          <xref ref-type="bibr" rid="ref-303870" />
          <xref ref-type="bibr" rid="ref-303869" />
          <xref ref-type="bibr" rid="ref-303875" />
        </p>
      </fn>
    </fn-group>
    <ref-list>
      <ref id="ref-303869">
        <element-citation publication-type="article-journal">
          <article-title>Veiled sentiments: honor</article-title>
          <source>Modesty, and Poetry in a Bedouin Society (Berkeley: University of California Press, in press) Abu-LughodVeiled Sentiments: Honor, Modesty, and Poetry in a Bedouin Society</source>
          <person-group person-group-type="author">
            <name>
              <surname>Abu-Lughod</surname>
              <given-names>Lila</given-names>
            </name>
          </person-group>
          <date>
            <year>1986</year>
          </date>
        </element-citation>
      </ref>
      <ref id="ref-303854">
        <element-citation publication-type="article-journal">
          <article-title>The complex role of the community in the determination of well-being and hope among divorced muslim women</article-title>
          <source>Journal of Community Psychology</source>
          <person-group person-group-type="author">
            <name>
              <surname>Abu-Ras</surname>
              <given-names>Ruba</given-names>
            </name>
            <name>
              <surname>Itzhaki-Braun</surname>
              <given-names>Yael</given-names>
            </name>
          </person-group>
          <publisher-name>Wiley Online Library</publisher-name>
          <date>
            <year>2023</year>
          </date>
        </element-citation>
      </ref>
      <ref id="ref-303870">
        <element-citation publication-type="article-journal">
          <article-title>Hierarchies, jobs, bodies: A theory of gendered organizations</article-title>
          <source>Gender &amp; Society</source>
          <person-group person-group-type="author">
            <name>
              <surname>Acker</surname>
              <given-names>Joan</given-names>
            </name>
          </person-group>
          <publisher-name>SAGE Publications</publisher-name>
          <date>
            <month>6</month>
            <year>1990</year>
          </date>
          <volume>4</volume>
          <issue>2</issue>
          <fpage>139</fpage>
          <lpage>158</lpage>
          <issn>0891-2432</issn>
          <pub-id pub-id-type="doi">10.1177/089124390004002002</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.1177/089124390004002002">https://doi.org/10.1177/089124390004002002</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303866">
        <element-citation publication-type="article-journal">
          <article-title>Religion and sexual behaviors: Understanding the influence of islamic cultures and religious affiliation for explaining sex outside of marriage</article-title>
          <source>American Sociological Review</source>
          <person-group person-group-type="author">
            <name>
              <surname>Adamczyk</surname>
              <given-names>Amy</given-names>
            </name>
            <name>
              <surname>Hayes</surname>
              <given-names>Brittany E.</given-names>
            </name>
          </person-group>
          <date>
            <year>2012</year>
          </date>
          <volume>77</volume>
          <issue>5</issue>
          <fpage>723</fpage>
          <lpage>746</lpage>
        </element-citation>
      </ref>
      <ref id="ref-303847">
        <element-citation publication-type="article-journal">
          <article-title>Online fatwas in pakistan using social networking platforms</article-title>
          <source>Ulumuna</source>
          <person-group person-group-type="author">
            <name>
              <surname>Adel</surname>
              <given-names>Samiullah</given-names>
            </name>
            <name>
              <surname>Numan</surname>
              <given-names>Muhammad</given-names>
            </name>
          </person-group>
          <publisher-name>State Islamic University (UIN) Mataram</publisher-name>
          <date>
            <day>17</day>
            <month>6</month>
            <year>2023</year>
          </date>
          <volume>27</volume>
          <issue>1</issue>
          <fpage>201</fpage>
          <lpage>226</lpage>
          <issn>2775-2453</issn>
          <pub-id pub-id-type="doi">10.20414/ujis.v27i1.689</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.20414/ujis.v27i1.689">https://doi.org/10.20414/ujis.v27i1.689</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303857">
        <element-citation publication-type="article-journal">
          <article-title>Ethics, tradition, authority: Toward an anthropology of the fatwa</article-title>
          <source>American Ethnologist</source>
          <person-group person-group-type="author">
            <name>
              <surname>Agrama</surname>
              <given-names>Hussein Ali</given-names>
            </name>
          </person-group>
          <publisher-name>Wiley</publisher-name>
          <date>
            <day>28</day>
            <month>1</month>
            <year>2010</year>
          </date>
          <volume>37</volume>
          <issue>1</issue>
          <fpage>2</fpage>
          <lpage>18</lpage>
          <issn>0094-0496</issn>
          <pub-id pub-id-type="doi">10.1111/j.1548-1425.2010.01238.x</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.1111/j.1548-1425.2010.01238.x">https://doi.org/10.1111/j.1548-1425.2010.01238.x</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303873">
        <element-citation publication-type="article-journal">
          <article-title>Muslims, multiculturalism and the question of the silent majority</article-title>
          <source>Journal of Muslim Minority Affairs</source>
          <person-group person-group-type="author">
            <name>
              <surname>Akbarzadeh</surname>
              <given-names>Shahram</given-names>
            </name>
            <name>
              <surname>Roose</surname>
              <given-names>Joshua M.</given-names>
            </name>
          </person-group>
          <publisher-name>Informa UK Limited</publisher-name>
          <date>
            <month>9</month>
            <year>2011</year>
          </date>
          <volume>31</volume>
          <issue>3</issue>
          <fpage>309</fpage>
          <lpage>325</lpage>
          <issn>1360-2004</issn>
          <pub-id pub-id-type="doi">10.1080/13602004.2011.599540</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.1080/13602004.2011.599540">https://doi.org/10.1080/13602004.2011.599540</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303851">
        <element-citation publication-type="article-journal">
          <article-title>Gender inference for arabic language in social media</article-title>
          <source>Discrimination and Diversity</source>
          <person-group person-group-type="author">
            <name>
              <surname>Al-Ghadir</surname>
              <given-names>Abdul Rahman I.</given-names>
            </name>
            <name>
              <surname>Alabdullatif</surname>
              <given-names>Abdullatif</given-names>
            </name>
            <name>
              <surname>Azmi</surname>
              <given-names>Aqil M.</given-names>
            </name>
          </person-group>
          <publisher-name>IGI Global</publisher-name>
          <fpage>811</fpage>
          <lpage>821</lpage>
          <pub-id pub-id-type="doi">10.4018/978-1-5225-1933-1.ch037</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.4018/978-1-5225-1933-1.ch037">https://doi.org/10.4018/978-1-5225-1933-1.ch037</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303850">
        <element-citation publication-type="article-journal">
          <article-title>A study of arabic social media users—posting behavior and author’s gender prediction</article-title>
          <source>Cognitive Computation</source>
          <person-group person-group-type="author">
            <name>
              <surname>Al-Ghadir</surname>
              <given-names>Abdulrahman I</given-names>
            </name>
            <name>
              <surname>Azmi</surname>
              <given-names>Aqil M</given-names>
            </name>
          </person-group>
          <publisher-name>Springer</publisher-name>
          <date>
            <year>2019</year>
          </date>
          <volume>11</volume>
          <fpage>71</fpage>
          <lpage>86</lpage>
        </element-citation>
      </ref>
      <ref id="ref-303849">
        <element-citation publication-type="article-journal">
          <article-title>Analysis the arabic authorship attribution using machine learning methods: Application on islamic fatwā</article-title>
          <source>Advances in Intelligent Systems and Computing</source>
          <person-group person-group-type="author">
            <name>
              <surname>Al-Sarem</surname>
              <given-names>Mohammed</given-names>
            </name>
            <name>
              <surname>Emara</surname>
              <given-names>Abdel-Hamid</given-names>
            </name>
          </person-group>
          <publisher-name>Springer International Publishing</publisher-name>
          <date>
            <day>9</day>
            <month>9</month>
            <year>2018</year>
          </date>
          <fpage>221</fpage>
          <lpage>229</lpage>
          <issn>2194-5357</issn>
          <isbn>9783319990064</isbn>
          <pub-id pub-id-type="doi">10.1007/978-3-319-99007-1_21</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.1007/978-3-319-99007-1_21">https://doi.org/10.1007/978-3-319-99007-1_21</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303853">
        <element-citation publication-type="article-journal">
          <article-title>(Under) cover and uncovered: Muslim women’s resistance to islamophobic violence</article-title>
          <source>Victims &amp; Offenders</source>
          <person-group person-group-type="author">
            <name>
              <surname>Baboolal</surname>
              <given-names>Aneesa A</given-names>
            </name>
          </person-group>
          <publisher-name>Taylor &amp; Francis</publisher-name>
          <date>
            <year>2023</year>
          </date>
          <fpage>1</fpage>
          <lpage>21</lpage>
        </element-citation>
      </ref>
      <ref id="ref-303874">
        <element-citation publication-type="book">
          <source>British asian muslim women, multiple spatialities and cosmopolitanism</source>
          <person-group person-group-type="author">
            <name>
              <surname>Bhimji</surname>
              <given-names>Fazila</given-names>
            </name>
          </person-group>
          <publisher-name>Palgrave Macmillan UK</publisher-name>
          <date>
            <year>2012</year>
          </date>
          <isbn>9781349436736</isbn>
          <pub-id pub-id-type="doi">10.1057/9781137013873</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.1057/9781137013873">https://doi.org/10.1057/9781137013873</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303871">
        <element-citation publication-type="article-journal">
          <article-title>Reconstructing self and society: Javanese muslim women and “the veil”</article-title>
          <source>American ethnologist</source>
          <person-group person-group-type="author">
            <name>
              <surname>Brenner</surname>
              <given-names>Suzanne</given-names>
            </name>
          </person-group>
          <publisher-name>Wiley</publisher-name>
          <date>
            <month>11</month>
            <year>1996</year>
          </date>
          <volume>23</volume>
          <issue>4</issue>
          <fpage>673</fpage>
          <lpage>697</lpage>
          <issn>0094-0496</issn>
          <pub-id pub-id-type="doi">10.1525/ae.1996.23.4.02a00010</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.1525/ae.1996.23.4.02a00010">https://doi.org/10.1525/ae.1996.23.4.02a00010</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303848">
        <element-citation publication-type="article-journal">
          <article-title>Al-buti’s thoughts on maslāhah and its application in the fatwa of world fatwa institutions</article-title>
          <source>Samarah: Jurnal Hukum Keluarga dan Hukum Islam</source>
          <person-group person-group-type="author">
            <name>
              <surname>Dahlan</surname>
              <given-names>Abdurrahman</given-names>
            </name>
            <name>
              <surname>Qodsiyah</surname>
              <given-names>Bagus Haziratul</given-names>
            </name>
            <name>
              <surname>Azizah</surname>
              <given-names>Azizah</given-names>
            </name>
            <name>
              <surname>Asmawi</surname>
              <given-names>Asmawi</given-names>
            </name>
            <name>
              <surname>Hejazziey</surname>
              <given-names>Djawahir</given-names>
            </name>
          </person-group>
          <date>
            <year>2023</year>
          </date>
          <volume>7</volume>
          <issue>2</issue>
          <fpage>1148</fpage>
          <lpage>1170</lpage>
        </element-citation>
      </ref>
      <ref id="ref-303845">
        <element-citation publication-type="article-journal">
          <article-title>Challenging the status quo: Khaled m. Abou el fadl’s perspectives on islamic legal authority and the restrictive fatwa on women’s solo travel</article-title>
          <source>JIL: Journal of Islamic Law</source>
          <person-group person-group-type="author">
            <name>
              <surname>Faiz</surname>
              <given-names>Muhammad Fauzinudin</given-names>
            </name>
            <name>
              <surname>Rohmatulloh</surname>
              <given-names>Dawam Multazamy</given-names>
            </name>
            <name>
              <surname>Solikhudin</surname>
              <given-names>Muhammad</given-names>
            </name>
          </person-group>
          <publisher-name>IAIN Pontianak</publisher-name>
          <date>
            <day>23</day>
            <month>2</month>
            <year>2023</year>
          </date>
          <volume>4</volume>
          <issue>1</issue>
          <fpage>47</fpage>
          <lpage>66</lpage>
          <issn>2721-5040</issn>
          <pub-id pub-id-type="doi">10.24260/jil.v4i1.1071</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.24260/jil.v4i1.1071">https://doi.org/10.24260/jil.v4i1.1071</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303875">
        <element-citation publication-type="webpage">
          <source>Fatwa</source>
          <ext-link ext-link-type="uri" ns0:href="https://www.britannica.com/topic/fatwa">https://www.britannica.com/topic/fatwa</ext-link>
          <comment>Accessed: 2021-05-04</comment>
        </element-citation>
      </ref>
      <ref id="ref-303858">
        <element-citation publication-type="article-journal">
          <article-title>Male or female: What traits characterize questions prompted by each gender in community question answering?</article-title>
          <source>Expert Systems with Applications</source>
          <person-group person-group-type="author">
            <name>
              <surname>Figueroa</surname>
              <given-names>Alejandro</given-names>
            </name>
          </person-group>
          <publisher-name>Elsevier BV</publisher-name>
          <date>
            <month>12</month>
            <year>2017</year>
          </date>
          <volume>90</volume>
          <fpage>405</fpage>
          <lpage>413</lpage>
          <issn>0957-4174</issn>
          <pub-id pub-id-type="doi">10.1016/j.eswa.2017.08.037</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.1016/j.eswa.2017.08.037">https://doi.org/10.1016/j.eswa.2017.08.037</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303872">
        <element-citation publication-type="article-journal">
          <article-title>Young Muslim women's political participation in Scotland: Exploring the intersections of gender, religion, class and place</article-title>
          <source>Political Geography</source>
          <person-group person-group-type="author">
            <name>
              <surname>Finlay</surname>
              <given-names>Robin</given-names>
            </name>
            <name>
              <surname>Hopkins</surname>
              <given-names>Peter</given-names>
            </name>
          </person-group>
          <publisher-name>Elsevier BV</publisher-name>
          <date>
            <month>10</month>
            <year>2019</year>
          </date>
          <volume>74</volume>
          <fpage>102046</fpage>
          <issn>0962-6298</issn>
          <pub-id pub-id-type="doi">10.1016/j.polgeo.2019.102046</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.1016/j.polgeo.2019.102046">https://doi.org/10.1016/j.polgeo.2019.102046</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303842">
        <element-citation publication-type="article-journal">
          <article-title>Moderation in fatwas and ijtihad: An analysis of fatwas issued by the MKI malaysia concerning the covid-19 pandemic</article-title>
          <source>AHKAM: Jurnal Ilmu Syariah</source>
          <person-group person-group-type="author">
            <name>
              <surname>Ismail</surname>
              <given-names>Abdul Manan</given-names>
            </name>
            <name>
              <surname>Baharuddin</surname>
              <given-names>Ahmad Syukran</given-names>
            </name>
          </person-group>
          <publisher-name>Universitas Islam Negeri Syarif Hidayatullah Jakar</publisher-name>
          <date>
            <year>2022</year>
          </date>
        </element-citation>
      </ref>
      <ref id="ref-303867">
        <element-citation publication-type="article-journal">
          <article-title>Does online anonymity undermine the sense of personal responsibility?</article-title>
          <source>Media, Culture &amp; Society</source>
          <person-group person-group-type="author">
            <name>
              <surname>Jordan</surname>
              <given-names>Tim</given-names>
            </name>
          </person-group>
          <date>
            <year>2019</year>
          </date>
          <volume>41</volume>
          <issue>4</issue>
          <fpage>572</fpage>
          <lpage>577</lpage>
        </element-citation>
      </ref>
      <ref id="ref-303844">
        <element-citation publication-type="article-journal">
          <article-title>Protection through constitutional guarantees: The case of women, children, and backward sections of the people</article-title>
          <source>The Constitutional Law of Bangladesh</source>
          <person-group person-group-type="author">
            <name>
              <surname>Khan</surname>
              <given-names>Borhan Uddin</given-names>
            </name>
            <name>
              <surname>Mollah</surname>
              <given-names>Md Al Ifran Hossain</given-names>
            </name>
          </person-group>
          <publisher-name>Springer Nature Singapore</publisher-name>
          <date>
            <year>2023</year>
          </date>
          <fpage>213</fpage>
          <lpage>228</lpage>
          <isbn>9789819925780</isbn>
          <pub-id pub-id-type="doi">10.1007/978-981-99-2579-7_12</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.1007/978-981-99-2579-7_12">https://doi.org/10.1007/978-981-99-2579-7_12</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303843">
        <element-citation publication-type="article-journal">
          <article-title>Siting islamic feminism: The indonesian congress of women islamic scholars and the challenge of challenging patriarchal authority</article-title>
          <source>History and Anthropology</source>
          <person-group person-group-type="author">
            <name>
              <surname>Kloos</surname>
              <given-names>David</given-names>
            </name>
            <name>
              <surname>Ismah</surname>
              <given-names>Nor</given-names>
            </name>
          </person-group>
          <publisher-name>Taylor &amp; Francis</publisher-name>
          <date>
            <year>2023</year>
          </date>
          <fpage>1</fpage>
          <lpage>26</lpage>
        </element-citation>
      </ref>
      <ref id="ref-303846">
        <element-citation publication-type="article-journal">
          <article-title>Islamic law, disability, and women in indonesia: The cases of nahdlatul ulama and muhammadiyah</article-title>
          <source>Journal of Disability &amp; Religion</source>
          <person-group person-group-type="author">
            <name>
              <surname>Maftuhin</surname>
              <given-names>Arif</given-names>
            </name>
          </person-group>
          <publisher-name>Informa UK Limited</publisher-name>
          <date>
            <day>9</day>
            <month>9</month>
            <year>2023</year>
          </date>
          <volume>28</volume>
          <issue>1</issue>
          <fpage>13</fpage>
          <lpage>27</lpage>
          <issn>2331-2521</issn>
          <pub-id pub-id-type="doi">10.1080/23312521.2023.2255860</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.1080/23312521.2023.2255860">https://doi.org/10.1080/23312521.2023.2255860</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303864">
        <element-citation publication-type="article-journal">
          <article-title>MALLET: A machine learning for language toolkit.</article-title>
          <person-group person-group-type="author">
            <name>
              <surname>McCallum</surname>
              <given-names>Andrew Kachites</given-names>
            </name>
          </person-group>
          <date>
            <year>2002</year>
          </date>
          <issue>3</issue>
        </element-citation>
      </ref>
      <ref id="ref-303856">
        <element-citation publication-type="article-journal">
          <article-title>Jewish, christian and islamic in the english wikipedia</article-title>
          <source>Online-Heidelberg Journal of Religions on the Internet</source>
          <person-group person-group-type="author">
            <name>
              <surname>Mohamed</surname>
              <given-names>Emad</given-names>
            </name>
          </person-group>
          <date>
            <year>2016</year>
          </date>
          <volume>11</volume>
        </element-citation>
      </ref>
      <ref id="ref-303863">
        <element-citation publication-type="article-journal">
          <article-title>Computing happiness from textual data</article-title>
          <source>Stats</source>
          <person-group person-group-type="author">
            <name>
              <surname>Mohamed</surname>
              <given-names>Emad</given-names>
            </name>
            <name>
              <surname>Mostafa</surname>
              <given-names>Sayed A.</given-names>
            </name>
          </person-group>
          <publisher-name>MDPI AG</publisher-name>
          <date>
            <day>3</day>
            <month>7</month>
            <year>2019</year>
          </date>
          <volume>2</volume>
          <issue>3</issue>
          <fpage>347</fpage>
          <lpage>370</lpage>
          <issn>2571-905X</issn>
          <pub-id pub-id-type="doi">10.3390/stats2030025</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.3390/stats2030025">https://doi.org/10.3390/stats2030025</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303876">
        <element-citation publication-type="article-journal">
          <article-title>Linguistic features evaluation for hadith authenticity through automatic machine learning</article-title>
          <source>Digital Scholarship in the Humanities</source>
          <person-group person-group-type="author">
            <name>
              <surname>Mohamed</surname>
              <given-names>Emad</given-names>
            </name>
            <name>
              <surname>Sarwar</surname>
              <given-names>Raheem</given-names>
            </name>
          </person-group>
          <publisher-name>Oxford University Press (OUP)</publisher-name>
          <date>
            <day>13</day>
            <month>11</month>
            <year>2021</year>
          </date>
          <volume>37</volume>
          <issue>3</issue>
          <fpage>830</fpage>
          <lpage>843</lpage>
          <issn>2055-7671</issn>
          <pub-id pub-id-type="doi">10.1093/llc/fqab092</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.1093/llc/fqab092">https://doi.org/10.1093/llc/fqab092</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303862">
        <element-citation publication-type="paper-conference">
          <source>Arabic-SOS: Segmentation, stemming, and orthography standardization for classical and pre-modern standard arabic</source>
          <person-group person-group-type="author">
            <name>
              <surname>Mohamed</surname>
              <given-names>Emad</given-names>
            </name>
            <name>
              <surname>Sayyed</surname>
              <given-names>Zeeshan Ali</given-names>
            </name>
          </person-group>
          <date>
            <year>2019</year>
          </date>
          <fpage>27</fpage>
          <lpage>32</lpage>
        </element-citation>
      </ref>
      <ref id="ref-303855">
        <element-citation publication-type="article-journal">
          <article-title>Predictors of perceived discrimination in medical settings among muslim women in the USA</article-title>
          <source>Journal of Racial and Ethnic Health Disparities</source>
          <person-group person-group-type="author">
            <name>
              <surname>Murrar</surname>
              <given-names>Sohad</given-names>
            </name>
            <name>
              <surname>Baqai</surname>
              <given-names>Benish</given-names>
            </name>
            <name>
              <surname>Padela</surname>
              <given-names>Aasim I.</given-names>
            </name>
          </person-group>
          <publisher-name>Springer Science and Business Media LLC</publisher-name>
          <date>
            <day>9</day>
            <month>1</month>
            <year>2023</year>
          </date>
          <volume>11</volume>
          <issue>1</issue>
          <fpage>150</fpage>
          <lpage>156</lpage>
          <issn>2197-3792</issn>
          <pub-id pub-id-type="doi">10.1007/s40615-022-01506-0</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.1007/s40615-022-01506-0">https://doi.org/10.1007/s40615-022-01506-0</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303852">
        <element-citation publication-type="article-journal">
          <article-title>The contribution of all-women tours to well-being in middle-aged muslim women</article-title>
          <source>Gender and tourism sustainability</source>
          <person-group person-group-type="author">
            <name>
              <surname>Nikjoo</surname>
              <given-names>Adel</given-names>
            </name>
            <name>
              <surname>Zaman</surname>
              <given-names>Mustafeed</given-names>
            </name>
            <name>
              <surname>Salehi</surname>
              <given-names>Shima</given-names>
            </name>
            <name>
              <surname>Hernández-Lara</surname>
              <given-names>Ana Beatriz</given-names>
            </name>
          </person-group>
          <publisher-name>Routledge</publisher-name>
          <date>
            <day>18</day>
            <month>1</month>
            <year>2023</year>
          </date>
          <fpage>269</fpage>
          <lpage>284</lpage>
          <isbn>9781003329541</isbn>
          <pub-id pub-id-type="doi">10.4324/9781003329541-17</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.4324/9781003329541-17">https://doi.org/10.4324/9781003329541-17</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303865">
        <element-citation publication-type="article-journal">
          <article-title>Scikit-learn: Machine learning in Python</article-title>
          <source>Journal of Machine Learning Research</source>
          <person-group person-group-type="author">
            <name>
              <surname>Pedregosa</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Varoquaux</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Gramfort</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Michel</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Thirion</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Grisel</surname>
              <given-names>O.</given-names>
            </name>
            <name>
              <surname>Blondel</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Prettenhofer</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Weiss</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Dubourg</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Vanderplas</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Passos</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Cournapeau</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Brucher</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Perrot</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Duchesnay</surname>
              <given-names>E.</given-names>
            </name>
          </person-group>
          <date>
            <year>2011</year>
          </date>
          <volume>12</volume>
          <fpage>2825</fpage>
          <lpage>2830</lpage>
        </element-citation>
      </ref>
      <ref id="ref-303859">
        <element-citation publication-type="article-journal">
          <article-title>Muslim women speak online: Religion, conversion, activism, and art</article-title>
          <source>Hawwa</source>
          <person-group person-group-type="author">
            <name>
              <surname>Piela</surname>
              <given-names>Anna</given-names>
            </name>
          </person-group>
          <publisher-name>Brill</publisher-name>
          <publisher-loc>Leiden, The Netherlands</publisher-loc>
          <date>
            <day>15</day>
            <month>10</month>
            <year>2015</year>
          </date>
          <volume>13</volume>
          <issue>3</issue>
          <fpage>271</fpage>
          <lpage>278</lpage>
          <issn>1569-2078</issn>
          <pub-id pub-id-type="doi">10.1163/15692086-12341287</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.1163/15692086-12341287">https://doi.org/10.1163/15692086-12341287</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303861">
        <element-citation publication-type="article-journal">
          <article-title>Exploring the Meanings of<italic>Hijab</italic>through Online Comments in Canada</article-title>
          <source>Journal of Intercultural Communication Research</source>
          <person-group person-group-type="author">
            <name>
              <surname>Rahman</surname>
              <given-names>Osmud</given-names>
            </name>
            <name>
              <surname>Fung</surname>
              <given-names>Benjamin</given-names>
            </name>
            <name>
              <surname>Yeo</surname>
              <given-names>Alexia</given-names>
            </name>
          </person-group>
          <publisher-name>Informa UK Limited</publisher-name>
          <date>
            <day>8</day>
            <month>4</month>
            <year>2016</year>
          </date>
          <volume>45</volume>
          <issue>3</issue>
          <fpage>214</fpage>
          <lpage>232</lpage>
          <issn>1747-5759</issn>
          <pub-id pub-id-type="doi">10.1080/17475759.2016.1171795</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.1080/17475759.2016.1171795">https://doi.org/10.1080/17475759.2016.1171795</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303868">
        <element-citation publication-type="article-journal">
          <article-title>TO VEIL OR NOT TO VEIL?: A case study of identity negotiation among muslim women in austin, texas</article-title>
          <source>Gender &amp; Society</source>
          <person-group person-group-type="author">
            <name>
              <surname>READ</surname>
              <given-names>JEN’NAN GHAZAL</given-names>
            </name>
            <name>
              <surname>BARTKOWSKI</surname>
              <given-names>JOHN P.</given-names>
            </name>
          </person-group>
          <date>
            <year>2000</year>
          </date>
          <volume>14</volume>
          <issue>3</issue>
          <fpage>395</fpage>
          <lpage>417</lpage>
        </element-citation>
      </ref>
      <ref id="ref-303860">
        <element-citation publication-type="article-journal">
          <article-title>Female converts from greek orthodoxy to islam and their digital religious identity</article-title>
          <source>Hawwa</source>
          <person-group person-group-type="author">
            <name>
              <surname>Sakellariou</surname>
              <given-names>Alexandros</given-names>
            </name>
          </person-group>
          <publisher-name>Brill</publisher-name>
          <publisher-loc>Leiden, The Netherlands</publisher-loc>
          <date>
            <day>15</day>
            <month>10</month>
            <year>2015</year>
          </date>
          <volume>13</volume>
          <issue>3</issue>
          <fpage>422</fpage>
          <lpage>439</lpage>
          <issn>1569-2078</issn>
          <pub-id pub-id-type="doi">10.1163/15692086-12341291</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.1163/15692086-12341291">https://doi.org/10.1163/15692086-12341291</ext-link>
        </element-citation>
      </ref>
      <ref id="ref-303877">
        <element-citation publication-type="article-journal">
          <article-title>Author verification of <italic>Nahj Al-Balagha</italic></article-title>
          <source>Digital Scholarship in the Humanities</source>
          <person-group person-group-type="author">
            <name>
              <surname>Sarwar</surname>
              <given-names>Raheem</given-names>
            </name>
            <name>
              <surname>Mohamed</surname>
              <given-names>Emad</given-names>
            </name>
          </person-group>
          <publisher-name>Oxford University Press (OUP)</publisher-name>
          <date>
            <day>20</day>
            <month>1</month>
            <year>2022</year>
          </date>
          <volume>37</volume>
          <issue>4</issue>
          <fpage>1210</fpage>
          <lpage>1222</lpage>
          <issn>2055-7671</issn>
          <pub-id pub-id-type="doi">10.1093/llc/fqab103</pub-id>
          <ext-link ext-link-type="uri" ns0:href="https://doi.org/10.1093/llc/fqab103">https://doi.org/10.1093/llc/fqab103</ext-link>
        </element-citation>
      </ref>
    </ref-list>
  </back>
</article>