Automatic Identification of Political Ideology in Online News Articles
The following research discusses text analysis approaches to automatically categorize news articles based on their political ideology. In this case, ideology is defined as a writer expressing either a liberal or a conservative point of view. This classification is done at both the document and the phrase level, as previous research has indicated that doing so increases classifier performance over using a “bag of words” approach. Linguistic features related to lexical richness are extracted from the articles via Python, and features related to emotions and values are extracted via the Linguistic Inquiry and Word Count software. The machine learning software Weka is then used to apply various classification algorithms on the numeric features. Additionally, Amazon Mechanical Turk is used to measure human accuracy and inter-rater agreement on identifying the ideology of the same texts. In all, the trained classifiers perform well above the baseline and outperform the human annotators on the same tasks.
Authors who publish with this journal agree to the following terms: RAIS Journal of Social Sciences is given by the author the right of the first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal. Authors retain copyright. If the author cites from his own article published in RAIS Journal of Social Sciences, then he is encouraged to cite the name of the RAIS Journal of Social Sciences, volume, and page. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access). This journal provides immediate open access to its content, in this way, we make research freely available to the public and support a greater global exchange of knowledge.
The names and email addresses entered in this journal site will be used exclusively for the stated purposes of this journal and will not be made available for any other purpose or to any other party.