Linguistic feature selection for personality trait identification from textual data

Angad Singh, Priti Maheshwary, Nitin Kumar Mishra, Timothy Malche

Abstract


Personality identification is a common and central problem in text processing. Sensing personality is helpful for various purposes; for example, estimating users' personalities before providing them with any service is necessary. Individuality is essential in a person's nature in every outlook, for instance, in text writing. But, this remains a core challenge because of the low accuracy achieved. The proposed study solves this problem and presents a big five trait identification technique from text data, which applies a feature selection method to increase accuracy. This technique is called linguistic feature selection for personality trait identification (LFSPTI). This technique first finds features based on mutual information (MI), F-statistic, principal component analysis (PCA), and chi-square, then uses the genetic algorithm (GA) to select high-ranked features from all feature subsets. These four parameters provide various forms of the dataset. The experimental results exhibit that the LFSPTI method enhances the classification accuracy against the best of the competing methods by 1.18%, 0.83%, 1.61%., 1.15%, 1.82%, and 1.39% for extraversion (EXT), neuroticism (NEU), agreeableness (AGR), conscientiousness (CON), openness (OPN), and mean overall personality traits, respectively.

Keywords


Chi-square; Feature selection; F-statistic; Genetic algorithm; Mutual information; Personality trait identification; Principal component analysis

Full Text:

PDF


DOI: http://doi.org/10.11591/ijeecs.v37.i3.pp1976-1984

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

shopify stats IJEECS visitor statistics