A robust gender inference model for online social networks and its application to LinkedIn and Twitter

Athanasios Kokkos, Theodoros Tzouramanis

Abstract


Online social networking services have come to dominate the dot com world: Countless online communities coexist on the social Web. Some typically characteristic user attributes, such as gender, age group, sexual orientation, are not automatically part of the profile information. In some cases user attributes can even be deliberately and maliciously falsified. This paper examines automated inference of gender on online social networks by analyzing written text with a combination of natural language processing and classification techniques. Extensive experimentation on LinkedIn and Twitter has yielded accuracy of this gender identification technique of up to 98.4 percent.


Keywords


Text Mining in Online Social Networking Services; Psycho-linguistic analysis; Support Vector Machine; Digital Investigation; Attribute Disclosure; Gender Inference; LinkedIn; Twitter

Full Text:

HTML


DOI: http://dx.doi.org/10.5210/fm.v19i9.5216



A Great Cities Initiative of the University of Illinois at Chicago University Library.

© First Monday, 1995-2017. ISSN 1396-0466.