Explainable classification of wiki streams
DATE:
2024-02-16
UNIVERSAL IDENTIFIER: http://hdl.handle.net/11093/7171
EDITED VERSION: https://link.springer.com/10.1007/978-3-031-45642-8_7
UNESCO SUBJECT: 6308 Comunicaciones Sociales
DOCUMENT TYPE: conferenceObject
ABSTRACT
Web 2.0 platforms, like wikis and social networks, rely on crowdsourced data and, as such, are prone to data manipulation by illintended contributors. This research proposes the transparent identification of wiki manipulators through the classification of contributors as benevolent or malevolent humans or bots, together with the explanation of the attributed class labels. The system comprises: (i) stream-based data pre-processing; (ii) incremental profiling; and (iii) online classification, evaluation and explanation. Particularly, the system profiles contributors and contributions by combining features directly collected with content- and side-based engineered features. The experimental results obtained with a real data set collected from Wikivoyage – a popular travel wiki – attained a 98.52 % classification accuracy and 91.34 % macro Fmeasure. In the end, this work seeks to address data reliability to prevent information detrimental and manipulation.
Files in this item
- Name:
- 2024_garcia_wiki_streams.pdf
- Size:
- 800.8Kb
- Format:
- Description:
- Embargo ata 16-02-2025