BibliotecaPortal de investigación
es | gl
  • Home
  • Contact us
  • Give feedback
  • Help
    • About Investigo
    • Search and Find
    • Submit
    • Intellectual Property
    • Open Access Policy
  • Links
    • Sherpa / Romeo
    • Dulcinea
    • OpenDOAR
    • Dialnet Plus
    • ORCID
    • Creative Commons
    • UNESCO Nomenclature
    • español
    • English
    • Gallegan
JavaScript is disabled for your browser. Some features of this site may not work without it.
All of InvestigoAuthorsTitles Materias Unesco Research GroupsType of ContentsJournal TitlesThis CollectionAuthorsTitlesUNESCO SubjectsResearch GroupsType of ContentsJournal Titles

Library guides

Self-archivingRequest PermissionRelated guides

Statistics

View Usage Statistics

Improvements for research data repositories: The case of text spam

Vázquez, Ismael; Novo Lourés, MaríaAutor UVIGO; Pavón Rial, Maria ReyesAutor UVIGO; Laza Fidalgo, RosalíaAutor UVIGO; Méndez Reboredo, José RamónAutor UVIGO; Ruano Ordás, David AlfonsoAutor UVIGO
DATE: 2023-04
UNIVERSAL IDENTIFIER: http://hdl.handle.net/11093/7491
EDITED VERSION: https://journals.sagepub.com/doi/10.1177/0165551521998636
UNESCO SUBJECT: 1203.17 Informática
DOCUMENT TYPE: article

ABSTRACT

Current research has evolved in such a way scientists must not only adequately describe the algorithms they introduce and the results of their application, but also ensure the possibility of reproducing the results and comparing them with those obtained through other approximations. In this context, public data sets (sometimes shared through repositories) are one of the most important elements for the development of experimental protocols and test benches. This study has analysed a significant number of CS/ML (Computer Science/ Machine Learning) research data repositories and data sets and detected some limitations that hamper their utility. Particularly, we identify and discuss the following demanding functionalities for repositories: (1) building customised data sets for specific research tasks, (2) facilitating the comparison of different techniques using dissimilar pre-processing methods, (3) ensuring the availability of software applications to reproduce the pre-processing steps without using the repository functionalities and (4) providing protection mechanisms for licencing issues and user rights. To show the introduced functionality, we created STRep (Spam Text Repository) web application which implements our recommendations adapted to the field of spam text repositories. In addition, we launched an instance of STRep in the URL https://rdata.4spam.group to facilitate understanding of this study
Show full item record

Files in this item

[PDF]
Name:
2023_mendez_improvements_resea ...
Size:
1.606Mb
Format:
PDF
Description:
Embargo indefinido por copyright
View/Open

Send to

MendeleyZoteroRefworks

The Institutional Repository of the University of Vigo Investigo is disseminated in:

University library
Rúa Leonardo da Vinci, s/n
As Lagoas, Marcosende
36310 Vigo

Location

Information
+34 986 813 821
investigo@uvigo.gal

Accessibility | Legal notice | Data protection
Logo UVigo

INFORMACIÓN
+34 986 812 000
informacion@uvigo.gal

CONTACTO

CAMPUS DO MAR

CAMPUS DE OURENSE
+34 988 387 102
Campus da Auga

CAIXA DE QUEIXAS, SUXESTIÓNS E PARABÉNS

TRANSPARENCIA

CAMPUS DE PONTEVEDRA
+34 986 801 949
Campus CREA

OUTRAS WEBS INSTITUCIONAIS

EMERXENCIAS

CAMPUS DE VIGO
+34 986 812 000
Campus Vigo Tecnolóxico

MURO SOCIAL