Skip to main content

Main navigation

  • About ITEFI
  • Research
  • Formación y empleo
  • OpenLab
  • Servicios científico técnicos
  • Staff Directory

Advancing the Use of Information Compression Distances in Authorship Attribution

authorship recognition
cyber-attribution
normalised compression distance
Muñoz, S.P., Oliva, C., Lago-Fernández, L.F., Arroyo, D.
Spezzano, F., Amaral, A., Ceolin, D., Fazio, L., Serra, E. (eds) Disinformation in Open Online Media. MISDOOM 2022. Lecture Notes in Computer Science, vol 13545 . Springer, Cham.
https://doi.org/10.1007/978-3-031-18253-2_8

Detecting unreliable information in social media is an open challenge, in part as a result of the difficulty to associate a piece of information to known and trustworthy actors. The identification of the origin of sources can help society deal with unverified, incomplete, or even false information. In this work we tackle the problem of associating a piece of information to a certain politician. The use of inaccurate information is of great relevance in the case of politicians, since it affects social perception and voting behavior. Moreover, misquotation can be weaponized to hinder adversary reputation. We consider the task of applying a compression-based metric to conduct authorship attribution in social media, namely in Twitter. In specific, we leverage the Normalized Compression Distance (NCD) to compare an author’s text with other authors’ texts. We show that this methodology performs well, obtaining 80.3% accuracy in a scenario with 6 different politicians.

Acknowledgements

This project has received funding from the European Union’s Horizon 2020 Research and Innovation Programme under grant agreement No. 872855 (TRESCA project), from Grant PLEC2021-007681 (project XAI-DisInfodemics) funded by MCIN/AEI/ 10.13039/501100011033 and by European Union NextGeneration EU/PRTR, from Comunidad de Madrid (Spain) under the project CYNAMON (no. P2018/TCS-4566), cofunded with FSE and FEDER EU funds, and from Spanish projects MINECO/FEDER TIN2017-84452-R and PID2020-114867RB-I00

GiCSI

proyecto/s relacionado/s

  • IA explicable para desinformación y detección de conspiración durante infodemias. XAIDisInfodemics
    Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020, Programa Estatal de I+D+i Orientada a los Retos de la Sociedad (AEI)
  • Trustworthy, Reliable and Engaging Scientific Communication Approaches. TRESCA
    Programa HORIZONTE'2020 (UE)
Acoustics and Non Destructive Evaluation (DAEND)
  • Environmental Acoustics (GAA)
  • G Carma: Materials Characterization by Non Destructive Evaluation
  • ULAB, Ultrasounds for Liquid Analysis and Bioengineering
Information and Communication Technologies (TIC)
  • Cybersecurity and Privacy Protection Research Group (GiCP)
  • Research group on Cryptology and Information Security (GiCSI)
    • Quantum Communications Laboratory (LCQE)
  • Multichannel Ultrasonic Signal Processing Group (MUSP)
Sensors and Ultrasonic Systems (DSSU)
  • Ultrasonic Systems and Technologies (USTG)
  • Nanosensors and Smart Systems (NoySi)
  • Ultrasonic Resonators for cavitation and micromanipulation (RESULT)
  • Advanced Sensor Technology (SENSAVAN)
  • Quantum Electronics (QE)
Laboratorios
  • Laboratorio de Acústica
  • Laboratorio de Metrología Ultrasónica Médica (LMUM)
  • Laboratorio de Comunicaciones Cuánticas
  • Laboratory for International Collaboration in Advanced Biophotonics Imaging

Instituto de Tecnologías Físicas y de la Información Leonardo Torres Quevedo  - ITEFI
C/ Serrano, 144. 28006 - Madrid • Tel.: (+34) 91 561 88 06  Contacto  •  Intranet
EDIFICIO PARCIALMENTE ACCESIBLE POR PERSONAS CON MOVILIDAD REDUCIDA