Language resources Archives

LIRNEasia discussion series: “Tackling the Information Disorder in Asia”

Posted by Milindu Tissera on May 23, 2022 / 0 Comments

LIRNEasia will host the online discussion series "Tackling the Information Disorder in Asia" June 8 (7:30 AM UTC - 9:30 AM UTC) and June 9 (7:30 AM UTC - 10:00 AM UTC), 2022. This event is free and open to the public. Prior registration mandatory.

Tackling Disinformation: Can AI augment human efforts in fact-checking?

Posted by Milindu Tissera on January 17, 2022 / 0 Comments

LIRNEasia conducted a forum in mid-December 2021 focusing on tackling disinformation.

A Corpus and Machine Learning Models for Fake News Classification in Bengali

Posted by Milindu Tissera on October 13, 2021 / 0 Comments

We present a dataset consisting of 3468 documents in Bengali, drawn from Bangladeshi news websites and factchecking operations, annotated as CREDIBLE, FALSE, PARTIAL or UN-CERTAIN. The dataset has markers for the content of the document, the classification, the web domain from which each document was retrieved, and the date on which the document was published. We also present the results of misinformation classification models built for the Bengali language, as well as comparisons to prior work in English and Sinhala.

A Corpus and Machine Learning Models for Fake News Classification in Sinhala

Posted by Milindu Tissera on July 16, 2021 / 0 Comments

We present a dataset consisting of 3576 documents in Sinhala, drawn from Sri Lankan news websites and factchecking operations, annotated as CREDIBLE, FALSE, PARTIAL or UN- CERTAIN. The dataset has markers for the content of the document, the classification, the web domain from which each document was retrieved, and the date on which the document was published. We also present the results of misinformation classification models built for the Sinhala language, as well as comparisons to English benchmarks, and suggest that for smaller media ecosystems it may make more practical sense to model uncertainty instead of truth vs falsehood binaries.