misinformation Archives — LIRNEasia


We present a dataset consisting of 3576 documents in Sinhala, drawn from Sri Lankan news websites and factchecking operations, annotated as CREDIBLE, FALSE, PARTIAL or UN- CERTAIN. The dataset has markers for the content of the document, the classification, the web domain from which each document was retrieved, and the date on which the document was published. We also present the results of misinformation classification models built for the Sinhala language, as well as comparisons to English benchmarks, and suggest that for smaller media ecosystems it may make more practical sense to model uncertainty instead of truth vs falsehood binaries.
The problem with regulating information is its inherent slipperyness. In 2018, when invited to speak on the subject I quoted a Deputy Minister of the Malaysian Government, speaking in Parliament: Datuk Jailani Johari, the Deputy Communications and Multimedia Minister, explained that fake news is information that is confirmed to be untrue, especially by the authorities or parties related to the news. He said that 1MDB has been investigated by the police and Attorney-General and the reports have been presented to Parliament’s Public Accounts Committee (PAC), which is made up of lawmakers from both sides of the divide. Jailani added that recommendations from the PAC report have been accepted and been implemented by the Government. .