A whitepaper distilling LIRNEasia's current thoughts on the possibilities and issues with the computation extraction of syntactic and semantic language from digital text.
The study by our bd4d team built on the Social Connectedness Index concept introduced by Michael Bailey (the team lead for economics research at Facebook) and others.
A personal reflection on the people of CPRsouth
On the 13th of February, a team from Lirneasia – comprised of Professor Rohan Samarajiva, Dr. Sujata Gamage, and myself – presented some of our research at the Trivedi Center at Ashoka University in Delhi, India. Ashoka, for those of us who are not familiar with it, is a private university that focuses on liberal arts: their capital stems from philanthropic contributions. The Trivedi audience were a mix of high-level academics and students – most with a base degree in computer science. Trivedi is dedicated to putting together datasets on Indian politics.
by Keshan de Silva and Yudhanjaya Wijeratne One of the most useful datasets we have is a collection of pseudoanaonymized call data records for all of Sri Lanka, largely from the year 2013. Given that Sri Lanka has extremely high cell coverage and subscription rates (we’re actually oversubscribed – there’s more subscribers than people in the country; an artifact of people owning multiple SIMS), this dataset is ripe for conducting analysis at a big data scale. We recently used it to examine the event attendance of the annual Nallur festival that happens in Jaffna, Sri Lanka. Using CDR records, we were able to analyze the increase in population of the given region during the time of the festival. A lengthy writeup describes it on Medium, explaining the importance of the festival and the logic for picking it.