Yuan Ze University is a leading private university in Taiwan. In keeping with its strategy of differentiation and internationalization, it has invited LIRNEasia to interact with its new Center of Excellence known as the Innovation Center for Big Data and Convergence. Even though Taiwan is not a country of focus for LIRNEasia, we have had considerable interactions with Taiwan academics over the years within the framework of CPRsouth. Professor Yu-li Liu was one of the founder members of the CPRsouth Board and helped us establish relationships in the Peoples Republic. Professor Yuntsai Chou of Yuan Ze in currently on the CPRsouth Board.
Bangladesh’s leading English language newspaper, the Daily Star, has carried a story on what that country could gain from analysis of mobile network big data. He said a lot of people are now talking about big data, but the current models of data analysis do not cover all the people, especially the poor. Mobile networks’ big data can be used to reach this segment, he added. Big data can give organisations access to more data than they have experienced before, and thus give them the opportunity to discover data correlations and patterns. Access to more accurate information can influence their business in many ways.
The specialized unit of the UN dealing with big data for development, UN Global Pulse, has constituted a data privacy advisory group. LIRNEasia’s chair and advisor to the big data for development team, Rohan Samarajiva, is a member. Among the other members are MIT’s Sandy Pentland and Umar Saif, Chair of the Punjab IT Board and Secretary IT of the Government of Punjab.
Reading the interview with Viktor Mayer-Schonberger from which the quotation below is taken, I was reminded of an exchange we had in Doha earlier this month, at ITU Telecom World. What do you have to say to those who are still unsure about ‘Big Data’? To them I say: The only way that you can be in power and not be ‘controlled’ by data is to first understand its power and then use it to your own advantage. Otherwise, you will always be at the receiving end in the balance of power between the ‘big guys’ who analyse data and the ‘every man’ who supplies data – whether consciously or inadvertently. The question was from Dr Shiv Bakhshi, now with Ericsson and whose association with me goes back to the 1990s.
This debate about access to data about railway delays in Britain has interesting implications in other fields such as electricity. She told me part of the problem is when public services are provided by the private sector; such firms claim that selling data is a revenue stream for them, and ask for public-funded subsidies to make it open. For example, the state-owned Ordnance Survey mapping company has made much of its data open, but takes a £10 million annual subsidy to do so, according to The Independent. “It’s particularly frustrating when organisations aren’t making much money from it,” Tennison added, noting that the costs of selling data—including lawyers for licensing and enforcing terms—often outstrip any revenue. Tennison hopes that’s not the case.
Over the past weeks, Sriganesh Lokanathan and I made multiple presentations on the above subject to potential funders and data donors in multiple countries, using the slideset given here. In an ideal world, we would be using our energy making presentations to those could make better informed policy decisions as a result (we have done such presentations and plan to do more in January 2015), but these efforts are also necessary. Without data and without money, this kind of public-interest research cannot be continued.
An unexpectedly detailed description of our big data session was included in the Day 3 highlights: Big data is usually in the headlines for the wrong reasons – surveillance, exploitation of personal data for commercial or governmental ends, intrusion of privacy – but can also serve a valid and immensely exciting social purpose for development. Kicking off a fascinating, packed and highly-interactive session, moderator Rohan Samarajiva, Founding Chair and CEO, LIRNEasia, set out this contradiction in perception of big data as a “competition of imaginations” between hype and pessimism, reminding us that big data is “of interest to all of us, as we are the creators of this data, the originators of this data”. Our mobile telephones, and by extension we ourselves, are permanently in communication with the nearest towers, sending out details of our whereabouts and activities in an ever-growing, highly personal call record. This session aimed to “talk not about the imagination, but about what has been done”, exploring current and future trends in the use of big data for development.
Much of what is discussed as “big data” does not include the poor, because smartphone penetration is still low, social media are not used by all classes and datafied records are rare in developing countries. Therefore, the session focused on research that has been/is being done on pseudonymized mobile network big data in developing countries. Instead the usual “battle of imaginations” which posits the optimistic scenarios that tend toward hype against the pessimistic scenarios that imagine all sorts of bad things that could happen, we began with reality. What had been actually done on the ground in countries as different as Namibia, Afghanistan and Sri Lanka were presented by data scientists who knew the ins and outs of data cleaning, pseudonymization, and what software needs to be used to analyze petabytes of data at a time. The active audience raised a range of questions.
As I move from several productive conversations about big data for development in London to Doha where we will be exploring the potential of mobile network big data in the context of three presentations on research insights that have been drawn from big data, the question that preoccupies me is whether we can afford to let these data go waste, or be only used for narrow commercial ends. In economies with high consumer spending power there will be enough incentive to extract value from the data. But in our countries, where the dominant business model does not leave a lot of room for R&D, will we be left to mercy of off-the-shelf data analytics packages, if any?
Following the plenary in 2013 at which Viktor Mayer-Schonberger introduced big data to ITU Telecom World attendees, there will be a panel discussion at the 2014 edition in Doha, Qatar. What is novel is that we will have three presentations by those who have actually got their hands dirty with big data, including Linus Bengtsson on Flowminder who will talk about their most recent work in helping track Ebola in West Africa, and our own Sriganesh Lokanathan and Joshua Blumenstock. Big Data for Development Tuesday, December 09, 2014, 11:00 AM – 12:30 PM, Meeting Room 104 Companies are increasingly relying on business analytics to extract value from the large volumes of computer-readable and analyzable (or “datafied”) data in their possession. For example, telecom operators are using these techniques to identify customers likely to exit so as to manage churn. Big data for development (BD4D) seeks to apply these techniques to big data held by both government and private entities to answer development-related questions.
I am not sure surveying current smartphone users, especially in countries where smartphone penetration is still low, is the best way to gauge the demand for smart-city services, but it is a useful input. Here are some key findings from an Ericsson study that is available on the web. The report – which surveyed over 9,000 smartphone users in nine cities (including Beijing, Delhi and Tokyo) – found that 76% of respondents would use traffic volume maps, while 70% would use energy usage monitors and 66% would use apps to check water quality. “These are services that consumers will expect cities to make available via the internet,” says Michael Bjorn, Ericsson ConsumerLab’s head of research. Bjorn adds that demand for smart-city services could also drive future concepts such as interactive road navigation, social bike/car sharing, indoor maps, as well as healthcare concepts like heart-rate monitoring rings, posture sensors and a digital health network of medical data accessible by physicians.
At one time, transaction-generated data (TGD) was the by product. E commerce or retailing services provided over the web was the main product. But if analysis of the TGD is used to give the company leverage in other sectors, resulting in acquisitions or entry .. .
The spread of infectious diseases is affected by the movement of people. We were thinking how this could be tracked using mobile network big data. Others are already doing it. All strength to them. The people of West Africa, and the world, need all the help they can get.
Technology, especially measuring and monitoring technology, does not exist in a power vacuum. As we struggle with getting our hands on data and finding the best ways of extracting insights, we should also give some thought to power dynamics. Reading this may get the process started. Life in a smart city is a frictionless; free of traffic congestion, optimally lit, with everything from bins to buildings constantly reporting their status and managing their interactions with residents. The smart slum is still a peripheral idea, but we can speculate on the likely impact of extending this ‘smartness’ to slums and make two competing claims.
Two weeks back I was invited to give a guest lecture by the Department of Management Studies at IIT Delhi. The topic of my lecture was based on our ongoing work in using mobile network big data for development in Sri Lanka. Attended by 60+ graduate students and faculty from various departments (Management, Economics and Computer Science), the lecture garnered a large amount of interest from people trying to understand how big data can be used in various domains (both public and private). Whilst the focus of my talk was very much on development, there are still many implications and cross-over learnings for businesses and this came out more in the discussion following the lecture. The issue for many though (and which will remain for sometime) is getting access to big data rather than the tools.