Scientists can now monitor and forecast diseases around the globe more effectively by analyzing views of Wikipedia articles, according to a team from Los Alamos National Laboratory.
“A global disease-forecasting system will improve the way we respond to epidemics,” scientist Sara Del Valle said. “In the same way we check the weather each morning, individuals and public health officials can monitor disease incidence and plan for the future based on today’s forecast.”
Del Valle and her team were able to successfully monitor influenza in the United States, Poland, Japan and Thailand, dengue fever in Brazil and Thailand, and tuberculosis in China and Thailand. The team was also able to forecast all but one of these, tuberculosis in China, at least 28 days in advance. This means, Del Valle said, that people start searching for disease-related information on Wikipedia before they seek medical attention.
The paper also shows the potential to transfer models across different regions; that is, one can “train” a computer model using public health data in one location and use it in another. For example, researchers could create models using data from Japan to track and forecast disease in Thailand. This is particularly important for countries that do not offer reliable disease data.
“The goal of this research is to build an operational disease monitoring and forecasting system with open data and open source code,” Del Valle said. “This paper shows we can achieve that goal.”
Read the study in PLoS Computational Biology: Global disease monitoring and forecasting with Wikipedia.