Curtin University Homepage
  • Library
  • Help
    • Admin

    espace - Curtin’s institutional repository

    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item

    Web search activity data accurately predict population chronic disease risk in the USA

    Access Status
    Fulltext not available
    Authors
    Nguyen, T.
    Tran, The Truyen
    Luo, W.
    Gupta, S.
    Rana, S.
    Phung, D.
    Nichols, M.
    Millar, L.
    Venkatesh, S.
    Allender, S.
    Date
    2015
    Type
    Journal Article
    
    Metadata
    Show full item record
    Citation
    Nguyen, T. and Tran, T. and Luo, W. and Gupta, S. and Rana, S. and Phung, D. and Nichols, M. et al. 2015. Web search activity data accurately predict population chronic disease risk in the USA. Journal of Epidemiology and Community Health. 69 (7): pp. 693–699.
    Source Title
    Journal of Epidemiology and Community Health
    DOI
    10.1136/jech-2014-204523
    ISSN
    0143-005X
    School
    Multi-Sensor Proc & Content Analysis Institute
    URI
    http://hdl.handle.net/20.500.11937/45061
    Collection
    • Curtin Research Publications
    Abstract

    Background: The WHO framework for non-communicable disease (NCD) describes risks and outcomes comprising the majority of the global burden of disease. These factors are complex and interact at biological, behavioural, environmental and policy levels presenting challenges for population monitoring and intervention evaluation. This paper explores the utility of machine learning methods applied to population-level web search activity behaviour as a proxy for chronic disease risk factors. Methods: Web activity output for each element of the WHO's Causes of NCD framework was used as a basis for identifying relevant web search activity from 2004 to 2013 for the USA. Multiple linear regression models with regularisation were used to generate predictive algorithms, mapping web search activity to Centers for Disease Control and Prevention (CDC) measured risk factor/disease prevalence. Predictions for subsequent target years not included in the model derivation were tested against CDC data from population surveys using Pearson correlation and Spearman's r. Results: For 2011 and 2012, predicted prevalence was very strongly correlated with measured risk data ranging from fruits and vegetables consumed (r=0.81; 95% CI 0.68 to 0.89) to alcohol consumption (r=0.96; 95% CI 0.93 to 0.98). Mean difference between predicted and measured differences by State ranged from 0.03 to 2.16. Spearman's r for state-wise predicted versus measured prevalence varied from 0.82 to 0.93. Conclusions: The high predictive validity of web search activity for NCD risk has potential to provide real-time information on population risk during policy implementation and other population-level NCD prevention efforts.

    Related items

    Showing items related by title, author, creator and subject.

    • Burden of disease and benefits of exercise in fixed airway obstruction asthma
      Turner, Sian Elizabeth (2009)
      Background and research questions. The characterization of chronic persistent asthma in an older adult population is not well defined. This is due to the difficulties in separating the diagnosis of asthma from that of ...
    • The role of functional, radiological and self-reported measures in predicting clinical outcome in spondylotic cervical radiculopathy
      Agarwal, Shabnam (2011)
      BackgroundCervical radiculopathy (CR) results in significant disability and pain and is commonly treated conservatively with satisfactory clinical outcomes. However, a considerable number of patients require surgery to ...
    • Effectiveness of general practice nurse interventions in cardiac risk factor reduction amongst adults
      Halcomb, E.; Moujalli, S.; Griffiths, R.; Davidson, Patricia (2007)
      Background: Cardiovascular disease is the leading cause of death for adults in Australia. In recent years there has been a shift in health service delivery from institutional to community-based care for chronic conditions, ...
    Advanced search

    Browse

    Communities & CollectionsIssue DateAuthorTitleSubjectDocument TypeThis CollectionIssue DateAuthorTitleSubjectDocument Type

    My Account

    Admin

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    Follow Curtin

    • 
    • 
    • 
    • 
    • 

    CRICOS Provider Code: 00301JABN: 99 143 842 569TEQSA: PRV12158

    Copyright | Disclaimer | Privacy statement | Accessibility

    Curtin would like to pay respect to the Aboriginal and Torres Strait Islander members of our community by acknowledging the traditional owners of the land on which the Perth campus is located, the Whadjuk people of the Nyungar Nation; and on our Kalgoorlie campus, the Wongutha people of the North-Eastern Goldfields.