A journal of IEEE and CAA , publishes high-quality papers in English on original theoretical/experimental research and development in all areas of automation
Volume 8 Issue 1
Jan.  2021

IEEE/CAA Journal of Automatica Sinica

  • JCR Impact Factor: 6.171, Top 11% (SCI Q1)
    CiteScore: 11.2, Top 5% (Q1)
    Google Scholar h5-index: 51, TOP 8
Turn off MathJax
Article Contents
Sohail Imran, Tariq Mahmood, Ahsan Morshed and Timos Sellis, "Big Data Analytics in Healthcare — A Systematic Literature Review and Roadmap for Practical Implementation," IEEE/CAA J. Autom. Sinica, vol. 8, no. 1, pp. 1-22, Jan. 2021. doi: 10.1109/JAS.2020.1003384
Citation: Sohail Imran, Tariq Mahmood, Ahsan Morshed and Timos Sellis, "Big Data Analytics in Healthcare — A Systematic Literature Review and Roadmap for Practical Implementation," IEEE/CAA J. Autom. Sinica, vol. 8, no. 1, pp. 1-22, Jan. 2021. doi: 10.1109/JAS.2020.1003384

Big Data Analytics in Healthcare — A Systematic Literature Review and Roadmap for Practical Implementation

doi: 10.1109/JAS.2020.1003384
Funds:  This work was supported by two research grants provided by the Karachi Institute of Economics and Technology (KIET) and the Big Data Analytics Laboratory at the Insitute of Business Administration (IBA-Karachi)
More Information
  • The advent of healthcare information management systems (HIMSs) continues to produce large volumes of healthcare data for patient care and compliance and regulatory requirements at a global scale. Analysis of this big data allows for boundless potential outcomes for discovering knowledge. Big data analytics (BDA) in healthcare can, for instance, help determine causes of diseases, generate effective diagnoses, enhance QoS guarantees by increasing efficiency of the healthcare delivery and effectiveness and viability of treatments, generate accurate predictions of readmissions, enhance clinical care, and pinpoint opportunities for cost savings. However, BDA implementations in any domain are generally complicated and resource-intensive with a high failure rate and no roadmap or success strategies to guide the practitioners. In this paper, we present a comprehensive roadmap to derive insights from BDA in the healthcare (patient care) domain, based on the results of a systematic literature review. We initially determine big data characteristics for healthcare and then review BDA applications to healthcare in academic research focusing particularly on NoSQL databases. We also identify the limitations and challenges of these applications and justify the potential of NoSQL databases to address these challenges and further enhance BDA healthcare research. We then propose and describe a state-of-the-art BDA architecture called Med-BDA for healthcare domain which solves all current BDA challenges and is based on the latest zeta big data paradigm. We also present success strategies to ensure the working of Med-BDA along with outlining the major benefits of BDA applications to healthcare. Finally, we compare our work with other related literature reviews across twelve hallmark features to justify the novelty and importance of our work. The aforementioned contributions of our work are collectively unique and clearly present a roadmap for clinical administrators, practitioners and professionals to successfully implement BDA initiatives in their organizations.


  • loading
  • 1 https://neo4j.com
    2 http://www.hl7.org/implement/standards/fhir/)
    3 A group of graduate students participated in this activity over a period of 3 months. For the sake of brevity, the details are outside the scope of this paper.
    4 To the best of our knowledge, this list is complete as of June 2020.
    5 A detailed discussion of the nine compared papers is outside the scope of this work; we invite the reader to go through these papers for more required information.
  • [1]
    N. V. Chawla and D. A. Davis, “Bringing big data to personalized healthcare: A patient-centered framework,” J. Gen. Intern. Med., vol. 28, no. S3, pp. 660–665, Jun. 2013. doi: 10.1007/s11606-013-2455-8
    A. R. Reddy and P. S. Kumar, “Predictive big data analytics in healthcare,” in Proc. 2nd Int. Conf. Computational Intelligence & Communication Technology, Ghaziabad, India, 2016.
    R. Kohli and S. S. L. Tan, “Electronic health records: How can IS researchers contribute to transforming healthcare?” MIS Quart., vol. 40, no. 3, pp. 553–573, Sept. 2016. doi: 10.25300/MISQ/2016/40.3.02
    H. Chen, R. H. L. Chiang, and V. C. Storey, “Business intelligence and analytics: From big data to big impact,” MIS Quart., vol. 36, no. 4, pp. 1165–1188, Dec. 2012. doi: 10.2307/41703503
    Y. Demchenko, C. Ngo, and P. Membrey, “Architecture framework and components for the big data ecosystem Draft Version 0.2,” System and Network Engineering, SNE technical report SNE-UVA-2013-02, Sept. 2013.
    C. M. Tucker, M. Marsiske, K. G. Rice, J. J. Nielson, and K. Herman, “Patient-centered culturally sensitive health care: Model testing and refinement,” Health Psychol., vol. 30, no. 3, pp. 342–350, May 2011. doi: 10.1037/a0022967
    G. Harrison, Next Generation Databases: NoSQL, NewSQL, and Big Data. Apress, 2015.
    X. Wu, S. Kadambi, D. Kandhare, and A. Ploetz, Seven NoSQL Databases in a Week: Get Up and Running with the Fundamentals and Functionalities of Seven of the Most Popular NoSQL Databases Kindle. USA: Packt Publishing, 2018.
    K. Jee and G. H. Kim, “Potentiality of big data in the medical sector: Focus on how to reshape the healthcare system,” Healthc. Inform. Res., vol. 19, no. 2, pp. 79–85, Jun. 2013. doi: 10.4258/hir.2013.19.2.79
    J. King, V. Patel, and M. F. Furukawa, “Physician adoption of electronic health record technology to meet meaningful use objectives: 2009–2012,” The Office of the National Coordinator for Health Information Technology, Tech. Rep., Dec. 2012.
    V. Mayer-Schönberger and K. Cukier, Big Data: A Revolution That Will Transform How We Live, Work, and Think. Eamon Dolan, 2014.
    S. Axryd. Why 85% of big data projects fail. [Online]. Available: https://www.digitalnewsasia.com/insights/why-85-big-data-projects-fail. Accessed on: Apr. 16, 2019.
    I. Yoo, P. Alafaireet, M. Marinov, K. Pena-Hernandez, R. Gopidi, J. F. Chang, and L. Hua, “Data mining in healthcare and biomedicine: A survey of the literature,” J. Med. Syst., vol. 36, no. 4, pp. 2431–2448, May 2012. doi: 10.1007/s10916-011-9710-5
    D. Tomar and S. Agarwal, “A survey on data mining approaches for healthcare,” Int. J. Bio-Sci. Bio-Technol., vol. 5, no. 5, pp. 241–266, Oct. 2013. doi: 10.14257/ijbsbt.2013.5.5.25
    H. C. Koh and G. Tan, “Data mining applications in healthcare,” J. Healthc. Inf. Manage., vol. 19, no. 2, pp. 64–72, Feb. 2005.
    S. Patel and H. Patel, “Survey of data mining techniques used in healthcare domain,” Int. J. Inf. Sci. Techn., vol. 6, no. 1–2, pp. 53–60, Mar. 2016.
    R. Sujatha, R. Sumathy, and Nithya R A, “A survey of health care prediction using data mining,” Int. J. Innov. Res. Sci.,Eng. Technol., vol. 5, no. 8, pp. 14538–14543, Aug. 2016.
    P. Horstmeier. Healthcare business intelligence: What your strategy needs. [Online]. Available: https://www.healthcatalyst.com/healthcare-business-intelligence-data-warehouse, Accessed on: Jan. 1, 2016.
    H. Smalltree. Business intelligence case study: Hospital BI helps healthcare. [Online]. Available: https://searchbusinessanalytics.techtarget.com/news/1507291/Business-intelligence-case-study-Hospital-BI-helps-healthcare, Accessed on: Jul. 20, 2006.
    M. Karlberg and M. Skaliotis, “Big data for official statistics – Strategies and some initial European applications,” United Nations Economic Commission for Europe, Geneva, Switzerland, Tech. Rep., Sept. 2013.
    O. Ola and K. Sedig, “The challenge of big data in public health: An opportunity for visual analytics,” Online J. Public Health Inf., vol. 5, no. 3, pp. 223, Feb. 2014.
    B. Kayyali, D. Knott, and S. Van Kuiken, “The big-data revolution in us health care: Accelerating value and innovation,” Mckinsey & Company, Tech. Rep., Apr. 2013.
    I. R. M. Association, Healthcare Administration. IGI Global, 2015.
    H. Zahid, T. Mahmood, A. Morshed, and T. Sellis, “Big data analytics in telecommunications: Literature review and architecture recommendations,” IEEE/CAA J. Autom. Sinica, vol. 7, no. 1, pp. 18–38, Jan. 2020. doi: 10.1109/JAS.2019.1911795
    MapR, “Zeta architecture and the data-centric enterprise,” 2020. [Online]. Available: https://mapr.com/solutions/zeta-enterprise-architecture/.
    Wikibon, “Hadoop-nosql software and services market forecast 2012-2017,” 2013. [Online]. Available: wikibon.org/wiki/v/.
    M. L. Rethlefsen, D. L. Rothman, and D. S. Mojon, Internet Cool Tools for Physicians. Berlin, Germany: Springer, 2009, pp. 37–40.
    R. Vine, “Google scholar,” J. Med. Libr. Assoc., vol. 94, no. 1, pp. 97–99, Jan. 2006.
    WU Libraries, “Comprehensive comparison of reference managers: Mendeley vs. zotero vs. docear. 2012. [Online]. Available: https://isg.beel.org/blog/2014/01/15/comprehensive-comparison-of-reference-managers-mendeley-vs-zotero-vs-docear/.
    “How to choose: Zotero, mendeley, or endnote,” 2017. [Online]. Available: http://libguides.wustl.edu/choose.
    “Mendeley: Comparing citation managers,” 2017. [Online]. Available: http://libguides.lib.msu.edu/mendeley/comparison.
    “Comparison chart,” 2017. [Online]. Available: https://www.library.wisc.edu/services/citation-managers/comparison-chart/.
    “Readcube,” 2020. [Online]. Available: https://www.readcube.com/home.
    Y. J. Chen, Y. C. Su, Y. M. Chen, and C. Y. Huang, “Design and implementation of a medical knowledge service system for cross-organization healthcare collaboration,” in Proc. 6th IEEE Int. Conf. Industrial Informatics, Daejeon, South Korea, 2008.
    E. Gasiorowski Denis, “Big plans for big data,” 2017. [Online]. Available: https://www.iso.org/news/2014/03/Ref1821.html.
    Sokrati, “Importance of standardizing your big-data,” 2017. [Online]. Available: https://sokrati.com/engineering/standardizing-big-data/.
    J. Stevens, “Standardization and big data,” 2017. [Online]. Available: https://www.artezio.com/pressroom/blog/standardization-and-big-data.
    T. Olavsrud, “Big data leaders and users unite around standardization,” 2015. [Online]. Available: https://www.cio.com/article/2884666/big-data/big-data-leaders-and-users-unite-around-standardization.html.
    B. Feldman, E. M. Martin, and T. Skotnes, “Big data in healthcare hype and hope,” Dr. Bonnie 360, Tech. Rep., Oct. 2012.
    F. X. Diebold, “Big data’ dynamic factor models for macroeconomic measurement and forecasting,” in Advances in Economics and Econometrics, Eighth World Congress of the Econometric Society Cambridge, Cambridge, UK, 2000, pp. 115–122.
    D. Laney, “3D data management: Controlling data volume, velocity, and variety,” META Group, Tech. Rep., Feb. 2001.
    J. S. Ward and A. Barker, Undefined by data: A survey of big data definitions. 2013. [Online]. Available: https://arxiv.org/abs/1309.5821
    R. Bellazzi, “Big data and biomedical informatics: A challenging opportunity,” Yearb. Med. Inform., vol. 9, no. 1, pp. 8–13, May 2014.
    E. Morley-Fletcher, “ Big data healthcare: An overview of the challenges in data intensive healthcare,” 2013. [Online]. Available: http://ec.europa.eu/information_society/newsroom/cf/dae/document.cfm?doc_id=3499.
    G. Luo, “Mlbcd: A machine learning tool for big clinical data,” Health Inf. Sci. Syst., vol. 3, no. 1, pp. 3, Sep. 2015. doi: 10.1186/s13755-015-0011-0
    E. F. Codd, “A relational model of data for large shared data banks,” Commun. ACM, vol. 13, no. 6, pp. 377–387, Jun. 1970. doi: 10.1145/362384.362685
    K. Orend, “Analysis and classification of NoSQL databases and evaluation of their ability to replace an object-relational persistence layer,” M.S. thesis, Technische Universität München, Germany, 2010.
    N. Marz and J. Warren, Big Data: Principles and Best Practices of Scalable Realtime Data Systems. Greenwich, USA, Manning Publications, 2015.
    B. G. Tudorica and C. Bucur, “A comparison between several NoSQL databases with comments and notes,” in Proc. RoEduNet Int. Conf. 10th Edition: Networking in Education and Research, Iasi, Romania, 2011.
    Q. Yao, Y. Tian, P. F. Li, L. L. Tian, Y. M. Qian, and J. S. Li, “Design and development of a medical big data processing system based on Hadoop,” J. Med. Syst., vol. 39, no. 3, pp. 23, Feb. 2015. doi: 10.1007/s10916-015-0220-8
    A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, N. Zhang, S. Antony, H. Liu, and R. Murthy, “Hive – A petabyte scale data warehouse using hadoop,” in Proc. IEEE 26th Int. Conf. Data Engineering, Long Beach, USA, 2010, pp. 996–1005.
    M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I. Stoica, “Spark: Cluster computing with working sets,” in Proc. 2nd USENIX Conf. Hot Topics in Cloud Computing, Boston, USA, 2010.
    G. M. Siddesh, S. Hiriyannaiah, and K. G. Srinivasa, “Driving big data with hadoop technologies,” in Handbook of Research on Cloud Infrastructures for Big Data Analytics, P. Raj and G. C. Deka, Eds. IGI Global, 2014, pp. 232–262.
    K. Sravanthi and T. S. Reddy, “Applications of big data in various fields,” Int. J. Comput. Sci. Inf. Technol., vol. 6, no. 5, pp. 4629–4632, 2015.
    K. Michael and K. W. Miller, “Big data: New opportunities and new challenges [guest editors’ introduction],” Computer, vol. 46, no. 6, pp. 22–24, Jun. 2013. doi: 10.1109/MC.2013.196
    D. Zeng and R. Lusch, “Big data analytics: Perspective shifting from transactions to ecosystems,” IEEE Intell. Syst., vol. 28, no. 2, pp. 2–5, Mar. 2013. doi: 10.1109/MIS.2013.40
    M. Pospiech and C. Felden, “Big data – A state-of-the-art,” in Proc. 18th Americas Conf. Information Systems, Detroit, USA, 2012.
    R. L. Sallam, C. Howson, C. J. Idoine, T. Oestreich, J. L. Richardson, and J. A. Tapadinhas. Magic quadrant for business intelligence and analytics platforms. [Online]. Available: https://www.gartner.com/doc/3611117/magic-quadrant-business-intelligence-analytics, Accessed on: Feb. 01, 2017.
    J. A. Menius Jr and M. D. Rousculp, “Growth in health care data causing an evolution in the pharmaceutical industry,” North Carol. Med. J., vol. 75, no. 3, pp. 188–190, Jun. 2014. doi: 10.18043/ncm.75.3.188
    S. Salas-Vega, A. Haimann, and E. Mossialos, “Big data and health care: Challenges and opportunities for coordinated policy development in the EU,” Health Syst. Reform, vol. 1, no. 4, pp. 285–300, May 2015. doi: 10.1080/23288604.2015.1091538
    F. F. Costa, “Big data in biomedicine,” Drug Dis. Today, vol. 19, no. 4, pp. 433–440, Apr. 2014. doi: 10.1016/j.drudis.2013.10.012
    A. Carstensen and K. Sandkuhl, “Coordination of inter-organisational healthcare processes: Experiences from combining process- and document centred modelling,” in Proc. Communication and Coordination in Business Processes: The Int. Workshop, Kiruna, Sweden, 2005.
    S. Schneeweiss, “Learning from big health care data,” N. Engl. J. Med., vol. 370, no. 23, pp. 2161–2163, Jun. 2014. doi: 10.1056/NEJMp1401111
    S. Zillner and S. Neururer, “Technology roadmap development for big data healthcare applications,” KI – Künstl. Intell., vol. 29, no. 2, pp. 131–141, Nov. 2015. doi: 10.1007/s13218-014-0335-y
    O. Schmitt and T. A. Majchrzak, “Using document-based databases for medical information systems in unreliable environments,” in Proc. 9th Int. Conf. Information Systems for Crisis Response and Management, Vancouver, Canada, 2012.
    M. J. C. Nuijten, “The selection of data sources for use in modelling studies,” PharmacoEconomics, vol. 13, no. 3, pp. 305–316, Mar. 1998. doi: 10.2165/00019053-199813030-00005
    R. Thorlby, S. Jorgensen, B. Siegel, and J. Z. Ayanian, “How health care organizations are using data on patients’ race and ethnicity to improve quality of care,” Milbank Quart., vol. 89, no. 2, pp. 226–255, Jun. 2011. doi: 10.1111/j.1468-0009.2011.00627.x
    P. D. Clayton and G. Hripcsak, “Decision support in healthcare,” Int. J. Bio-Med. Comput., vol. 39, no. 1, pp. 59–66, Apr. 1995. doi: 10.1016/0020-7101(94)01080-K
    R. Lenz and M. Reichert, “IT support for healthcare processes – Premises, challenges, perspectives,” Data Knowl. Eng., vol. 61, no. 1, pp. 39–58, Apr. 2007. doi: 10.1016/j.datak.2006.04.007
    R. C. Brownson, J. G. Gurney, and G. H. Land, “Evidence-based decision making in public health,” J. Public Health Manage. Pract., vol. 5, no. 5, pp. 86–97, Sept. 1999. doi: 10.1097/00124784-199909000-00012
    B. Reeder, D. Revere, R. A. Hills, J. G. Baseman, and W. B. Lober, “Public health practice within a health information exchange: Information needs and barriers to disease surveillance,” Online J. Public Health Inform., vol. 4, no. 3, pp. ojphi.v4i3.4277, Dec. 2012.
    M. Goddard, D. Mowat, C. Corbett, C. Neudorf, P. Raina, and V. Sahai, “The impacts of knowledge management and information technology advances on public health decision-making in 2010,” Health Inform. J., vol. 10, no. 2, pp. 111–120, Jun. 2004. doi: 10.1177/1460458204042233
    M. M. Hansen, T. Miron-Shatz, A. Y. S. Lau, and C. Paton, “Big data in science and healthcare: A review of recent literature and perspectives: Contribution of the IMIA social media working group,” Yearb. Med. Inform., vol. 9, no. 1, pp. 21–26, Aug. 2014.
    B. B. Cohen, S. Franklin, and J. K. West, “Perspectives on the massachusetts community health information profile (MassCHIP): Developing an online data query system to target a variety of user needs and capabilities,” J. Public Health Manage. Pract., vol. 12, no. 2, pp. 155–160, Mar.–Apr. 2006. doi: 10.1097/00124784-200603000-00007
    F. J. Ohlhorst, Big Data Analytics: Turning Big Data into Big Money. Hoboken, USA: Wiley, 2013.
    P. V. Raja, E. Sivasankar, and R. Pitchiah, “Framework for smart health: Toward connected data from big data,” in Intelligent Computing and Applications, D. Mandal, R. Kar, S. Das, and B. K. Panigrahi, Eds. New Delhi, India: Springer, 2015, pp. 423–433.
    M. Mian, A. Teredesai, D. Hazel, S. Pokuri, and K. Uppala, “Work in progress – In-memory analysis for healthcare big data,” in Proc. IEEE Int. Congr. Big Data, Anchorage, USA, 2014.
    H. D. Miller, “From volume to value: Better ways to pay for health care,” Health Aff., vol. 28, no. 5, pp. 1418–1428, Sept. 2009. doi: 10.1377/hlthaff.28.5.1418
    J. Roski, G. W. Bo-Linn, and T. A. Andrews, “Creating value in health care through big data: Opportunities and policy implications,” Health Aff., vol. 33, no. 7, pp. 1115–1122, Jul. 2014. doi: 10.1377/hlthaff.2014.0147
    A. Gandomi and M. Haider, “Beyond the hype: Big data concepts, methods, and analytics,” Int. J. Inf. Manage., vol. 35, no. 2, pp. 137–144, Apr. 2015. doi: 10.1016/j.ijinfomgt.2014.10.007
    W. Raghupathi and J. Tan, “Strategic IT applications in health care,” Commun. ACM, vol. 45, no. 12, pp. 56–61, Dec. 2002. doi: 10.1145/585597.585602
    H. C. Kum and S. Ahalt, “Privacy-by-design: Understanding data access models for secondary data,” in AMIA Jt. Summits Transl. Sci. Proc., vol. 2013, pp. 126-130, Mar. 2013.
    M. Peeters, “Free movement of patients: Directive 2011/24 on the application of patients’ rights in cross-border healthcare,” Eur. J. Health Law, vol. 19, no. 1, pp. 29–60, Mar. 2012. doi: 10.1163/157180912X615158
    I. S. Rubinstein, “Big data: The end of privacy or a new beginning?” Int. Data Priv. Law, vol. 3, no. 2, pp. 74–87, May 2013. doi: 10.1093/idpl/ips036
    S. Imran and I. Hyder, “Security issues in databases,” in Proc. 2nd Int. Conf. Future Information Technology and Management Engineering, Sanya, China, 2009, pp. 541–545.
    P. Nisen and F. Rockhold, “Access to patient-level data from GlaxoSmithKline clinical trials,” N. Engl. J. of Med., vol. 369, no. 5, pp. 475–478, Aug. 2013. doi: 10.1056/NEJMsr1302541
    H. M. Krumholz, J. S. Ross, C. P. Gross, E. J. Emanuel, B. Hodshon, J. D. Ritchie, J. B. Low, and R. Lehman, “A historic moment for open science: The yale university open data access project and medtronic,” Ann. Intern. Med., vol. 158, no. 12, pp. 910–911, Jun. 2013. doi: 10.7326/0003-4819-158-12-201306180-00009
    I. Khanna, “Drug discovery in pharmaceutical industry: Productivity challenges and trends,” Drug Dis. Today, vol. 17, no. 19-20, pp. 1088–1102, Oct. 2012. doi: 10.1016/j.drudis.2012.05.007
    M. M. Mello, J. K. Francer, M. Wilenzick, P. Teden, B. E. Bierer, and M. Barnes, “Preparing for responsible sharing of clinical trial data,” N. Engl. J. Med., vol. 369, no. 17, pp. 1651–1658, Oct. 2013. doi: 10.1056/NEJMhle1309073
    J. S. Ross, R. Lehman, and C. P. Gross, “The importance of clinical trial data sharing: Toward more open science,” Circ.:Cardiovasc. Qual. Outcomes, vol. 5, no. 2, pp. 238–240, Mar. 2012. doi: 10.1161/CIRCOUTCOMES.112.965798
    P. C. Tang, J. S. Ash, D. W. Bates, J. M. Overhage, and D. Z. Sands, “Personal health records: Definitions, benefits, and strategies for overcoming barriers to adoption,” J. Am. Med. Inform. Assoc., vol. 13, no. 2, pp. 121–126, Mar. 2006. doi: 10.1197/jamia.M2025
    D. J. Ballantyne and M. Mulhall, “Method and apparatus for electronically accessing and distributing personal health care information and services in hospitals and homes,” U.S. Patent 5 867 821, February 02, 1999.
    I. Iakovidis, “Towards personal health record: Current situation, obstacles and trends in implementation of electronic healthcare record in Europe,” Int. J. Med. Inform., vol. 52, no. 1-3, pp. 105–115, Oct. 1998. doi: 10.1016/S1386-5056(98)00129-4
    K. Caine and R. Hanania, “Patients want granular privacy control over health information in electronic medical records,” J. Am. Med. Inform. Assoc., vol. 20, no. 1, pp. 7–15, Jan. 2013. doi: 10.1136/amiajnl-2012-001023
    Y. Demchenko, Z. M. Zhao, P. Grosso, A. Wibisono, and C. de Laat, “Addressing big data challenges for scientific data infrastructure,” in Proc. IEEE 4th Int. Conf. Cloud Computing Technology and Science, Taipei, China, 2012, pp. 614–617.
    L. H. Curtis, J. Brown, and R. Platt, “Four health data networks illustrate the potential for a shared national multipurpose big-data network,” Health Aff., vol. 33, no. 7, pp. 1178–1186, Jul. 2014. doi: 10.1377/hlthaff.2014.0121
    M. Frisse, A. Wilcox, D. Sittig, M. Kahn, and M. H. Lopez, “Clinical informatics, CER, and PCOR: Building blocks for meaningful use of big data in health care,” AcademyHealth, Oct. 31, 2012.
    W. Raghupathi and V. Raghupathi, “Big data analytics in healthcare: Promise and potential,” Health Inf. Sci. Syst., vol. 2, no. 1, Feb. 2014.
    D. A. Gritzalis, “Enhancing security and improving interoperability in healthcare information systems,” Med. Inform., vol. 23, no. 4, pp. 309–323, Jan. 1998. doi: 10.3109/14639239809025367
    A. Berler, S. Pavlopoulos, and D. Koutsouris, “Design of an interoperability framework in a regional healthcare system,” in Proc. 26th Annu. Int. Conf. IEEE Engineering in Medicine and Biology Society, San Francisco, USA, 2004.
    M. H. Kuo, T. Sahama, A. W. Kushniruk, E. M. Borycki, and D. K. Grunwell, “Health big data analytics: Current perspectives, challenges and potential solutions,” Int. J. Big Data Intell., vol. 1, no. 1-2, pp. 114–126, Jan. 2014.
    S. Hoffman and A. Podgurski, “The use and misuse of biomedical data: Is bigger really better?” Am. J. Law Med., vol. 39, no. 4, pp. 497–538, Dec. 2013. doi: 10.1177/009885881303900401
    R. Nambiar, R. Bhardwaj, A. Sethi, and R. Vargheese, “A look at challenges and opportunities of big data analytics in healthcare,” in Proc. IEEE Int. Conf. Big Data, Silicon Valley, USA, 2013.
    S. D. Fihn, J. Francis, C. Clancy, C. Nielson, K. Nelson, J. Rumsfeld, T. Cullen, J. Bates, and G. L. Graham, “Insights from advanced analytics at the veterans health administration,” Health Aff., vol. 33, no. 7, pp. 1203–1211, Jul. 2014. doi: 10.1377/hlthaff.2014.0054
    European Commission, “Together for health: A strategic approach for the EU 2008–2013,” Commission of the European Communities, Brussels, Tech. Rep., Oct. 2007.
    M. Ercan and M. Lane, “An evaluation of the suitability of NoSQL databases for distributed EHR systems,” in Proc. 25th Australasian Conf. Information Systems, Auckland, New Zealand, 2014.
    J. Kim and K. Y. Chung, “Ontology-based healthcare context information model to implement ubiquitous environment,” Multimed. Tools Appl., vol. 71, no. 2, pp. 873–888, Jul. 2014. doi: 10.1007/s11042-011-0919-6
    H. Q. Yu, X. Zhao, X. Zhen, F. Dong, E. J. Liu, and G. Clapworthy, “Healthcare-event driven semantic knowledge extraction with hybrid data repository,” in Proc. 4th Edition of the Int. Conf. Innovative Computing Technology, Luton, UK, 2014.
    M. Mazurek, “Applying NoSQL databases for operationalizing clinical data mining models,” in Proc. 10th Int. Conf. Beyond Databases, Architectures, and Structures, Ustron, Poland, 2014, pp. 527–536.
    F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach, M. Burrows, T. Chandra, A. Fikes, and R. E. Gruber, “Bigtable: A distributed storage system for structured data,” in Proc. 7th Symp. Operating Systems Design and Implementation, Seattle, USA, 2006, pp. 205–218.
    G. Matei, “Column-oriented databases, an alternative for analytical environment,” Data. Syst. J., vol. 1, no. 2, pp. 3–16, 2010.
    B. Lee and E. Jeong, “A design of a patient-customized healthcare system based on the Hadoop with text mining (PHSHT) for an efficient disease management and prediction,” Int. J. Software Eng. Appl., vol. 8, no. 8, pp. 131–150, 2014.
    C. T. Yang, J. C. Liu, W. H. Hsu, H. W. Lu, and W. C. C. Chu, “Implementation of data transform method into NoSQL database for healthcare data,” in Proc. Int. Conf. Parallel and Distributed Computing, Applications and Technologies, Taipei, China, 2013, pp. 198–205.
    D. Chrimes, M. H. Kuo, A. W. Kushniruk, and B. Moa, “Interactive big data analytics platform for healthcare and clinical services,” Global J. Eng. Sci., vol. 1, no. 1, Sept. 2018.
    A. Lith and J. Mattsson, “Investigating storage solutions for large data – A comparison of well performing and scalable data storage solutions for real time extraction and batch insertion of data,” M.S. thesis, Chalmers Univ. Technology, Göteborg, Sweden, 2010.
    Y. Park, M. Shankar, B. H. Park, and J. Ghosh, “Graph databases for large-scale healthcare systems: A framework for efficient data management and data services,” in Proc. IEEE 30th Int. Conf. Data Engineering Workshops, Chicago, USA, 2014.
    M. Baglioni, S. Pieroni, F. Geraci, F. Mariani, S. Molinaro, M. Pellegrini, and E. Lastres, “A new framework for distilling higher quality information from health data via social network analysis,” in Proc. IEEE 13th Int. Conf. Data Mining Workshops, Dallas, USA, 2013.
    P. Conde, T. Alonso, I. Garau, P. Roca, and J. Oliver, “Treatment of medical databases and their graphical representation on the internet,” Med. Inform. Internet Med., vol. 31, no. 3, pp. 195–204, Jan. 2006. doi: 10.1080/14639230600804879
    S. Batra and C. Tyagi, “Comparative analysis of relational and graph databases,” Int. J. Soft Comput. Eng. (IJSCE)., vol. 2, no. 2, pp. 509–512, May 2012.
    E. Torres-Serrano, “A large-scale graph processing system for medical imaging information based on DICOM-SR,” Int. J. Image Min., vol. 1, no. 2-3, pp. 143–158, Jan. 2015.
    M. Štufi, B. Bačić, and L. Stoimenov, “Big data analytics and processing platform in Czech republic healthcare,” Appl. Sci., vol. 10, no. 5, pp. 1705, Mar. 2020. doi: 10.3390/app10051705
    M. P. Gopinath, G. S. Tamilzharasi, S. L. Aarthy, and R. Mohanasundram, “An analysis and performance evaluation of NoSQL databases for efficient data management in e-health clouds,” Int. J. Pure Appl. Math., vol. 117, no. 21, pp. 177–197, 2017.
    K. Kaur and R. Rani, “Managing data in healthcare information systems: Many models, one solution,” Computer, vol. 48, no. 3, pp. 52–59, Mar. 2015. doi: 10.1109/MC.2015.77
    S. M. Freire, D. Teodoro, F. Wei-Kleiner, E. Sundvall, D. Karlsson, and P. Lambrix, “Comparing the performance of NoSQL approaches for managing archetype-based electronic health record data,” PLoS One, vol. 11, no. 3, pp. e0150069, Mar. 2016. doi: 10.1371/journal.pone.0150069
    DB-Engines, “DB-engines ranking of key-value stores,” 2017. [Online]. Available: https://db-engines.com/en/ranking/key-value+store.
    DB-Engines, “DB-Engines ranking of document stores,” 2017. [Online]. Available: https://db-engines.com/en/ranking/document+store.
    DB-Engines, “DB-Engines ranking of graph DBMS,” 2017. [Online]. Available: https://db-engines.com/en/ranking/graph+dbms.
    DB-Engines, “DB-Engines ranking of wide column stores,” 2017. [Online]. Available: https://db-engines.com/en/ranking/wide+column+store.
    K. L. Chen and H. Lee, “The impact of big data on the healthcare information systems,” in Transactions of the Int. Conf. Health Information Technology Advancement, 2013.
    S. Zillner, N. Lasierra, W. Faix, and S. Neururer, “User needs and requirements analysis for big data healthcare applications,” Stud. Health Technol. Inform., vol. 205, pp. 657–661, Aug. 2014.
    H. Boinepelli, “Applications of big data,” in Big Data, A. Primer, Ed. New Delhi, India: Springer, 2015, pp. 161–179.
    L. Hood, J. C. Lovejoy, and N. D. Price, “Integrating big data and actionable health coaching to optimize wellness,” BMC Med., vol. 13, no. 1, pp. 4, Jan. 2015. doi: 10.1186/s12916-014-0238-7
    E. Begoli, T. Dunning, and C. Frasure, “Real-time discovery services over large, heterogeneous and complex healthcare datasets using schema-less, column-oriented methods,” in Proc. IEEE 2nd Int. Conf. Big Data Computing Service and Applications, Oxford, UK, 2016.
    J. Lawler, A. Joseph, and H. Howell-Barber, “A big data analytics methodology program in the health sector,” Inf. Syst. Edu. J., vol. 14, no. 3, pp. 63–75, May 2016.
    Martin, “Big data in healthcare,” 2016. [Online]. Available: https://www.martinsights.com/?p=853.
    M. Logic, “Health information systems mobilized by NoSQL solutions,” 2016. [Online]. Available: https://www.intel.com/content/dam/www/public/us/en/documents/solution-briefs/xeon-e5-v3-marklogic-healthcare-database-migration.pdf.
    MongoDb, “Healthcare,” 2020. [Online]. Available: https://www.mongodb.com/industries/healthcar.
    CouchBase, “ Why couchbase NoSQL for healthcare,” 2020. [Online]. Available: https://www.couchbase.com/solutions/nosql-for-healthcare.
    R. Sreekanth, G. V. Madhava Rao, and S. Nanduri, “Big data electronic health records data management and analysis on cloud with mongoDB: A NoSQL database,” Int. J. Adv. Eng. Global Technol., vol. 3, no. 7, pp. 946–949, Jul. 2015.
    C. Dobre and F. Xhafa, “NoSQL technologies for real time (patient) monitoring,” in Medical Imaging: Concepts, Methodologies, Tools, and Applications, Information Resources Management Association, Ed. IGI Global, 2016.
    PYPL, “PYPL popularity of programming language,” 2020. [Online]. Available: http://pypl.github.io/PYPL.html.
    T. Trends, “Most important business intelligence trends for 2020,” 2020. [Online]. Available: https://medium.com/@akki.greatlearning/most-important-business-intelligence-trends-for-2020-1fe65e4389ab.
    J. Ladley, Data Governance: How to Design, Deploy, and Sustain an Effective Data Governance Program. 2nd ed. Waltham, USA: Academic Press, 2019.
    S. P. Kane and K. Matthias, Docker: Up & Running: Shipping Reliable Containers in Production. 2nd ed. USA: O’Reilly Media, 2018.
    G. Kim, P. Debois, J. Willis, J. Humble, and J. Allspaw, The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations. Portland, USA: IT Revolution Press, 2016.
    A. Gorelik, The Enterprise Big Data Lake: Delivering the Promise of Big Data and Data Science. Sebastopol, California: O’Reilly Media, 2019.
    J. Richardson, R. Sallam, K. Schlegel, A. Kronz, and J. L. Sun, “2020 Gartner magic quadrant for analytics and business intelligence platforms,” 2020. [Online]. Available: https://info.microsoft.com/ww-landing-2020-gartner-magic-quadrant-for-analytics-and-business-intelligence.html?LCID=EN-US.
    W. Raghupathi and V. Raghupathi, “Big data analytics in healthcare: Promise and potential,” Health Inf. Sci. Syst., vol. 2, no. 1, pp. 3, Feb. 2014. doi: 10.1186/2047-2501-2-3
    Y. C. Wang, L. A. Kung, and T. A. Byrd, “Big data analytics: Understanding its capabilities and potential benefits for healthcare organizations,” Technol. Forecasting Soc. Change, vol. 126, pp. 3–13, Jan. 2018. doi: 10.1016/j.techfore.2015.12.019
    A. Belle, R. Thiagarajan, S. M. R. Soroushmehr, F. Navidi, D. A. Beard, and K. Najarian, “Big data analytics in healthcare,” BioMed Res. Int., vol. 2015, pp. 370194, Jul. 2015.
    J. M. Sun and C. K. Reddy, “Big data analytics for healthcare,” in Proc. 19th ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining, Chicago, USA, 2013.
    L. Hong, M. Q. Luo, R. X. Wang, P. X. Lu, W. Lu, and L. Lu, “Big data in health care: Applications and challenges,” Data Inf. Manage., vol. 2, no. 3, pp. 175–197, Dec. 2018.
    M. M. Malik, S. Abdallah, and M. Ala’raj, “Data mining and predictive analytics applications for the delivery of healthcare services: A systematic literature review,” Ann. Oper. Res., vol. 270, no. 1-2, pp. 287–312, Nov. 2018. doi: 10.1007/s10479-016-2393-z
    A. Pashazadeh and N. J. Navimipour, “Big data handling mechanisms in the healthcare applications: A comprehensive and systematic literature review,” J. Biomed. Inform., vol. 82, pp. 47–62, Jun. 2018. doi: 10.1016/j.jbi.2018.03.014
    D. Tomar, J. P. Bhati, P. Tomar, and G. Kaur, “Migration of healthcare relational database to NoSQL cloud database for healthcare analytics and management,” in Healthcare Data Analytics and Management: A Volume in Advances in Ubiquitous Sensing Applications for Healthcare, N. Dey, C. Bhatt, A. S. Ashour, and S. J. Fong, Eds. Amsterdam, The Netherlands: Elsevier, 2019, pp. 59–87.
    K. Ding and P. J. Jiang, “RFID-based production data analysis in an IoT-enabled smart job-shop,” IEEE/CAA J. Autom. Sinica, vol. 5, no. 1, pp. 128–138, Jan. 2018. doi: 10.1109/JAS.2017.7510418
    M. S. Shang, X. Luo, Z. G. Liu, J. Chen, Y. Yuan, and M. C. Zhou, “Randomized latent factor model for high-dimensional and sparse matrices from industrial applications,” IEEE/CAA J. Autom. Sinica, vol. 6, no. 1, pp. 131–141, Jan. 2019. doi: 10.1109/JAS.2018.7511189
    M. Ghahramani, M. C. Zhou, and G. Wang, “Urban sensing based on mobile phone data: Approaches, applications, and challenges,” IEEE/CAA J. Autom. Sinica, vol. 7, no. 3, pp. 627–637, May 2020. doi: 10.1109/JAS.2020.1003120


    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(13)  / Tables(5)

    Article Metrics

    Article views (2645) PDF downloads(128) Cited by()


    • The most thorough systematic literature review on big data analytics applications to healthcare
    • Focus on healthcare applications for NoSQL databases and Apache Hadoop ecosystem
    • Proposes the first-ever Zeta architecture called Med-BDA for big healthcare data analytics
    • Med-BDA has the potential to solve ALL current limitations for big healthcare data analytics
    • We present business strategies to successfully implement Med-BDA in any clinical organization


    DownLoad:  Full-Size Img  PowerPoint