Disease Detection from Real Clinical Data Using Hadoop MapReduce: A Case Study at Al- Mukhtar Clinic, Tripoli

Authors

DOI:

https://doi.org/10.63318/waujpasv3i2_07

Keywords:

Hadoop MapReduce, Big Data Analytics, Machine Learning, Healthcare, Libya, Al-Mukhtar Clinic

Abstract

This paper investigates the application of Hadoop MapReduce for analyzing large-scale clinical
data in the context of cardiovascular risk prediction. Utilizing a dataset comprising over 500,000
real-world medical records collected from Al-Mukhtar Clinic in Tripoli, the research implements a
distributed analytical pipeline that integrates Hadoop MapReduce with the Spark tool, in addition
to machine learning algorithms such as K-means and K-Medoids clustering, hierarchical
clustering, and decision trees, was used to identify patterns and classify patients. . The proposed
system achieved a data processing efficiency of 92% and a predictive accuracy of 89% for
cardiovascular conditions. Additionally, the integration of advanced analytics led to a 15%
improvement in diagnostic accuracy over conventional methods. The architecture demonstrated
scalability, fault tolerance, and operational resilience, making it particularly suitable for healthcare
environments with limited computational resources. These findings highlight the potential of datadriven
methodologies to enhance clinical decision-making and public health outcomes in resourceconstrained
settings.

Downloads

Download data is not yet available.

Downloads

Published

2025-05-26

How to Cite

Abdosalam, M., & Abuhamoud, N. (2025). Disease Detection from Real Clinical Data Using Hadoop MapReduce: A Case Study at Al- Mukhtar Clinic, Tripoli. Wadi Alshatti University Journal of Pure and Applied Sciences, 3(2), 63-74. https://doi.org/10.63318/waujpasv3i2_07