Learning to Classify Tropical Disease Web Pages from Large Indonesian Web Documents


In the era of internet technology, many cases of tropical diseases like malaria, leprosy, and dengue fever are reported online. These online facts can be very useful to track the spread of the diseases. Studies in classifying tropical disease web pages from a large set of Indonesian web pages have not yet been recognized. In this paper, we built classifiers using Support Vector Machine (SVM), Naïve Bayesian, and K-Nearest Neighbors. We generated dictionaries of n-gram terms for both positive (tropical disease) and negative (non-tropical disease) classes and used the dictionaries to extract feature attributes of the pages. The experimental results show that SVM with polynomial kernel is the best classifier model when compared to the other models and methods. The F-measure and accuracy of the model are 95.52% and 99.59% respectively.

  • Abstract
  • Key Words
  • 1. Introduction
  • 2. Preliminaries
  • 3. Features Extraction
  • 4. Experimental Results
  • 5. Summaries
  • Acknowledgments
  • References
Topics: Diseases

Related Content

Customize your page view by dragging and repositioning the boxes below.

Related Journal Articles
Related eBook Content
Topic Collections

Sorry! You do not have access to this content. For assistance or to subscribe, please contact us:

  • TELEPHONE: 1-800-843-2763 (Toll-free in the USA)
  • EMAIL: asmedigitalcollection@asme.org
Sign In