Articles

Detection and Classification of Gastrointestinal Diseases by using Machine Learning: A Review

Currently, gastrointestinal diseases claim the lives of up to two million people worldwide. GI disease treatment can be challenging, time-consuming, and expensive.  One of the most recent advancements in medical imaging is the use of video endoscopy to diagnose gastrointestinal illnesses such stomach ulcers, bleeding, and polyps. Doctors require a lot of time to review all the images produced by medical video endoscopy since there are so many of them. This makes manual diagnosis difficult and has encouraged research into computer-aided approaches to diagnose all of the generated images quickly and accurately. The innovative aspect of the suggested methodology is the creation of a system for the diagnosis of digestive disorders. Machine learning techniques have the potential to significantly lower the cost of examination procedures while increasing the accuracy and speed of diagnosis. This paper describes a method for classifying GI illnesses using machine learning techniques.

Predicting Customer Satisfaction through Sentiment Analysis on Online Review

User-generated content, such as user reviews, posts, tags, ratings, and opinions on the internet, can be used as a business indicator if collected and appropriately analyzed. One of the examples is predicting customer satisfaction through implementing big data analytics on online reviews. In analyzing the user-generated content to predict customer satisfaction, the author implements machine learning approach using the Sentiment Analysis method. Five-fold cross-validation was performed to train the classification model. The training was performed with a combination of tokenization methods: term frequency-inverse document frequency (tf-idf) and bag-of-words; n-gram types: unigram, bigram, trigram, and combination of unigram, bigram, and trigram; and machine learning algorithms: linear support vector classification (LinearSVC) and multinomial naïve bayes (MultinomialNB). The result was then evaluated using classification performance metrics such as precision, recall, F1 measure, and AUC score.

The result shows that the tf-idf vectorizer performs similarly to the bag-of-words method. A similar result was also observed for machine learning algorithm selection. Both MultinomialNB and LinearSVC produce the same performance. Low-level n-grams (such as unigrams and bigrams) tended to have higher precision, recall, F1 measure, and AUC score than high-order n-grams (such as trigrams). The best results were achieved by combining unigrams, bigrams, and trigrams, resulting in an average performance score of 0.94 for all measurements. From the result and analysis, the author finds that predicting customer satisfaction using text and sentiment analysis methods on user-generated content is possible. The model’s performance in this experiment is decent, with high precision, recall, F1, and AUC score.