Student Feedback Analyzer for E-Learning Platforms

Shashiprabha T. A. S; Liyanage I. M; Lakmal Rupasinghe; Chethana Liyanapathirana

doi:10.17577/IJERTV9IS120223

Volume 09, Issue 12 (December 2020)

Student Feedback Analyzer for E-Learning Platforms

DOI : 10.17577/IJERTV9IS120223

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 473
Authors : Shashiprabha T. A. S , Liyanage I. M , Lakmal Rupasinghe , Chethana Liyanapathirana
Paper ID : IJERTV9IS120223
Volume & Issue : Volume 09, Issue 12 (December 2020)
Published (First Online): 31-12-2020
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Student Feedback Analyzer for E-Learning Platforms

Vol. 9 Issue 12, December-2020

Platforms

Shashiprabha T.A.S

Department of Software Engineering

Sri Lanka Institute of Information Technology Malabe, Sri Lanka

Liyanage I.M

Department of Software Engineering

Sri Lanka Institute of Information Technology Malabe, Sri Lanka

Lakmal Rupasinghe

Senior Lecturer: Department of Information Systems Engineering

Sri Lanka Institute of Information Technology Malabe, Sri Lanka

Chethana Liyanapathirana

Lecturer: Department of Information Systems Engineering Sri Lanka Institute of Information Technology

Malabe, Sri Lanka

Abstract:- Evaluation of feedback is important in identifying weaknesses and action taking for better systems integration, and in maintaining system efficiency. Most e-learning systems are not configured with a proper user feedback evaluation framework such as e-commerce systems to improve product quality and product reviews by identifying consumer real expectations. In the field of education, most higher learning institutions, universities collect qualitative and quantitative feedback manually and digitally to improve the performance of students learning progress and teaching instructors. But domain-specific e-learning platform feedback evaluations are rare. The sentiment analysis is the most common qualitative feedback analysis. Sentiment analytics systems have become increasingly popular for obtaining data about the perspective learning style of the students. Developing these systems from scratch is a difficult task, so many researches use non-domain- specific, commercial, general-purpose tools. Nevertheless, most existing techniques of sentiment analysis focus only on the abstract level, broadly classifying sentiments into positive, neutral or negative, and lacking the ability to perform fine-grained sentiment analysis. In this study, we propose a supervised ml-based sentiment analysis model that includes five distinct classes (1-5) labeled as strongly negative, negative, neutral, weekly positive and weekly negative in order to provide instructors with rapid and better understanding. Most of the current ml-based Sentiment Analysis models used SVM and Naive Bayes algorithms. We proposed our own ensemble model in this paper by joining three algorithms to boost performance.

And we also included supervised model suggestion mining with the option mining model above mentioned.

To boost the learning material performance of e-learning platforms, this kind of more reliable domain-specific feedback evaluation seems to be very important. In this paper, we propose a web service based on the API to obtain and evaluate feedback through the use of machine learning and AI (Artificial Intelligence) technologies to decide the most suitable way to deliver learning materials (student perspective). In both tasks the program achieves a high accuracy: Extraction of sentiments (84 percent) and analysis

of suggestions (97 percent).Since this method is a micro service we can use this approach with e-learning systems of any kind.

Keywords- Machine Learning, Ensemble, Sentiment, Student Feedback, Accuracy, Suggestion mining, Domain specific

INTRODUCTION

E-learning is a booming sector, and is also rising rapidly. Udemy, Cousera and Lynda are the industry's leading platforms. Statistics show that the Global E-Learning Market is expected to rise at a CAGR ( Compound Annual Growth Rate) of about 7.2 per cent over the next decade to reach about $325 billion by 2025[5].Considering the protection of time, money , and resources, students now enjoy electrical learning through e- learning platforms. On the other hand, we will see a lot of new e- learning technologies in the years ahead that undoubtedly implies more competition. To accomplish this contest the owner needs determine the students' perception and their expected level of learning materials and course content. Evaluation of student feedback is the best way of determining student perspective. Gathering student feedback for any e-learning platform is a comprehensive task. It enables owners of platforms to listen to their students and understand them, examine their feedback and take appropriate action. Even quantitative feedback is very straightforward and reliable and qualitative feedback can provide a thorough interpretation of the real aspect of the students..

Sentiment analysis – As users we see the rating is a quantitative reflection of qualitative inputs. Anyway, the written text is much richer with detail than a basic rating of stars. The most common method of analyzing qualitative reviews is by analyzing sentiments. Study of sentiment is an analysis of the views of people on a particular object, human, text, etc. This is their thoughts that reflect their feeling about a lone entity whether we like it or not. It is a mechanism by which the views given in a

document are computationally classified and graded to identify whether they are positive, negative or neutral.

Five polarity sentiment analysis – Past researchers have proposed groundbreaking techniques for evaluating the text's polarity. Traditionally they divide the text into three different levels of polarity i.e. Positive, negative and neutral where the positive class includes those documents where the positive language has been used, while the negative class includes those documents in which the user does have some bad experience with a product and the neutral class provides those records which are neither positive nor negative.

Throughout this research paper, we tried to introduce e- learning domain specific sentiment analysis which contains five polarity: Strongly negative, negative, neutral, positive and strongly positive. Binary class labels can be adequate to research large-scale positive / negative opinion patterns in text data such as tweets, product reviews or user feedback, but their limitations do remain. For example: "This python course is excellent. But tutorials are little bit unorganized" when performing a positive review which contains some tiny issue from reviews a five polarity sentiment analysis can provide more reliable results to an automated system which prioritizes addressing customer complaints than binary classification.

Moreover, sentences with dual-polarity like "The course was very disgusting … But the instructor was great there. Can confuse the binary classifiers of feelings, leading to inaccurate class predictions.

According to our domain (e-learning), instructor and course owner will be able to get very good idea about the stage of quality of the course by using above type of five polarity sentiment analysis. The points above have enough inspiration to tackle this issue!

"The course is very awesome" for example suggests a very powerful positive opinion, while "the course is good" implies a weak positive attitude. This encourages us to conduct research on analyzing university student feedback on five levels than two levels to getting the real aspect of students.

ML classification algorithmsClassification of sentiments is meant to assess the general meaning of a written text and may be a form an admiration or criticism. It can be done by the use of machine learning algorithms such as Naive Bayes and Support Vector Machine. Thus, the problem to be discussed in the project would be:

Most researchers used classification algorithms such as SVM and multinomial naive Bayes ml algorithms to produce more reliable resuts in machine learning based sentiment analysis models. Here we used our own ensemble model of three ml SVM algorithms, Naive Bayes and logistic regression. What approach of machine learning does best in terms of accuracy on reviews of university student (e-learning)?
BACKGVRoOl. U9 NIsDsue 12, December-2020 I centered on previous research works related to text analysis, sentiment analysis, and machine learning algorithms in this section.

I centered on works related to text analysis, sentiment analysis, NLP, machine learning algorithms, ensemble models, education domain and data preprocessing techniques relevant for feedback analysis in this section.

Sentiment analysis is an NLP methodology that can derive feelings from a text. This attempts to differentiate the data using controlled or unsupervised approaches into positive negative polarities. Sentiment mining techniques and tools are popular and widely used in consumer-business product review mining. Most researchers frequently find alternative approaches to analyze subjective consumer feedback on e-commerce sites. Salthuri Vanaja has performed some research based on aspect level input study to consider the brand strategies from the consumer perspective [2]. While contrasting two classification algorithms for machine learning she had achieved more precision from the classification of Naive Bayes than the classification of Support Vector Machine and Her method includes Nouns, Pronouns, Verbs, Adjective tags for each word with customer reviews and adjectives derived using classification algorithms [2]. She used ' SentiWordNet ' to generate positive negative, neutral scores for each word [2].

Zarmeen Nasim et al suggested a hybrid model that was trained using TF-IDF and lexicon-based features to evaluate the sentiments conveyed by students in their textual feedback [1].The method presented here is restricted to computing the general sentiment of the student reviews. They contrasted their domain- specific approach to other current APIs at the evaluation stage. Five polarity (fine grained) sentiment analysis for greater accuracy will be their future work [1].

Emitza Guzman et al suggested an ensemble framework for classifying app reviews, and also demonstrated an ensemble model that provides better value than single algorithms for feedback sentiment analysis [3].

Hassan Raza1 et al proposed a method of sentiment analysis for the scientific document. Various machine learning classifiers, including NB, SVM, DT, LR, KNN and RF, have been used along with various features to process data and improve classification results. Classifier accuracy is measured using different assessment metrics such as F-score and Accuracy score. The findings reveal that SVM is doing higher than other classifiers. NaÃ¯ve Bayes performs well after SVM. In the case of the macro average, the efficiency of the SVM classifier is better when evaluating the F-score, and the precision is better when measuring the random forest in the case of the micro average. Three polarities (positive, negative and neutral) were used here [4].
METHODOLOGY

The proposed approach (review analyzing) is outlined in this section. The diagram below (figure 1) illustrates the proposed API demonstrating purposeful student input research methodology to high-level diagram, and describing each phase of the high-level diagram.

Figure 1- High Level Diagram of Proposed API

There are two EC2 instances .These two instances are specially stored in servers in which both backend and frontend will run on these servers .And flask will be the back-end API server .Node will be the front end server to front-end.

When student feedback analyzer request for a API of sentiments back end flask will response with the help of the machine learning model. Machine learning model is periodically improved through machine learning studios jupyter.Gragual changes are stored in the generic database with versioning.

When the web application is access the mobile client for a sentiment flask will response to the client feedback analyzer with

the latest machine learning model. DVioflf.e9reInsstusee1c2to, rDsedceamtabseert-2a0r2e0 checked.

The proposed approach (review analyzing) is outlined in this section. Firstly, the student review dataset creation is prepared, and is then used for the study of sentiments.

Methodology on ml model building is shown in below figure 4

.The suggested technique consists basically of three steps i.e. Data preprocessing and normalization, feature engineering and classification that are discussed below:

Figure 2- ML Model Pipeline

PHASE 1: Data Pre-Processing and Normalization

Data Set – The subject of our dataset is feedback from IT university students who have taken Cousera courses that are allocated to IT undergraduates. Initially we evaluated specifically the data collection that we had used. We had chosen the dataset from the Cousera e-learning platform. I obtained this dataset from Keggale machine learning repository. The whole collection of data had about 20000 sample numbers. Sample has the following fields: course id, scores, review (by a course user), and sentiments (1-5 score) suggestions (whether or not a suggestion is offered by review).It has been divided to two samples as test and train (75%- 25%).

The raw data must be pre-processed to increase the consistency and efficiency of the classification process. The role of pre- processing deals with the process of preparation which eliminates repetitive words, non-English characters and punctuations. It improves data skills and appropriateness. It involves eliminating non-English letters, eliminating stop words, removing accented characters, expanding contractions, removing repeated characters, removing URLs, removing hashtags, handling negations, and handling emoticons.

Pre-processing is an intermediate step in the classification of text and emotion. A vast range of methods are used to increase the

efficiency of classification. This makes it easier to standardize across a corpus of documents that allows to create meaningful features and decreases dimensionality and bruising created by several factors such as irrelevant symbols, special characters, XML and HTML tags etc.
The voting classification is a machine learning model focused on the highest likelihood of a chosen class as the result, which trains in a category of several models and forecast outcomes (class). It essentially aggregates the outcomes of each voting classifier and predicts the output class based on the largest vote majority. The concept is to build a common model that trains these models and forecasts output by the cumulative majority of voting for each output class instead of making different individual models and testing correctly for each.

PHASE 5: Model Evaluation

The evaluation metrics provide a deeper view into a multi classifier's output features. Accuracy (Ai) is also used as a categorization metric. However, consistency attributes are far less reluctant than specific and reminder can differ in the number of correct decisions:

recision (i) is calculated as a criterion for a random text (d) being listed as a sub-category or as a right one. It reflects the capacity of the classificatory to identify a document as correct and incorrect in contrast to other documents in the category:

The precision of a classifier is calculated by accuracy. Precision means less false positive, thus poorer accuracy means more false positive. This is also in conflict with recall, decrease recall is a convenient way to improve precision.

Recall (i) is defined as the probability that this decision will be made if a random document dx is listed under category (ci) Recall tests the classifier's completeness or awareness. Higher recall means fewer false negatives, and less recall means more false negatives. Improved recall will also minimize precision as it grows more and more more difficult to be accurate with expanded sample size.
1. measure Metric: precision and recall can be merged in order to create the single, weighted mean of precision and recall, metric called the F-measure. The findings will be analyzed in the final step of this work to identify the challenges, changes and how the study can be expanded. There will also be a description of the progress reached and the scope for the future. In accordance with the previous articles a comparative description of the work actually under way and the work planned is produced.

RESULTS & DISCUSSION

There are some ruled based sentiment analyzing tools like Vadar, TextBolb that can be customized for five state sentiment analysis. In here we can use their compound value for the analysis. But thing is they are not more accurate and domain specific. In the below table, I compare our application with above mentioned tools.

Approach	Accuracy
Textbolb	37.30%
Vadar	57.62%
Proposed Approach	80.84%

TABLE I

SUMMARY OF ACCURACY SCORES OBTAINED FOR DIFFERENT CURRENT TOOL AND OUR APPROACH FOR STUDENT FEEDBACK DATASET

In most of previous sentiment analysis ml based applications, researchers have been used naÃ¯ve bayes, random forest and SVM algorithms for most accurate results. In here we used our own ensemble model using SVM, naÃ¯ve bayes and logistic regression together. In below table 2, we compare our ensemble model with other existing models.

TABLE 2

SUMMARY OF ACCURACY SCORES OBTAINED FOR DIFFERENT MODEL FOR

REGISTRY STUDENT FEEDBACK DATASET

	CV				TF- IDF
	Accuracy %	Precision %	Recall %	F-Score %	Accuracy %	Precision %	Recall %	F-Score %
SVM	78.35	74.33	78.35	75.36	80.59	74.49	80.59	80.59
Logistic Regressi on	79.97	75.64	79.97	77.07	79.02	70.25	79.02	69.97
NaÃ¯ve Bayes( Multino mial)	80.14	75.30	80.14	76.42	78.93	62.29	78.93	69.63
Random Forest	79.37	72.48	79.37	72.12	78.57	73.80	79.57	72.20
Propose d Ensembl e Model	79.30	72.25	79.30	72.05	80.84	75.08	80.84	76.66

CONCLUSION & FUTURE WORK

We observed our ensemble model gives best performance than other single machine learning algorithms. And also purposed ensemble model gives best accuracy for tfidf features than CV features. The CV primarily reflects the number of words present in the feedback, while the significance of the words inside the review is defined by TFIDF.In here we achieved emoticons detection and negation handling challenges of sentiment analysis. Future work would be sarcasm detection and spam review detection. The paper outlined an ensemble method for sentimental analysis on reviews from students. The methodology

proposed involved machine learninVgoal.p9pIrsosaucepe2s,, Daelocenmg bwerit-p0520 subjects of polarity emotions. It analyzed other APIs for sentiment analytics and contrasted the findings with the collection of five label to the study of polarity sentiment. It was found that TF-IDF and domain-specific e-learning student feedback analysis ensemble model was used to achieve the best results.

ACKNOWLEDGEMENT

We hereby acknowledge that the research work submitted to the Sri Lankan Institute of Information Technology under the direction of our Supervisor, Dr. Prabath Lakmal Rupasinghe and our Co-Supervisor Miss Chethana Liyanapathirana, titled Student Feedback Analyzing Assistant Using Micro Services Architecture" is really a documentation of an original creation performed by me. This research study is applied in conditional fulfillment of the Bachelor of Science Special (Honors) Degree in Information Technology curriculum. The findings contained in this article have not been applied for grant of any degree or qualification to any other university or organization. Knowledge extracted from others ' published or unfinished research was mentioned in the article, and a list of references is given.

REFERENCES

Z. Nasim, Q. Rajput and S. Haider, "Sentiment analysis of student feedback using machine learning and lexicon based approaches," 2017 International Conference on Research and Innovation in Information Systems (ICRIIS), Langkawi, 2017, pp. 1-6, doi: 10.1109/ICRIIS.2017.8002475.
S. Vanaja and M. Belwal, "Aspect-Level Sentiment Analysis on E- Commerce Data", 2018 International Conference on Inventive Research in Computing Applications (ICIRCA), 2018. Available: 10.1109/icirca.2018.8597286 [Accessed 19 February 2020].
E. Guzman, M. El-Haliby and B. Bruegge, "Ensemble Methods for App Review Classification: An Approach for Software Evolution (N)," 2015 30th IEEE/ACM International Conference on Automated Software Engineering (ASE), Lincoln, NE, 2015, pp. 771-776, doi: 10.1109/ASE.2015.88.
Raza, H., Faizan, M., Hamza, A., Mushtaq, A. and Akhtar, N., 2019. Scientific Text Sentiment Analysis using Machine Learning Techniques. International Journal of Advanced Computer Science and Applications, 10(12).
R. Markets, "Global E-Learning Market to Reach $325 billion by 2025 – Rapid Growth in Online Content & Digitization / Innovations in Wearable Technologies are Flourishing the E-learning Industry", GlobeNewswire News Room, 2020. [Online]. Available: https://www.globenewswire.com/news- release/2017/02/06/914187/0/en/Global-E-Learning-Market-to-Reach- 325-billion-by-2025-Rapid-Growth-in-Online-Content-Digitization- Innovations-in-Wearable-Technologies-are-Flourishing-the-E-learning- Industry.html?utm_source=emojics&utm_campaign=How%20to%20Co llect%20User%20Feedback%20for%20Your%20E- Learning%20Platform. [Accessed: 21- Feb- 2020].
N. Altrabsheh, M. Cocea and S. Fallahkhair, "Sentiment Analysis: Towards a Tool for Analysing Real-Time Students Feedback", 2014 IEEE 26th International Conference on Tools with Artificial Intelligence, 2014. Available: 10.1109/ictai.2014.70 [Accessed 21 February 2020].
J. Sultana, N. Sultana, K. Yadav, and F. Alfayez, Prediction of Setiment Analysis on Educational Data based on Deep Learning Approach, 2018 21st Saudi Computer Society National Computer Conference (NCC), 2018.

Student Feedback Analyzer for E-Learning Platforms

Leave a Reply