A Framework to process Text Data of Web Discussion Forums A Study of LisLinks

  • Mohit Garg Indira Gandhi National Tribal University, Amarkantak, India
  • Uma Kanjilal Indira Gandhi National Open University, New Delhi , India
Keywords: Text mining, Discussion forums, LIS Links, Data pre-processing

Abstract

Nowadays, people use the internet for both seeking and disseminating information in a collaborative way on various social media platforms like Quora, Yahoo Answers, LisLinks Forum, etc. This social interaction on different topics makes these platforms as a knowledge repository. Evaluation of these repositories can help to understand various trends. However, this evaluation is a challenging task because of unstructured data and the unavailability of application programming interfaces for the harvesting of a dataset. This study presented a framework to harvest and pre-processing of data available on LisLinks Forum. The proposed framework is implemented using statistical programming language R. The fourteen metadata elements were defined for the discussion forums. The framework automatically harvest and pre-process relevant data of posts.

Author Biographies

Mohit Garg, Indira Gandhi National Tribal University, Amarkantak, India
Mr. Mohit Garg is working as Assistant Librarian in Indira Gandhi National Tribal University, Amarkantak, MP, India. He is pursuing PhD from School of Social Science, Indira Gandhi National Open University under the guidance of Prof. Uma Kanjilal. His area of interest are Information Retrieval, Data Science, Quantitative analysis and Machine learning.
Uma Kanjilal, Indira Gandhi National Open University, New Delhi , India
Prof. Uma Kanjilal is Professor and Head in the Faculty of Library and Information Science in the Indira Gandhi National Open University (IGNOU) is also holding charge of Director of Centre for Online Education (COE) at IGNOU. She has more than 29 years of experience in the Open and Distance Learning System. Her specialization includes e-learning, multimedia courseware development, ICT applications in Libraries and Digital Libraries.
Published
2019-12-16
How to Cite
Garg, M., & Kanjilal, U. (2019). A Framework to process Text Data of Web Discussion Forums A Study of LisLinks. DESIDOC Journal of Library & Information Technology, 39(06), 315-321. https://doi.org/10.14429/djlit.39.06.15145