Journal of Scientometric Research, 2023, 12, 1, 44-53
DOI: 10.5530/jscires.12.1.008
Published: April 2023
Type: Research Article
Galal M. Bin Makhashen*, Hamdi A. Al-Jamimi
Research Institute, King Fahd University of Petroleum and Minerals, Dhahran, SAUDI ARABIA.
Abstract:
Highly cited articles capture the attention of significant contributors in the research community as an opportunity to improve knowledge, source of ideas or solutions, and advance their research in general. Typically, these articles are authored by a large number of scientists with international collaboration. However, this could not be the only reason for an article to be highly cited, there might be several other characteristics for an article to be more attractive to researchers and readers. In other words, there are a few other characteristics that help articles/papers to be more than others to appear in search engines or to grab readersā attention. In this study, we modeled several machine-learning methods with a set of articles, and journal characteristics including authors-count, title characteristics, abstract length, international collaboration, number of keywords, funding information, journal characteristics, etc. We extracted 20 characteristics and developed multiple machine-learning models to automate highly-cited papers recognition from regular papers. In experiments conducted with an ensemble machine learning algorithm, 97% recognition accuracy was achieved. Other algorithms including a deep learning method using LSTMs also achieved high recognition accuracy. Such high performances can be utilized for a promising HCP auto-detection system in the future.
Keywords: Artificial Intelligence, Machine Learning, Highly Cited Paper Indicators, Digital Libraries, Bibliometric Analysis.