Friday, 15 July 2022

A Bibliometric Statistical Analysis of the Fuzzy Inference System - based Classifiers

Source: https://ieeexplore.ieee.org/abstract/document/9439531

A Bibliometric Statistical Analysis of the Fuzzy Inference System - based Classifiers

Abstract:

Nowadays, under the pressure of numerous research publications, researchers increasingly pay attention to writing survey papers to track and understand one research topic they are interested in, and then begin to conduct more in-depth research. Until this moment, there are two types of survey papers: traditional review analysis and bibliometric statistical analysis. Compared with traditional review analysis, due to the analysis of various bibliometric information that can be quickly summarized to assess and predict one research field's development, the bibliometric statistical analysis is progressively proposed. However, no research relied on the bibliometric approach to explore fuzzy inference system (FIS) -based classifiers. More importantly, since the current open-ended bibliometric analysis approaches have different assessment focuses, choosing a suitable approach is problematic. Therefore, based on the extraction, integration, and expansion of previous bibliometric analysis theories, this research proposes a new systematic and time-saving bibliometric statistical analysis approach. It is worth noting that the proposed approach eliminates the need to read the internal content of all publications. Two core parts (Publication Information and TOP 20 SETs) are generated by the proposed analysis approach. Among them, analyzing Author Keywords and TOP 20 SETs are unprecedented guiding features to assist researchers in exploring research topic. Significantly, this research relies on the proposed approach to explore FIS-based classifiers. Various assessments cover the bibliometric information of the entire related publications. In addition, these results may need to be considered to increase the citation rate of future publications.

Published in: IEEE Access ( Volume: 9)

Page(s): 77811 - 77829

Date of Publication: 24 May 2021

Electronic ISSN: 2169-3536

INSPEC Accession Number: 20995089

DOI: 10.1109/ACCESS.2021.3082908

Funding Agency:

Exploring a research topic based on the proposed systematic and time-saving bibliometric statistical analysis approach. In this study, the explored topic is FIS-based cla...View more

Hide Full Abstract

CCBY - IEEE is not the copyright holder of this material. Please follow the instructions via https://creativecommons.org/licenses/by/4.0/ to obtain full-text articles and stipulations in the API documentation.

SECTION I.

Introduction

Classification is one fundamental research task within developing the techniques of Data Mining and Artificial Intelligence. Since mapping the feature of a sample data to a set of category labels is the core task of classifiers, the Fuzzy Inference System (FIS) gradually attracted the researchers’ attention as a proven universal approximation [1]. Moreover, FIS-based classifiers have the advantages of extraordinary transparency and interpretability comprised of other famous structures such as Neural Network-based classifiers [2]. Therefore, FIS-based classifiers became an alternative structure for designing flexible classifiers, together with Bayesian classifiers, Decision Trees, Neural Network-based classifiers, and Support Vector Machines [3]. Nowadays, based on the above-mentioned excellent characteristics and the current outstanding performance of FIS, FIS-based classifiers are rapidly developing into one of the indispensable branches of high-performance classifiers.

Among the countless FIS-based classifiers, one vital band is the Evolvable FIS-based classifiers. These classifiers can be implemented in the structural form based on Neuro-Fuzzy or Fuzzy Rule. Evolvable FIS-based classifiers’ core advantages include organizing and updating their structure and parameters in real-time and online. Therefore, they were generally associated with data stream processing and approximated a dynamically changing environment [4]. These popular Evolvable FIS-based classifiers include, but are not limited to, the Evolving Takagi Sugeno systems (eTS) [5], [6], eTS+ [7], PANFIS [8], GENFIS [9]. Since uncertainty is an inherent fact in data stream classification, noisy measurement results and noisy data are different due to the expert’s knowledge. Whatever information technology, processing method, or other method is used, one reasonable solution is to utilize Type-2 FIS-based classifiers. The band of T2FIS-based classifiers is founded on Zadeh’s ideal (Type-2 Fuzzy Inference Systems) because it has the fuzzy memberships generated by a fuzzy-fuzzy set [10]. Conspicuously, the Interval Type-2 Fuzzy Logic Systems is one popular topic during the development of traditional T2FISs. It reduces the complex representation of traditional T2FIS. Therefore, parts of classifiers based on the Interval Type-2 Fuzzy Logic System have become prominent and widely employed [11]. For a long time, most of the above are developed based on TS-type FIS. AnYa-type FIS [12] was introduced as one alternative fundamental structure to the traditional FISs (Mamdani- and TS-type). Its advantages are free of parameters, logical connectors, aggregation operator, and membership functions in the Fuzzy Rule’s antecedent part. It is especially worth noting that the AnYa-type fuzzy system uses new data analysis techniques: its antecedent part (IF) relates the analyzed information to Data Cloud [13]. With the AnYa FIS-based classifier’s current research, its focus of classifying data streams is addressing high-dimensional, complex, or large-scale problems [14].

Until now, many researchers are still developing and improving FIS-based systems to implement FIS-based classifier modeling to cope with increasingly complex classification problems. Therefore, the number of FIS-based classifiers is not limited to the classifiers mentioned above. Therefore, before developing more in-depth research, new researchers will face severe challenges to determine research trends and model the entire research architecture of FIS-based classifiers. Fortunately, it seems that two powerful databases (Web of Science and SCOPUS) make it possible to overcome these difficulties. When using the databases, if researchers do not consider the particularity of the research topic, they will pay close attention to the number of citations and citation rate. These highly cited publications can help researchers find potential research fields or high-quality papers to evaluate the most cited publications in related research and successfully publish high-quality papers. Therefore, trend information is valuable and can speed up the research progress. Although neither citations nor citation rates are scientific information for evaluating publications, they can serve as practical guides and support for determining research topics. However, the databases often list too many related publications. We call this situation over-fitting. Such many publications usually involve too many research fields, including some areas that we want to ignore. Therefore, linking appropriate Keywords to search in the databases has become a challenge. Suppose here we already have a Keyword link to collect related publications. However, if a large number of results are listed in the databases, even if we are satisfied with the results, it will still force us to add more rules to limit the results so that they can be analyzed and processed within the time allowed. Therefore, the trade-off between coverage and reviewability is also a challenge. Under this pressure, the survey papers around research topics are like treasures, assisting us in promoting research progress. However, writing a traditional survey paper will take a long time, and it usually focuses on a small part of one research topic in depth. Therefore, exploring different survey methods to summarize and evaluate research topics is becoming a research problem. In the current situation, some studies have begun to notice the bibliographic information. The core guide is a statistical analysis of bibliometric information with their citation indexes, which can directly output the evaluation of other used articles’ quality. Therefore, the bibliometric analysis can help researchers understand specific publications and research fields and further promote more in-depth research on their literature.

With some early studies (such as social science [15], knowledge management [16], fuzzy research [17]) focused on analyzing bibliometric information, more and more researchers begin to focus on how to develop the analytical approach and utilize the approach to explore and evaluate the development of a research topic. The Bibliometric statistical analysis is defined as applying mathematical and statistical methods to analyze publications [18]. Hence, the analysis approach is developed to measure scientific progress as a standard research tool for systematic analysis [19]. This definition means that the approach can help researchers recognize research trends and evaluate scientific manuscripts [20]. In the current research cases, reference [21] used bibliometric analysis to study WoS’s m-learning publications. It provided readers with the commonly systematic statistical information to deepen their understanding. In reference [22], another bibliometric method was used to emphasize critical themes and various research trends. In particular, it used questionnaires to identify Keywords to collect related publications. It was also shown that peer-reviewed high-impact journals and academic databases could provide helpful and reliable research information. Reference [23] relied on a mixed-method, including bibliometric analysis and traditional review analysis, to complete quantitative and qualitative analysis. Although the above current research provided various bibliographic analysis methods, the low-quality trade-off between various bibliographic information and the lack of some critical bibliographic information is still apparent. Besides, because of the different focuses of their bibliographic statistical analysis approaches, it is difficult to determine which one is better than others. In particular, the research [23] may confuse us whether the bibliometric statistical analysis lacks research value, and the survey papers still have to rely on traditional survey approach to make up for its shortcomings.

In this research, a new systematic and time-saving bibliometric statistical analysis approach is proposed by extracting, integrating, and expanding the previous bibliometric theories. The new approach ensures the coverage of bibliometric information used to explore a research topic and makes a trade-offs between various bibliometric information. Meanwhile, this research adopts the proposed approach to determine the popular development trend of FIS-based classifiers. Also, the proposed approach extracts and summarizes the most relevant research information and resources that have a significant impact on the research topic (FIS-based Classifiers). Two well-known databases, WoS and SCOPUS, are used to extract all bibliometric information of FIS-based classifiers’ publications. WoS is a structured database that can index selected top publications, covering the most important scientific achievements. Although WoS is considered one of the largest and most trusted databases for literature search and analysis, SCOPUS journal reports seem more comprehensive than WoS [24]. Based on one extracted Keyword link, a total of 2,291 publications are collected from WoS and SCOPUS. The complete publication information will be used to analyze and evaluate our research topic comprehensively. The information covers document types, research fields most relevant to FIS-based classifiers, distribution of countries and regions, journals, authors, research fields, and Author Keywords. In addition, TOP 20 SETs summarize the high-impacted resources about FIS-based classifiers. Therefore, when we outline research trends and identify future research, this research analysis will contribute to the researcher’s concern. Meanwhile, this research will help researchers understand and determine the research field of FIS-based classifiers in an objective and credible approach.

The remaining parts of this paper are arranged as follows: The “METHODOLOGY” section introduces the whole proposed bibliometric statistical analysis method and follows the “LIMITATION” of this research. Then one merged “RESULTS & DISCUSSION” section (containing two sub-sections: Publication Information and TOP 20 SETs) provides intuitive data analysis results and comparisons. Finally, the “SUMMATION & FUTURES” section proposes a summary of the analysis results and points out the future works about the proposed approach.

SECTION II.

Methodology

In order to overcome the above mentioned multiple shortcomings [16], [21]–[22][23], [25], this section introduces the proposed systematic and time-saving bibliometric statistical analysis approach. The research objectives of proposing the new approach include: 1) ensure the analytical balance and coverage of all key types of bibliometric data; 2) ensure simplified analysis methods and good analytical standards to promote preliminary research and exploration; 3) ensure that the method can output good results and provide high-quality features of a research topic. The proposed approach is from extracting, integrating, and expanding the previous bibliometric techniques. Its core is based on a quantitative analysis of all relevant publications collected by a Keyword link and searching in paper title. The above features make it possible to explore numerous publications without worrying about the analysis results’ insufficient quality. Furthermore, compared with traditional comment analysis, it has a high degree of flexibility in restricting and collecting publication resources. And, it can point out research trends and extract research resources with significant impact. Meanwhile, compared with the previous bibliometric statistical analysis, the proposed approach also make the analysis process more standardized and controllable, thereby greatly improving the robustness of the bibliometric analysis. These characteristics of the proposed approach ensure reliable analysis of research topic publications from different perspectives. Table 1 summarizes the comparison results between the proposed approach and the other latest bibliometric analysis method. For evaluating its performance, this research utilize the proposed approach to label the number, characteristics, and productivity of all publications related to “FIS-based classifiers”. At the same time, explore target areas to further determine possible research trends.

TABLE 1 Comparison of Current Bibliometric Methods

The proposed approach includes two parts, namely Data Collection and Data Analysis. In Data Collection, the process of defining the Query and collecting publications is described. Besides, it contains one direct evaluation method for evaluating the effectiveness of the collected publications. In Data Analysis, systematic and standard analysis methods will be introduced in detail for bibliometric analysis. The entire research process is shown in Figure 1.

FIGURE 1.

The research flow of the proposed bibliometric approach.

Friday, 15 July 2022