Page 850 - Proceedings Collega2023
P. 850

collected from the first time it was posted by KiniTV on facebook on 17 of March 2023 until 23 July 2023,
               consists of 1087 comments. To maintain ethical standards and respect user privacy, all data collected are
               publicly accessible and do not involve any unauthorized access to private information. The collected data
               undergo a rigorous preprocessing phase, which includes text cleaning, removal of duplicates, and
               anonymization of user information. Textual content is tokenized, and any personally identifiable
               information is redacted to ensure anonymity and compliance with ethical guidelines.

               Correlational Analysis

                       This approach employed to identify relationships and associations among the various themes
               that emerge from the ethnographic data. This statistical technique allows us to uncover patterns of co-
               occurrence, causality, and interdependence among different themes, providing insights into the
               underlying structure of political discourse. To run this complex analysis, I am using RStudio with several
               main packages such as iGraph and ggraph. By combining virtual ethnography with correlational analysis,
               I seek to transcend the limitations of traditional sentiment analysis methods and gain a deeper
               understanding of the key themes that shape online political discussions. This research methodology
               allows us to explore the multifaceted and dynamic nature of social contentious discourse, providing
               valuable insights for academia, and society as a whole.

               Analysis and Result

                       The analysis has been conducted in several different segments, including an overall comment, as
               well as categorically by ethnic groups such as the Malay, Chinese, Indian, and unknown ethnic
               backgrounds. The analysis consisted of two measures. The differences are in the number of words that
               appear or used in interactions have been categorized into two criteria: those used more than 5 times and
               those used more than 10 times, which split the measure into two part, the general themes and also
                             i
               primary theme.  For some groups, the number of comments are a bit low compare to the other.
                       For instance, the comments from Indian ethnic group in this matter is not many as Malay and
               Chinese, hence the then number of words that appears are also lowered in the analysis to ensure that
               the meaning and theme of the interactions are successfully translated.  From the words networks that
               appeared, we only select word combinations that are relevant and meaningful to the discussion. The
               analysis procedure is divided into three main steps. Firstly, it involves identifying and understanding the
               emerging themes. Secondly, it deems it important to consider uniformity and balance within the word
               combinations' interactions in order to understand them. At the third stage, the meaning is assessed
               based on the emerging themes, followed by discussions on the findings.
               The diagram in Figure 1 illustrates the general themes that occurred within the entire sample taken,
               where the correlation was at 3% or higher, and n≥5. From this interaction, we can clearly observe several
               words networks that appear to form the overall interaction themes. Figure 2 presented the primary
               themes at 3% or higher, and n≥10.

               Figure 1 Overall Comments n ≥ 5










               International Conference on Local Wisdom of the Malay Archipelago (COLLEGA 2023) Page - 837 -
   845   846   847   848   849   850   851   852   853   854   855