Characterization of Syrian refugees with work permit applications in Turkey: A data mining based methodology


Creative Commons License

Gencosman B. C., İNKAYA T.

EXPERT SYSTEMS WITH APPLICATIONS, cilt.180, 2021 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 180
  • Basım Tarihi: 2021
  • Doi Numarası: 10.1016/j.eswa.2021.114846
  • Dergi Adı: EXPERT SYSTEMS WITH APPLICATIONS
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, PASCAL, Aerospace Database, Applied Science & Technology Source, Communication Abstracts, Computer & Applied Sciences, INSPEC, Metadex, Public Affairs Index, Civil Engineering Abstracts
  • Anahtar Kelimeler: Data mining, Self-organizing maps, Decision tree, Association rule mining, Syrian refugees, CLUSTER VALIDITY, PERFORMANCE, MIGRATION, IMPACT
  • Bursa Uludağ Üniversitesi Adresli: Evet

Özet

With the technological advancements in data collection systems, data-driven approaches become a necessity for understanding and managing the socioeconomic systems. Motivated by this, we focus on the formal employment of Syrian refugees in Turkey, and propose a data mining based methodology in order to understand their profiles. In this context, Syrian refugees with work permit applications are examined between years 2010 and 2018. The dataset includes demographic properties of the applicants and characteristics of their workplaces. The proposed methodology aims to extract the hidden, interesting and useful characteristics of the Syrian refugees having formal employment potential. The proposed approach integrates several data mining tasks, i.e. clustering, classification, and association rule mining, and it has four phases. In the first phase, data pre-processing and visualization operations are performed. In the second phase, the profiles of the Syrian refugee workers are determined using clustering. Self-organizing map and hierarchical clustering are implemented for this purpose. In the third phase, decision tree is used to specify the distinguishing characteristics of the clusters. In the fourth phase, the association rules are generated to reveal the interesting and frequent properties of each cluster. The results reveal the profiles of Syrian refugees with work permit applications. The findings obtained from this study can be a basis for developing policies and strategies that facilitate the labor market integration of the immigrants. The proposed methodology can be used to analyze time-dependent patterns and other immigration data for different countries as well.