Metode Schema Matching berbasis Linguistic dan Constraint untuk Integrasi Database di Sekolah

Suwanto Raharjo, Ema Utami, Omar Muhammad Altoumi Alsyaibani

Abstract


Integrasi data kini telah menjadi kebutuhan bagi setiap organisasi. Organisasi yang tidak mengintegrasikan data antara satu divisi dengan divisi lainnya akan menimbulkan kemungkinan terjadinya redundansi dan data yang tidak konsisten. Proses integrasi data umumnya terkendala oleh penggunaan istilah dan penulisan nama tabel serta atribut yang berbeda. Dalam penelitian ini, metode Linguistik dikombinasikan dengan metode Constraint digunakan untuk menemukan kesamaan antara atribut database yang berbeda. Metode Bigram digunakan sebagai metode linguistik. Atribut yang diusulkan untuk dihapus oleh Bigram ditinjau kembali dari aspek Constraint. Dengan menggunakan metode ini, 8 tabel dan 60 atribut dapat direduksi dari tujuh database. Hasil penelitian menunjukkan bahwa metode ini mempunyai akurasi di atas 99% pada semua skenario. Di sisi lain, Precision terendah terdapat saat membandingkan atribut antara database Administrasi_siswa dan Kesiswaan, yaitu hanya mencapat 70%. Meskipun masih ada beberapa kesalahan yang dilakukan oleh Bigram sebagai metode Linguistik, kesalahan tersebut dapat ditutupi dengan menggabungkan metode tersebut dengan metode berbasis Constraint. Pengujian validitas hasil integrasi dilakukan dengan query menggunakan sintaks SQL langsung ke database dan menghasilkan hasil query yang benar.

Keywords


Bigram; Constraint-Based; Data Integration; Linguistic; Schema Matching

Full Text:

PDF

References


F. Hamidi, M. Meshkat, M. Rezaee, and M. Jafari, “Information technology in education,” Procedia Comput. Sci., vol. 3, pp. 369–373, 2011, doi: https://doi.org/10.1016/j.procs.2010.12.062.

U. Dayal and H.-Y. Hwang, “View Definition and Generalization for Database Integration in a Multidatabase System,” IEEE Trans. Softw. Eng., vol. SE-10, no. 6, pp. 628–645, 1984, doi: 10.1109/TSE.1984.5010292.

J. S. P. Fong and K. W. T. Yan, Information Systems Reengineering, Integration and Normalization: Heterogeneous Database Connectivity. Springer International Publishing, 2021.

A. E. Permanasari, H. P. Satyaprabha, A. Suwastono, and G. T. Mulyani, “Pengembangan Basis Data Sistem Informasi Manajemen Rumah Sakit Berbasis Linguistic-based Schema Matching,” J. Nas. Tek. Elektro dan Teknol. Inf., vol. 8, no. 2, pp. 101–106, 2019, doi: http://dx.doi.org/10.22146/jnteti.v8i2.498.

B. Villanyi, P. Martinek, and B. Szikora, “A novel framework for the composition of schema matchers,” in The 14th WSEAS Int’l Conf. on Computers, Latest Trends on Computers. Corfu Island, Greece, 2010, pp. 379–384, doi: https://dl.acm.org/doi/10.5555/1981573.1981641.

D. Jones, “On-Demand Information Delivery: Integration of Patron-Driven Acquisition into a Comprehensive Information Delivery System,” J. Libr. Adm., vol. 51, no. 7–8, pp. 764–776, 2011, doi: 10.1080/01930826.2011.601275.

C. Kavitha, G. S. Sadasivam, and S. N. Shenoy, “Ontology based semantic integration of heterogeneous databases,” Eur. J. Sci. Res., vol. 64, no. 1, pp. 115–122, 2011, [Online]. Available: https://www.researchgate.net/publication/290160598_Ontology_based_semantic_integration_of_heterogeneous_databases.

P. Martinek, “Schema matching methodologies and runtime solutions in SOA based enterprise application integration,” 2009, [Online]. Available: https://repozitorium.omikk.bme.hu/bitstream/handle/10890/869/ertekezes.pdf?sequence=1.

A. Algergawy, E. Schallehn, and G. Saake, “Combining effectiveness and efficiency for schema matching evaluation,” in International Workshop on Model-Based Software and Data Integration, 2008, pp. 19–30, doi: https://doi.org/10.1007/978-3-540-78999-4_4.

M. A. F. Rachman and G. A. P. Saptawati, “Database integration based on combination schema matching approach (case study: Multi-database of district health information system),” in 2017 2nd International conferences on Information Technology, Information Systems and Electrical Engineering (ICITISEE), 2017, pp. 430–435, doi: 10.1109/ICITISEE.2017.8285544.

G. H. Martono and S. N. Azhari, “Review implementation of linguistic approach in schema matching,” Int. J. Adv. Intell. Informatics, vol. 3, no. 1, pp. 1–9, 2017, doi: https://doi.org/10.26555/ijain.v3i1.75.

A. A. Alwan, A. Nordin, M. Alzeber, and A. Z. Abualkishik, “A survey of schema matching research using database schemas and instances,” Int. J. Adv. Comput. Sci. Appl., vol. 8, no. 10, p. 2017, 2017, doi: 10.14569/IJACSA.2017.081014.

E. Sutanta, R. Wardoyo, K. Mustofa, and E. Winarko, “A Hybrid Model Schema Matching Using Constraint-Based and Instance-Based.,” Int. J. Electr. & Comput. Eng., vol. 6, no. 3, 2016, doi: http://doi.org/10.11591/ijece.v6i3.pp1048-1058.

M. Shrestha, T. X. Tran, B. Bhattarai, M. L. Pusey, and R. S. Aygun, “Schema matching and data integration with consistent naming on protein crystallization screens,” IEEE/ACM Trans. Comput. Biol. Bioinforma., vol. 17, no. 6, pp. 2074–2085, 2019, doi: https://doi.org/10.1109/TCBB.2019.2913368.

R. Hammad, A. C. Nurcahyo, A. Z. Amrullah, P. Irfan, and K. A. Latif, “Optimization of data integration using schema matching of linguistic-based and constraint-based in the university database,” Matrix J. Manaj. Teknol. dan Inform., vol. 11, no. 3, pp. 119–129, 2021, doi: https://doi.org/10.31940/matrix.v11i3.119-129.

S. Munir, F. Khan, and M. A. Riaz, “An instance based schema matching between opaque database schemas,” in 2014 4th International Conference on Engineering Technology and Technopreneuship (ICE2T), 2014, pp. 177–182, doi: https://doi.org/10.1109/ICE2T.2014.7006242.

H.-H. Do and E. Rahm, “COMA—a system for flexible combination of schema matching approaches,” in VLDB’02: Proceedings of the 28th International Conference on Very Large Databases, 2002, pp. 610–621, doi: https://doi.org/10.1016/B978-155860869-6/50060-3.

H. Zhao and S. Ram, “Combining schema and instance information for integrating heterogeneous data sources,” Data & Knowl. Eng., vol. 61, no. 2, pp. 281–303, 2007, doi: https://doi.org/10.1016/j.datak.2006.06.004.

P. A. Bernstein, J. Madhavan, and E. Rahm, “Generic schema matching, ten years later,” Proc. VLDB Endow., vol. 4, no. 11, pp. 695–701, 2011, doi: https://doi.org/10.14778/3402707.3402710.

A. P. Ambrosio, E. Métais, and J.-N. Meunier, “The linguistic level: Contribution for conceptual design, view integration, reuse and documentation,” Data & Knowl. Eng., vol. 21, no. 2, pp. 111–129, 1997, doi: https://doi.org/10.1016/S0169-023X(96)00028-6.

M. T. Ozsu and P. Valduriez, “Principles of Distributed Database System,” 2011, doi: https://doi.org/10.1007/978-3-030-26253-2.

N. Choi, I.-Y. Song, and H. Han, “A survey on ontology mapping,” ACM Sigmod Rec., vol. 35, no. 3, pp. 34–41, 2006, doi: https://doi.org/10.1145/1168092.1168097.

R. Ramakrishnan, J. Gehrke, and J. Gehrke, Database management systems, vol. 3. McGraw-Hill New York, 2003.

E. Rahm and P. A. Bernstein, “A survey of approaches to automatic schema matching,” VLDB J., vol. 10, no. 4, pp. 334–350, 2001, doi: https://doi.org/10.1007/s007780100057.

G. Y. Swara and Y. Pebriadi, “Rekayasa Perangakat Lunak Tiket Bioskop Berbasis Web,” J. TEKNOIF, vol. 4, no. 2, pp. 27–39, 2016, doi: https://doi.org/10.21063/jtif.2016.V4.2.27-39.




DOI: http://dx.doi.org/10.26418/jp.v8i2.55852

Refbacks

  • There are currently no refbacks.