Identifying Fraud in Automobile Insurance Using Naïve Bayes Classifier

Dadang Amir Hamzah, Annisa Sentya Hermawan, Shintya Jasmine Pertiwi, Syarifah Intan Nabilah

Abstract


In this article, the Naïve Bayes Classifier is employed to detect fraud in automobile insurance. The Naïve Bayes classier is a simple probabilistic method based on the Bayes theorem. The data used in this article is determined from databricks.com which consists of 40 attributes and 1000 entries. The target attribute that will be predicted consists of two categories,” yes" or "no", which inform whether there is a fraud or not. The Data is split into training and testing with suitable proportions. Based on training data, the Naïve Bayes Classifier is applied to the testing data and returns the predictions data. Then, the prediction data is compared with the actual data to see the performance of the method. The result shows that the Naïve Bayes Classifier gives a good result to predict the insurance fraud with 78% accuracy, 67% precision, 3% of recall,  and  6% of F1 score  for “Yes”

Keywords


Automobile Insurance; Fraud Detection; Naïve Bayes Classifier

Full Text:

PDF

References


A. Bodaghi and B. Teimourpour, “Automobile Insurance Fraud Detection Using Social Network Analysis,” 2018. doi: 10.1007/978-3-319-95810-1_2.

M. A. Caruana and L. Grech, “Automobile insurance fraud detection,” Commun Stat Case Stud Data Anal Appl, vol. 7, no. 4, 2021, doi: 10.1080/23737484.2021.1986169.

A. Ghorbani and S. Farzai, “Fraud Detection in Automobile Insurance using a Data Mining Based Approach,” International Journal of Mechatronics, Electrical and Computer Technology (IJMEC), vol. 8, no. 27, 2018.

X. Liu, J. B. Yang, D. L. Xu, K. Derrick, C. Stubbs, and M. Stockdale, “Automobile insurance fraud detection using the evidential reasoning approach and data-driven inferential modelling,” in IEEE International Conference on Fuzzy Systems, 2020, vol. 2020-July. doi: 10.1109/FUZZ48607.2020.9177589.

S. K. Majhi, S. Bhatachharya, R. Pradhan, and S. Biswal, “Fuzzy clustering using salp swarm algorithm for automobile insurance fraud detection,” in Journal of Intelligent and Fuzzy Systems, 2019, vol. 36, no. 3. doi: 10.3233/JIFS-169944.

Y. Wang and W. Xu, “Leveraging deep learning with LDA-based text analytics to detect automobile insurance fraud,” Decis Support Syst, vol. 105, 2018, doi: 10.1016/j.dss.2017.11.001.

R. W. Pratiwi, D. Yusuf, and S. Nugroho, “Prediksi Rating Film Menggunakan Metode Naïve Bayes,” Jurnal Teknik Elektro, vol. 8, no. 2, 2016.

R. James, G., Witten, D., Hastie, T., Tibshirani, An Introduction to Statistical Learning - with Applications in R | Gareth James | Springer. 2013.

D. Berrar, “Bayes’ theorem and naive bayes classifier,” in Encyclopedia of Bioinformatics and Computational Biology: ABC of Bioinformatics, vol. 1–3, 2018. doi: 10.1016/B978-0-12-809633-8.20473-1.

W. McKinney and P. D. Team, “Pandas - Powerful Python Data Analysis Toolkit,” Pandas - Powerful Python Data Analysis Toolkit, 2015.




DOI: http://dx.doi.org/10.33021/jafrm.v1i2.3971

DOI (PDF): http://dx.doi.org/10.33021/jafrm.v1i2.3971.g1349

Refbacks

  • There are currently no refbacks.