Detección de fraude financiero mediante redes neuronales de clasificación en un caso real español

ELENA BADAL-VALERO; BELÉN GARCÍA-CÁRCELES

doi:10.25115/EAE.V34I3.3075

Detección de fraude financiero mediante redes neuronales de clasificación en un caso real español

ELENA BADAL-VALERO ¹
BELÉN GARCÍA-CÁRCELES ¹

1 Universitat de València

Universitat de València

Valencia, España

ROR https://ror.org/043nxc105

Journal:

Estudios de economía aplicada

ISSN: 1133-3197, 1697-5731

Year of publication: 2016

Issue Title: Datos, información y conocimiento en Economía

Volume: 34

Issue: 3

Pages: 693-710

Type: Article

DOI: 10.25115/EAE.V34I3.3075 DIALNET GOOGLE SCHOLAR Dialnet editor

More publications in: Estudios de economía aplicada

Sustainable development goals

Abstract

This paper explores the possibilities offered by statistical tools based on artificial neural networks for pattern recognition in expert work for money-laundering detection. The data is provided by the Spanish Police Department and comes from a case in which is actually working at. Account information is provided, where some accounting entries are identified as fraud. Hence it is possible to use this information to train a classification model. In this analysis, after briefly describing methodology used and fitting strategy, it is presented a model with a promising predictive capacity, even with strongly unbalanced training data set. After applying balancing technique to the training data (SMOTE) the result is remarkably improved which would indicate the viability of those models as tool for police experts planification, providing a way to reduce the use of expensive research resources.

€ View funding

Funding information

Las autoras agradecen el apoyo del Ministerio de Economía y Competitividad a través del proyecto CSO2013-43054-R.

Funders

Ministerio de Economía y Competitividad Spain
- CSO2013-43054-R

Bibliographic References

BECK, M.W. (2015). “NeuralNetTools: Visualization and Analysis Tools for Neural Networks”. Version 1.4.0. Disponible en: http://cran.r-project.org/web/packages/ NeuralNetTools/ [27/07/2016].
BÜCHLMANN, P., YU B. (2002). “Analyzing Bagging”. The Annals of Statistics. Vol. 30, No. 4, pp. 927-961.
CHAWLA, N. V., BOWYER, K. W., HALL, L. O., y KEGELMEYER, W. P. (2002). “Smote: Synthetic minority over-sampling technique”. Journal of Artificial Intelligence Research, pp. 321-357.
CARIDAD, J. M., & CEULAR, N. (2001). “Un análisis del mercado de la vivienda a través de redes neuronales artificiales”. Estudios de economía aplicada, (18), pp. 67-81.
CHAWLA, N. V., LAZAREVIC, A., HALL, L. O., and BOWYER, K. W. (2003b). “Smoteboost: Improving Prediction of the Minority Class in Boosting”. In Seventh European Conference on Principles and Practice of Knowledge Discovery in Databases, Vol.16, pp. 107-1 19, Dubrovnik, Croatia.
DUNCAN, L.T., and CRAN TEAM.(2016). Package ‘RCurl’. Disponible en: https://cran.r-project.org/web/packages/RCurl/index.html [27/07/2016].
DUTTA, S. (2013). Statistical Techniques for Forensic Accounting. Upper Saddle River (NJ): FT Press.
GOH, A.T. (1995). “Back-propagation neural networks for modelling complex systems”. Artificial Intelligence in Engineering, Vol.9, nº3, pp. 143-151.
HASTIE, T., TIBSHIRANI, R., y FRIEDMAN, J. (2008). The Elements of Statistical Learning. Data Mining, Inference, and Prediction (2nd ed.). Standfor: Springer. (pp. 392-396).
HEIDARINIA, N., HAROUNABADI, A., y SADEGHZADEH, M. (2014). “An intelligent Anti-Money Laundering Method for Detecting Risky Users in the Banking Systems”. International Journal of Computer Applications. No.22, pp. 35-39.
JAPKOWICZ, N. (2000), “The Class Imbalance Problem: Significance and Strategies”. In Proceedings of the 2000 International Conference on Artificial Intelligence (IC-AI'2000): Special Track on Inductive Learning, Las Vegas, Nevada.
KHAC, N. L., y KECHADI, M. (2010). “Application of Data Mining for Anti-Money Laundering Detection: A Case Study”. IEEE International Conference on Data Mining Workshops.
LIN-TAO, JI, N., y ZHANG, J.-L. (2008). “A RBF neural network model for anty-money laundering”. International Conference on Wavalet Analysis and Pattern Recognition, pp. 209-215.
MEIR, R., y RÄSTCH, G. (2003). “An introduction to boosting and leveraging”. Lecture Notes in Computer Science, pp 118-183.
MUÑIZ, P. y J. A. ÁLVAREZ (1997). “Comportamiento del Mercado: Hipótesis alternativas”. Revista de Bolsas y Mercados Españoles, Vol.60, pp 29-33.
NGAI, E., HU, Y., WONG, Y.,CHEN, Y., y SUN, X. (2011). “The application of data mining techniques in financial fraud detection: A classification frame work and an academic review of literature”. Decision Support Systems, Vol.50, nº3, pp. 559-569.
OLDEN, D. (2005). “Illuminating the "black box": a randomization approach for understanding variable contributions in artificial neural networks”. Ecological Modelling, nº 154, pp. 135-150.
OLMEDO, E., VELASCO, F., & VALDERAS, J. M. (2007). “Caracterización no lineal y predicción no paramétrica en el IBEX35”. Estudios de Economía Aplicada, 25(3).
PETRUCELLI, J. (2012). Detecting Fraud in Organizations: Techniques, Tools, and Resources. Washington DC: John Wiley & Sons, Inc.
R CORE TEAM (2015). R: A language and environment for statistical computing. R Foundation for Statistical Computing,Vienna, Austria, ISBN 3-900051-07-0, URL http://www.R-project.org/
RIPLEY. (1996). Pattern Recognition and Neural Networks. Cambridge University:Press.
SHMIELD, R. y AMES, M. (2013). “Next generation detection engine for fraud and compliance”. SAS Global Forum, pp. 1-6.
TORGO, L. (2010) Data Mining using R: learning with case studies, CRC Press (ISBN: 9781439810187).
UBERBACHER, E. C., & MURAL, R. J. (1991). “Locating protein-coding regions in human DNA sequences by a multiple sensor-neural network approach”. Proceedings of the National Academy of Sciences, 88(24), pp.11261-11265.
U.S. CONGRESS, OFFICE OF TECHNOLOGY ASSESSMENT. (1995). “Information Technologies for Control of Money Laundering”. Washington, DC: U.S: U.S. Government Printing Office. pp. 55-72.
VENABLES , W.N. y RIPLEY, B. (2002). Modern Applied Statistics with S. 4th Edition. New York: Springer.
WICKHAM, H. (2015). stringr: Simple, Consistent Wrappers for Common String Operations. R package version 1.1.0. https://CRAN.R-project.org/package=stringr
WICKHAM, H, y CHANG, W. (2016). devtools: Tools to Make Developing R Packages Easier. R package version 1.12.0. https://CRAN.R-project.org/package=devtools.

Data source: Dialnet

Detección de fraude financiero mediante redes neuronales de clasificación en un caso real español

Universitat de València

Sustainable development goals

Abstract

Funding information

Funders

Bibliographic References