You are here

The costs of poor data quality

Journal Name:

Publication Year:

DOI: 
doi:10.3926/jiem.2011.v4n2.p168-193
Abstract (2. Language): 
Purpose: The technological developments have implied that companies store increasingly more data. However, data quality maintenance work is often neglected, and poor quality business data constitute a significant cost factor for many companies. This paper argues that perfect data quality should not be the goal, but instead the data quality should be improved to only a certain level. The paper focuses on how to identify the optimal data quality level. Design/methodology/approach: The paper starts with a review of data quality literature. On this basis, the paper proposes a definition of the optimal data maintenance effort and a classification of costs inflicted by poor quality data. These propositions are investigated by a case study. Findings: The paper proposes: (1) a definition of the optimal data maintenance effort and (2) a classification of costs inflicted by poor quality data. A case study illustrates the usefulness of these propositions. Research limitations/implications: The paper provides definitions in relation to the costs of poor quality data and the data quality maintenance effort. Future research may build on these definitions. To further develop the contributions of the paper, more studies are needed. Practical implications: As illustrated by the case study, the definitions provided by this paper can be used for determining the right data maintenance effort and costs inflicted by poor quality data. In many companies, such insights may lead to significant savings. Originality/value: The paper provides a clarification of what are the costs of poor quality data and defines the relation to data quality maintenance effort. This represents an original contribution of value to future research and practice.
168-193

REFERENCES

References: 

Ballou, D. P., Madnick, S., & Wang, R. (2004). Assuring information quality. Journal of Management Information Systems, 20, 9–11.
Ballou, D. P., & Pazer, H. (1985). Modeling data and process quality in multi-input multi-output information systems. Management Science, 31(2), 150-162. doi:10.1287/mnsc.31.2.150
Batini, C., Cappiello, C., Francalanci, C., & Maurino, A. (2009). Methodologies for Data Quality Assessment and Improvement. ACM Computing Surveys, 41(3), Article 16.
Braithwaite, A., & Samakh, E. (1998). The cost-to-serve method. International Journal of Logistics Management, 9(1), 64-88.
Davenport, T.H. (1998). Putting the enterprise into the enterprise system. Harvard Business Review, 76(4), 121-131.
Davenport, T.H., & Prusak, L. (1998). Working Knowledge: How Organizations Manage What They Know. Harvard Business School Press, Cambridge, MA.
Ellram, L. M., & Siferd, S. P. (1993). Purchasing: The cornerstone of the Total Cost of Ownership concept. Journal of Business Logistics, 14(1), 163-184.
Eppler, M., & Helfert, M. (2004). A classification and analysis of data quality costs. MIT International Conference on Information Quality, November 5-6, 2004, Boston.
Even, A., & Shankaranarayanan, G. (2009). Utility cost perspectives in data quality management. Journal of Computer Information Systems, 50(2), 127-135.
Ge, M., & Helfert, M. (2007). A Review of Information Quality Research - Develop a Research Agenda. International Conference on Information Quality, November 9-11, 2007, Cambridge, Massachusetts, USA.
Haug, A., Pedersen, A., & Arlbjørn, J.S. (2009). A classification model of ERP system data quality. Industrial Management & Data Systems, 109(8), 1053-1068. doi:10.1108/02635570910991292
Häkkinen, L., & Hilmola, O-P. (2008). ERP evaluation during the shakedown phase: Lessons from an after-sales division. Information Systems Journal, 18(1), 73-100.
Joshi, S., Krishnan, R., & Lave, L. (2001). Estimating the hidden costs of environmental regulation. The Accounting Review, 76(2), 171-198. doi:10.2308/accr.2001.76.2.171
Jing-hua, X., Kang, X., & Xiao-wei, W. (2009). Factors influencing enterprise to improve data quality in information systems application —An empirical research on 185 enterprises through field study. 16th International Conference on Management Science & Engineering, September 14-16, 2009, Moscow, Russia.
Kahn, B., Strong, D., & Wang, R. (2003). Information quality benchmarks: Product and service performance. Communications of the ACM, 45, 184-192. doi:10.1145/505248.506007
Kaplan, R. S., & Cooper, R. (1998). Cost and effect: Using integrated cost systems to drive profitability and performance. Boston: Harvard Business School Press.
Kim, W. (2002). On Three Major Holes in Data Warehousing Today. Journal of Object Technology, 1(4), 39-47. doi:10.5381/jot.2002.1.4.c3
Kim, W., & Choi, B. (2003). Towards Quantifying Data Quality Costs. Journal of Object Technology, 2(4), 69-76. doi:10.5381/jot.2003.2.4.c6
Kengpol, A. (2001). The Implementation of Information Quality for the Automated Information Systems in the TDQM Process: A Case Study in Textile and Garment Company in Thailand, in: Pierce, E. & R. Katz-Haas (Eds.): Proceedings of the Sixth MIT Information Quality Conference, pp. 206-216, Boston.
Knolmayer, G., & Röthlin, M. (2006). Quality of material master data and its effect on the usefulness of distributed ERP systems. Lecture Notes in Computer Science, 4231, 362-371. doi:10.1007/11908883_43
Lederman, R., Shanks, G., & Gibbs, M.R. (2003, June). Meeting privacy obligations: the implications for information systems development. Proceedings of the 11th European Conference on Information Systems. Paper presented at ECIS: Naples, Italy. Retrieved June 29th, 2009, from: http://is2.lse.ac.uk/asp/aspecis/20030081.pdf
Lee, Y., Pipino, L., Funk, J., & Wang, R. Y. (2006). Journey to data quality. Cambridge, Mass: The MIT Press.
Leo, L., Pipino, L. Yang, W. L., & Wang, R. Y. (2002). Data quality assessment. Communications of the ACM, 45(4), 211-218.
Levitin, A. V., & Redman, T. C. (1998). Data as a resource: Properties, implications, and prescriptions. Sloan Management Review, 40(1), 89-101.
Madnick, S., Wang, R., & Xian, X. (2004). The design and implementation of a corporate householding knowledge processor to improve data quality. Journal of Management Information Systems, 20(1), 41-49.
Marsh, R. (2005). Drowning in dirty data? It’s time to sink or swim: A four-stage methodology for total data quality management. Database Marketing & Customer Strategy Management, 12(2), 105–112. doi:10.1057/palgrave.dbm.3240247
Miles, M.B., & Huberman, M.A. (1994). Qualitative Data Analysis: An Expanded Sourcebook. Thousand Oaks, California, CA.
Newell, S., Robertson, M., Scarbrough, H., & Swan, J. (2002). Managing Knowledge Work. Basingstoke: Palgrave-Macmillan.
Park, K., & Kusiak, A. (2005). Enterprise resource planning (ERP) operations support system for maintaining process integration. International Journal of Production Research, 43(19), 3959-3982. doi:10.1080/00207540500140799
Piprani, B., & Ernst, D. (2008). A Model for Data Quality Assessment. Lecture Notes in Computer Science, 5333, 750-759. doi:10.1007/978-3-540-88875-8_99
Raman, A. (2000). Retail-data quality: evidence, causes, costs, and fixes. Technology in Society, 22, 97–109. doi:10.1016/S0160-791X(99)00037-8
Redman, T.C. (1998). The impact of poor data quality on the typical enterprise. Communications of the ACM, 41(2), 79-82. doi:10.1145/269012.269025
Ryu, K.-S., Park J.-S., & Park, J.-H. (2006). A data quality management maturity model. ETRI Journal, 28(2), 191-204. doi:10.4218/etrij.06.0105.0026
Silverman, D. (2005). Doing qualitative research. London: Sage Publications.
Smith, H. A., & McKeen, J. D. (2008). Master data management: Salvation or snake oil? Export find similar. Communications of the Association for Information Systems, 23(4), 63-72.
Srinidhi, B. (1992). The hidden costs of specialty products. Journal of Management Accounting Research, 4, 198-208.
Stake, R.E. (2000). Case studies, in Denzin, N.K. and Lincoln, Y.S. (Eds.), The handbook of qualitative research (pp. 435-454). California: Sage Publications.
Tayi, G. K., & Ballou, D. P. (1998). Examining data quality. Communications of the ACM, 41(2), 54-57. doi:10.1145/269012.269021
Vayghan, J. A., Garfinkle, S. M., Walenta, C., Healy, D.C., & Valentin, Z. (2007). The internal information transformation of IBM. IBM Systems Journal, 46(4), 669-684. doi:10.1147/sj.464.0669
Wand, Y., & Wang, R. Y. (1996). Anchoring data quality dimensions in ontological foundations. Communications of the ACM, 39(11), 86-95. doi:10.1145/240455.240479
Wang, R. Y., & Strong, D. (1996). Beyond accuracy: What data quality means to data consumers. Journal of Management Information Systems, 12(4), 5-34.
Wang, R. Y., Storey, V. C., & Firth, C. P. (1995). A framework for analysis of data quality research. IEEE Transactions on Knowledge and Data Engineering, 7(4), 623–640. doi:10.1109/69.404034
Watts, S. G., & Shankaranarayanan, A. E. (2009). Data quality assessment in context: A cognitive perspective. Decision Support Systems, 48, 202–211. doi:10.1016/j.dss.2009.07.012
Yin, R.K. (2009). Case Study Research: Design and Methods. Los Angeles, LA: Sage Publications.

Thank you for copying data from http://www.arastirmax.com