Hagn, Michael and Heinrich, Bernd and Krapf, Thomas and Schiller, Alexander (2025) Handling imperfection: A taxonomy for machine learning on data with data quality defects. DECISION SUPPORT SYSTEMS, 196: 114493. ISSN 0167-9236, 1873-5797
Full text not available from this repository. (Request a copy)Abstract
In recent years, machine learning (ML) has become ubiquitous in sectors including transportation, security, health, and finance to analyze large amounts of data and support decision-making. However, real-world datasets used in ML often exhibit various data quality (DQ) defects that can significantly impair the performance and validity of ML models and thus also the decisions derived from them. Therefore, a plethora of methods across various research strands have been proposed to address DQ defects and mitigate their negative impact on MLbased data analysis and decision support. This has resulted in a fragmented research landscape, where comparisons and classifications of methods dealing with ML on data with DQ defects are very challenging for both researchers and practitioners. Thus, based on a structured design process, we develop and present a taxonomy for this research field. The taxonomy serves as a systematic framework to classify and organize existing research and methods according to relevant dimensions and facilitates future work in this area. Its reliability, understandability, completeness, and usefulness are supported by an evaluation with external researchers and practitioners. Finally, we identify current trends and research gaps and derive challenges and directions for future research.
| Item Type: | Article |
|---|---|
| Uncontrolled Keywords: | QUESTIONNAIRE; AGREEMENT; Taxonomy; Machine learning; Data quality; Data uncertainty |
| Subjects: | 000 Computer science, information & general works > 004 Computer science 300 Social sciences > 330 Economics |
| Divisions: | Business, Economics and Information Systems > Institut für Wirtschaftsinformatik > Lehrstuhl für Wirtschaftsinformatik II (Prof. Dr. Bernd Heinrich) Informatics and Data Science > Department Information Systems > Lehrstuhl für Wirtschaftsinformatik II (Prof. Dr. Bernd Heinrich) |
| Depositing User: | Dr. Gernot Deinzer |
| Date Deposited: | 06 May 2026 08:49 |
| Last Modified: | 06 May 2026 08:49 |
| URI: | https://pred.uni-regensburg.de/id/eprint/66167 |
Actions (login required)
![]() |
View Item |

