Restat, Valerie and Diestelkaemper, Indra and Klettke, Meike and Stoerl, Uta (2025) FONDUE-Fine-Tuned Optimization: Nurturing Data Usability & Efficiency. JOURNAL OF BIG DATA, 12 (1): 131. ISSN 2196-1115
Full text not available from this repository. (Request a copy)Abstract
To provide good results and decisions in data-driven systems, data quality must be ensured as a primary consideration. An important aspect of this is data cleaning. Although many different algorithms and tools already exist for data cleaning, an end-to-end data quality solution is still needed. In this paper, we present FONDUE, our vision of a well-founded end-to-end data quality optimizer. In contrast to many studies that consider data cleaning in the context of machine learning, our approach focuses on various scenarios, such as when preprocessing and downstream analysis are separated. As an adaptive and easily extendable framework, FONDUE operates similarly to proven methods of database query optimization. Analogously, it consists of the following parts: Rule-based optimization, where the appropriate data cleaning algorithms are selected based on use case constraints, optimizer hints in the form of best practices, and cost-based optimization, where the costs are measured in terms of data quality. Accordingly, the result is an optimized data cleaning pipeline. The choice of different optimization goals enables further flexibility, e.g. for environments with limited resources. As a first building block of FONDUE, we present CheDDaR, which is used to detect errors and measure data quality. Both are important tasks for improving data quality with FONDUE.
| Item Type: | Article |
|---|---|
| Uncontrolled Keywords: | QUERY OPTIMIZER; ALGORITHM; Data quality; Data cleaning; Optimization |
| Subjects: | 000 Computer science, information & general works > 004 Computer science |
| Divisions: | Informatics and Data Science > General computer science > Data Engineering (Prof. Dr.-Ing. Meike Klettke) |
| Depositing User: | Dr. Gernot Deinzer |
| Date Deposited: | 17 Jun 2026 08:56 |
| Last Modified: | 17 Jun 2026 08:56 |
| URI: | https://pred.uni-regensburg.de/id/eprint/66904 |
Actions (login required)
![]() |
View Item |

