Projecten per jaar
Uittreksel
Documentation and guidelines about the data requirements provide guidance in this process and enable to communicate what to expect from the data, but are mostly intended for humans only. To facilitate the harmonization process, we propose the usage of a specification file, describing the constraints to which the data should comply. Its syntax is human- and machine-readable, so it can be used to communicate expected data quality/conformity and to validate data automatically. The scope of the set of specifications can be specific to a dataset, researcher or research community, which allows bottom-up and top-down adoption. As an example, we apply the specifications to verify data mapped to the biodiversity information standard Darwin Core.
In this talk, we will present "whip", a proposed syntax and format to express data specifications. Whip allows to define column-based constraints for tabular (tidy) data with a number of rules. We will also demonstrate a software application (called "pywhip") to validate data sets using these specifications. We hope it will trigger a discussion on how to express data specifications and communicate data quality expectations.
Oorspronkelijke taal | Engels |
---|---|
Pagina's | 87 |
Aantal pagina’s | 1 |
Publicatiestatus | Gepubliceerd - 24-sep.-2018 |
Technologisch
- ICT
Vrije trefwoorden
- data quality
Projecten
- 1 Actief
-
LifeWatch (EVINBO)
Milotic, T. (Projectleider), Adriaens, T. (Medewerker), Aelterman, B. (Medewerker), Azijn, K. (Medewerker), Casaer, J. (Medewerker), Cools, N. (Medewerker), De Dobbelaer, T. (Medewerker), Desmet, P. (Medewerker), Fostier, C. (Medewerker), Goossens, C. (Medewerker), Huybrechts, P. (Medewerker), Noé, N. (Medewerker), Oldoni, D. (Medewerker), Pauwels, I. (Medewerker), Pollet, M. (Medewerker), Reyserhove, L. (Medewerker), Spanoghe, G. (Medewerker), Stienen, E. (Medewerker), Van Hoey, S. (Medewerker), Van den Bergh, E. (Medewerker), Verhelst, P. (Medewerker) & Wouters, J. (Medewerker)
1/01/12 → 31/12/29
Project: Evinbo
Onderzoeksoutput
- 1 Software/Code
-
pywhip: v0.3.2
Van Hoey, S., Noé, N. & Desmet, P., 27-aug.-2018, Instituut voor Natuur- en Bosonderzoek.Onderzoeksoutput: Andere bijdrage › Software/Code