2024
Client | BIFOLD / Lennart Behme
Role | illustrator
Project | Fainder is an index for distribution-aware dataset search.
Given a large collection of datasets, the task of dataset search is to identify the relevant datasets based on a user query. Distribution-aware search refers to a subfield of data discovery, whose goal it is to find datasets that not only match user-specified keywords but also statistical properties of a dataset.
For example: A user could be interested in datasets about medical studies but only wants to receive datasets where at least 50% of the study participants were older than 60. This is a requirement on the distribution of the “age” property in a tabular dataset. With a simple approach, such searches would take a very long time. The role of Fainder is to make this search faster.
Concept | An eye and spreadsheets represented in synonymous colours to signify the act of finding relevant datasets within multiple spreadsheets.