Dataset
A Dataset takes a database schema (tables and columns) and adds Fides privacy categorizations. This is a database-agnostic way to annotate privacy declarations.
1 2 3 4 5 |
|
-
The schema is represented as a set of "collections" (tables) that contain "fields" (columns). These can also be arbitrarily nested to handle document-type databases (e.g., NoSQL or S3).
-
At each level -- Dataset, collection, and field, you can assign one or more Data Categories. The Categories declared at each child level are additive.
You use your Datasets by adding them to Systems. A System can contain any number of Datasets, and a Dataset can be added to any number of Systems. When a dataset is referenced by a system, all applicable data categories set on the dataset are treated as part of the system. If a Dataset is not referenced by a System, a warning is surfaced denoting an orphan dataset exists.
Datasets cannot contain other Datasets.
Object Structure
fides_key constrained string
A string token of your own invention that uniquely identifies this Dataset. It's your responsibility to ensure that the value is unique across all of your Dataset objects. The value may only contain alphanumeric characters, underscores, and hyphens. ([A-Za-z0-9_.-]
).
name string
A UI-friendly label for the Dataset.
description string
A human-readable description of the Dataset.
organization_fides_key string default: default_organization
The fides key of the Organization to which this Dataset belongs.
meta object
An optional object that provides additional information about the Dataset. You can structure the object however you like. It can be a simple set of key: value
properties or a deeply nested hierarchy of objects. How you use the object is up to you: Fides ignores it.
data_categories [string]
Arrays of Data Category resources, identified by fides_key
, that apply to all collections in the Dataset.
collections [object]
An array of objects that describe the Dataset's collections.
collections.name string
A UI-friendly label for the collection.
collections.description string
A human-readable description of the collection.
collections.data_categories [string]
Arrays of Data Category resources, identified by fides_key
, that apply to all fields in the collection.
collections.fields [object]
An array of objects that describe the collection's fields.
collections.fields.name string
A UI-friendly label for the field.
collections.fields.description string
A human-readable description of the field.
collections.fields.data_categories [string]
Arrays of Data Categories, identified by fides_key
, that applies to this field.
collections.fields.fields [object]
An optional array of objects that describe hierarchical/nested fields (typically found in NoSQL databases)
Examples
Manifest File
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 |
|
API Payload
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 |
|