Base schema used by the DataLad dataset (de)serializer

At the moment, this is largely identical to the distribution schema. The aim is to keep the differences as minimal as possible.

Main additions are

  • custom namespaces for Git SHA, git-annex key, git-annex remote, and DataLad dataset identifiers

Name: datalad-dataset-schema

Schema Diagram

erDiagram Thing { uriorcurie id uriorcurieList conforms_to string description uriorcurieList is_about uriorcurie meta_type string name uriorcurieList same_as string title uriorcurie type } Identifier { string notation } Characteristic { string description uriorcurie is_defined_by string name string title uriorcurie type uriorcurie range string value } Property { uriorcurie meta_type string description uriorcurie is_defined_by string name string title uriorcurie type uriorcurie range string value } QuantitativeProperty { uriorcurie unit uriorcurie meta_type string description uriorcurie is_defined_by string name string title uriorcurie type uriorcurie range string value } Activity { W3CISO8601 ended_at uriorcurie id uriorcurieList conforms_to string description uriorcurieList is_about uriorcurie meta_type string name uriorcurieList same_as string title uriorcurie type } Agent { uriorcurie id uriorcurieList conforms_to string description uriorcurieList is_about uriorcurie meta_type string name uriorcurieList same_as string title uriorcurie type } Entity { uriorcurie id uriorcurieList conforms_to string description uriorcurieList is_about uriorcurie meta_type string name uriorcurieList same_as string title uriorcurie type } Influence { string influencer } AgentInfluence { string influencer } Attribution { string influencer } EntityInfluence { uriorcurie meta_type string influencer } Derivation { uriorcurie had_activity uriorcurie meta_type string influencer } Role { uriorcurie id } Person { string address EmailAddress email uriorcurie id uriorcurieList conforms_to string description uriorcurieList is_about uriorcurie meta_type string name uriorcurieList same_as string title uriorcurie type } Organization { string address uriorcurie id uriorcurieList conforms_to string description uriorcurieList is_about uriorcurie meta_type string name uriorcurieList same_as string title uriorcurie type } Distribution { uriList access_url NonNegativeInteger byte_size W3CISO8601 date_modified W3CISO8601 date_published uriList download_url uriorcurie format string media_type uriorcurie id uriorcurieList conforms_to string description uriorcurieList is_about uriorcurie meta_type string name uriorcurieList same_as string title uriorcurie type } Resource { W3CISO8601 date_modified W3CISO8601 date_published stringList keyword uri landing_page string version uriorcurie id uriorcurieList conforms_to string description uriorcurieList is_about uriorcurie meta_type string name uriorcurieList same_as string title uriorcurie type } LicenseDocument { string license_text uriorcurie id uriorcurieList conforms_to string description uriorcurieList is_about uriorcurie meta_type string name uriorcurieList same_as string title uriorcurie type } DistributionPart { string name } Checksum { uriorcurie algorithm HexBinary digest } DataService { string download_url_template uri endpoint_description uri endpoint_url W3CISO8601 date_modified W3CISO8601 date_published stringList keyword uri landing_page string version uriorcurie id uriorcurieList conforms_to string description uriorcurieList is_about uriorcurie meta_type string name uriorcurieList same_as string title uriorcurie type } QualifiedAccess { } Parameter { string description uriorcurie is_defined_by string name string title uriorcurie type uriorcurie range string value } Thing ||--}o Identifier : "identifier" Thing ||--}o Property : "has_property" Identifier ||--|o Thing : "schema_agency" Activity ||--}o AgentInfluence : "qualified_association" Activity ||--}o Thing : "relation" Activity ||--}o Agent : "was_associated_with" Activity ||--}o Activity : "was_informed_by" Activity ||--}o Identifier : "identifier" Activity ||--}o Property : "has_property" Agent ||--}o Thing : "relation" Agent ||--}o Identifier : "identifier" Agent ||--}o Property : "has_property" Entity ||--}o Attribution : "qualified_attribution" Entity ||--}o Derivation : "qualified_derivation" Entity ||--}o EntityInfluence : "qualified_relation" Entity ||--}o Thing : "relation" Entity ||--}o Agent : "was_attributed_to" Entity ||--}o Entity : "was_derived_from" Entity ||--}o Activity : "was_generated_by" Entity ||--}o Identifier : "identifier" Entity ||--}o Property : "has_property" Influence ||--}| Role : "had_role" AgentInfluence ||--|| Agent : "agent" AgentInfluence ||--}| Role : "had_role" Attribution ||--|| Agent : "agent" Attribution ||--}| Role : "had_role" EntityInfluence ||--}| Entity : "entity" EntityInfluence ||--}| Role : "had_role" Derivation ||--}| Entity : "entity" Derivation ||--}| Role : "had_role" Person ||--}o Organization : "affiliation" Person ||--}o Thing : "relation" Person ||--}o Identifier : "identifier" Person ||--}o Property : "has_property" Organization ||--}o Thing : "relation" Organization ||--}o Identifier : "identifier" Organization ||--}o Property : "has_property" Distribution ||--}o DataService : "access_service" Distribution ||--}o Checksum : "checksum" Distribution ||--}o Distribution : "has_part" Distribution ||--|o Resource : "is_distribution_of" Distribution ||--|o LicenseDocument : "license" Distribution ||--}o QualifiedAccess : "qualified_access" Distribution ||--}o DistributionPart : "qualified_part" Distribution ||--}o Attribution : "qualified_attribution" Distribution ||--}o Derivation : "qualified_derivation" Distribution ||--}o EntityInfluence : "qualified_relation" Distribution ||--}o Thing : "relation" Distribution ||--}o Agent : "was_attributed_to" Distribution ||--}o Entity : "was_derived_from" Distribution ||--}o Activity : "was_generated_by" Distribution ||--}o Identifier : "identifier" Distribution ||--}o Property : "has_property" Resource ||--|o Agent : "contact_point" Resource ||--|o Resource : "is_part_of" Resource ||--|o Resource : "is_version_of" Resource ||--}o Attribution : "qualified_attribution" Resource ||--}o Derivation : "qualified_derivation" Resource ||--}o EntityInfluence : "qualified_relation" Resource ||--}o Thing : "relation" Resource ||--}o Agent : "was_attributed_to" Resource ||--}o Entity : "was_derived_from" Resource ||--}o Activity : "was_generated_by" Resource ||--}o Identifier : "identifier" Resource ||--}o Property : "has_property" LicenseDocument ||--}o Attribution : "qualified_attribution" LicenseDocument ||--}o Derivation : "qualified_derivation" LicenseDocument ||--}o EntityInfluence : "qualified_relation" LicenseDocument ||--}o Thing : "relation" LicenseDocument ||--}o Agent : "was_attributed_to" LicenseDocument ||--}o Entity : "was_derived_from" LicenseDocument ||--}o Activity : "was_generated_by" LicenseDocument ||--}o Identifier : "identifier" LicenseDocument ||--}o Property : "has_property" DistributionPart ||--|o Entity : "entity" DataService ||--}o Parameter : "has_parameter" DataService ||--|o Agent : "contact_point" DataService ||--|o Resource : "is_part_of" DataService ||--|o Resource : "is_version_of" DataService ||--}o Attribution : "qualified_attribution" DataService ||--}o Derivation : "qualified_derivation" DataService ||--}o EntityInfluence : "qualified_relation" DataService ||--}o Thing : "relation" DataService ||--}o Agent : "was_attributed_to" DataService ||--}o Entity : "was_derived_from" DataService ||--}o Activity : "was_generated_by" DataService ||--}o Identifier : "identifier" DataService ||--}o Property : "has_property" QualifiedAccess ||--}o DataService : "access_service" QualifiedAccess ||--}o Parameter : "has_parameter"


Class Description
Characteristic An inherent quality, function, disposition or process characteristic.
        Parameter A variable whose value changes the characteristics of a system or a function.
        Property An inherent quality, function, disposition or process characteristic that is being observed or measured.
                QuantitativeProperty An inherent quantitative property that is being observed or measured.
Checksum A Checksum is a value that allows to check the integrity of the contents of a file. Even small changes to the content of the file will change its checksum. This class allows the results of a variety of checksum and cryptographic message digest algorithms to be represented.
DistributionPart An association class for attaching additional information to a hasPart relationship.
Identifier Identifier.
Influence Capacity of an entity, activity, or agent to have an effect on the character, development, or behavior of another.
        AgentInfluence Capacity of an agent to have an effect on the character, development, or behavior of another Entity, Agent, or Activity
                Attribution Attribution is the ascribing of an entity to an agent.
        EntityInfluence Capacity of an entity to have an effect on the character, development, or behavior of another.
                Derivation Transformation of an entity into another, an update of an entity resulting in a new one, or the construction of a new entity based on a pre-existing entity.
QualifiedAccess An association class for attaching additional information to an access_service relationship between a DCAT:Distribution and a DCAT:DataService.
Role A role is the function of a resource or agent with respect to another resource, in the context of resource attribution or resource relationships.
Thing The most basic item.
        Activity An activity is something that occurs over a period of time and acts upon or with entities; it may include consuming, processing, transforming, modifying, relocating, using, or generating entities.
        Agent Something that bears some form of responsibility for an activity taking place, for the existence of an entity, or for another agent's activity.
                Organization A social or legal instititution such as a company, a society, or a university.
                Person Person agents are people.
        Entity A physical, digital, conceptual, or other kind of thing with some fixed aspects; entities may be real or imaginary.
                Distribution A specific representation of data, which may come in the form of a single file, or an archive or directory of many files, may be standalone or part of a dataset.
                LicenseDocument A legal document giving official permission to do something with a resource.
                Resource Resource published or curated by a single agent.
                        DataService A collection of operations that provides access to one or more distributions or data processing functions.


Slot Description
access_service A data service that gives access to a distribution
access_url URL that gives access to the subject
address Physical address of the subject, such as a postal address, a bibliographic lo...
affiliation An organization that an agent is affiliated with
agent References an agent which influenced an entity
algorithm The algorithm or rules to follow to compute a score, an effective method expr...
byte_size The size of a distribution in bytes
checksum The checksum property provides a mechanism that can be used to verify that th...
conforms_to An established standard to which the subject conforms
contact_point Relevant contact information for the subject
date_modified Date on which the resource was (last) changed, updated or modified
date_published Date on which the resource was (last) changed, updated or modified
description A free-text account of the thing
digest Lower case hexadecimal encoded checksum digest value produced using a specifi...
distribution An available distribution of a resource
download_url URL that gives direct access to the subject in the form of a downloadable fil...
download_url_template A URL template with placeholders enclosed in braces ({example})
email Email address associated with an entity
ended_at End is when an activity is deemed to have been ended by an entity, known as t...
endpoint_description A description of the services available via the end-points, including their o...
endpoint_url The root location or primary endpoint of a service (a Web-resolvable IRI)
entity References an entity which influenced an entity
format The file format of a distribution
had_activity The function of an entity or agent with respect to another entity or resource
had_role The function of an entity or agent with respect to another entity or resource
has_parameter Relation between a process or function and an information entity which modula...
has_part A related resource that is included either physically or logically in the des...
has_property Relation between a subject and a quality, capability or role that it bears
id Globally unique identifier of a metadata object
identifier An unambiguous reference to the subject within a given context
influencer Reference the resource (Entity, Agent, or Activity) whose influence is being ...
is_about A relation of an information artifact to a thing
is_defined_by The definition of the subject thing
is_distribution_of Inverse property of DCAT:distribution
is_part_of A related resource that is included either physically or logically in the des...
is_version_of A related resource of which the described resource is a version
keyword One or more keywords or tags describing the resource
landing_page A Web page that can be navigated to in a Web browser to gain access to a reso...
license A legal document under which the resource is made available
license_text A copy of the actual text of a license reference, file or snippet that is ass...
media_type The media type of a distribution as defined by IANA
meta_type Type designator of a metadata object for validation and schema structure hand...
name Name of a thing
notation String of characters such as "T58:5" or "30:4833" used to uniquely identify a...
qualified_access Link to a description of a access_service relationship with `DCAT:DataServi...
qualified_association Assignment of responsibility to an agent for an activity, indicating that the...
qualified_attribution Attribution is the ascribing of an entity to an agent
qualified_derivation A transformation of an entity into another, an update of an entity resulting ...
qualified_part Qualified a hasPart relationship with another entity
qualified_relation Characterizes the relationship or role of an entity with respect to the subje...
range State that the values of a property are instances of a class
relation The subject has a relation to the object
same_as Property that determines that subject and object are equal
schema_agency Name of the agency that issued an identifier
title A summarily description of a thing
type State that the subject is an instance of a class
unit A unit of measurement is a standardized quantity of a physical quality
value Value of a thing
version Version indicator (name or identifier) of a resource
was_associated_with An activity association is an assignment of responsibility to an agent for an...
was_attributed_to Attribution is the ascribing of an entity to an agent
was_derived_from Derivation is a transformation of an entity into another, an update of an ent...
was_generated_by Generation is the completion of production of a new entity by an activity
was_influenced_by The object had an effect on the character, development, or behavior of the su...
was_informed_by Communication is the exchange of an entity by two activities, one activity us...


Type Description
Boolean A binary (true or false) value
Curie a compact URI
Date a date (year, month and day) in an idealized calendar
DateOrDatetime Either a date or a datetime
Datetime The combination of a date and time
Decimal A real number with arbitrary precision that conforms to the xsd:decimal speci...
Double A real number that conforms to the xsd:double specification
EmailAddress RFC 5322 compliant email address
Float A real number that conforms to the xsd:float specification
HexBinary hex-encoded binary data
Integer An integer
Jsonpath A string encoding a JSON Path
Jsonpointer A string encoding a JSON Pointer
Ncname Prefix part of CURIE
Nodeidentifier A URI, CURIE or BNODE that represents a node in a model
NonNegativeInteger An integer
Objectidentifier A URI or CURIE that represents an object in the model
Sparqlpath A string encoding a SPARQL Property Path
String A character string
Time A time object represents a (local) time of day, independent of any particular...
Uri a complete URI
Uriorcurie a URI or a CURIE
W3CISO8601 W3C variant/subset of IS08601 for specifying date(times)


Subset Description