The DCAT Application Profile for data portals in Switzerland DCAT-AP CH is an application profile of the DCAT vocabulary [[VOCAB-DCAT]]. It serves to describe data that are registered on Swiss data portals. Its target group are managers of open data portals in Switzerland and the providers of these data. DCAT-AP CH has been used by the Open Government Data Portal of the Swiss Federal Statistical Office since 2016.

It is important to be aware that data portals don't provide the data themselves but only metadata on the data. This serves the purpose of making the data discoverable.

The DCAT Application Profile for data portals in Switzerland has striven from the start to be compatible with the Application Profile for data portals in Europe [[DCAT-AP]]. Full conformance has not yet been reached, but the current specification takes responsibility in making the existing conformance problems visible, so that they can be resolved.

Introduction

About

The DCAT Application Profile for data portals in Switzerland (DCAT-AP CH) is an application profile of the DCAT vocabulary ([[VOCAB-DCAT]]). It serves to describe data that are registered on Swiss data portals.

DCAT [[VOCAB-DCAT]] is a semantic definition to describe data by the means of an RDF vocabulary. It allows for a decentralized approach regarding the publication of data by enforcing interoperability via the use of a common language to describe the data.

Since DCAT is only a language, it can be applied in many contexts. An application profile applies DCAT to one specific context. In the case of DCAT-AP CH the context is that of data portals in Switzerland.

This context alone would allow for a very flexible usage of DCAT in DCAT-AP CH, but Switzerland is also part of Europe and therefore the interoperability with the [[DATAPORTAL-EU]] has been a topic from the start. Europe has it's own application profile [[DCAT-AP]], that has established a role of a de facto standard within Europe. Swiss data is also harvested and represented by the [[DATAPORTAL-EU]]. The EU puts a lot of power behind further developing their application profile DCAT-AP. Therefore staying in line with the DCAT-AP is not only a goal in itself, but means that DCAT-AP CH will stay embedded in a bigger community of stakeholders.

Metadata

Neither DCAT-AP CH nor DCAT-AP describe data, only metadata. Metadata is by definition secondary information on the data: when and by whom were they published, which usage conditions apply, how often are they updated, whom to contact about them and where and how can they be accessed. See [[METADATA-DEF]].

DCAT vocabulary building blocks

In DCAT the vocabulary consists of classes and properties.

Classes used in DCAT-AP CH

Classes and properties are used to deliver the metadata in a structured way: The exact specification of these classes in the Swiss context is the topic of this specification as it is the topic of DCAT-AP to define the structure in the European context:

The three classes above are the classes that are further specified in DCAT-AP CH ( ).

Interoperability

DCAT-AP CH aims at making data discoverable on the national and international level by providing the necessary metadata in a form that can be understood by both humans and machines.

A precondition this for interoperability is that the DCAT language is used correctly. Therefore this specification aims at enabling data publishers to specify their data catalog in a way that conforms to both DCAT-AP CH and DCAT-AP. It also gives guidance to data portal managers on how the data catalogs should be received and processed in a DCAT-AP CH and DCAT-AP conformant way.

Provider requirements

A data catalog conforms to DCAT-AP CH if:

Receiver requirements

An application (data portal) conforms to DCAT-AP CH if:

Namespaces

Namespaces and prefixes used in normative parts of this recommendation are shown in the following table:

Prefix Namespace IRI Source
dcat http://www.w3.org/ns/dcat# [[VOCAB-DCAT]]
dct http://purl.org/dc/terms/ [[DCTERMS]]
foaf http://xmlns.com/foaf/0.1/ [[FOAF]]
schema http://schema.org/ [[schema-org]]
rdf http://www.w3.org/1999/02/22-rdf-syntax-ns# [[RDF-SYNTAX-GRAMMAR]]
rdfs http://www.w3.org/2000/01/rdf-schema# [[RDF-SCHEMA]]
vcard http://www.w3.org/2006/vcard/ns# [[VCARD-RDF]]
xsd http://www.w3.org/2001/XMLSchema# [[XMLSCHEMA11-2]]
adms http://www.w3.org/ns/adms# [[VOCAB-ADMS]]
skos http://www.w3.org/2004/02/skos/core# [[SKOS-REFERENCE]]
locn http://www.w3.org/ns/locn# [[LOCN]]

Multiple languages, URIs and content negotiation

Switzerland is a multilingual country. Therefore, supporting multiple languages is very important in DCAT-AP CH. Multiple languages are handled as follows.

Best practice for Resource URIs URIs should be language and format neutral: http://my-dataset-name-language-and-format-neutral. They are generally not translatable in the ways strings are because it is assumed that content negotiation is used to best match the users needs with the URI's availabilities.

The table lists multilingual properties of DCAT-AP CH and the translation strategies that apply to them:

Label RDF property Range Multilingual Support
Catalog title dct:title rdfs:Literal Language encoded string
Catalog description dct:description rdfs:Literal Language encoded string
Dataset title dct:title rdfs:Literal Language encoded string
Dataset description dct:description rdfs:Literal Language encoded string
Dataset keyword dcat:keyword rdfs:Literal Language encoded string
Catalog homepage foaf:homepage foaf:Document Content negotiation
Dataset landing Page dcat:landingPage foaf:Document Content negotiation
Catalog publisher dct:publisher foaf:Agent Content negotiation for the URI and language encoded string for the name
Dataset publisher dct:publisher foaf:Agent Content negotiation for the URI and language encoded string for the name

Application profile overview

Localization of DCAT-AP

DCAT-AP CH is a sub-profile of [[DCAT-AP]]. Its task is that of localizing DCAT-AP. This means it adds and overwrites the specification of DCAT-AP where Swiss data providers have different needs in describing their data, that cannot be met by DCAT-AP.

The overwrites fall into the categories listed below:

If a class or property of DCAT-AP is not mentioned in this specification it means:

Scope of DCAT-AP CH

DCAT-AP CH only specifies three classes of DCAT-AP:

DCAT-AP CH has three requirement levels:

The diagram shows the classes with their properties and requirement levels.

classDiagram Catalog --> Dataset : dcat:dataset Dataset --> Distribution : dcat:distribution class Catalog { Mandatory: - dct:title - dct:description - dct:issued - foaf:homepage - dct:publisher - dcat:dataset Conditional: - dct:modified Optional: - dcat:themeTaxonomy - dct:license - dct:rights - dct:language } class Dataset { Mandatory: - dct:title - dct:description - dct:publisher - dcat:contactPoint - dct:identifier Conditional: - dct:issued - dct:modified - dcat:theme - dcat:distribution - dcat:landingPage - dct:language Optional: - dcat:keyword - dct:spatial - dct:coverage - dct:temporal - dct:accrualPeriodicity - dct:relation - rdfs:seeAlso - schema:image } class Distribution { Mandatory: - dct:issued - dcat:accessURL - dct:rights Conditional: - dct:title - dct:description - dcat:byteSize - dcat:mediaType - dct:format - dct:language - dct:modified Optional: - dct:license - dct:identifier - dcat:downloadURL - dct:coverage - schema:image }

Vocabulary specification

Class Property Tables

For each class a property table has been added, that includes all properties of the class, that are specified in DCAT-AP CH:

Class and Property Definitions

Besides the property tables each class and property is also defined in detail. The fields used in the definitions have the following meaning:

Classes specified in DCAT-AP-CH

Class: Catalog

Label Catalog
Obligation Mandatory
RDF Class dcat:Catalog
Definition A data catalog is a curated collection of metadata corresponding to the datasets that are hosted on a data portal.
Usage The catalog has a hierarchical structure and contains references to all the metadata for the data that can be accessed via the portal (these entries have the class dcat:Dataset). The datasets themselves refer to all distributions of the actual data (class dcat:Distribution). In DCAT-AP there is an additional class, dcat:CatalogRecord, which contains metadata on the catalog. This class has not been used so far in the implementation of DCAT-AP CH on opendata.swiss.

The following table lists the properties used for describing a catalog.

Label Obl. MC RDF Property Range Conf.
title M 1* dct:title rdfs:Literal Y
description M 1* dct:description rdfs:Literal Y
release date M 1 dct:issued rdfs:Literal (typed as as xsd:date) Y
homepage M 1* foaf:homepage foaf:Document Y
publisher M 1 dct:publisher foaf:Agent N
dataset M n dcat:dataset dcat:Dataset Y
update/ modification date C 1 dct:modified rdfs:Literal (typed as as xsd:date) Y
themes O n dcat:themeTaxonomy skos:ConceptScheme N
license O 1 dct:license dct:LicenseDocument N
rights O 1 dct:rights dct:RightsStatement N
language O n dct:language dct:LinguisticSystem N

Property: title

Label title
Obligation Mandatory
Max card. 1*
RDF Property dct:title
Range rdfs:Literal as a [[ISO 639-1]] two letter code indicating the language
Definition Name given to the catalog (in the language indicated in the attribute)
Usage The title of the catalog in the indicated language. The title must be given in at least one of the four languages German, French, English, Italian. Further languages are optional.

Property: description

Label description
Obligation Mandatory
Max card. 1*
RDF Property dct:description
Range rdfs:Literal
Definition Free-text description of the catalog (in the language indicated in the attribute).
Usage The description of the catalog in the indicated language. The description must be given in at least one of the four languages German, French, English, and Italian. Further languages are optional.

Property: release date

Label release date
Obligation Mandatory
Max card. 1
RDF Property dct:issued
Range rdfs:Literal (typed as as xsd:date).
Definition Date of formal issuance (e.g. publication) of the catalog.
Usage The date when the catalog was published.

Property: homepage

Label homepage
Obligation Mandatory
Max card. 1
RDF Property foaf:homepage
Range foaf:Document
Definition Homepage of the catalog.
Usage For example https://opendata.swiss would be the homepage of the catalog exported to the [[DATAPORTAL-EU]] Data Portal.

Property: publisher

Label publisher
Obligation Mandatory
Max card. 1
RDF Property dct:publisher
Range foaf:Agent
Definition Entity responsible for making the catalog available.
Usage This property provides the organization that is responsible for the portal. If that organization has an entry in [[TERMDAT]], the corresponding identification will be entered. The property will be represented in the following way:

Property: update/ modification date

Label update/ modification date
Obligation Conditional
Max card. 1
RDF Property dct:modified
Range rdfs:Literal (typed as as xsd:date).
Definition Most recent date on which the catalog was changed, updated or modified
Usage This property MUST only be set if the catalog has been updated in its content or structure after it has first been issued. In this case the property MUST contain the date of the last update.

Property: themes

Label themes
Obligation Optional
Max card. n
RDF Property dcat:themeTaxonomy
Range

skos:ConceptScheme

Definition Simple Knowledge organization system (SKOS) used to classify the catalog's datasets
Usage

As a basis for classification might be used, for example, [[TERMDAT]], [[PLZO_CH]], or a growing terminology gathered by a portal itself. Terminologies determined by the eCH Standards might also be appropriate (for example eCH-0049, eCH-0070 and eCH-0145 for describing the services or tasks of the public administration.)

For exchange with other European portals, the upper level of the multilingual EU thesaurus [[EUROVOC]] is supported.

Property: dataset

Label dataset
Obligation Mandatory
Max card. n
RDF Property dcat:dataset
Range dcat:Dataset ()
Definition Dataset that is part of the catalog.
Usage Reference to the dataset that is contained in the catalog.

Property: license

Label license
Obligation Optional
Max card. 1
RDF Property dct:license
Range dct:LicenseDocument
Definition License document under which the catalog (and not the datasets) is made available.
Usage If the license of the catalog applies to all of its datasets and distributions, the attribute should be replicated on each distribution. Alternatively, the property rights can be used.

Property: rights

Label access restrictions
Obligation Optional
Max card. 1
RDF Property dct:rights
Range dct:RightsStatement
Definition Rights under which the catalog (and not the datasets) is made available, if the same rights apply to all the datasets and distributions, the attribute should be replicated on each distribution.
Usage Alternatively the property license can be used.

Property: language

Label language
Obligation Optional
Max card. n
RDF Property dct:language
Range dct:LinguisticSystem as ISO two letter code [[ISO 639-1]]
Definition Languages used in the textual metadata of the datasets referenced in the catalog, such as titles, descriptions, etc.
Usage The properties title and description of Catalogs, Datasets or Distribution can be provided in multiple languages. In that case, it makes sense to also list these languages on the Catalog level.

Class: Dataset

Label Dataset
Obligation Mandatory
RDF Class dcat:Dataset
Definition The metadata entry of this class describes a single thematically closed data collection, which is published by a responsible organization (person or institution).
Usage The dataset can be made available in different forms: as file in a text/csv format or as API Service. Each physical representation SHOULD be described by a dcat:Distribution. In general, each dataset will have at least one distribution.

Special cases:

  • The dataset will also list distributions if these are only available on an external website, so that the reference is set by the property dcat:landingPage of the dataset
  • In exceptional cases, there can also be datasets that do not yet have distributions. This can be the case if the publication of these datasets is planned for a future date.
  • The property dcat:theme helps to group datasets by topics. Using a controlled vocabulary in the context of a data portal helps with the discoverability of the datasets by topic. Since in DCAT-AP the dcat:theme is optional, it can be ignored on export.

The following table lists the properties used for describing a dataset.

Label Obl. MC RDF Property Range Conf.
title M 1* dct:title rdfs:Literal Y
description M 1* dct:description rdfs:Literal Y
publisher M n dct:publisher foaf:Agent N
contact point M n dcat:contactPoint vcard:Kind Y
identifier M 1 dct:identifier rdfs:Literal N
release date C 1 dct:issued rdfs:Literal (typed as as xsd:date) Y
update/modification date C 1 rdfs:Literal (typed as as xsd:date) xsd:date Y
theme/ category C n dcat:theme skos:Concept N
dataset distribution C n dcat:distribution dcat:Distribution Y
landing page C 1 dcat:landingPage foaf:Document Y
language C n dct:language dct:LinguisticSystem N
keyword/ tag O n dcat:keyword rdfs:Literal Y
spatial/ geographical coverage O n dct:spatial dct:Location N
coverage O n dct:coverage dct:LocationPeriodOrJurisdiction N
temporal coverage O n dct:temporal dct:PeriodOfTime Y
frequency O 1 dct:accrualPeriodicity dct:Frequency N
related resource O n dct:relation rdfs:Resource Y
see also O n rdfs:seeAlso rdfs:Resource N
image O 3 schema:image schema:url or schema:ImageObject Y

Property: title

Label title
Obligation Mandatory
Max card. 1*
RDF Property dct:title
Range rdfs:Literal
Definition Name given to the dataset.
Usage

The element dct:title can occur more than once, in order to express the title in several languages.

On the user interface of data portals the title is shown in the language selected by the user. If the title is not available in that language one of the available titles in another language is displayed.

Multilingual titles don't mean that the data of the dataset is available in those languages. But if the dataset has a physical representation in a language it MUST also come with a title in that language. This way a title can be given to the physical representation even if the physical representation does not come with a title itself ().

Property: description

Label description
Obligation Mandatory
Max card. 1*
RDF Property dct:description
Range rdfs:Literal
Definition Free-text description of the dataset
Usage The description can be provided in several languages. On data portals the description that is displayed to the user is then usually determined by content negotiation and depends on his language preference. ()

Property: publisher

Label publisher
Obligation Mandatory
Max card. n
RDF Property dct:publisher
Range foaf:Agent
Definition Entity responsible for making the dataset available.
Usage

This property names the organization responsible for the publication of the dataset. If there are more organizations responsible, the property can be repeated.

If a [[TERMDAT]] entry exists for the publisher, this identification will be entered additionally. In case the publisher only provides the TERMDAT identification, the portal software determines the corresponding publisher name via the TERMDAT database. If only the publisher name is provided the portal software adds the TERMDAT identification from the database, in case it can be found. Otherwise, the identification is left empty. If the publisher provides both the TERMDAT identification and the name, the portal software accepts both values without changing them.

Internally the property will be represented in the following way:

Property: contact point

Label contact point
Obligation Mandatory
Max card. n
RDF Property dcat:contactPoint
Range vcard:Kind
Definition Relevant contact information with regard to the dataset
Usage This property SHOULD contain an email address that is really monitored by the data publisher. If there are several contributors involved in the publication of the dataset, the property can be used multiple times.

Property: identifier

Label identifier
Obligation Mandatory
Max card. 1
RDF Property dct:identifier
Range rdfs:Literal
Definition Unique identifier of the dataset. The identifier may be used as part of the URI of the dataset, but having it represented explicitly is still useful.
Usage

It is necessary to uniquely mark each Dataset in order to be able to link it to similar datasets. The identifier of a dataset should therefore have the following structure:

ID_Datenset@ID_Datapublisher

The data provider selects a data provider identifier which, based on human discretion, is nationally unique.

Datasets will be linked via the element "rdfs:seeAlso" by specifying in each case the corresponding element dct:identifier>.

Using rdfs:seeAlso for linking to other datasets makes it necessary that a data portal implementing this automatically takes care of consistency if datasets get changed or removed:

  • If the identifier of a dataset is changed: all seeAlso properties of other datasets on the portal should be changed accordingly
  • If a dataset is deleted: its identifier should be removed of the seeAlso of other datasets.

Property: release date

Label release date
Obligation Conditional
Max card. 1
RDF Property dct:issued
Range rdfs:Literal (typed as as xsd:date).
Definition Date of formal issuance (publication) of the dataset. This property should be set using the first known date of issuance.
Usage This property contains the date of the first issuance of the dataset. If this date is not known, it can be set to the date of the first reference of the dataset in the corresponding catalog for instance opendata.swiss. If the dataset does not yet have any distributions, this property can be skipped. The property can also be set to a future date if the publication of dataset and distribution is already planned. It should then contain the date that is scheduled for the publication. In a data portal, the dataset will only be publicly displayed if it has an issued date and if that date is not in the future.

Property: update/ modification date

Label update/ modification date
Obligation Conditional
Max card. 1
RDF Property dct:modified
Range rdfs:Literal (typed as as xsd:date).
Definition Most recent date on which the dataset was changed, updated or modified. The value of this property indicates a change to the actual dataset, not a change to the catalog record. No value may indicate that the dataset has never changed after its initial publication, or that the date of the last modification is not known, or that the dataset is continuously updated
Usage This property MUST only be set if the distributions (the actual data) that the dataset describes have been updated after it has been issued. In this case the property MUST contain the date of the last update. That way a person or institution using the data for an analysis or application will know when to update the report or application on their side.

Property: theme/ category

Label theme/ category
Obligation Conditional
Max card. n
RDF Property dcat:theme
Range skos:Concept
Definition Main category of the dataset. A dataset can have multiple themes; themes are part of the knowledge organization system (KOS) used to classify the catalog's datasets.
Usage The content of the property must be a term of the skos:ConceptScheme of the data-portal, if that exists.

Property: dataset distribution

Label dataset distribution
Max card. n
Obligation Conditional
RDF Property dcat:distribution
Range dcat:Distribution ()
Definition Available distribution of the dataset
Usage This property contains the reference to the metadata of a distribution of the data. In exceptional cases, a dataset for which no distribution form exists (yet) can be described in the catalogue. In this case, the element dcat:distribution may be omitted.

Property: landing page

Label landing page
Obligation Conditional
Max card. 1
RDF Property dcat:landingPage
Range foaf:Document
Definition A webpage that can be accessed in a Web browser to gain access to the dataset, its distributions and/or additional information. If distributions are accessible only through a landing page (i.e. direct download URLs are not known), the landing page link should be duplicated as accessURL on the distributions.
Usage The property is needed if the dataset described by the metadata is only indirectly accessible via a webpage of the data publisher. In this case, the dataset must still have a distribution, but the url entered in the dcat:accessURL of the distribution will be the same as the url for the dcat:landingPage.

Property: language

Label language
Obligation Conditional
Max card. n
RDF Property dct:language
Range dct:LinguisticSystem
Definition Language of the dataset. If the dataset is available in multiple languages, use multiple values for this property. If each language is available separately, define an instance of dcat:Distribution for each language and describe the specific language of each distribution using dct:language.
Usage The property MUST be set if the distribution is not language-independent. In that case, the language property indicates for which of the languages German, French, English, Italian a distribution is offered by the dataset.

Property: keyword/ tag

Label keyword/ tag
Obligation Optional
Max card. n
RDF Property dcat:keyword
Range rdfs:Literal
Definition Keyword or tag describing the dataset
Usage As keywords terms from [[TERMDAT]] can be used. But also free tags can be used for this purpose.

It is good practice to mark the language of the keywords with the [[ISO 639-1]] language code such as "geodata"@en.

Property: spatial/ geographical coverage

Label spatial/ geographical coverage
Obligation Optional
Max card. n
RDF Property dct:spatial
Range dct:Location
Definition Spatial coverage of the dataset.
Usage

This property refers to a geographic region that is covered by the Dataset. Possible values for Swiss datasets might be the official canton abbreviations or community names as they are published by swisstopo ([[CADASTRE-PLZ-CH]]).

Another option is to provide point coordinates or a rectangular bounding box. It is not possible with dct:Location to provide a polygon as geographical coverage, therefore dct:coverage should be used for this purpose ()

Property: temporal coverage

Label temporal coverage
Obligation Optional
Max card. n
RDF Property dct:temporal
Range dct:PeriodOfTime
Definition Temporal period covered by the dataset.
Usage For temporal references this property is MUST be used in order to conform to DCAT-AP.

Property: coverage

Label coverage
Obligation Optional
Max card. n
RDF Property dct:coverage
Range dct:LocationPeriodOrJurisdiction
Definition Spatial (and temporal) characteristics of the object or resource. This is the key element for supporting spatial or temporal range searching on document-like objects that are spatially referenced or time-referenced.
Usage

The property should only be used for spatial coverage that has the shape of a polygon. (for example name="dc.coverage.polygon" scheme = "WGS84.DD" content = "9.547 46.737 9.550 46.739 9.556 46.740 9.554 46.743 9.548 46.742 9.547 46.737"; name= "dc.coverage.placeName content="Heidsee"). In this case there must a property dct:spatial added as well in order to stay conformant with DCAT-AP. The portal software will do that automatically.

Property: frequency

Label frequency
Obligation Optional
Max card. 1
RDF Property dct:accrualPeriodicity
Range dct:Frequency
Definition Time interval at which the dataset is updated
Usage This property is used to indicate a periodicity in the updates of the dataset, if it applies.

Property: related resource

Label related resource
Obligation Optional
Max card. n
RDF Property dct:relation
Range rdfs:Resource
Definition Related resource
Usage

With this property, a dataset can link to a related dataset.

Additional material such as an article, example or documentation helpful in the context of the dataset can also be linked in this way.

The element is also used to link to legal foundations that apply to the publication or usage of the dataset.

Property: see also

Label see also
Obligation Optional
Max card. n
RDF Property rdfs:seeAlso
Range rdfs:Resource
Definition Resource that might be useful to users interested in the dataset.
Usage

The element rdfs:seeAlso enables data publishers to add a reference, in the metadata of a dataset, to another dataset. This allows users to discover datasets related to one they have already found.

The element rdfs:seeAlso is added to the metadata of a dataset as necessary. In this way, related datasets (e.g. population data from 2013 and population data from 2014) can be simply connected with one another.

Internally, the dct:identifier property of the related dataset will be used.

Property: image

Label image
Obligation Optional
Max card. 1
RDF Property schema:image
Range schema:url or schema:ImageObject
Definition A thumbnail picture illustrating the content of the dataset.
Usage For distributions that consist of visual content (photographs, videos, maps, etc.) it makes sense to add a limited number of thumbnails to the metadata.

Class: Distribution

Label Distribution
Obligation Mandatory
RDF Class dcat:Distribution
Definition A metadata entry of this class describes a distribution of the data.
Usage

The distribution can contain all the data of the dataset or it can contain just a certain part of the data. For example: it can contain all data about the population in Switzerland or it can contain just on year of this data, say the year 2018. It can also contain the data in an alternative data format, for example a graphical presentation of the data for the years 1990 up to 2010.

DCAT-AP CH allows the representation of various relationships within dataset or between datasets: The most simple relationship is the aggregation of various physical representations of data into a dataset. One possible such dataset would be a time series where each distribution covers one year of the data and the dataset spans several years.

The following table lists of dcat:Distribution the properties that are specified by DCAT-AP CH.

Label Obl. MC RDF Property Range Conf.
release date M 1 dct:issued rdfs:Literal (typed as as xsd:date) Y
access URL M n dcat:accessURL rdfs:Resource Y
rights M 1 dct:rights dct:RightsStatement N
title C 1* dct:title rdfs:Literal N
description C 1* dct:description rdfs:Literal N
byte size C 1 dcat:byteSize rdfs:Literal (typed as as xsd:decimal) Y
media type C 1 dcat:mediaType dct:MediaType Y
format C 1 dct:format dct:MediaTypeOrExtent N
language C 1 dct:language dct:LinguisticSystem N
update/ modification date C 1 dct:modified rdfs:Literal (typed as as xsd:date) Y
license O 1 dct:license dct:LicenseDocument N
identifier O 1 dct:identifier rdfs:Literal Y
download URL O n dcat:downloadURL rdfs:Resource Y
coverage O n dct:coverage LocationPeriodOrJurisdiction N
image O 3 schema:image schema:url or schema:ImageObject Y

Property: release date

Label release date
Obligation Mandatory
Max card. 1
RDF Property dct:issued
Range rdfs:Literal (typed as as xsd:date)
Definition Date of formal issuance (publication) of the distribution
Usage The first time issuance of the distribution.

Property: access URL

Label access URL
Obligation Mandatory
Max card. 1
RDF Property dcat:accessURL
Range rdfs:Resource
Definition landing page, feed, SPARQL endpoint or other type of resource that gives access to the distribution of the dataset. Use accessURL, and not downloadURL, if the url does not point to a downloadable file or when you are not sure whether it does.
Usage

If the dataset is only accessible via a landing page, then the value entered in dcat:landingPage should be the same as the value in dcat:accessURL.

Since the field is mandatory in the DCAT-AP it has to be set even for downloadable distributions. In that case both dcat:accessURL and dcat:downloadURL should contain the same value.

There might be more than one way to access a distribution (HTTP, WebDAV, FTP) there might be several access urls for a distribution.

Property: rights

Label rights
Obligation Mandatory
Max card. 1
RDF Property dct:rights
Range dct:RightsStatement
Definition This property refers to a statement that specifies rights associated with the Distribution.
Usage

To specify the rights statement for a distribution on a Swiss open data portal, the following information is necessary:

  • Attribution: required/not required
  • Non-commercial use: allowed/not allowed
  • Commercial use: allowed/not allowed/only with approval

The element dct:rights should be formed with the vocabulary [[ODI-VOCAB]]:

The individual conditions are formulated as follows:

  • ReferenceRequired / ReferenceNotRequired
  • NonCommercialAllowed / NonCommercialNotAllowed
  • CommercialAllowed / CommercialNotAllowed / CommercialWithApprovalOnly

The fully-specified rights statement is then assembled from the three relevant conditions:

In case attribution is required, the text to be used must additionally be included:

The data owner must also be able to list the legal basis for the collection and publication of the data. For this, the element dct:relation may be included in the element dct:rights, with the following form.

Property: title

Label title
Obligation Conditional
Max card. 1*
RDF Property dct:title
Range rdfs:Literal
Definition Name given to the distribution.
Usage

The title MUST be given if the distribution contains only part of the data offered by the dataset. .

The title can be given in several languages. In multilingual data portals, the title in the language selected by a user will usually be shown as title for the distribution.

Property: description

Label description
Obligation Conditional
Max card. 1*
RDF Property dct:description
Range rdfs:Literal
Definition Freetext description of the distribution.
Usage

The description MUST be provided if the distribution contains only part of the data offered by the dataset. .

The description can be given in several languages. In multilingual data portals, the description in the language selected by a user will usually be shown as description for the distribution.

Property: byte size

Label byte size
Obligation Conditional
Max card. 1
RDF Property dcat:byteSize
Range rdfs:Literal (typed as as xsd:decimal).
Definition Size of a distribution in bytes; if the precise size is not known, an approximate size can be indicated.
Usage If the distribution is a file the size SHOULD be indicated if possible with this property.

Property: media type

Label media type
Obligation Conditional
Max card. 1
RDF Property dcat:mediaType
Range dct:MediaType The values MUST be in defined in [[IANA-MEDIA-TYPES]]
Definition The property media type is only necessary if the physical representation is a file, that is referred to by a . If the physical representation is a file but if it is only accessible by a and if therefore the download URL is not set, the property media type can be skipped.

The value of the media type MUST be a IANA MIME type, see [[IANA-MEDIA-TYPES]].

Usage This property MUST be provided if the Distribution has the property dcat:downloadURL set to a downloadable file.

Property: format

Label format
Obligation Conditional
Max card. 1
RDF Property dct:format
Range dct:MediaTypeOrExtent
Definition File format of the distribution.
Usage This property MUST be used for distributions that offer a download url that has file format not in [[IANA-MEDIA-TYPES]].

Property: language

Label language
Obligation Optional
Max card. n
RDF Property dct:language
Range rdfs:Literal as ISO two letter code ([[ISO 639-1]])
Definition Language of the distribution
Usage

The property MUST be set if the distribution is language-dependent, or if it is given in some of the languages German, French, Italian and English but not in all four languages.

If there is a distribution with a language property set, the dataset MUST also provide a title in that language ()

Property: update/modification date

Label update/ modification date
Obligation Conditional
Max card. 1
RDF Property dct:modified
Range rdfs:Literal (typed as as xsd:date).
Definition Most recent date on which the distribution was changed, up-dated or modified.
Usage This property MUST be provided if the content of the distribution has changed since it was first published in the catalog. If there have been several changes, the property MUST show the date of the last content modification. This property signals to applications that use the data that their application might need to be updated as well.

Property: license

Label license
Obligation Optional
Max card. 1
RDF Property dct:license
Range dct:LicenseDocument
Definition License or terms of use under which the distribution is made available.
Usage

In the element dct:license, one license can be specified. It is recommended to use a current, international standard license, e.g. the CC-0 Creative Commons Public Domain Dedication.

This differs from the element dct:rights, where any information necessary to specify the usage rights can be added.

Property: identifier

Label identifier
Obligation Optional
Max card. 1
RDF Property dct:identifier
Range rdfs:Literal
Definition The internal identifier of the distribution.
Usage Some publishers issue unique identifiers for their physical representations of the dataset, therefore the identifier property has been also added for distributions. The interest in this field is mostly for the data publisher to recognize and match distributions with the corresponding representations of the data in his internal system.

Property: download URL

Label download URL
Obligation Optional
Max card. n
RDF Property dcat:downloadURL
Range rdfs:Resource
Definition URL pointing to a file that contains the distribution of the dataset in a given format
Usage

Because the distribution might be accessible via various methods (HTTP, WebDAV, FTP), there MAY be more than one download url.

In order to ensure the conformance with DCAT-AP each download url must be doubled as access url.

Property: coverage

Label coverage
Obligation Optional
Max card. n
RDF Property dct:coverage
Range dct:LocationPeriodOrJurisdiction
Definition Spatial (and temporal) characteristics of the object or resource .
Usage

This is the key element for supporting spatial or temporal range searching on document-like objects that are spatially referenced or time-referenced.

The content of various distributions in a dataset can differ from one another (e.g. population data for different years). For this reason, the specific data in the distributions may differ from the general information in the metadata of the dataset, particularly in terms of time and location.

In this case, the element dct:coverage, which can contain both time and location information, is permitted in DCAT-AP CH-compliant metadata.

Property: image

Label image
Obligation Optional
Max card. 1
RDF Property schema:image
Range schema:url or schema:ImageObject
Definition thumbnail picture illustrating the content of the dataset.
Usage For distributions that consist of visual content (photographs, videos, maps, etc.) it makes sense to add a limited number of thumbnails to the metadata.

Conformance to DCAT-AP

Conformance Criteria

DCAT-AP CH and DCAT-AP are both application profiles for DCAT.

The [[DCAT-PROFILE-GUIDANCE]] states that application profiles may form hierarchies.

For interoperability reasons, DCAT-AP CH should aim to be a sub-profile of DCAT-AP. That way it can be assured that metadata structured as DCAT-AP CH can be efficiently harvested by [[DATAPORTAL-EU]] without losing important information on its metadata.

The following diagram captures the relationship between DCAT, DCAT-AP and DCAT-AP CH:

classDiagram DCAT --> DCAT_AP: is_profile_of DCAT_AP --> DCAT_AP_CH: is_subprofile_of DCAT --> DCAT_AP_CH: is_profile_of class DCAT { - defines classes - defines properties - base vocabulary } class DCAT_AP { - uses classes of DCAT - specifies requirement levels - specifies controlled vocabulary } class DCAT_AP_CH { - localizes DCAT-AP for Swiss context - can restrict more but not less than DCAT-AP }

DCAT-AP CH must conform to both DCAT and DCAT-AP. This conformance of DCAT-AP CH means in detail:

Learnings and Next Steps

Learnings

Next Steps