For providers, services and trainingMaterial, the corresponding confluence pages (describing the data model) has a column that says whether or not the information is public (the Public
column).
I've used this as the primary selector for deciding whether information should be shared.
The corresponding confluence page for data sources (describing the data model) lacks a Public
column. Therefore, it isn't clear what information (if any) is considered public.
Without understanding which fields are public, it is unclear (to me) which data may be shared.
My default position is to assume all data is private and exclude Data Sources from the dump altogether.
The Provider Profile, Resource Profile (for services) contains fields (First Name, Last Name, Email, ...) under the collective group of "Public Contact". These details are marked as being public (Public
column has values Yes
).
From manual inspection, I can confirm that this information is publicly accessible: it is visible in the portal when viewing the details of a provider or a service.
However, some providers and some services/resources have a public contact that clearly identifies a person. In other words, this is personal data.
My understanding is that the privacy agreement (through which this information was collected) does not allow personal information to be shared.
Therefore, I would suggest we filter out Public Contact, to avoid inadvertently sharing personal information.
One important decision is to choose a license for the data dump.
From discussion with Sally, we thought that CC-BY (specifically, CC-BY 4.0) would be a reasonable choice.
Alexander Paul Millar (bddc6c8d) at 28 Mar 16:51
Rename container to common-container
Alexander Paul Millar (9411f8c7) at 26 Mar 11:05
Add initial version of trainingResources data model
... and 1 more commit
Alexander Paul Millar (adef6a7d) at 25 Mar 14:21
Introduce the Email type
Alexander Paul Millar (ee40ed3c) at 25 Mar 12:34
Add URL as a new type
Alexander Paul Millar (0310f1eb) at 19 Mar 23:29
Add information taken from EOSC Marketplace "About" pages
... and 1 more commit
Alexander Paul Millar (3c7b6dd4) at 15 Mar 10:59
Factor out common elements: structures and enumerations
... and 1 more commit
Alexander Paul Millar (4949b579) at 15 Mar 09:47
Factor out common container elements
Alexander Paul Millar (0013d123) at 15 Mar 09:27
Teach git to ignore input data and generated output
Alexander Paul Millar (a759ed51) at 12 Mar 23:42
Add git location to generated output
Alexander Paul Millar (3ea9febd) at 12 Mar 14:41
Initial commit