Dictionary of terminology

Publication Year: 08. 04. 2017    Date of last inspection: 08. 04. 2017

  Know an interesting study?

Fill-in our form!

AIP, Archival Information Package: the state of digital objects in the archive after storage in the preservation system.  

Authenticity: a digital object is authentic if it can be said to rely on three significant provable properties: that the object is what it purports to be, that it was created by whomsoever is purported to have been created by; and that it was created at the time when it is purported to have been created.

Data database: a relational database with basic data on originally received data files and their variations, including the identification number, type of data file and basic thematic characteristics (data file or other connected materials).

Data description: the description of the data file with data in accordance with DDI (position in the data file and the markings of variables and units), together with a detailed description of the content of variables (conceptual meaning of the variable, set of values with codes) that define the thematic and conceptual level of data file that is under digital preservation and part of DDI codebook file.

Data Documentation Initiative (DDI): is a project of the social science community to establish an international standard and methodology for describing the content, presentation, transport, and preservation of metadata about datasets in the social and behavioral sciences. (http://www.ddialliance.org/).

Data producer: is an individual who is named on a deposit agreement as having sufficient responsibility to grant particular rights to the Archive on behalf of a data collection. The depositor may be the investigator, creator or the copyright owner of a data collection, but does not have to be. In some terminology, the term depositor is used in a similar meaning.

Data users: those persons, or client systems, that interact with OAIS services to find and acquire preserved information of interest. A special class of Consumers is the Designated Community. The Designated Community is the set of Consumers who should be able to understand the preserved information.

Data: constitute primary sources that underpin scientific research and enable derivation of theoretical or applied findings. More precise definitions of data vary according to context. Quantitative data may refer to just the matrices of numbers or words that comprise a data file but may also cover other information (metadata) held within a statistical package data file, such as variable labels, code labels, and missing value definitions. Qualitative data might include interview transcripts as well as audio and video recordings (analog or digital).

DataCite: DataCite is a leading global non-profit organization that provides persistent identifiers (DOIs) for research data (https://www.datacite.org/).

Data file description: data file description of quantitative data includes at least information on the number of variables and units in the datafile and their corresponding data file format: part of the DDI codebook file.

Datafile: a digital object that is the core unit of preservation and distribution in the ADP. The data are symbolic representations of reality, acquired with the data collection process.

DataVerse: is an open source web application to share, preserve, cite, explore, and analyze research data (http://dataverse.org/). 

DDI codebook file: digital object, formatted according to the DDI2.1 DTD standard in the XML language that includes all descriptive data, data on provenance and other metadata, which are important for understanding and reproducing data files. It includes the marking of identifiers and metadata of digital objects that form the collection of data materials (http://www.ddialliance.org/specification).  

Designated Community: an identified group of potential Consumers who should be able to understand a particular set of information. The Designated Community may be composed of multiple user communities. A Designated Community is defined by the Archive and this definition may change over time.

Digital data curation: Data curation is the selection, preservation, maintenance, and archiving of digital assets and it establishes, maintains and adds value to data for present and future use.

Digital Object: An object composed of a set of bit sequences.

Digital preservation: is a series of managed activities necessary to ensure enduring access to authentic versions of the content of digital materials for as long as necessary.

DIP, Dissemination Information Package: the state of digital objects as available to final users in different periods.   

DSA: Data Seal of Approval (http://datasealofapproval.org/)

Fedora: is the flexible, modular, open source repository platform with native linked data support.

Independently Understandable: A characteristic of information that is sufficiently complete to allow it to be interpreted, understood and used by the Designated Community without having to resort to special resources not widely available, including named individuals.

Information Object: A Data Object together with its Representation Information.

Information Package: A logical container composed of optional Content Information and optional associated Preservation Description Information. Associated with this Information Package is Packaging Information used to delimit and identify the Content Information and Package Description information used to facilitate searches for the Content Information.

Integrity: refers to its completeness and to a continued state of un-alteration of a digital object.

Jira: software used to note down the state and changes of individual studies as well as other administrative tasks.

Long-Term Preservation: The act of maintaining information, Independently Understandable by a Designated Community, and with evidence supporting its Authenticity, over the Long Term.

Metadata: information that describes significant aspects (e.g. content, context and structure of information) of a resource.

Nesstar: Nesstar WebView is a web-based system for the dissemination of data (http://www.nesstar.com/).   

Normalisation at ingest: file format conversion by the repository when data are submitted.

OAIS: The Open Archival Information System (OAIS) Reference Model is a conceptual framework for an archival system dedicated to preserving and maintaining access to digital information. It addresses a full range of archival preservation functions including ingest, archival storage, data management, access, and dissemination. It is not a metadata standard but rather it outlines a taxonomy that defines the information types deemed necessary for the understanding of digital content over an indefinite period of time (https://public.ccsds.org/pubs/650x0m2.pdf). 

OpenAire: The FP7 project OpenAIRE aimed to support the implementation of the EC and ERC Open Access policies. Its successor OpenAIREplus was aimed at linking the aggregated research publications to the accompanying research and project information, datasets and author information. Open access to scientific peer reviewed publications has evolved from a pilot project with limited scope in FP7 to an underlying principle in the Horizon 2020 funding scheme, obligatory for all H2020 funded projects. The goal is to make as much European funded research output as possible available to all, via the OpenAIRE portal.

Other materials, connected with the research data: data files of textual and other materials, which are connected with the research data, such as questionnaire, external code etc.

Other materials: textual and other data file documents that are important for the additional understanding of the content and context of the study, such as preliminary reports, publications etc. They are usually not part of ADP's digital preservation but are accessible through a link on the primary point of access.

Preservation strategy: a digital preservation strategy is a particular technical approach to the preservation of digital materials. This document contains the strategy and policy of the Archive.

Reliability: relies on having trusted and dependable contents of a digital object.

Semantic Information: The Representation Information that further describes the meaning beyond that provided by the Structure Information.

Significant properties: are those elements of a digital object which need to be preserved in order for it to be used by the designated user community. They will almost always include information content and a level of functionality, but might also include formatting and look and feel.

SIP, Submission Information Package: the state of digital objects in the archive after ingest. 

Structure Information: The Representation Information that imparts meaning about how other information is organized. For example, it maps bit streams to common computer types such as characters, numbers, and pixels and aggregations of those types such as character strings and arrays.

Study database: a relational database with basic metadata on the level of study that includes the unique study code, title of the study, data provider identification and the date of signing the license agreement.

Study: is the Archive’s basic content-tracking concept used for a data collection during the stages of acquisition, ingest, preservation and dissemination. There is normally a one-to-one relationship between a study and a data collection. Similarly, there is often a one-to-one relationship between a deposit and a study, but a deposit may, depending on circumstances, be divided into more than one study. Equally, more than one deposit from the same depositor may be combined into a single study, similar to accumulating archival fonds.

Succession Plan: The plan of how and when the management, ownership and/or control of the OAIS holdings will be transferred to a subsequent OAIS in order to ensure the continued effective preservation of those holdings.

Transformation: A Digital Migration in which there is an alteration to the Content Information or PDI of an Archival Information Package. For example, changing ASCII codes to UNICODE in a text document being preserved is a Transformation.

Usability: relies on the ability of a digital object to be located, retrieved, presented and interpreted.


  How to cite this page?

Social Science Data Archives. YEAR. Dictionary of terminology. Accessed: http://adp.fdv.uni-lj.si/eng/usposobi/slovar/ (DD. month year).

18. June 2020 |

Konferenca: Raziskovalni podatki in Evropski oblak odprte znanosti

11. March 2020 | Velika dvorana Ministrstva za izobraževanje, znanost in šport, Masarykova 16, Ljubljana

Delavnica: Ravnanje z raziskovalnimi podatki

UNI-FDVCESSDA coretrust_logo RDA_Node
ADP is part of the Social Sciences Research Institute of the Faculty of Social Sciences. The Slovenian Research Agency provides funding of the ADP within the infrastructure program "Network of Research and Infrastructural Centres" The ADP is a member of the umbrella organization of the European Social Science Data Archives CESSDA ERIC. © ADP (ISSN 2385-9415) | 1997 - 2017 | arhiv.podatkov@fdv.uni-lj.si