Page 109 - Big data - Concept and application for telecommunications
P. 109
Big data - Concept and application for telecommunications 3
1 Scope
This Recommendation specifies the functional requirements for data provenance in a big data ecosystem as
defined in [ITU-T Y.3600]. This Recommendation introduces data provenance as well as data provenance in
big data ecosystem, and provides a conceptual model, operations, logical components, and functional
requirements for big data provenance. The functional requirements provided in this Recommendation are
derived from use cases
2 References
The following ITU-T Recommendations and other references contain provisions which, through reference in
the text of this technical report form basis and help understanding the topic of trust provisioning in ICT. At
the time of publication, the editions indicated were valid. All Recommendations and other references are
subject to revision; readers are therefore encouraged to investigate the possibility of applying the most
recent edition of the Recommendations and other references listed below. A list of the currently valid ITU-T
Recommendations is regularly published.
[ITU-T Y.3600] Recommendation ITU-T Y.3600 (2015), Big data – Cloud computing based requirements
and capabilities.
3 Definitions
3.1 Terms defined elsewhere
This Recommendation uses the following terms defined elsewhere:
3.1.1 big data [ITU-T Y.3600]: A paradigm for enabling the collection, storage, management, analysis and
visualization, potentially under real-time constraints, of extensive datasets with heterogeneous
characteristics.
NOTE – Examples of datasets characteristics include high-volume, high-velocity, high-variety, etc.
3.1.2 provenance [b-ITU-T X.1255]: Information pertaining to any source of information including the
party or parties involved in generating it, introducing it and/or vouching for it.
3.2 Terms defined in this Recommendation
This Recommendation defines the following term:
3.2.1 big data provenance: Information that records the historical path of data according to the data
lifecycle operations in a big data ecosystem.
NOTE 1 – Data lifecycle operations include data generation, transmission, storage, use, and deletion.
NOTE 2 – Data provenance information provides details about the source of data, such as the person
responsible for the provision of data, functions applied to data, and information about the computing
environment for data processing (e.g., operating system, description of the hardware, locale settings and
time zone).
4 Abbreviations and acronyms
This Recommendation uses the following abbreviations and acronyms:
BD Big Data
BDC Big Data service Customer
BDSP Big Data Service Provider
DB Data Broker
DP Data Provider
Static data – Data provenance, data formats and trust 101