Analytics
Analytics covers the analysis of data in its broadest sense, including its collection, measurement, analysis, visualization and interpretation.
API (Application Programming Interface)
An API is an interface that allows computer programs to automatically interact under specified, documented conditions, without human intervention.
Artificial Intelligence (AI)
Artificial intelligence (AI) is the simulation of human intelligence processes by machines, especially computer systems.
Big data
Big data covers the technologies, methods and processes used to collect and analyze the enormous volumes of data now produced by organizations.
Box Plot
A box plot is a standardized, graphical way of summarizing the distribution of a set of data groups and visualizing them for further analysis.
Business Glossary
A business glossary defines and organizes the different business terms used to describe data within an organization.
Business intelligence
Business intelligence uses technology to collect and analyze data, providing deeper insight into operations, and supporting better decision-making.
Chief Data Officer (CDO)
The Chief Data Officer (CDO) is the senior executive responsible for how an organization utilizes and governs data to drive maximum business value.
Cloud Computing
Cloud computing is the on-demand delivery of IT resources and access to remotely stored data via the internet which scales to match user needs.
Data Lifecycle
Data plays an essential role in organizational success. It fuels strategic and operational decisions. It drives innovation and revenue generation for ...
Dashboard as a Service (DaaS)
Dashboard as a Service (Daas) is a cloud-based solution that enables the easy creation of online dashboards without requiring deep technical skills.
Data Accessibility
Data accessibility refers to how easy it is for an organization’s data to be discovered, accessed, understood and utilized effectively by all.
Data Analyst
A data analyst collects, processes and analyzes an organization's data, ensuring data is used as a decision-making tool.
Data architect
The data architect designs and manages an organization's data architecture, which collects, stores, processes, analyzes and shares all data.
Data Architecture
A data architecture provides a framework for collecting, storing, sharing, processing, analyzing and reusing data, thus turning it into value.
Data as a product
Data as a Product (DaaP) is an approach that creates standalone products built on data, enabling it to be used by internal and external users. A data ...
Data Asset
A data asset is any digital object or entity made up of data. It could be a dataset, document, visualization or data service.
Data Asset Inventory (DAI)
A data asset inventory (DAI) is a structured catalog that finds, lists and details all of an organization’s data, aiding compliance and data security....
Data Asset Management
Data asset management is the end-to-end process of organizing, managing, and optimizing data assets to generate business value.
Data automation
Data automation refers to the practice of processing data through automated software. Essentially, the tasks of collecting, exploring, cleaning or man...
Data catalog
A data catalog is an inventory of all data within an organization. This enables internal and external users to easily find and access information.
Data cleansing
Data cleansing (or data cleaning) is the process of identifying and fixing incorrect, incomplete, duplicate, unneeded, or invalid data in a data set.
Data contract
A data contract is a formal agreement that defines how data is structured, formatted, and communicated between different components of a data system.
Data culture
A data culture is where everyone in an organization relies on, and uses, data within their working lives for decision making, planning and operations.
Data custodian
Data custodians are directly responsible for the technical safekeeping, security, accessibility, and proper maintenance of an organization’s data asse...
Data dashboards
Data dashboards are management tools that collect and display metrics, such as key performance indicators, to monitor and improve business activities.
Data Dictionary
A data dictionary provides detailed technical specifications about data elements, structures and attributes within a specific data source.
Data discovery
Data discovery involves finding and classifying data from multiple sources and making it available for all to improve decision making and performance....
Data ethics
Data ethics covers the ethical and moral obligations of collecting, sharing, and using data, focused on ensuring that data is used fairly, for good.
Data Exchange
A data exchange enables the secure, controlled, self-service sharing of data between different organizations, allowing potential data monetization.
Data exploration
Data exploration is the first step in data analysis, where data visualization and statistical techniques are used to better understand the nature of d...
Data governance
Data governance covers how you identify, organize, handle, manage, and use data collected in your organization, reducing risk and enabling agility.
Data Governance Officer
A data governance officer is responsible for leading and coordinating data governance activities on a day-to-day basis. Data governance covers how an ...
Data innovation
Data innovation is the use of data and analytics to create new added-value products, solutions, processes, organizational methods and markets.
Data integration
Data integration covers bringing together data from multiple different sources, making it more actionable and useful to those who access it.
Data Intelligence
Faced with vast data volumes, data intelligence is crucial to identifying the most relevant information.
Data interoperability
Data interoperability is vital in order to foster collaboration, decision-making, and information sharing. Today’s world is characterized not just by ...
Data join
Data join involves combining multiple datasets into one, increasing the relevance of data and enabling deeper analysis.
Data Lake
A data lake is a large-scale, centralized repository which stores and processes structured, semistructured, and unstructured data in its raw format.
Data Lifecycle
Data plays an essential role in organizational success. It fuels strategic and operational decisions. It drives innovation and revenue generation for ...
Data lineage
Data lineage (or data traceability) provides full visibility of the data lifecycle inside and outside the organization, including any changes made.
Data literacy
Data literacy is the ability to read, understand, work with, analyze and communicate with data, turning it into meaningful, relevant information.
Data Management Platform (DMP)
A complete definition of Data Management Platforms (DMPs), their benefits, and their central role in modern data management.
Data Mapping
Data mapping is the process of matching data fields from one database or information source, to related fields in a destination source or system.
Data Marketplace
A data marketplace is a centralized space where datasets and data assets can be accessed, exchanged and shared between organizations and individuals.
Data Mart
A subset of a data warehouse, a data mart is a way of storing data focused on a particular office, department, line of business area or subject.
Data Mesh
Data mesh is a decentralized, federated approach to data management that enables data sharing and data democratization across the organization.
Data mining
Data mining is the analysis of huge volumes of data to find hidden patterns, anomalies, or correlations, predicting future trends and opportunities.
Data monetization
Data monetization is the process of using company data to drive quantifiable internal or external economic benefits.
Data Normalization
Data normalization ensures that data from different sources is organized and structured in a uniform, consistent and logical way, removing anomalies.
Data Pipeline
A data pipeline covers the steps involved in processing, optimizing and preparing raw data from disparate sources, so it can be used by the business
Data portal
The following article provides a comprehensive definition of data portals. In an ever-evolving digital world, understanding the role and capabilities ...
Data Preparation
Data preparation (or pre-processing) validates, cleans, consolidates and enriches the raw data collected by an organization.
Data producer
A data producer is the root source of data. It can be a person manually entering data, an automated service, or a device/machine that gathers data.
Data product
A data product is a product built around data, enabling users to complete a specific task or objective using that underlying data.
Data product marketplace
A data product marketplace is a standardized, centralized collaborative platform that promotes and enables the consumption of data products and other ...
Data quality
Data quality is a measure of the condition of data, based on areas including accuracy, completeness, timeliness, consistency and reliability.
Data repository
A data repository is a secure, accessible data storage space containing specific data partitioned and made available for analysis or reporting.
Data Reuse
Data reuse is when information is reused for purposes other than the one it was initially collected for. This creates new value from the data.
Data room
A data room is a secure place that an organization uses to share confidential documents with selected third parties.
Data science
Data science is the practice of extracting and applying valuable information and insights from large volumes of structured and unstructured data.
Data Service Level Agreement (SLA)
A data service level agreement (SLA) sets out guaranteed performance levels around quality, reliability and availability for data assets.
Data silo
A data silo is a collection of data created by one department or system that is inaccessible to the wider organization.
Data sourcing
Data sourcing is the systematic, methodological process of collecting raw data from inside and outside an organization.
Data Space
A data space brings together relevant data infrastructures from different partners and shares them in an interoperable, secure and standardized way.
Data steward
A data steward helps implement a good data governance strategy across their organization to guarantee the quality, usability, and security of data.
Data storage
Data storage is the retention of digital information on recording media, so that it can be accessed by computers or other devices for future use.
Data storytelling
Data storytelling involves making information easily understandable and compelling by using storytelling techniques to provide context.
Data Streaming
What is data streaming? Why is it essential for real-time analysis and decision-making?
Data Transformation
Data transformation involves transitioning data from an external source to an internal information system.
Data Virtualization
Data virtualization brings together data from multiple, disparate sources across the organization in real-time, in a single, virtual location.
Data Warehouse
A data warehouse brings together data from multiple sources into a single, centralized, large repository for storage, analysis and reporting.
Data-as-a-Service / Data Service
Data-as-a-Service (or DaaS) is a business model where organizations offer customers and partners access to their data, normally through subscription.
Data-driven
In a data-driven organization employees across all departments rely on insights from data to support strategic and operational decision making.
Data-driven
In a data-driven organization employees across all departments rely on insights from data to support strategic and operational decision making.
Database Administrator
The Database Administrator ensures that the thousands of pieces of information stored in an organization's databases are reliable, high-quality, secur...
Database management systems (DBMS)
A database management system (DBMS) is software that allows data to be stored, retrieved, sorted, deleted, modified or used.
Dataset
A dataset is a collection of related data points, providing data in an understandable form to be shared and reused internally and externally.
Dataset schema
A dataset schema is a blueprint that outlines how particular data, such as in a database, is structured, configured and organized.
Digital Asset Management
Digital Asset Management (DAM) enables the efficient and secure storage, retrieval and management of digital files and assets.
Digital Twin
A digital twin is a virtual replica of a physical element which is capable of reproducing its behavior and lifecycle.
Ecosystem data marketplace
An ecosystem data marketplace seamlessly and securely shares data between partners within an ecosystem through an intuitive self-service experience.
Embedded Analytics
The goal of embedded analytics is to make it easy for everyone to access and utilize data directly where they need it.
Environmental, Social and Governance (ESG)
Environmental, Social and Governance (ESG) data provides information on an organization’s impact on society, the environment, and its transparency.
Extract, Transform, Load (ETL)
Extract, Transform, Load (ETL) is the data integration process used to combine data from multiple sources into a single, centralized repository.
GDPR – General Data Protection Regul...
The General Data Protection Regulation (GDPR) is legislation designed to protect and control the use of personal data in the EU and other countries.
Geographic Information System (GIS)
A Geographic Information System (GIS) is an information system that gathers, manages and analyzes spatial and geographic data.
Open government
Open government increases transparency, integrity, accountability, and stakeholder participation in government, benefiting citizens and public bodies.
Internal data marketplace
An internal data marketplace (IDM) effectively and securely shares data through self-service with all employees across the entire organization.
Internal Data Portal
An internal data portal shares data inside an organization or with a limited range of selected partners.
Internet of Things (IoT)
The Internet of Things (IoT) is a network of physical sensors that monitor variables and connect and exchange this data through the internet.
Linked Open Data (LOD)
Linked Open Data (LOD) refers to information that is both accessible to everyone and structured in a way that machines can interpret. Linked Open Data...
LLM Mesh
A Large Language Model (LLM) Mesh is an integrated ecosystem of multiple LLMs that enables AI to be scaled successfully across the organization.
Master Data Management (MDM)
Master Data Management is the process of creating and managing a uniform set of data records to categorize transactional data across a business.
Metadata
Metadata is data that gives a description of other data. This provides context to make it more easily understandable and usable.
Mobility as a Service (MaaS)
The Mobility as a Service (MaaS) concept covers data-driven digital services that aim to improve mobility through user-focused platforms.
Modern Data Stack
A modern data stack (MDS) is a collection of cloud-based components and tools used to collect, store, process, analyze, visualize, and share data.
No-code
No-code solutions enable non-specialists without programming skills to quickly and easily develop and publish full software applications.
NoSQL Databases
NoSQL databases are a type of database management system that is designed to be capable of processing large volumes of changing unstructured data.
Open data
Open data is data which is shared, openly accessible and exploitable for any purpose by everyone (including companies, citizens, media, or consumers)....
Open data portal
An open data portal is an online portal used by both public and private organizations to share data externally with their audiences.
Open source
Open source software (OSS) is software and source code released under a license that grants users the rights to use, modify, and distribute it.
Personal Data
Personal data (or personally identifiable information (PII)) is information that relates to an identified or identifiable individual.
Product Data Management (PDM)
Product data management (PDM) is the central system used to securely capture and manage engineering data and process information during product develo...
Public sector data
Public data refers to all information made freely available by government bodies or local collectivities.
Reference Data
Reference data is static, or semi-static, data used to classify or categorize other data, ensuring consistency and standardization in data management.
Relational Database
A relational database stores and structures data by organizing it in rows, columns and tables, based on defined relationships between data points.
Self-Service Business Intelligence
Self-service business intelligence (BI) solutions enable a wider audience to successfully benefit from business intelligence tools. Explore what self-...
Self-service data
Self-service data enables everyone within an organization or ecosystem to independently access, query and gain insights from data.
Self-service data platform (SDP)
A self-service data platform (SDP) enables the independent discovery, access, and analysis of data sets and the creation and management of data produc...
Smart City
A smart city is an area that uses technology and data to improve the citizen experience, increase efficiency, innovate, and meet its wider objectives.
Smart grids
Smart grids aim to optimize the production, distribution and consumption of electricity through connected technology, including networks and sensors.
Smart Parking
Smart parking uses digital technologies to optimize vehicle parking, saving time, reducing pollution and improving the driving experience.
Snowflake schema
A snowflake schema is a type of database/data warehouse schema used to store data through a multi-dimensional structure.
Sovereign AI
Sovereign AI covers an individual nation’s ability to control, create and deploy AI models using its own infrastructure, data, workforce and networks....
Sovereign Cloud
A sovereign cloud is a cloud environment that is physically located within one country, facilitating compliance with local laws.
Standardized Data
Standardized data is data from different sources that has been transformed into a consistent, standards-based format, allowing meaningful comparisons.
Star schema
A star schema is the simplest type of database/data warehouse schema used to store data, with the model’s design resembling a star shape.
Statistical Data and Metadata eXchange (SD...
SDMX (Statistical Data and Metadata eXchange) is a standard developed by the statistics community to manage and automate the process of exchanging and...
Structured and Unstructured Data
Structured and unstructured data are terms to describe the format and models of specific data, and impact how data is collected, stored and analyzed.