Term 31
Cosmos DB API
Cosmos DB API is a collection of interfaces that lets applications interact with Azure Cosmos DB, a globally distributed NoSQL database, using different data models and query languages.
Acronym study
Terms 31–60 of 136 DP-900 acronyms and key terms. Each entry includes a plain-English definition and a link to the full 800-word glossary page with exam context and practice questions.
Term 31
Cosmos DB API is a collection of interfaces that lets applications interact with Azure Cosmos DB, a globally distributed NoSQL database, using different data models and query languages.
Term 32
Cross-Region Replication is the automated copying of data from a storage bucket in one geographic region to a bucket in a different geographic region for disaster recovery, compliance, or lower latency access.
Term 33
A dashboard is a visual display of key metrics and data points that helps IT professionals monitor, analyze, and manage systems or processes in real time.
Term 34
Data is raw, unprocessed information, like numbers, words, or measurements, that can be stored, processed, and analyzed by computers.
Term 35
A data catalog is a centralized inventory of data assets that helps people find, understand, and trust the data they need for analytics or business decisions.
Term 36
A Data Center Interconnect is a network connection that links two or more separate data centers together so they can share data, resources, and services as if they were a single facility.
Term 37
Data classification is the process of organizing data into categories based on its sensitivity, value, and criticality to an organization, so that appropriate security controls can be applied.
Term 38
Data governance is the overall process of managing the availability, usability, integrity, and security of data used in an organization, based on internal standards and policies.
Term 39
Data ingestion is the process of moving data from various sources into a storage system where it can be accessed, analyzed, and used.
Term 40
A data lake is a centralized storage repository that holds vast amounts of raw data in its native format until it is needed for analysis.
Term 41
Data Lake Storage Gen2 is a cloud-based storage service that combines a scalable data lake with enterprise-grade file system capabilities for big data analytics.
Term 42
A data lakehouse is a modern data architecture that combines the flexibility of a data lake with the reliability and performance of a data warehouse on a single platform.
Term 43
A Data Lifecycle Manager is a system or set of policies that automates the movement, protection, retention, and deletion of data from creation to disposal, ensuring compliance and efficient storage usage.
Term 44
Data lineage is the process of tracking the origin, movement, and transformation of data as it flows through various systems and steps in a data pipeline.
Term 45
A data model is a blueprint that defines how data is organized, stored, and accessed in a database or data system.
Term 46
Data retention is the practice of keeping data for a specific period to meet legal, business, or compliance needs, and then securely disposing of it.
Term 47
Data security is the practice of protecting digital information from unauthorized access, corruption, or theft throughout its lifecycle.
Term 48
Data transformation is the process of converting data from one format, structure, or value into another to make it usable for analysis, storage, or reporting.
Term 49
Data visualization is the practice of translating data and information into visual context, such as charts and graphs, to make complex data easier to understand and use for decision-making.
Term 50
A data warehouse is a central repository that stores large amounts of structured data from multiple sources, optimized for querying and analysis rather than day-to-day transactions.
Term 51
Dataproc is a managed cloud service for running Apache Spark and Apache Hadoop clusters, allowing you to process large datasets quickly and economically.
Term 52
A dataset is a collection of related data, usually in a structured format, that can be used for analysis, training models, or reporting in Azure data services.
Term 53
A datastream is a continuous, ordered flow of data that is generated and transmitted from a source to a destination for real-time processing or analysis.
Term 54
DAX (DynamoDB Accelerator) is a fully managed, highly available, in-memory cache for Amazon DynamoDB that provides microsecond read latency.
Term 55
A Dedicated SQL pool is a cloud-based analytics service in Azure Synapse Analytics that provides a managed, scalable environment for running large-scale data warehousing queries using Transact-SQL.
Term 56
Denormalization is a database design strategy that adds redundant data to one or more tables to reduce the number of joins needed in queries, improving read performance at the cost of extra storage and more complex writes.
Term 57
Amazon DocumentDB is a fully managed, MongoDB-compatible document database service that stores, queries, and indexes JSON-like data for scalable applications.
Term 58
EBS encryption is a security feature that automatically encrypts data stored on Amazon Elastic Block Store volumes, protecting it at rest and in transit between the volume and the attached EC2 instance.
Term 59
An EBS snapshot is a point-in-time backup of an Amazon Elastic Block Store volume that you can use for recovery, cloning, or migration.
Term 60
Amazon Elastic Block Store (EBS) volume types are different categories of block-level storage volumes optimized for specific performance, cost, and use case requirements in the AWS cloud.