DP-201: Designing an Azure Data Solution

Notes

Design Azure data storage solutions (40-45%)

Recommend an Azure Data solution based on requirements

  • choose the correct data storage solution to meet the technical and business requirements
  • choose the partition distribution type

Design non-relational cloud data stores

  • design data distribution and partitions
  • design for scale, including multi-region, latency, and throughput
  • design a solution that uses Cosmos DB, Data Lake Storage Gen2, or Blob storage
  • select the appropriate Cosmos DB API
  • design a disaster recovery strategy
  • design for high availability

Design relational cloud data stores

  • design data distribution and partitions
  • design for scale, including multi-region, latency, and throughput
  • design a solution that uses SQL Database and SQL Data Warehouse
  • design a disaster recovery strategy
  • design for high availability

Design data processing solutions (25-30%)

Design batch processing solutions

  • design batch processing solutions by using Data Factory and Azure Databricks
  • identify the optimal data ingestion method for a batch processing solution
  • identify where processing should take place, such as at the source, at the destination, or in transit

Design real-time processing solutions

  • design for real-time processing by using Stream Analytics and Azure Databricks
  • design and provision compute resources

Design for data security and compliance (25-30%)

Design security for source data access

  • plan for secure endpoints
  • choose the appropriate authentication mechanism, such as access keys, shared access signatures (SAS), and Azure Active Directory (Azure AD)

Design security for data policies and standards

  • design for data encryption for data at rest and in transit
  • design for data auditing and data masking
  • design for data privacy and data classification
  • design a data retention policy
  • plan an archiving strategy
  • plan to purge data based on business requirements

Terminology

  • Azure Data Storage Solution
  • Non-relational Cloud Data Stores
  • Relational Cloud Data Stores
  • Batch Processing Solutions
    • Data Factory
    • Azure Databricks
  • Real-time Processing Solutions
    • Stream Analytics
    • Azure Databricks
  • Secure Endpoints
  • Shared Access Signatures (SAS)
  • Data Encryption
    • Data at rest and in Transit
  • Data Auditing
  • Data Masking
  • Data Privacy
  • Data Classification
  • Data Retention Policy
  • Archiving Strategy
  • Data Purging

Blogs

Training

Videos

References