AZ-305 Learning Portal

Concept — What & Why

Azure Non-Relational Storage Services

Azure Cosmos DBA globally distributed NoSQL database with multi-API support (Core SQL, MongoDB, Cassandra, Table, Gremlin). Provides single-digit millisecond latency globally, 99.999% SLA, configurable consistency levels, and built-in multi-region writes. Supports change feed for real-time data processing.Partition KeyThe property used to distribute data across logical partitions in Cosmos DB. Must be high-cardinality (many unique values), evenly distributed, and aligned with primary access patterns. A poor partition key causes hot partitions and request throttling. Cannot be changed after container creation.Cosmos DB Consistency LevelsTrade-offs between latency and data consistency: Strong (linearizable, slowest), Bounded Staleness (within version/time bounds), Session (default, consistent within a session), Consistent Prefix (ordered), Eventual (fastest, weakest). Session consistency is recommended for most applications.Azure Blob StorageObject storage for unstructured data (files, images, videos, logs). Three access tiers: Hot (frequent access), Cool (infrequent, 30+ day minimum), Archive (rare access, 90+ day minimum, requires rehydration). Lifecycle policies automate tier transitions based on last-modified date.Azure Data Lake Storage Gen2Blob Storage with hierarchical namespace (HNS) enabled. Provides POSIX-compliant ACLs, directory-level operations, and is integrated with analytics engines (Spark, Synapse, HDInsight). Required for big data analytics pipelines.

Service Selection Matrix

Data Type	Use Case	Recommended Service
JSON documents	User profiles, product catalogs	Cosmos DB Core SQL or MongoDB API
Key-value	Session state, IoT telemetry	Cosmos DB Table API or Azure Table Storage
Graph	Social networks, knowledge graphs	Cosmos DB Gremlin API
Files, images, videos	Media, backups, archives	Azure Blob Storage
Big data analytics	Spark, Hadoop, Synapse	Azure Data Lake Storage Gen2
Full-text search	E-commerce discovery	Azure AI Search

Blob Storage Redundancy Options

Option	Copies	Protection	Use Case
LRS	3 (same DC)	Hardware failure	Dev/test
ZRS	3 (across zones)	Zone failure	HA within region
GRS	6 (3 primary + 3 secondary)	Regional failure	DR, secondary read-only after failover
RA-GRS	6 + readable secondary	Regional failure	Geo-distributed reads without failover
GZRS / RA-GZRS	Zone + geo	Zone + regional	Maximum resilience

Deep Dive — How It Works

Cosmos DB Design Patterns

Partition Key Selection Criteria

A good partition key must satisfy ALL of the following:

High cardinality — Many unique values (user ID, order ID) prevents hot partitions
Even distribution — Requests spread evenly across logical partitions
Access pattern alignment — Your primary query filter should be the partition key
Immutability — Cannot be changed; choose a property that won't be updated

Anti-patterns: Using region (only 3–5 values), status (only a few states), or timestamp (hot partition on the latest time bucket) creates severe throttling.

Throughput Models Comparison

Model	Pricing	Best For
Provisioned (Manual)	Fixed RU/s, minimum charge	Predictable, steady workloads
Serverless	Per-RU consumed, no minimum	Bursty, intermittent, dev/test
Autoscale	Scales between 10%–100% of max RU/s	Variable with spikes, production

Decision rule: If traffic is zero for significant periods → Serverless. If traffic spikes unpredictably but there's always baseline traffic → Autoscale. If traffic is stable and predictable → Provisioned.

Blob Storage Access Tier Strategy

Lifecycle policy example: Blob last modified >30 days → move to Cool. Last modified >90 days → move to Archive. Last modified >7 years → delete.

Archive tier limitations:

Rehydration takes 1–15 hours (Standard) or up to 1 hour (High Priority)
Early deletion penalty if deleted before 90 days
Not suitable for data accessed more than 3 times per year

Cosmos DB Consistency Level Selection

Consistency	Latency	When to Choose
Strong	Highest	Financial transactions, global inventory
Bounded Staleness	High	Gaming leaderboards, social feeds
Session	Low (default)	Most applications — per-session consistency
Consistent Prefix	Low	Order-sensitive event streams
Eventual	Lowest	Click tracking, telemetry, non-critical data

Hands-On Lab

Hands-On: Create Cosmos DB Container with Optimal Partition Key

Step 1: Create Cosmos DB Account

Navigate to Azure Cosmos DB > Create
Select API: Core (SQL) for JSON with SQL queries
Configure:
- Account name: Globally unique
- Consistency level: Session (default)
- Geo-redundancy: Enable for HA
- Multi-region writes: Enable for active-active
Review and create

Step 2: Create Database and Container

Open Cosmos DB account > Data Explorer > New Container
Database ID: Create or select
Container ID: Enter meaningful name (e.g., user-profiles)
Partition key: Enter /userId (high-cardinality property)
Throughput: Autoscale with max 4,000 RU/s (recommended)
Enable TTL if items should auto-expire (e.g., session data)

Step 3: Configure Blob Storage Lifecycle Management

Open Storage Account > Lifecycle management

Click Add a rule > Code view and enter:

{
  "rules": [{
    "name": "tier-and-delete",
    "type": "Lifecycle",
    "definition": {
      "actions": {
        "baseBlob": {
          "tierToCool": {"daysAfterModificationGreaterThan": 30},
          "tierToArchive": {"daysAfterModificationGreaterThan": 90},
          "delete": {"daysAfterModificationGreaterThan": 2555}
        }
      },
      "filters": {"blobTypes": ["blockBlob"]}
    }
  }]
}

Save and apply — transitions occur automatically

Step 4: Query Cosmos DB with SQL API

Open Data Explorer > select container > New SQL Query

Example queries:

-- Efficient: filters on partition key
SELECT * FROM c WHERE c.userId = 'user-123'

-- Cross-partition: avoid in hot paths
SELECT * FROM c WHERE c.region = 'US'

Click Execute Query — note RU consumption for each query
Cross-partition queries consume significantly more RUs

Exam Angle — What AZ-305 Tests

AZ-305 Exam Focus

AZ-305 tests your ability to select the right non-relational service, consistency level, partition key strategy, and blob tier for a given scenario. Partition key selection and the serverless vs. provisioned vs. autoscale decision are the highest-frequency topics.

Exam Trap

Wrong Partition Key Selection: Any property with low cardinality (region, status, tier) causes hot partitions — a few logical partitions receive all requests while others sit idle, causing throttling. Always select high-cardinality properties (user ID, order ID, device ID) as partition keys.

Exam Trap

Cosmos DB Serverless Always Cheaper: Serverless has no minimum cost — but at predictable, continuous workloads with high RU/s consumption, Provisioned or Autoscale is cheaper. Serverless excels for bursty or intermittent workloads with significant idle periods.

Exam Trap

Archive Tier as Primary Storage: Archive tier requires rehydration (1–15 hours) before data is accessible. It is not suitable as primary storage for data accessed more than a few times per year. Use Archive only for compliance archiving or rarely accessed backup data.

Exam Trap

GRS vs. RA-GRS for Geo-Distributed Reads: GRS replicates to a secondary region, but the secondary is only readable after a failover. RA-GRS provides a readable secondary endpoint before failover occurs. Use RA-GRS when applications need to read from secondary regions without failover.

Exam Tip

Session Consistency Is the Sweet Spot: For most applications, Session consistency provides the right balance — each client session sees its own writes immediately (read-your-writes within a session). Strong consistency adds significant latency. Eventual consistency can cause surprising data discrepancies. Default to Session unless there's a specific reason to change it.

Must Memorize

TTL for Auto-Expiry: Use Cosmos DB's Time-to-Live (TTL) setting on a container to automatically delete items after a specified duration. This is distinct from Azure Blob lifecycle policies. TTL is the answer for IoT sensor data, session tokens, or any data with a natural expiration period.

Question — click to flip

Q: What makes a good Cosmos DB partition key?

Question — click to flip

Q: When should you use Cosmos DB Serverless vs. Autoscale throughput?

Question — click to flip

Q: What is the difference between GRS and RA-GRS for Blob Storage?

Question — click to flip

Q: What is Cosmos DB TTL and when should you use it?

Question — click to flip

Q: Which Cosmos DB consistency level is recommended for most applications?

Question — click to flip

Q: When should you use Azure Data Lake Storage Gen2 instead of regular Blob Storage?

2.2 — Design Data Storage Solutions for Semi-Structured and Unstructured Data

Azure Non-Relational Storage Services

Service Selection Matrix

Blob Storage Redundancy Options

Cosmos DB Design Patterns

Partition Key Selection Criteria

Throughput Models Comparison

Blob Storage Access Tier Strategy

Cosmos DB Consistency Level Selection

Hands-On: Create Cosmos DB Container with Optimal Partition Key

Step 1: Create Cosmos DB Account

Step 2: Create Database and Container

Step 3: Configure Blob Storage Lifecycle Management

Step 4: Query Cosmos DB with SQL API

AZ-305 Exam Focus