Data Platforms in Azure, Part 5: Choosing the Right Service

Over the last four parts of this series, we’ve gone deep into Azure’s major data platforms:

Cosmos DB (Part 1) — globally distributed NoSQL.
Azure Data Lake (Part 2) — scalable storage for analytics.
Azure Database for MySQL (Part 3) — managed open-source relational.
Azure Database for PostgreSQL (Part 4) — managed Postgres, with Hyperscale.

Now let’s put it all together. Which service do you use, and when?

The Decision Framework

Here’s the big question: What kind of workload are you running?

Transactional (OLTP): Lots of small reads and writes (e.g., ecommerce, SaaS apps).
Analytical (OLAP): Large queries scanning millions of rows, often for BI or ML.
Globally distributed / low-latency: Users all over the world need fast access.
Unstructured / semi-structured data: JSON, logs, telemetry, files.
Hybrid / mixed: Some workloads blend these needs.

Service by Service Breakdown

🔹 Azure SQL Database

Best for: Traditional apps written for SQL Server.
Pros: Full T-SQL, strong relational features, easy migration from on-prem SQL.
Cons: Proprietary; less flexible for JSON/document workloads.
Example: A payroll system or CRM built on SQL Server.

🔹 Azure Cosmos DB

Best for: Low-latency, globally distributed workloads.
Pros: Multi-region writes, five APIs (SQL, Mongo, Cassandra, Gremlin, Table), sub-10ms latency.
Cons: RU/s pricing model can be confusing; not ideal for complex joins.
Example: A gaming leaderboard with players across continents.

🔹 Azure Data Lake Storage (Gen2)

Best for: Big data and analytics pipelines.
Pros: Schema-on-read, hierarchical namespace, works seamlessly with Synapse, Databricks, Spark.
Cons: Not for OLTP; querying raw files can be slower unless optimized.
Example: Storing raw IoT sensor data, then analyzing with Synapse.

🔹 Azure Database for MySQL

Best for: Web and mobile apps already using MySQL.
Pros: Open source compatibility, managed backups and HA, cost-effective for moderate workloads.
Cons: Limited to MySQL feature set, fewer extensions than Postgres.
Example: A WordPress site or LAMP stack SaaS app.

🔹 Azure Database for PostgreSQL

Best for: Apps needing Postgres features, extensions, or scale-out.
Pros: JSONB, PostGIS, Hyperscale (Citus) for sharding, strong standards compliance.
Cons: Slightly steeper learning curve; Hyperscale adds complexity.
Example: A geospatial app using PostGIS for mapping and routing.

Quick Comparison Table

Service	Type	Strengths	Weaknesses	Example Use Case
SQL Database	Relational (PaaS)	T-SQL, strong transactional support, easy migration from SQL Server	Proprietary, less JSON support	ERP or CRM app
Cosmos DB	NoSQL	Global distribution, multi-API, low latency	RU pricing, limited joins	Real-time gaming, IoT
Data Lake	Storage + Analytics	Schema-on-read, cheap, scales to PBs, works with Synapse	Not OLTP, slower queries without optimization	Raw log storage, ML training
MySQL (Flexible Server)	Relational (OSS)	Familiar, cost-effective, managed service	Limited extensions, single node	LAMP apps, WordPress
PostgreSQL (Flexible + Hyperscale)	Relational (OSS)	Rich data types, extensions, horizontal scale with Citus	Complexity at scale, learning curve	Geospatial apps, SaaS scale-out

Migration Tips

From SQL Server: Use Data Migration Assistant → Azure SQL Database or Managed Instance.
From MongoDB or Cassandra: Cosmos DB’s Mongo and Cassandra APIs are often drop-in replacements.
From on-prem MySQL/Postgres: Use Azure Database Migration Service for minimal downtime.
For analytics: Land raw data in Data Lake, then layer Synapse/Databricks on top.

Putting It Together

A real-world example:

Imagine you’re running a youth esports platform (matches, rosters, stats, video replays).

Match stats (fast lookups, global players): Cosmos DB (SQL API).
Video replays and logs: Data Lake Storage Gen2.
Tournament scheduling and billing: Azure SQL Database.
Content management system (WordPress site): MySQL Flexible Server.
Advanced analytics (skill tracking, ML models): Data Lake + Synapse.

No one service fits all — the trick is to match the workload to the right Azure data platform.

Wrap-Up

Azure’s data platform ecosystem is broad, but not redundant. Each service is optimized for a different type of workload.

Choose Cosmos DB for global NoSQL.
Choose Data Lake for analytics storage.
Choose MySQL/Postgres when you want open source relational.
Choose SQL Database if you’re in the Microsoft SQL Server world.

With these five parts, you’ve now got a beginner-to-advanced view of how Azure’s main data services fit together.

In the future, I’ll extend this series with hands-on labs (e.g., connecting Power BI to Data Lake, or building a Hyperscale Postgres cluster) so you can practice what we’ve covered.

References

Azure SQL Database overview
Azure Cosmos DB overview
Data Lake Storage Gen2
Azure Database for MySQL
Azure Database for PostgreSQL

Discover more from SQLYARD

Subscribe to get the latest posts sent to your email.

Data Platforms in Azure, Part 5: Choosing the Right Service

The Decision Framework

Service by Service Breakdown

🔹 Azure SQL Database

🔹 Azure Cosmos DB

🔹 Azure Data Lake Storage (Gen2)

🔹 Azure Database for MySQL

🔹 Azure Database for PostgreSQL

Quick Comparison Table

Migration Tips

Putting It Together

Wrap-Up

Like this:

Related

Discover more from SQLYARD

Leave a ReplyCancel reply

Sign up to receive email updates, fresh news and more!

The Decision Framework

Service by Service Breakdown

🔹 Azure SQL Database

🔹 Azure Cosmos DB

🔹 Azure Data Lake Storage (Gen2)

🔹 Azure Database for MySQL

🔹 Azure Database for PostgreSQL

Quick Comparison Table

Migration Tips

Putting It Together

Wrap-Up

Share this:

Like this:

Related

Discover more from SQLYARD

Related Posts

Leave a ReplyCancel reply

Discover more from SQLYARD