Which Azure service provides a managed Apache Spark environment for big data analytics and machine learning workloads?
HDInsight provides managed Apache Spark (and other open-source frameworks) for big data analytics.
Why this answer
Azure HDInsight is the correct answer because it is a fully managed, open-source analytics service that provides Apache Spark clusters for big data processing and machine learning workloads. It supports Spark, Hadoop, Hive, and other frameworks, allowing users to run distributed data analytics and ML pipelines without managing infrastructure.
Exam trap
The trap here is that candidates often confuse Azure Synapse Analytics (which also includes Spark) as the primary managed Spark service, but HDInsight is the dedicated, open-source-focused offering for Apache Spark clusters without Synapse's SQL-centric integration.
How to eliminate wrong answers
Option A is wrong because Azure Stream Analytics is a real-time event processing engine for streaming data (e.g., IoT telemetry), not a managed Apache Spark environment. Option C is wrong because Azure Data Factory is a cloud-based ETL and data integration service for orchestrating data movement and transformation, not a Spark-based analytics platform. Option D is wrong because Azure Synapse Analytics is a unified analytics platform that integrates SQL, Spark, and Pipelines, but it is not exclusively a managed Apache Spark environment; HDInsight is the dedicated service for open-source Spark clusters.