We recommend using Azure Native.
Azure v6.21.0 published on Friday, Mar 7, 2025 by Pulumi
Spark on Azure HDInsight
An example Pulumi component that deploys a Spark cluster on Azure HDInsight.
Running the App
- Create a new stack: - pulumi stack init dev
- Login to Azure CLI (you will be prompted to do this during deployment if you forget this step): - az login
- Specify the Azure location and subscription to use: - pulumi config set azure:location WestUS pulumi config set azure:subscriptionId <YOUR_SUBSCRIPTION_ID>
- Define Spark username and password (make it complex enough to satisfy Azure policy): - pulumi config set username <value> pulumi config set --secret password <value>
- Run - pulumi upto preview and deploy changes:- $ pulumi up Previewing changes: ... Performing changes: ... info: 5 changes performed: + 5 resources created Update duration: 15m6s
- Check the deployed Spark endpoint: - $ pulumi stack output endpoint https://myspark1234abcd.azurehdinsight.net/ # For instance, Jupyter notebooks are available at https://myspark1234abcd.azurehdinsight.net/jupyter/ # Follow https://docs.microsoft.com/en-us/azure/hdinsight/spark/apache-spark-load-data-run-query to test it out