Enable Delta Lake for AWS EMR¶
To enable Delta Lake support for AWS EMR, set the EMR_SPARK_DELTA_LAKE_ENABLE to true.
-
SSH to the instance where Privacera is installed.
-
Navigate to the
custom-varsplugin directory:Bash -
Open the
penv.shfile and update the following properties: -
After updating the configuration, follow the setup steps starting from the Create EMR Cluster for triggering an EMR cluster.
-
For information on how to use and validate Delta Lake in AWS EMR, see the Using Delta Lake with AWS EMR guide.
To enable Delta Lake support for AWS EMR, set the SPARK_DELTA_LAKE_ENABLE to enable-spark-deltalake.
-
To enable Delta Lake support for EMR Spark, update the
BootstrapActionsconfiguration in emr template as shown below. Then, create a new emr cluster with this template:privacera-emr-bootstrap-actions-delta-lake
-
Add the following Delta classification to the EMR template to enable Delta support
-
After updating the configuration, follow the setup steps starting from the Create EMR Cluster for triggering an EMR cluster.
-
For information on how to use and validate Delta Lake in AWS EMR, see the Using Delta Lake with AWS EMR guide.
- Prev topic: Advanced Configuration