Databricks Spark SQL Data Source in Privacera#
Prerequisites#
Have the following details ready to enter into the data source definition in Privacera:
-
A username and password in the target system that has read/write permission.
-
The name of the JDBC driver you need.
-
A JDBC connection string to communicate with the target data source.
Add Databricks Spark SQL Data Source in Privacera#
To add Databricks Spark SQL data source in Privacera Platform:
-
Navigate to: Settings > Data Source Registration.
-
Optionally click Add System or modify an existing data source.
-
Enter a useful name for this data source and a useful description.
-
Click Save.
-
Locate the new data source system name and from the wrench icon on the right, select Add Data Source.
-
In the Add Data Source dialog, on the Choose tab, select Databricks Spark SQL.
-
On the Configure tab:
-
Enter a required Application Name of your choice.
-
Enter a required Application Code of your choice. This is an identifier for your own use.
-
If you have prepared a properties file in JSON format, click Import Properties and load the file.
-
Scroll to find the following properties and enter the values you prepared:
-
jdbc.username
Enter the Email ID used to login to the Databricks account console.
-
jdbc.password
On Databricks account console:
a. Navigate to Settings -> User Settings -> Access Tokens.
b. Click Generate New Token.
c. Use the Token as password.
-
jdbc.url
On Databricks account console:
a. Click Compute and select the Cluster.
b. Navigate to Advance Options and click JDBC/ODBC tab.
c. Copy the URL from the JBDC URL section and update as shown in the following example:
Original URL: jdbc:spark://<yourHostname>:443/default;transportMode=http;ssl=1;httpPath=sql/protocolv1/o/6824215520793722/0406-064613-sweet542;**AuthMech=3;UID=token;PWD=<personal-access-token>** New URL: jdbc:hive2://<yourHostname>:443/default;transportMode=http;ssl=true;httpPath=sql/protocolv1/o/6824215520793722/0406-064613-sweet542;
-
-
Accept the default values for all other properties or modify them if needed.
-
At the bottom left, to verify the properties, click Test Connection.
Note
Your Databricks cluster should be up and running before clicking Test Connection.
-
At the bottom right, click Next to save the data source or Back to return to the Choose tab.