Using Anaconda with Cloudera CDH
There are different methods of using Anaconda Scale on a cluster with Cloudera CDH:
- The freely available Anaconda parcel for Cloudera CDH.
- Custom Anaconda parcels for Cloudera CDH
- A dynamic, managed version of Anaconda on all of the nodes using Anaconda Scale
The freely available Anaconda parcel is based on Python 2.7 and includes the default conda packages that are available in the free Anaconda distribution.
Anaconda Enterprise users can also leverage Anaconda Repository to create and distribute their own custom Anaconda parcels for Cloudera Manager.
If you need more dynamic functionality than the Anaconda parcels offer, Anaconda Scale lets you dynamically install and manage multiple conda environments–such as Python 2, Python 3, and R environments–and packages across a cluster.
For more information about installing the Anaconda parcel on a CDH cluster using Cloudera Manager, see the Anaconda parcel documentation.
To transition from the Anaconda parcel for CDH to the dynamic, managed version of Anaconda Scale, follow the instructions below to uninstall the Anaconda parcel on a CDH cluster and then transition to a centrally managed version of Anaconda.
If the Anaconda parcel is installed on the CDH cluster, uninstall the parcel:
- From the Cloudera Manager Admin Console, in the top navigation bar, click the Parcels indicator.
- To the right of the Anaconda parcel listing, click the Deactivate button.
- When prompted, click OK to deactivate the Anaconda parcel and restart Spark and related services.
- Click the arrow to the right of the Anaconda parcel listing and select Remove From Hosts.
- In the confirmation dialog box, confirm removal of the Anaconda parcel from the cluster nodes.
For more information about managing Cloudera parcels, see the Cloudera documentation.
Once you’ve uninstalled the Anaconda parcel, see the Anaconda Scale installation instructions for more information about installing a centrally managed version of Anaconda.