Hadoop and Spark overview

Overview


Apache Hadoop is a collection of open source cluster computing tools that supports popular applications for data science at scale, such as Spark.

You can interact with tools in the Hadoop ecosystem from your Domino executors by configuring your Domino environment with the necessary software dependencies and credentials. Detailed guides to configuring your environment are available for the following distributions of Hadoop tools:

 

 

Additional Hadoop capabilities


Domino also supports running Spark on a Domino executor in local mode, querying Hive tables with JDBC, and authenticating to clusters with Kerberos. See the following guides for more information.

Was this article helpful?
0 out of 0 found this helpful