Sooner or later, you will find the need to grow beyond individual jobs in Hadoop and create more complicated workflows. Apache Oozie is a workflow scheduler system that is designed to manage jobs in Hadoop.
These steps show you how to get a simple Oozie installation set up in your single node cluster. For these instructions, I used:
- Oozie 4.0.1
- Hadoop 2.4.1
- Hive 0.13.1
- CentOS 6.5