Friday, March 8, 2019

Oozie - Workflow


Oozie workflow is a sequence of actions that are in the form of DAG.(Directed acyclic graph).it is a combination of actions and control nodes that are arranged in the form directed acyclic graph.Normally the action is a hadoop job like Pig,Hive,MR job etc) but there are some scenarios where the jobs are not Hadoop one like (shell script, email notification etc).


An action does not start until the previous action completes.The start and end controls point out the start and end of the workflow.The fork and join control nodes allow execution in parallel.The decision control node is like a switch case statement.oozie workflows can also be parametrized .These parameters come from a configuration file called parameter file.

We will create a sample and oozie workflow . We will try to check the working of the oozie in our next blog.



No comments:

Post a Comment

Hadoop - What is a Job in Hadoop ?

In the field of computer science , a job just means a piece of program and the same rule applies to the Hadoop ecosystem as wel...