Oozie
workflow is a sequence of actions that are in the form of
DAG.(Directed acyclic graph).it is a combination of actions and
control nodes that are arranged in the form directed acyclic
graph.Normally the action is a hadoop job like Pig,Hive,MR job etc)
but there are some scenarios where the jobs are not Hadoop one like
(shell script, email notification etc).
An
action does not start until the previous action completes.The start
and end controls point out the start and end of the workflow.The fork
and join control nodes allow execution in parallel.The decision
control node is like a switch case statement.oozie workflows can also
be parametrized .These parameters come from a configuration file
called parameter file.
We
will create a sample and oozie workflow . We will try to check the
working of the oozie in our next blog.
No comments:
Post a Comment