Oozie Sqoop action
There are few ways to build Oozie Sqooping action.
Read moreThere are few ways to build Oozie Sqooping action.
Read moreHCatalog enables Pig to read and write directly to Hive metastore. Pig dynamically determines structure of the table allowing easier data manipulation. Here’s how to make Pig work with HCatalog and how to run such jobs through Oozie.
Read moreAlthough HBase is mostly used for lookups, sometimes there comes a need to perform bulk reads and writes. Doing that through Pig is very convenient. Here’s how to establish Pig-HBase communication.
Read moreSqoop can be used to import data from the relational database into HBase. Although exporting data from HBase is not natively supported you can still manage it by putting Hive and HCatalog between HBase and Sqoop. Here’s how to do both importing and exporting with Oozie in Kerberised environment.
Read moreHive gives a nice option to manipulate the data stored in HBase. Not only it provides the SQL capabilities but also can be easily incorporated into the workflow processing.
Read moreZeppelin is a notebook based framework for data analysis and data visualizations.
Although building it from the source is quite straightforward I had a few configuration issues that made the process longer.
Few days ago I attended Data works summit in Munich. Used to be called Hadoop summit but apparently Hadoop itself moved a bit into the background, giving space to other cool technologies, emerging at very fast pace.
Read more