Hive – Data Side of Life

Spark and Hive

Alice
Tags: Hive, Spark
0

Spark gives us the ability to use SQL for data processing. With that we can connect with JDBC and ODBC to pretty much any database or use structured data formats like avro, parquet, orc. We can also connect to Hive and use all the structures we have there. In Spark 2.0 entry points to SQL (SQLContext) and Hive (HiveContext) were substituted with one object – SparkSession. SparkSession allows you to read and write to Hive, use HiveSQL language and Hive UDFs.

Alice
Tags: HBase, Hive, Impala, Oozie, Parquet, Pig, Sqoop
1

Last years while working with Hadoop I spent a lot of time dealing with issues or finding tricks for some solutions. That involved a lot searching, reading and mostly try and error approach. That’s why I decided to share some of the solutions I found and tried.

Alice
Tags: HBase, Hive, Oozie
0

Hive gives a nice option to manipulate the data stored in HBase. Not only it provides the SQL capabilities but also can be easily incorporated into the workflow processing.

Tag: Hive

Spark and Hive

Hadoop troubleshooting & tricks

Hive table to manipulate HBase data