Wednesday 18 January 2017

How to configure Toad for Hadoop Hive HDFS




When working with large tables in Hive and doing a comparison job between hive and source tables, or you want to see the hive table output in a structured way, the terminal window is not a right choice. For the people who worked in Oracle or Sql Server, there are wide range of enterprise level tools available in the market to perform any sort of job on the database with breeze.


I will show you how to configure and use Toad with Hive, however you can also connect Toad to HDFS and see the file system or connect to Oozie or Spark as well.


Download the Toad for Hadoop software from here


After you install, go to the eco system and click ‘Add new Hadoop eco system’. You can choose cloudera or Horton works/Apache Hadoop versions. Later go to the SQL tab under the eco system and do the Hive configuration as follows.



As shown in the above screen dump, pass the host details of the UNIX box where Hive is located, give the hive port number. You should find this in hive-site.xml.
If your Hive setup is setup with SSL and LDAP configuration, you have to pass those details as shown in the second red box above.

Now let us configure HDFS in the Toad.

Navigate to the HDFS tab under the eco system. Give the details as below. In my case it is single node cluster, and have to pass localhost.localdomain but if not, you have to pass your name node host and port details.


I hope this helps in your Toad setup. If you have any questions leave the comment below. 
 



1 comment:

  1. If above configuration does not work, You may have to add below line in the hosts file in c:\windows\system32\etc\hosts path in the Windows operating system.

    127.0.0.1 localhost.localdomain localhost

    ReplyDelete