Showing posts from November, 2015
Organizations dealing with large volume of data are moving with rapid pace so as the their volume of data.In past 10 years,they have shown more inclination towards non-relational database over relational database.With advancement of NoSQL databases(Mo…
Tableau is an interactive data visualization products focused on business intelligence and it provides support for integration with Hive, Apache drill and other big data storage or data processing engine. The main agenda of this post is to set-up T…
In previous post we discussed about installation of Apache drill in embedded mode and SQL query execution against various data sources.The main agenda of this post is to set-up Apache drill in pseudo distributed mode(Apache drill on single node) an…
A distributed SQL engine designed for data-intensive distributed applications for interactive analysis of large-scale datasets.It follows Schema-free JSON document model similar to MongoDB and has pluggable architecture that enables connectivity to…
SQuirreL SQL Client is a graphical SQL client written in Java that allows us to view the structure of a JDBC compliant database, browse the data in tables, issue SQL commands etc. Follow below steps to configure SQuirreL for DB2 database. Download S…
Tableau supports connections to data stored in Cloudera distributions using Hive(Hiveserver1) and the data source's Hive ODBC driver.The main agenda of this post is to set-up Tableau 9.0 Desktop version to connect to Hadoop Hive(Cloudera distrib…
Apache Hive :- It is a data warehouse infrastructure built on top of HDFS and MapReduce(HDFS provides distributed storage and MapReduce is execution model) and facilitates querying & managing large datasets residing in distributed storage.It pro…
In previous post, we discussed about Pros and Cons of Lock (java.util.concurrent.locks) over synchronized methods and statements and concluded that Lock interface provides high programmatic control and some times efficient too.The main agenda of thi…