Showing posts from October, 2018
Kafka is publish subscribe messaging system which are most commonly used in asynchronous work flow. Homebrew is a software package management system that simplifies the installation of software on Apple's macOS operating system. Using brew Kafka…
Extracts are saved subsets of data that we can use to improve performance - we can reduce the total amount of data by using filters and configuring other limits. How to create Extract in Tableau(Desktop version) :- 1. Right click on Data source l…
In previous post we discussed about Data visualisation with CSV data source and Tableau : Export a Image chart with report generated from CSV data - here we visualised Representative and their TotalSells. In this post we will reuse same worksheet an…
Tableau is an interactive data visualisation products focused on business intelligence. It provides support for integrating various data source like CSV, databases(local and cloud), various big data storage like Apache drill, Hive, etc. It query data…
Setup Kibana with Elastic search Elastic Stack (Elasticsearch, Logstash and Kibana) : Setup Kibana with Elasticsearch 6.0 (Document management using DevConsole tool) Elasticsearch Inverted index(Analysis): How to create inverted index and ho…
Elasticsearch is a distributed search and analytics engine which runs on Apache Lucene (The indexing and search library for high performance, full text search engine). In previous posts we used Kafka Source Connectors (FileSourceConnector in standa…
In previous post we did setup Kafka connect cluster in docker (Landoop fast-data-dev) and created FileStreamConnector in standalone mode and in distributed mode for transferring file content to Kafka topic. In this post we will use existing Docker…
In previous post we did setup Kafka connect cluster in docker(Landoop fast-data-dev) and created FileStreamConnector in standalone mode for transferring file content to Kafka topic. In this post we will use existing Docker Kafka connect setup to t…
Kafka Connect (or Connect API) is a framework to import/export data from/to other systems and it internally uses the Producer and Consumer API. The Connect API defines the programming interface which is implemented to build a concrete connector which …