interview Questions on Pentaho

Today, i will share few set of interview questions on Pentaho. Below are the Questions. How you will implement SCD type 0 ,SCD type 1, SCD type 2 in pentaho.  Difference between arguments and variables What are the components present in transformation, name at least 10 components. Have you ever implemented any plugin using Java in Pentaho. If I want to run 10 jobs in parallel through Shell Script, how should I do it. What is factless fact table , give practical examples What is conformed dimension and degenerate dimensions…

Difference between Pentaho 7.0 and Pentaho 8.0 Community Edition

Today, i will discuss about the difference between the Pentaho 7.0 and Pentaho 8.0 in community Edition at ground level.  The major  difference between Pentaho 7.0 and Pentaho 8.0 is AEL feature. AEL stands for Adaptive Execution Layer. The objective of AEL is to develop visually once and execute anywhere. In other words, you develop the code on your desktop and run it , if it works fine , point the application to Spark resource and run. Now, when i say point your application to Spark resource that doesn’t mean you…

Datacleaner plugin in Pentaho

Today i will discuss about how to  use Data cleaner plugin in Pentaho 7.0.0. First go to Tools->Market Place-> Search with word “Datacleaner “. See the below SS for the same. As you can see , it is coming as installed because i have already installed on my local Machine. Once you install this plugin,  go to <PATH>/data-integration/plugins/, you will see folder “kettle6-profiling-datacleaner”. Now, you need to download Datacleaner from the below URL. https://datacleaner.org/get_datacleaner_ce I have downloaded the latest version DataCleaner 5.1.5. Once downloading is completed, unzip the folder.So , it…

USE OF MARKS IN TABLEAU

In this , i will discuss about the “Marks” section in tableau. See the SS how it looks like in Tableau. Here we will continue from the last post where we discussed about the symbol maps and issues involved in it. See the SS regarding the population data using symbol map. So, whenever you talk about BI tools, we says “Data Speaks” . Here the dots is not giving the user friendly information. Therefore , we will incorporate the Marks section in this map. Drag the Population Measure to Label.…

unknown values in symbol map in tableau

Today i will discuss about the particular issue which i faced in Tableau. When click on symbol map in show me tab. The map did not come up in work space Area and moreover i was getting one message at the right bottom 57 uknown. See the below image  for the same. Before i explain how i corrected this , I would like to tell you about the source data. So , I am using population data of India (Random Numbers) which has fields (city name, state, population). I will share the population…

Change data type of Dimensions in Tableau Worksheet

Today, i will discuss about how to change the data type of a Dimension in Tableau. See the image below with population data (city and state wise). So, the file has city name ,state and Population.  Open Tableau->click on excel->choose the file (population-data.xlsx) ->click Open. Hence, data is imported to Tableau. See the below image for the same.   Next step is click on Sheet1 (left bottom).  Once you open Sheet1 ,see the Dimensions and Measures  on the left pane as below. Here , if you see clearly city name…

JSON Input and JSON Output Component in Pentaho

Today i will discuss about the json input and json output component in Pentaho.  So , in first scenario, i will load the text file to json output . Below is the image for the same. Here, i am using order_Details.txt as input file which has data related to orders. In JSON output component, few things should be kept in mind . Json bloc name  :- Name of the bloc, it should be as per the input file. so , in my cases i have kept “ORDER” as Json bloc…

Microsoft Access input Component in Pentaho

Today, I will discuss about the Microsoft Access input component in Pentaho. There are many times we as developers comes across the scenario where we need to write the Sql to join the multiple tables in Access db. So, we tend to use either table input or Sql component to achieve the same by creating database connection for Access db. I tried this multiple times but it didn’t worked for me. Second way to handle the multiple table joins in Access db is  to write the Sql query in Access…

Logging in Pentaho Jobs

Today , i will discuss about the logging in Pentaho jobs which actually helps the production support team to analyze/identify the issue in less time post . First  thing , in order to create the logs for the ETL jobs, right click on job and go to Edit and go to 3rd tab (logging settings). Check the check box “Specify log file” If you want to append the log file, Check the box “Append logfile”. Then comes the “Name of the logfile”, specify the variables and place their values in…