interview Questions on Pentaho

Today, i will share few set of interview questions on Pentaho. Below are the Questions. How you will implement SCD type 0 ,SCD type 1, SCD type 2 in pentaho.  Difference between arguments and variables What are the components present in transformation, name at least 10 components. Have you ever implemented any plugin using Java in Pentaho. If I want to run 10 jobs in parallel through Shell Script, how should I do it. What is factless fact table , give practical examples What is conformed dimension and degenerate dimensions…

Difference between Pentaho 7.0 and Pentaho 8.0 Community Edition

Today, i will discuss about the difference between the Pentaho 7.0 and Pentaho 8.0 in community Edition at ground level.  The major  difference between Pentaho 7.0 and Pentaho 8.0 is AEL feature. AEL stands for Adaptive Execution Layer. The objective of AEL is to develop visually once and execute anywhere. In other words, you develop the code on your desktop and run it , if it works fine , point the application to Spark resource and run. Now, when i say point your application to Spark resource that doesn’t mean you…

Datacleaner plugin in Pentaho

Today i will discuss about how to  use Data cleaner plugin in Pentaho 7.0.0. First go to Tools->Market Place-> Search with word “Datacleaner “. See the below SS for the same. As you can see , it is coming as installed because i have already installed on my local Machine. Once you install this plugin,  go to <PATH>/data-integration/plugins/, you will see folder “kettle6-profiling-datacleaner”. Now, you need to download Datacleaner from the below URL. https://datacleaner.org/get_datacleaner_ce I have downloaded the latest version DataCleaner 5.1.5. Once downloading is completed, unzip the folder.So , it…

JSON Input and JSON Output Component in Pentaho

Today i will discuss about the json input and json output component in Pentaho.  So , in first scenario, i will load the text file to json output . Below is the image for the same. Here, i am using order_Details.txt as input file which has data related to orders. In JSON output component, few things should be kept in mind . Json bloc name  :- Name of the bloc, it should be as per the input file. so , in my cases i have kept “ORDER” as Json bloc…

Microsoft Access input Component in Pentaho

Today, I will discuss about the Microsoft Access input component in Pentaho. There are many times we as developers comes across the scenario where we need to write the Sql to join the multiple tables in Access db. So, we tend to use either table input or Sql component to achieve the same by creating database connection for Access db. I tried this multiple times but it didn’t worked for me. Second way to handle the multiple table joins in Access db is  to write the Sql query in Access…

Logging in Pentaho Jobs

Today , i will discuss about the logging in Pentaho jobs which actually helps the production support team to analyze/identify the issue in less time post . First  thing , in order to create the logs for the ETL jobs, right click on job and go to Edit and go to 3rd tab (logging settings). Check the check box “Specify log file” If you want to append the log file, Check the box “Append logfile”. Then comes the “Name of the logfile”, specify the variables and place their values in…

internal.job.filename.directory and internal.transformation.filename.directory in Pentaho

We will discuss about two built-in variables of Pentaho which most of the developers are not aware of or they don’t use these variables so often in their coding. These variables are Internal.Job.Filename.Directory and Internal.Transformation.Filename.Directory. Designed one Job which has further sub-jobs. See the SS for the same. Here, you can see there is parent-job which has sub jobs (start-job.kjb and job 2). The parent job is present at location C:\AS2\ as seen in below image. Now, right click on start-job and go to Edit, as it is clearly seen in…