Java Version for All Pentaho Versions

Today i will share the data for Java Version against Pentaho version. See the list below.It will help you in order to install correct Java version for Pentaho. PDI Version Java Version Pentaho 8.2| JDK 1.8 Pentaho 8.1| JDK 1.8 Pentaho 8.0| JDK 1.8 Pentaho 7.1| JDK 1.8 Pentaho 7.0| JDK 1.8 Pentaho 6.1| JDK 1.7 Pentaho 6.0| JDK 1.7 Pentaho 5.4| JDK 1.7 Pentaho 5.3| JDK 1.7 Pentaho 5.2| JDK 1.7 Pentaho 5.1| JDK 1.7                             …

CASE Statement in Pentaho

Today, I will discuss about how to create ETL(Pentaho) Equivalent of case statement in Database. In Pentaho, we have component “Number Range”. See the ETL Design for the same. In this, I have considered csv file input ,Number Range,Write to log components. In CSV file input, FEB-SAL.csv file which we have considered. See the below SS for the same. See the below SS for the Number Range Component. Number Range works on x1 less than equal to(<=) x greater than(<) x2 logic. Here, if you see, 1000 to 5000 ,…

AWS configuration files in Pentaho 8.2

Today, I will discuss about the configurations files which are newly introduced in Pentaho version >8. First of all, if you see in Pentaho version greater than 8, S3 file input and S3 file output components, Access Key and Secret Key variables are removed. See the below SS. Below are the steps to be performed in windows Machine. 1. create folder inside C:/Users/user/.aws 2. In order to create .aws folder , give folder name as .aws. then .aws folder gets created. 3. Inside .aws folder, create files config and credentials…

INTERVIEW QUESTIONS IN PENTAHO SET-3

Today, I will share few more set of questions that can be asked in Pentaho DI Interview. 1. How you will merge the rows in one row based with delimiter as separator in Pentaho 2. How to split the row into multiple rows having delimiter in it using Pentaho DI. 3. Scenario Based Question We have three csv files with Data as below File1.txt A B C 1 2 3 1 3 1 1 1 3 File2.txt A B C 1 2 1 2 1 1 1 3 1 File3.txt…

Pentaho Data Integration Enterprise Edition Installation

Today, I will discuss about the installation process of Pentaho Data Integration Enterprise Edition(30 Days Trial Version) available on Pentaho Site. As the community version of Pentaho gives a lot of opportunity to work in a lot of domains, Pentaho Enterprise version also have a lot of beneficiary apart from the usual support stuff. We will evaluate Pentaho versions in another article. In this article we will try and install the Enterprise versions in Windows and *Nix environment. First of all, we need to go to the main site of…

MERGE ROWS(DIFF) IN PENTAHO

Today, i will discuss about the component “Merge Rows(Diff)” in Pentaho. See the below transformation for the same. Before we start discussion on Merge Rows, always remember to sort the data on common columns from both streams before you add Merge row component. Otherwise results will not be correct. Here , i considered two CSV files which has data of employees. See the below images for the same. Here MAR-SAL.csv is mapped to Reference rows origin field in Merge rows component and FEB-SAL.csv is mapped to Compare rows origin  field in Merge…

HOW TO UPGRADE/UPDATE THE PENTAHO PLUGINS

Today, i will discuss about  how to upgrade/update the Pentaho plugins. Pentaho is the Best ETL tool in my understanding and its open source version is very effective in a way that many  organizations uses the open source version in production environment as well. It has several plugins to connect to various third-party services. Some time you need to upgrade those plugins without changing the Pentaho version. There might be multiple reasons for that but primarily it is due to some bug or lack of feature in current plugin. HOW TO UPGRADE/UPDATE…

DERIVED VARIABLES IN PENTAHO

Today, i will discuss about the Derived variables in Pentaho. The term “Derived Variables” means variable name or variable value derived from a particular variable or its value. In order to demonstrate that i have designed an ETL job for the same. See the below image for the same. In this  transformation, i used get system info, where i defined one variable “date_value” with value as system date (fixed). After this component, i added Java script component where i  created new more variables and value is  derived from date_value variable.…

INTERVIEW QUESTIONS IN PENTAHO SET-2

Today, i will discuss about the interview Questions on Pentaho. In my previous blog, I have share 15 Interview question. You can go through those as well using below link. interview Questions on Pentaho Below are the new set of Interview Questions on Pentaho. 1. What is Carte in Pentaho. In what all scenarios , we can avail Carte services. 2. How can we dynamically split value into multiple Variables. 3. What is AEL (Adaptive execution Layer) in Pentaho. 4. How can we change the User interface of Pentaho BI…

HOW TO GET FILENAMES PRESENT INSIDE THE ZIP FILE

Today, i will discuss about “How to get filenames present inside the zip file without unzipping it”. Here, “Get file names” component will be used.  See the below image for the same. In Get File Names component, the value of file directory is set to zip://C:/data-integration/SALARIES.zip and wildcard is “.*“. Here the catch is “zip:” part . Click on show filenames.It will give you all file names which are present inside the zip file. See the below image for the same. Run the  transformation  and see the Pentaho logs. Write to…