MERGE ROWS(DIFF) IN PENTAHO

Today, i will discuss about the component “Merge Rows(Diff)” in Pentaho. See the below transformation for the same. Before we start discussion on Merge Rows, always remember to sort the data on common columns from both streams before you add Merge row component. Otherwise results will not be correct. Here , i considered two CSV files which has data of employees. See the below images for the same. Here MAR-SAL.csv is mapped to Reference rows origin field in Merge rows component and FEB-SAL.csv is mapped to Compare rows origin  field in Merge…

HOW TO UPGRADE/UPDATE THE PENTAHO PLUGINS

Today, i will discuss about  how to upgrade/update the Pentaho plugins. Pentaho is the Best ETL tool in my understanding and its open source version is very effective in a way that many  organizations uses the open source version in production environment as well. It has several plugins to connect to various third-party services. Some time you need to upgrade those plugins without changing the Pentaho version. There might be multiple reasons for that but primarily it is due to some bug or lack of feature in current plugin. HOW TO UPGRADE/UPDATE…

DERIVED VARIABLES IN PENTAHO

Today, i will discuss about the Derived variables in Pentaho. The term “Derived Variables” means variable name or variable value derived from a particular variable or its value. In order to demonstrate that i have designed an ETL job for the same. See the below image for the same. In this  transformation, i used get system info, where i defined one variable “date_value” with value as system date (fixed). After this component, i added Java script component where i  created new more variables and value is  derived from date_value variable.…