[My story with openGauss] Kettle connects to openGauss 5.0.0 database

Preface

Kettle introduction (source Baidu Encyclopedia)

Kettle was originally an open source ETL tool, its full name is KDE Extraction, Transportation, Transformation and Loading Environment. In 2006, Pentaho acquired the Kettle project. Matt Casters, the original initiator of the Kettle project, joined the Pentaho team and became the data integration architect of the Pentaho suite [1]; since then, Kettle has become the main component of the enterprise-level data integration and business intelligence suite Pentaho Part, Kettle was also renamed to Pentaho Data Integration [1-2]. Pentaho was acquired by Hitachi Data Systems in 2015. (Hitachi Data Systems changed its name to Hitachi Vantara [4] in 2017)

Pentaho Data Integration is developed in Java and supports cross-platform operation. Its features include: support 100% coding-free, drag-and-drop development of ETL data pipelines; it can be connected to data sources including traditional databases, files, big data platforms, interfaces, and streaming data; Support ETL data pipeline to join machine learning algorithm.

Pentaho Data Integration is divided into a commercial version and an open source version. The cumulative downloads of the open source version as of January 2021 reached 8.36 million, of which 19% came from China. In China, most people are still accustomed to calling the open source version of Pentaho Data Integration Kettle.

1. Installation of openGuass 5.0.0

For the installation of openGauss, please refer to the link below

1. openGauss 5.0.0 installation

“Installing the single-machine openGauss 5.0.0 Enterprise Edition database based on openEuler 22.03LTS”

2. OpenGauss 3.1.0 installation

“Teach you how to install openGauss 3.1.0”: https://www.modb.pro/db/545504

2. Kettle installation

2.1, Download

Download pdi-ce-9.4.0.0-343.zip

Community Edition download link: https://www.hitachivantara.com/en-us/products/pentaho-platform/data-integration-analytics/pentaho-community-edition.html

f107596d -cef0-400f-b92e-64b480ef743f.png

  • Official document address: https://www.hitachivantara.com/en-us/pdf/implementation-guide/three-steps-to-install-pentaho-data-integration-ce.pdf

2.2. Install kettle

  • Operating system: centos7.9

2.2.1 Prerequisites

  • Requires 64-bit Java SE version jave SE 11 to 18 is fine

  • For Linux/Ubuntu installations, libwebkitgtk-1.0-0 also needs to be installed for PDI to function properly.

Install some dependencies

yum -y install epel-release<br>yum -y install webkitgtk<br>yum -y install redhat-lsb<br>
  • Download and install xulrunner: https://www.modb.pro/doc/112883

mkdir /root/xulrunner<br>tar jxvf xulrunner-1.9.2.28pre.en-US.linux-x86_64.tar.bz2<br>cd xulrunner<br>./xulrunner -register-global<br>

2.2.2 Unzip the installation package

  • Create a directory and unzip

mkdir /root/kettle<br>unzip pdi-ce-9.4.0.0-343.zip -d /root/kettle<br>
  • View Files

3cce6ad5 -84ea-4adc-a87b-cda37cd942e0.png

  • start kettle
    Execute under the graphical interface

cd /root/kettle/data-integration<br> ./kitchen.sh<br>./spoon.sh &<br>
  • Boot process
    2b0fb184 -195b-4dfa-ba45-f585b11adaa3.png

  • boot complete
    0f6c4dff -7408-4852-8a76-a8fe5947d4c5.png

Third, configure the connection

3.1 Download jar package

OpenGauss jar package download address
c48dd65b -2e10-4de5-83c9-5f63df364ce2.png
Unzip openGauss-5.0.0-JDBC.tar.gz to data-integration/lib

3.2 Add configuration

Add the following to data-integration/simple-jndi/jdbc.properties

##openGauss<br>postgres/type=javax.sql.DataSource<br>postgres/driver=org.opengauss.Driver<br>postgres/url=jdbc:opengauss://192.168.10.10:15400/postgres<br>postgres/user=qmttest<br>postgres/password=qoqoqooq<br>

3.3 Configure in Graphics

  • click connect
    2bce7310 -c171-4dcd-8ce1-463a1f07809d.png

  • Click on Repository Manager
    ddf9505b -1b2d-4245-aef5-5e4151680d32.png

  • Click Add

  • Click on Database Repository

  • Fill in the display name

  • Click Create

3d96d6ec -6135-4bc3-93dc-3dd801939423.png

  • Fill in the following content

f262ae49 -a1f8-4fa0-b7ff-4c4a67ef6cd2.png

  • click test
    598b8579 -39ba-4d6e-935a-4552077ed443.png

  • connection succeeded

  • Then click OK and save

3.4 Connect to openGauss database

  • Click connect and select the name you just created

64d735c7 -7720-4430-ab9a-017cbafe6bdd.png
Enter the default user password: admin/admin

This article is shared from the WeChat public account – openGauss (openGauss).
If there is any infringement, please contact [email protected] to delete it.
This article participates in the “OSC Source Creation Program”, and you are welcome to join and share if you are reading it.