Flume custom Sink collects txt data to Greenplum

1 FlumeInstallation Deployment < /strong> 1.1 Installation address (1) Flume official website address: Welcome to Apache Flume – Apache Flume (2) Document viewing address: Flume 1.11.0 User Guide – Apache Flume (3) Download address: http://archive.apache.org/dist/flume/ 1.2 Installation and Deployment (1) Upload apache-flume-1.9.0-bin.tar.gz to the /app/local directory of IDC 53 (2) Unzip apache-flume-1.9.0-bin.tar.gz to the /app/local […]

GreenPlum – cluster construction and installation

Article directory Cluster introduction Installation environment preparation Modify hostname Modify the /etc/sysconfig/network files of each node in turn Modify the system kernel/etc/sysctl.conf file Modify the number of processes Modify the /etc/selinux/config file Copy master node configuration to child nodes Create gpadmin user (all nodes) Turn off firewall Install Greenplum DB Install Greenplum DB on the […]

Install greenplum database on a single node in linux

Table of Contents 1. Installation package preparation 2. Turn off the firewall 3. Install greenplum dependency packages 4. Add configuration 5. Create a new greenplum user 6. Install greenplum installation package 7. Set environment variables 8. Permission interoperability 9. Modify the database configuration file 10. Initialize the database 11. Configure remote connection files 12. Description […]

Greenplum utility-gpcheckperf

Note: This article is translated from https://docs.vmware.com/en/VMware-Greenplum/7/greenplum-database/utility_guide-ref-gpcheckperf.html Grammar gpcheckperf -d <test_directory> [-d <test_directory> …] {<!– –>-f <hostfile_gpcheckperf> | – h <hostname> [-h hostname …]} [-r ds] [-B <block_size>] [-S <file_size>] [-D] [-v|-V] gpcheckperf -d <temp_directory> {<!– –>-f <hostfile_gpchecknet> | – h <hostname> [-h< hostname> …]} [ -r n|N|M [–duration <time>] [–netperf] ] [-D] [-v | […]

Greenplum utility-gpfdist

Note: This article is translated from https://docs.vmware.com/en/VMware-Greenplum/7/greenplum-database/utility_guide-ref-gpfdist.html Provide data files to or write data files from a Greenplum Database segment. Grammar gpfdist [-d <directory>] [-p <http_port>] [-P <last_http_port>] [-l <log_file>] [-t <timeout>] [-S] [-w <time>] [-v | -V] [-s] [-m <max_length>] [–ssl <certificate_path> [–sslclean <wait_time>] ] [–compress] [–multi_thread <num_threads>] [-c <config.yml>] gpfdist -? | –help […]

Greenplum import and export-COPY command

Note: This article is translated from https://docs.vmware.com/en/VMware-Greenplum/7/greenplum-database/ref_guide-sql_commands-COPY.html The COPY command in Greenplum is used to copy data between tables and files. You can write data from a file to a table or write data from a table to a file. Grammar COPY <table_name> [(<column_name> [, …])] FROM {<!– –>'<filename>’ | PROGRAM ‘<command>’ | STDIN} [ […]

Greenplum Utilities-gpinitsystem

Note: This article is translated from https://docs.vmware.com/en/VMware-Greenplum/7/greenplum-database/utility_guide-ref-gpinitsystem.html The gpinitsystem tool initializes the Greenplum Database system using the configuration parameters specified in the gpinitsystem_config file. Grammar gpinitsystem -c <cluster_configuration_file> [-h <hostfile_gpinitsystem>] [-B <parallel_processes>] [-p <postgresql_conf_param_file>] [-s <standby_coordinator_host> [-P <standby_coordinator_port>] [-S <standby_coordinator_datadir> | –standby_datadir=<standby_coordinator_datadir>]] [-m <number> | –max_connections=number>] [-b <size> | –shared_buffers=<size>] [-n <locale> | –locale=<locale>] [–lc-collate=<locale>] […]

Greenplum Database Optimizer – Join Type

Join type syntax support The from statement allows JOIN expressions and lists of table names to separate joined_table from table_ref, It may seem silly to separate joined_table from table_ref, but there is method in SQL’s madness: if you don’t do it this way you get reduce-reduce conflicts, because it’s not clear to the parser generator […]