In this article i will go thru the steps of disabling the Data Collector in Vertica.
Data Collector is on by default. So we will make some changes to that as i am running this in my AWS Vertica Cluster and since is running on on-demand EC2 schema i am really not interested in improving the performance of my database using historical system activities and counters.
The Data Collector can create extra overhead and i dont what this.
What is the Data Collector, in case you dont know ?
Is the utility that collects and retains database monitoring information.
Data Collector retains history of important system activities and records essential performance and resource utilization counters. You can use information the Data Collector retains in the following ways:
As a reference for what actions users have taken
To locate performance bottlenecks
To identify potential improvements to Vertica configuration
Lets start and see what are the steps to disable Data Collector.
See if the Data Collector is enabled.
we can see that we have it enabled.
To Disable the Data Collector we need to alter the value of the EnableDataCollector value to 0(zero).
The location to the where the Data Collector files are stored is usually in the path of your Database Catalog.
Here is the content of my DataCollector.
it contains a bunch of .log files and .sql files.
Lets see how much space Data Collector is using
this will depend a lot on the Data Collector policy you have in place.
To Clear the Data Collector data we need to use the clear_data_collector() function.
this function will clear all the data from the disk and memory.
Lets see how much space Data Collector is using after we run clear_data_collector() function.
Now that we have disabled Data Collector no data will generated and no overhead will be applied on your database.
Note:
Data Collector is very useful so we can analyze our Database overall performance so be careful before going ahead with the clean up.