gp_check_functions

The gp_check_functions module implements views that identify missing and orphaned relation files. The module also exposes a user-defined function that you can use to move orphaned files.

The gp_check_functions module is a SynxDB extension.

Installing and Registering the Module

The gp_check_functions module is installed when you install SynxDB. Before you can use the views defined in the module, you must register the gp_check_functions extension in each database in which you want to use the views: o

CREATE EXTENSION gp_check_functions;

Refer to Installing Additional Supplied Modules for more information.

Checking for Missing and Orphaned Data Files

SynxDB considers a relation data file that is present in the catalog, but not on disk, to be missing. Conversely, when SynxDB encounters an unexpected data file on disk that is not referenced in any relation, it considers that file to be orphaned.

SynxDB provides the following views to help identify if missing or orphaned files exist in the current database:

Consider it a best practice to check for these conditions prior to expanding the cluster or before offline maintenance.

By default, the views in this module are available to PUBLIC.

gp_check_orphaned_files

The gp_check_orphaned_files view scans the default and user-defined tablespaces for orphaned data files. SynxDB considers normal data files, files with an underscore (_) in the name, and extended numbered files (files that contain a .<N> in the name) in this check. gp_check_orphaned_files gathers results from the SynxDB master and all segments.

ColumnDescription
gp_segment_idThe SynxDB segment identifier.
tablespaceThe identifier of the tablespace in which the orphaned file resides.
filenameThe file name of the orphaned data file.
filepathThe file system path of the orphaned data file, relative to the data directory of the master or segment.

Caution Use this view as one of many data points to identify orphaned data files. Do not delete files based solely on results from querying this view.

gp_check_missing_files

The gp_check_missing_files view scans heap and append-optimized, column-oriented tables for missing data files. SynxDB considers only normal data files (files that do not contain a . or an _ in the name) in this check. gp_check_missing_files gathers results from the SynxDB master and all segments.

ColumnDescription
gp_segment_idThe SynxDB segment identifier.
tablespaceThe identifier of the tablespace in which the table resides.
relnameThe name of the table that has a missing data file(s).
filenameThe file name of the missing data file.

gp_check_missing_files_ext

The gp_check_missing_files_ext view scans only append-optimized, column-oriented tables for missing extended data files. SynxDB considers both normal data files and extended numbered files (files that contain a .<N> in the name) in this check. Files that contain an _ in the name, and .fsm, .vm, and other supporting files, are not considered. gp_check_missing_files_ext gathers results from the SynxDB segments only.

ColumnDescription
gp_segment_idThe SynxDB segment identifier.
tablespaceThe identifier of the tablespace in which the table resides.
relnameThe name of the table that has a missing extended data file(s).
filenameThe file name of the missing extended data file.

Moving Orphaned Data Files

The gp_move_orphaned_files() user-defined function (UDF) moves orphaned files found by the gp_check_orphaned_files view into a file system location that you specify.

The function signature is: gp_move_orphaned_files( <target_directory> TEXT ).

<target_directory> must exist on all segment hosts before you move the files, and the specified directory must be accessible by the gpadmin user. If you specify a relative path for <target_directory>, it is considered relative to the data directory of the master or segment.

SynxDB renames each moved data file to one that reflects the original location of the file in the data directory. The file name format differs depending on the tablespace in which the orphaned file resides:

TablespaceRenamed File Format
defaultseg<num>_base_<database-oid>_<relfilenode>
globalseg<num>_global_<relfilenode>
user-definedseg<num>_pg_tblspc_<tablespace-oid>_<gpdb-version>_<database-oid>_<relfilenode>

For example, if a file named 12345 in the default tablespace is orphaned on primary segment 2,

SELECT * FROM gp_move_orphaned_files('/home/gpadmin/orphaned');

moves and renames the file as follows:

Original LocationNew Location and File Name
<data_directory>/base/13700/12345/home/gpadmin/orphaned/seg2_base_13700_12345

gp_move_orphaned_files() returns both the original and the new file system locations for each file that it moves, and also provides an indication of the success or failure of the move operation.

Once you move the orphaned files, you may choose to remove them or to back them up.

Examples

Check for missing and orphaned non-extended files:

SELECT * FROM gp_check_missing_files;
SELECT * FROM gp_check_orphaned_files;

Check for missing extended data files for append-optimized, column-oriented tables:

SELECT * FROM gp_check_missing_files_ext;

Move orphaned files to the /home/gpadmin/orphaned directory:

SELECT * FROM gp_move_orphaned_files('/home/gpadmin/orphaned');