See Document
1. The clusters in a dataset are always represented in a tree structure. More complex clusters are composed of smaller clusters. The clusters located at leaf nodes cannot be further divided, and we call them unit clusters (UCs).
2. The UC’s samples follow an isotropic normal distribution, which means that the samples in each dimension of the UC follow a normal distribution.
All parameters:
Specify the number of clusters to add on this layer:
Control sample size:
Control Standard Deviation:
Control Overlap:
Control Angle:
All parameters:
Examples:
All parameters:
Examples:
Examples:
Examples:
Parameters | Description | Value | Example | Default |
---|---|---|---|---|
Public parameters | ||||
-d | Dimension | Positive integer | -d=2 | 2 |
-o | Output | FilePath | -o=/home/work/ -o=E:/work/ -o=/home/coordiante.txt | Current directory |
-rg |
Regenerate according to the original parameters in a parameter configure file | FilePath | -rg=/home/myConfig.txt | None |
-rp |
Reproduce according to a parameter configure file | FilePath | -rp=/home/myConfig.txt | None |
Private parameters | ||||
-t |
Three generation models: node = add UCs using relative positioning nodeFix = add UCs using absolute positioning bezier = add BCs | node/nodeFix/bezier | -t=node | Required if "-rg/rp" is not specified |
Parameters for -t=node | ||||
-nodeNum | The number of UCs to add to the current layer | Positive integer | -nodeNum=5 | 1 |
-ss | The sample size for each cluster to add to the current layer | Positive integer | -ss=500 | 300×(1+SD×random(0,1)) |
-sd | The standard deviation of normal distribution | Positive number | -sd=1 | 1+10×random(0,1) |
-ref | The reference UC for the new cluster. All UCs are numbered sequentially from 0-n. Each BC consists of multiple (default 200) UCs. | A positive integer less than the number of added UCs. | -ref=1 | round(random(0,1)×UC_Num) |
-overlap | The overlap between the new cluster and the reference cluster. It is also the largest overlap between the new cluster and other clusters. | Number | -overlap=0 | [0.7×random(0,1), -1×random(0,1)] |
-angle | The vector of the new cluster’s counterclockwise rotation angle, relative to the reference UC in each dimension | Number vector. The first dimension does not need to be rotated and is denoted as 0 | -angle=0,30 | 360×random(0,1) |
-label | Specify a label for the new cluster | Integer | -label=1 | Increment |
-cross | Whether clusters with conflicting parameters are shown (default is not displayed) | 0/1 | -cross=0 | 0 |
Parameters for -t=nodeFix | ||||
-ss | Same as -ss in -t=node | |||
-sd | Same as -sd in -t=node | |||
-label | Same as -label in -t=node | |||
-coordinate | The centre coordinates of the new cluster | Number vector | -coordinate=2,3 | Required |
Parameters for -t=bezier | ||||
-bezierNum | The number of BCs to add to the current layer | Positive integer | -nodeNum=5 | 1 |
-ss | Same as -ss in -t=node | 300×(1+SD×random(0,1)) | ||
-rss | The ratio of the ending sample size to the starting sample size | Positive numbers represent increases and negative numbers represent decreases. See publication for details. | -rss=10 | [10×random(0,1), -10×random(0,1)] |
-sd | Same as -sd in -t=node | 2.2-2.1×random(0,1) | ||
-rsd | The ratio of the ending SD to the starting SD | Positive numbers represent increases and negative numbers represent decreases. See publication for details. | -rsd=-2 | [5×random(0,1), -5×random(0,1)] |
-control | The control point of the Bezier curve | Number vector. The coordinate values of each control point are separated by commas | -control=2,3,12,13,16,-3 | 30×max(1,SD)×random(0,1) |
-label | Same as -label in -t=node | |||
-offset | The translation of the new BC in each dimension | Number vector | -offset=2,3 | None |