K
- Type of a key in upstream data.V
- Type of a value in upstream data.C
- Type of a partition context.public class DecisionTreeDataBuilder<K,V,C extends Serializable> extends Object implements PartitionDataBuilder<K,V,C,DecisionTreeData>
data
builder that makes DecisionTreeData
.Constructor and Description |
---|
DecisionTreeDataBuilder(Preprocessor<K,V> preprocessor,
boolean buildIdx)
Constructs a new instance of decision tree data builder.
|
Modifier and Type | Method and Description |
---|---|
DecisionTreeData |
build(LearningEnvironment envBuilder,
Iterator<UpstreamEntry<K,V>> upstreamData,
long upstreamDataSize,
C ctx)
Builds a new partition
data from a partition upstream data and partition context . |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
andThen, build
public DecisionTreeDataBuilder(Preprocessor<K,V> preprocessor, boolean buildIdx)
preprocessor
- Extractor of features and labels from an upstream
data..buildIdx
- Build index.public DecisionTreeData build(LearningEnvironment envBuilder, Iterator<UpstreamEntry<K,V>> upstreamData, long upstreamDataSize, C ctx)
data
from a partition upstream
data and partition context
.
Important: there is no guarantee that there will be no more than one UpstreamEntry with given key,
UpstreamEntry should be thought rather as a container saving all data from upstream, but omitting uniqueness
constraint. This constraint is omitted to allow upstream data transformers in DatasetBuilder
replicating
entries. For example it can be useful for bootstrapping.build
in interface PartitionDataBuilder<K,V,C extends Serializable,DecisionTreeData>
envBuilder
- Learning environment.upstreamData
- Partition upstream
data.upstreamDataSize
- Partition upstream
data size.ctx
- Partition context
.data
.
GridGain In-Memory Computing Platform : ver. 8.9.14 Release Date : November 5 2024