K
- Type of a key in upstream
data.V
- Type of a value in upstream
data.public class BootstrappedDatasetBuilder<K,V> extends Object implements PartitionDataBuilder<K,V,EmptyContext,BootstrappedDatasetPartition>
BootstrappedVector
containing each vector from original sample with counters of repetitions
for each subsample. As heuristic this implementation uses Poisson Distribution for generating counter values.Constructor and Description |
---|
BootstrappedDatasetBuilder(IgniteBiFunction<K,V,Vector> featureExtractor,
IgniteBiFunction<K,V,Double> lbExtractor,
int samplesCnt,
double subsampleSize)
Creates an instance of BootstrappedDatasetBuilder.
|
Modifier and Type | Method and Description |
---|---|
BootstrappedDatasetPartition |
build(Iterator<UpstreamEntry<K,V>> upstreamData,
long upstreamDataSize,
EmptyContext ctx)
Builds a new partition
data from a partition upstream data and partition context |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
andThen
public BootstrappedDatasetBuilder(IgniteBiFunction<K,V,Vector> featureExtractor, IgniteBiFunction<K,V,Double> lbExtractor, int samplesCnt, double subsampleSize)
featureExtractor
- Feature extractor.lbExtractor
- Label extractor.samplesCnt
- Samples count.subsampleSize
- Subsample size.public BootstrappedDatasetPartition build(Iterator<UpstreamEntry<K,V>> upstreamData, long upstreamDataSize, EmptyContext ctx)
data
from a partition upstream
data and partition context
build
in interface PartitionDataBuilder<K,V,EmptyContext,BootstrappedDatasetPartition>
upstreamData
- Partition upstream
data.upstreamDataSize
- Partition upstream
data size.ctx
- Partition context
.data
.
Follow @ApacheIgnite
Ignite Database and Caching Platform : ver. 2.7.2 Release Date : February 6 2019