Class/Object

ml.dmlc.xgboost4j.scala.spark

XGBoostClassifier

Related Docs: object XGBoostClassifier | package spark

Permalink

class XGBoostClassifier extends ProbabilisticClassifier[Vector, XGBoostClassifier, XGBoostClassificationModel] with XGBoostClassifierParams with DefaultParamsWritable

Linear Supertypes
DefaultParamsWritable, MLWritable, XGBoostClassifierParams, NonParamVariables, HasContribPredictionCol, HasLeafPredictionCol, ParamMapFuncs, HasNumClass, HasBaseMarginCol, HasWeightCol, BoosterParams, LearningTaskParams, GeneralParams, ProbabilisticClassifier[Vector, XGBoostClassifier, XGBoostClassificationModel], ProbabilisticClassifierParams, HasThresholds, HasProbabilityCol, Classifier[Vector, XGBoostClassifier, XGBoostClassificationModel], ClassifierParams, HasRawPredictionCol, Predictor[Vector, XGBoostClassifier, XGBoostClassificationModel], PredictorParams, HasPredictionCol, HasFeaturesCol, HasLabelCol, Estimator[XGBoostClassificationModel], PipelineStage, Logging, Params, Serializable, Serializable, Identifiable, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. XGBoostClassifier
  2. DefaultParamsWritable
  3. MLWritable
  4. XGBoostClassifierParams
  5. NonParamVariables
  6. HasContribPredictionCol
  7. HasLeafPredictionCol
  8. ParamMapFuncs
  9. HasNumClass
  10. HasBaseMarginCol
  11. HasWeightCol
  12. BoosterParams
  13. LearningTaskParams
  14. GeneralParams
  15. ProbabilisticClassifier
  16. ProbabilisticClassifierParams
  17. HasThresholds
  18. HasProbabilityCol
  19. Classifier
  20. ClassifierParams
  21. HasRawPredictionCol
  22. Predictor
  23. PredictorParams
  24. HasPredictionCol
  25. HasFeaturesCol
  26. HasLabelCol
  27. Estimator
  28. PipelineStage
  29. Logging
  30. Params
  31. Serializable
  32. Serializable
  33. Identifiable
  34. AnyRef
  35. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new XGBoostClassifier(xgboostParams: Map[String, Any])

    Permalink
  2. new XGBoostClassifier(uid: String)

    Permalink
  3. new XGBoostClassifier()

    Permalink
  4. new XGBoostClassifier(uid: String, xgboostParams: Map[String, Any])

    Permalink

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def $[T](param: Param[T]): T

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  4. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  5. def MLlib2XGBoostParams: Map[String, Any]

    Permalink
    Definition Classes
    ParamMapFuncs
  6. def XGBoostToMLlibParams(xgboostParams: Map[String, Any]): Unit

    Permalink
    Definition Classes
    ParamMapFuncs
  7. final val alpha: DoubleParam

    Permalink

    L1 regularization term on weights, increase this value will make model more conservative.

    L1 regularization term on weights, increase this value will make model more conservative. [default=0]

    Definition Classes
    BoosterParams
  8. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  9. final val baseMarginCol: Param[String]

    Permalink

    Param for initial prediction (aka base margin) column name.

    Param for initial prediction (aka base margin) column name.

    Definition Classes
    HasBaseMarginCol
  10. final val baseScore: DoubleParam

    Permalink

    the initial prediction score of all instances, global bias.

    the initial prediction score of all instances, global bias. default=0.5

    Definition Classes
    LearningTaskParams
  11. final val cacheTrainingSet: BooleanParam

    Permalink

    whether caching training data

    whether caching training data

    Definition Classes
    LearningTaskParams
  12. final val checkpointInterval: IntParam

    Permalink

    Param for set checkpoint interval (>= 1) or disable checkpoint (-1).

    Param for set checkpoint interval (>= 1) or disable checkpoint (-1). E.g. 10 means that the trained model will get checkpointed every 10 iterations. Note: checkpoint_path must also be set if the checkpoint interval is greater than 0.

    Definition Classes
    GeneralParams
  13. final val checkpointPath: Param[String]

    Permalink

    The hdfs folder to load and save checkpoint boosters.

    The hdfs folder to load and save checkpoint boosters. default: empty_string

    Definition Classes
    GeneralParams
  14. final def clear(param: Param[_]): XGBoostClassifier.this.type

    Permalink
    Definition Classes
    Params
  15. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  16. final val colsampleBylevel: DoubleParam

    Permalink

    subsample ratio of columns for each split, in each level.

    subsample ratio of columns for each split, in each level. [default=1] range: (0,1]

    Definition Classes
    BoosterParams
  17. final val colsampleBytree: DoubleParam

    Permalink

    subsample ratio of columns when constructing each tree.

    subsample ratio of columns when constructing each tree. [default=1] range: (0,1]

    Definition Classes
    BoosterParams
  18. final val contribPredictionCol: Param[String]

    Permalink

    Param for contribution prediction column name.

    Param for contribution prediction column name.

    Definition Classes
    HasContribPredictionCol
  19. def copy(extra: ParamMap): XGBoostClassifier

    Permalink
    Definition Classes
    XGBoostClassifier → Predictor → Estimator → PipelineStage → Params
  20. def copyValues[T <: Params](to: T, extra: ParamMap): T

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  21. final val customEval: CustomEvalParam

    Permalink

    customized evaluation function provided by user.

    customized evaluation function provided by user. default: null

    Definition Classes
    GeneralParams
  22. final val customObj: CustomObjParam

    Permalink

    customized objective function provided by user.

    customized objective function provided by user. default: null

    Definition Classes
    GeneralParams
  23. final def defaultCopy[T <: Params](extra: ParamMap): T

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  24. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  25. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  26. final val eta: DoubleParam

    Permalink

    step size shrinkage used in update to prevents overfitting.

    step size shrinkage used in update to prevents overfitting. After each boosting step, we can directly get the weights of new features and eta actually shrinks the feature weights to make the boosting process more conservative. [default=0.3] range: [0,1]

    Definition Classes
    BoosterParams
  27. final val evalMetric: Param[String]

    Permalink

    evaluation metrics for validation data, a default metric will be assigned according to objective(rmse for regression, and error for classification, mean average precision for ranking).

    evaluation metrics for validation data, a default metric will be assigned according to objective(rmse for regression, and error for classification, mean average precision for ranking). options: rmse, mae, logloss, error, merror, mlogloss, auc, aucpr, ndcg, map, gamma-deviance

    Definition Classes
    LearningTaskParams
  28. var evalSetsMap: Map[String, DataFrame]

    Permalink
    Attributes
    protected
    Definition Classes
    NonParamVariables
  29. def explainParam(param: Param[_]): String

    Permalink
    Definition Classes
    Params
  30. def explainParams(): String

    Permalink
    Definition Classes
    Params
  31. def extractLabeledPoints(dataset: Dataset[_], numClasses: Int): RDD[org.apache.spark.ml.feature.LabeledPoint]

    Permalink
    Attributes
    protected
    Definition Classes
    Classifier
  32. def extractLabeledPoints(dataset: Dataset[_]): RDD[org.apache.spark.ml.feature.LabeledPoint]

    Permalink
    Attributes
    protected
    Definition Classes
    Predictor
  33. final def extractParamMap(): ParamMap

    Permalink
    Definition Classes
    Params
  34. final def extractParamMap(extra: ParamMap): ParamMap

    Permalink
    Definition Classes
    Params
  35. final val featuresCol: Param[String]

    Permalink
    Definition Classes
    HasFeaturesCol
  36. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  37. def fit(dataset: Dataset[_]): XGBoostClassificationModel

    Permalink
    Definition Classes
    Predictor → Estimator
  38. def fit(dataset: Dataset[_], paramMaps: Array[ParamMap]): Seq[XGBoostClassificationModel]

    Permalink
    Definition Classes
    Estimator
    Annotations
    @Since( "2.0.0" )
  39. def fit(dataset: Dataset[_], paramMap: ParamMap): XGBoostClassificationModel

    Permalink
    Definition Classes
    Estimator
    Annotations
    @Since( "2.0.0" )
  40. def fit(dataset: Dataset[_], firstParamPair: ParamPair[_], otherParamPairs: ParamPair[_]*): XGBoostClassificationModel

    Permalink
    Definition Classes
    Estimator
    Annotations
    @Since( "2.0.0" ) @varargs()
  41. final val gamma: DoubleParam

    Permalink

    minimum loss reduction required to make a further partition on a leaf node of the tree.

    minimum loss reduction required to make a further partition on a leaf node of the tree. the larger, the more conservative the algorithm will be. [default=0] range: [0, Double.MaxValue]

    Definition Classes
    BoosterParams
  42. final def get[T](param: Param[T]): Option[T]

    Permalink
    Definition Classes
    Params
  43. final def getAlpha: Double

    Permalink
    Definition Classes
    BoosterParams
  44. final def getBaseMarginCol: String

    Permalink

    Definition Classes
    HasBaseMarginCol
  45. final def getBaseScore: Double

    Permalink
    Definition Classes
    LearningTaskParams
  46. final def getCheckpointInterval: Int

    Permalink
    Definition Classes
    GeneralParams
  47. final def getCheckpointPath: String

    Permalink
    Definition Classes
    GeneralParams
  48. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  49. final def getColsampleBylevel: Double

    Permalink
    Definition Classes
    BoosterParams
  50. final def getColsampleBytree: Double

    Permalink
    Definition Classes
    BoosterParams
  51. final def getContribPredictionCol: String

    Permalink

    Definition Classes
    HasContribPredictionCol
  52. final def getDefault[T](param: Param[T]): Option[T]

    Permalink
    Definition Classes
    Params
  53. final def getEta: Double

    Permalink
    Definition Classes
    BoosterParams
  54. final def getEvalMetric: String

    Permalink
    Definition Classes
    LearningTaskParams
  55. def getEvalSets(params: Map[String, Any]): Map[String, DataFrame]

    Permalink
    Definition Classes
    NonParamVariables
  56. final def getFeaturesCol: String

    Permalink
    Definition Classes
    HasFeaturesCol
  57. final def getGamma: Double

    Permalink
    Definition Classes
    BoosterParams
  58. final def getGrowPolicy: String

    Permalink
    Definition Classes
    BoosterParams
  59. final def getInteractionConstraints: String

    Permalink
    Definition Classes
    BoosterParams
  60. final def getLabelCol: String

    Permalink
    Definition Classes
    HasLabelCol
  61. final def getLambda: Double

    Permalink
    Definition Classes
    BoosterParams
  62. final def getLambdaBias: Double

    Permalink
    Definition Classes
    BoosterParams
  63. final def getLeafPredictionCol: String

    Permalink

    Definition Classes
    HasLeafPredictionCol
  64. final def getMaxBins: Int

    Permalink
    Definition Classes
    BoosterParams
  65. final def getMaxDeltaStep: Double

    Permalink
    Definition Classes
    BoosterParams
  66. final def getMaxDepth: Int

    Permalink
    Definition Classes
    BoosterParams
  67. final def getMaxLeaves: Int

    Permalink
    Definition Classes
    BoosterParams
  68. final def getMaximizeEvaluationMetrics: Boolean

    Permalink
    Definition Classes
    LearningTaskParams
  69. final def getMinChildWeight: Double

    Permalink
    Definition Classes
    BoosterParams
  70. final def getMissing: Float

    Permalink
    Definition Classes
    GeneralParams
  71. final def getMonotoneConstraints: String

    Permalink
    Definition Classes
    BoosterParams
  72. final def getNormalizeType: String

    Permalink
    Definition Classes
    BoosterParams
  73. final def getNthread: Int

    Permalink
    Definition Classes
    GeneralParams
  74. final def getNumClass: Int

    Permalink

    Definition Classes
    HasNumClass
  75. def getNumClasses(dataset: Dataset[_], maxNumClasses: Int): Int

    Permalink
    Attributes
    protected
    Definition Classes
    Classifier
  76. final def getNumEarlyStoppingRounds: Int

    Permalink
    Definition Classes
    LearningTaskParams
  77. final def getNumRound: Int

    Permalink
    Definition Classes
    GeneralParams
  78. final def getNumWorkers: Int

    Permalink
    Definition Classes
    GeneralParams
  79. final def getObjective: String

    Permalink
    Definition Classes
    LearningTaskParams
  80. final def getObjectiveType: String

    Permalink
    Definition Classes
    LearningTaskParams
  81. final def getOrDefault[T](param: Param[T]): T

    Permalink
    Definition Classes
    Params
  82. def getParam(paramName: String): Param[Any]

    Permalink
    Definition Classes
    Params
  83. final def getPredictionCol: String

    Permalink
    Definition Classes
    HasPredictionCol
  84. final def getProbabilityCol: String

    Permalink
    Definition Classes
    HasProbabilityCol
  85. final def getRateDrop: Double

    Permalink
    Definition Classes
    BoosterParams
  86. final def getRawPredictionCol: String

    Permalink
    Definition Classes
    HasRawPredictionCol
  87. final def getSampleType: String

    Permalink
    Definition Classes
    BoosterParams
  88. final def getScalePosWeight: Double

    Permalink
    Definition Classes
    BoosterParams
  89. final def getSeed: Long

    Permalink
    Definition Classes
    GeneralParams
  90. final def getSilent: Int

    Permalink
    Definition Classes
    GeneralParams
  91. final def getSketchEps: Double

    Permalink
    Definition Classes
    BoosterParams
  92. final def getSkipDrop: Double

    Permalink
    Definition Classes
    BoosterParams
  93. final def getSubsample: Double

    Permalink
    Definition Classes
    BoosterParams
  94. def getThresholds: Array[Double]

    Permalink
    Definition Classes
    HasThresholds
  95. final def getTimeoutRequestWorkers: Long

    Permalink
    Definition Classes
    GeneralParams
  96. final def getTrainTestRatio: Double

    Permalink
    Definition Classes
    LearningTaskParams
  97. final def getTreeLimit: Int

    Permalink
    Definition Classes
    BoosterParams
  98. final def getTreeMethod: String

    Permalink
    Definition Classes
    BoosterParams
  99. final def getUseExternalMemory: Boolean

    Permalink
    Definition Classes
    GeneralParams
  100. final def getVerbosity: Int

    Permalink
    Definition Classes
    GeneralParams
  101. final def getWeightCol: String

    Permalink
    Definition Classes
    HasWeightCol
  102. final val growPolicy: Param[String]

    Permalink

    growth policy for fast histogram algorithm

    growth policy for fast histogram algorithm

    Definition Classes
    BoosterParams
  103. final def hasDefault[T](param: Param[T]): Boolean

    Permalink
    Definition Classes
    Params
  104. def hasParam(paramName: String): Boolean

    Permalink
    Definition Classes
    Params
  105. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  106. def initializeLogIfNecessary(isInterpreter: Boolean, silent: Boolean): Boolean

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  107. def initializeLogIfNecessary(isInterpreter: Boolean): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  108. final val interactionConstraints: Param[String]

    Permalink
    Definition Classes
    BoosterParams
  109. final def isDefined(param: Param[_]): Boolean

    Permalink
    Definition Classes
    Params
  110. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  111. final def isSet(param: Param[_]): Boolean

    Permalink
    Definition Classes
    Params
  112. def isTraceEnabled(): Boolean

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  113. final val labelCol: Param[String]

    Permalink
    Definition Classes
    HasLabelCol
  114. final val lambda: DoubleParam

    Permalink

    L2 regularization term on weights, increase this value will make model more conservative.

    L2 regularization term on weights, increase this value will make model more conservative. [default=1]

    Definition Classes
    BoosterParams
  115. final val lambdaBias: DoubleParam

    Permalink

    Parameter of linear booster L2 regularization term on bias, default 0(no L1 reg on bias because it is not important)

    Parameter of linear booster L2 regularization term on bias, default 0(no L1 reg on bias because it is not important)

    Definition Classes
    BoosterParams
  116. final val leafPredictionCol: Param[String]

    Permalink

    Param for leaf prediction column name.

    Param for leaf prediction column name.

    Definition Classes
    HasLeafPredictionCol
  117. def log: Logger

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  118. def logDebug(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  119. def logDebug(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  120. def logError(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  121. def logError(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  122. def logInfo(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  123. def logInfo(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  124. def logName: String

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  125. def logTrace(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  126. def logTrace(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  127. def logWarning(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  128. def logWarning(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  129. final val maxBins: IntParam

    Permalink

    maximum number of bins in histogram

    maximum number of bins in histogram

    Definition Classes
    BoosterParams
  130. final val maxDeltaStep: DoubleParam

    Permalink

    Maximum delta step we allow each tree's weight estimation to be.

    Maximum delta step we allow each tree's weight estimation to be. If the value is set to 0, it means there is no constraint. If it is set to a positive value, it can help making the update step more conservative. Usually this parameter is not needed, but it might help in logistic regression when class is extremely imbalanced. Set it to value of 1-10 might help control the update. [default=0] range: [0, Double.MaxValue]

    Definition Classes
    BoosterParams
  131. final val maxDepth: IntParam

    Permalink

    maximum depth of a tree, increase this value will make model more complex / likely to be overfitting.

    maximum depth of a tree, increase this value will make model more complex / likely to be overfitting. [default=6] range: [1, Int.MaxValue]

    Definition Classes
    BoosterParams
  132. final val maxLeaves: IntParam

    Permalink

    Maximum number of nodes to be added.

    Maximum number of nodes to be added. Only relevant when grow_policy=lossguide is set.

    Definition Classes
    BoosterParams
  133. final val maximizeEvaluationMetrics: BooleanParam

    Permalink
    Definition Classes
    LearningTaskParams
  134. final val minChildWeight: DoubleParam

    Permalink

    minimum sum of instance weight(hessian) needed in a child.

    minimum sum of instance weight(hessian) needed in a child. If the tree partition step results in a leaf node with the sum of instance weight less than min_child_weight, then the building process will give up further partitioning. In linear regression mode, this simply corresponds to minimum number of instances needed to be in each node. The larger, the more conservative the algorithm will be. [default=1] range: [0, Double.MaxValue]

    Definition Classes
    BoosterParams
  135. final val missing: FloatParam

    Permalink

    the value treated as missing.

    the value treated as missing. default: Float.NaN

    Definition Classes
    GeneralParams
  136. final val monotoneConstraints: Param[String]

    Permalink
    Definition Classes
    BoosterParams
  137. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  138. final val normalizeType: Param[String]

    Permalink

    Parameter of Dart booster.

    Parameter of Dart booster. type of normalization algorithm, options: {'tree', 'forest'}. [default="tree"]

    Definition Classes
    BoosterParams
  139. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  140. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  141. final val nthread: IntParam

    Permalink

    number of threads used by per worker.

    number of threads used by per worker. default 1

    Definition Classes
    GeneralParams
  142. final val numClass: IntParam

    Permalink

    number of classes

    number of classes

    Definition Classes
    HasNumClass
  143. final val numEarlyStoppingRounds: IntParam

    Permalink

    If non-zero, the training will be stopped after a specified number of consecutive increases in any evaluation metric.

    If non-zero, the training will be stopped after a specified number of consecutive increases in any evaluation metric.

    Definition Classes
    LearningTaskParams
  144. final val numRound: IntParam

    Permalink

    The number of rounds for boosting

    The number of rounds for boosting

    Definition Classes
    GeneralParams
  145. final val numWorkers: IntParam

    Permalink

    number of workers used to train xgboost model.

    number of workers used to train xgboost model. default: 1

    Definition Classes
    GeneralParams
  146. final val objective: Param[String]

    Permalink

    Specify the learning task and the corresponding learning objective.

    Specify the learning task and the corresponding learning objective. options: reg:squarederror, reg:logistic, binary:logistic, binary:logitraw, count:poisson, multi:softmax, multi:softprob, rank:pairwise, reg:gamma. default: reg:squarederror

    Definition Classes
    LearningTaskParams
  147. final val objectiveType: Param[String]

    Permalink

    The learning objective type of the specified custom objective and eval.

    The learning objective type of the specified custom objective and eval. Corresponding type will be assigned if custom objective is defined options: regression, classification. default: null

    Definition Classes
    LearningTaskParams
  148. lazy val params: Array[Param[_]]

    Permalink
    Definition Classes
    Params
  149. final val predictionCol: Param[String]

    Permalink
    Definition Classes
    HasPredictionCol
  150. final val probabilityCol: Param[String]

    Permalink
    Definition Classes
    HasProbabilityCol
  151. final val rateDrop: DoubleParam

    Permalink

    Parameter of Dart booster.

    Parameter of Dart booster. dropout rate. [default=0.0] range: [0.0, 1.0]

    Definition Classes
    BoosterParams
  152. final val rawPredictionCol: Param[String]

    Permalink
    Definition Classes
    HasRawPredictionCol
  153. final val sampleType: Param[String]

    Permalink

    Parameter for Dart booster.

    Parameter for Dart booster. Type of sampling algorithm. "uniform": dropped trees are selected uniformly. "weighted": dropped trees are selected in proportion to weight. [default="uniform"]

    Definition Classes
    BoosterParams
  154. def save(path: String): Unit

    Permalink
    Definition Classes
    MLWritable
    Annotations
    @Since( "1.6.0" ) @throws( ... )
  155. final val scalePosWeight: DoubleParam

    Permalink

    Control the balance of positive and negative weights, useful for unbalanced classes.

    Control the balance of positive and negative weights, useful for unbalanced classes. A typical value to consider: sum(negative cases) / sum(positive cases). [default=1]

    Definition Classes
    BoosterParams
  156. final val seed: LongParam

    Permalink

    Random seed for the C++ part of XGBoost and train/test splitting.

    Random seed for the C++ part of XGBoost and train/test splitting.

    Definition Classes
    GeneralParams
  157. final def set(paramPair: ParamPair[_]): XGBoostClassifier.this.type

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  158. final def set(param: String, value: Any): XGBoostClassifier.this.type

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  159. final def set[T](param: Param[T], value: T): XGBoostClassifier.this.type

    Permalink
    Definition Classes
    Params
  160. def setAlpha(value: Double): XGBoostClassifier.this.type

    Permalink
  161. def setBaseMarginCol(value: String): XGBoostClassifier.this.type

    Permalink
  162. def setBaseScore(value: Double): XGBoostClassifier.this.type

    Permalink
  163. def setCheckpointInterval(value: Int): XGBoostClassifier.this.type

    Permalink
  164. def setCheckpointPath(value: String): XGBoostClassifier.this.type

    Permalink
  165. def setColsampleBylevel(value: Double): XGBoostClassifier.this.type

    Permalink
  166. def setColsampleBytree(value: Double): XGBoostClassifier.this.type

    Permalink
  167. def setCustomEval(value: EvalTrait): XGBoostClassifier.this.type

    Permalink
  168. def setCustomObj(value: ObjectiveTrait): XGBoostClassifier.this.type

    Permalink
  169. final def setDefault(paramPairs: ParamPair[_]*): XGBoostClassifier.this.type

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  170. final def setDefault[T](param: Param[T], value: T): XGBoostClassifier.this.type

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  171. def setEta(value: Double): XGBoostClassifier.this.type

    Permalink
  172. def setEvalMetric(value: String): XGBoostClassifier.this.type

    Permalink
  173. def setEvalSets(evalSets: Map[String, DataFrame]): XGBoostClassifier.this.type

    Permalink
    Definition Classes
    NonParamVariables
  174. def setFeaturesCol(value: String): XGBoostClassifier

    Permalink
    Definition Classes
    Predictor
  175. def setGamma(value: Double): XGBoostClassifier.this.type

    Permalink
  176. def setGrowPolicy(value: String): XGBoostClassifier.this.type

    Permalink
  177. def setLabelCol(value: String): XGBoostClassifier

    Permalink
    Definition Classes
    Predictor
  178. def setLambda(value: Double): XGBoostClassifier.this.type

    Permalink
  179. def setLambdaBias(value: Double): XGBoostClassifier.this.type

    Permalink
  180. def setMaxBins(value: Int): XGBoostClassifier.this.type

    Permalink
  181. def setMaxDeltaStep(value: Double): XGBoostClassifier.this.type

    Permalink
  182. def setMaxDepth(value: Int): XGBoostClassifier.this.type

    Permalink
  183. def setMaxLeaves(value: Int): XGBoostClassifier.this.type

    Permalink
  184. def setMaximizeEvaluationMetrics(value: Boolean): XGBoostClassifier.this.type

    Permalink
  185. def setMinChildWeight(value: Double): XGBoostClassifier.this.type

    Permalink
  186. def setMissing(value: Float): XGBoostClassifier.this.type

    Permalink
  187. def setNormalizeType(value: String): XGBoostClassifier.this.type

    Permalink
  188. def setNthread(value: Int): XGBoostClassifier.this.type

    Permalink
  189. def setNumClass(value: Int): XGBoostClassifier.this.type

    Permalink
  190. def setNumEarlyStoppingRounds(value: Int): XGBoostClassifier.this.type

    Permalink
  191. def setNumRound(value: Int): XGBoostClassifier.this.type

    Permalink
  192. def setNumWorkers(value: Int): XGBoostClassifier.this.type

    Permalink
  193. def setObjective(value: String): XGBoostClassifier.this.type

    Permalink
  194. def setObjectiveType(value: String): XGBoostClassifier.this.type

    Permalink
  195. def setPredictionCol(value: String): XGBoostClassifier

    Permalink
    Definition Classes
    Predictor
  196. def setProbabilityCol(value: String): XGBoostClassifier

    Permalink
    Definition Classes
    ProbabilisticClassifier
  197. def setRateDrop(value: Double): XGBoostClassifier.this.type

    Permalink
  198. def setRawPredictionCol(value: String): XGBoostClassifier

    Permalink
    Definition Classes
    Classifier
  199. def setSampleType(value: String): XGBoostClassifier.this.type

    Permalink
  200. def setScalePosWeight(value: Double): XGBoostClassifier.this.type

    Permalink
  201. def setSeed(value: Long): XGBoostClassifier.this.type

    Permalink
  202. def setSilent(value: Int): XGBoostClassifier.this.type

    Permalink
  203. def setSketchEps(value: Double): XGBoostClassifier.this.type

    Permalink
  204. def setSkipDrop(value: Double): XGBoostClassifier.this.type

    Permalink
  205. def setSubsample(value: Double): XGBoostClassifier.this.type

    Permalink
  206. def setThresholds(value: Array[Double]): XGBoostClassifier

    Permalink
    Definition Classes
    ProbabilisticClassifier
  207. def setTimeoutRequestWorkers(value: Long): XGBoostClassifier.this.type

    Permalink
  208. def setTrainTestRatio(value: Double): XGBoostClassifier.this.type

    Permalink
  209. def setTreeMethod(value: String): XGBoostClassifier.this.type

    Permalink
  210. def setUseExternalMemory(value: Boolean): XGBoostClassifier.this.type

    Permalink
  211. def setWeightCol(value: String): XGBoostClassifier.this.type

    Permalink
  212. final val silent: IntParam

    Permalink

    Deprecated.

    Deprecated. Please use verbosity instead. 0 means printing running messages, 1 means silent mode. default: 0

    Definition Classes
    GeneralParams
  213. final val sketchEps: DoubleParam

    Permalink

    This is only used for approximate greedy algorithm.

    This is only used for approximate greedy algorithm. This roughly translated into O(1 / sketch_eps) number of bins. Compared to directly select number of bins, this comes with theoretical guarantee with sketch accuracy. [default=0.03] range: (0, 1)

    Definition Classes
    BoosterParams
  214. final val skipDrop: DoubleParam

    Permalink

    Parameter of Dart booster.

    Parameter of Dart booster. probability of skip dropout. If a dropout is skipped, new trees are added in the same manner as gbtree. [default=0.0] range: [0.0, 1.0]

    Definition Classes
    BoosterParams
  215. final val subsample: DoubleParam

    Permalink

    subsample ratio of the training instance.

    subsample ratio of the training instance. Setting it to 0.5 means that XGBoost randomly collected half of the data instances to grow trees and this will prevent overfitting. [default=1] range:(0,1]

    Definition Classes
    BoosterParams
  216. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  217. final val thresholds: DoubleArrayParam

    Permalink
    Definition Classes
    HasThresholds
  218. final val timeoutRequestWorkers: LongParam

    Permalink

    the maximum time to wait for the job requesting new workers.

    the maximum time to wait for the job requesting new workers. default: 30 minutes

    Definition Classes
    GeneralParams
  219. def toString(): String

    Permalink
    Definition Classes
    Identifiable → AnyRef → Any
  220. final val trackerConf: TrackerConfParam

    Permalink

    Rabit tracker configurations.

    Rabit tracker configurations. The parameter must be provided as an instance of the TrackerConf class, which has the following definition:

    case class TrackerConf(workerConnectionTimeout: Duration, trainingTimeout: Duration, trackerImpl: String)

    See below for detailed explanations.

    • trackerImpl: Select the implementation of Rabit tracker. default: "python"

    Choice between "python" or "scala". The former utilizes the Java wrapper of the Python Rabit tracker (in dmlc_core), and does not support timeout settings. The "scala" version removes Python components, and fully supports timeout settings.

    • workerConnectionTimeout: the maximum wait time for all workers to connect to the tracker. default: 0 millisecond (no timeout)

    The timeout value should take the time of data loading and pre-processing into account, due to the lazy execution of Spark's operations. Alternatively, you may force Spark to perform data transformation before calling XGBoost.train(), so that this timeout truly reflects the connection delay. Set a reasonable timeout value to prevent model training/testing from hanging indefinitely, possible due to network issues. Note that zero timeout value means to wait indefinitely (equivalent to Duration.Inf). Ignored if the tracker implementation is "python".

    Definition Classes
    GeneralParams
  221. def train(dataset: Dataset[_]): XGBoostClassificationModel

    Permalink
    Attributes
    protected
    Definition Classes
    XGBoostClassifier → Predictor
  222. final val trainTestRatio: DoubleParam

    Permalink

    Fraction of training points to use for testing.

    Fraction of training points to use for testing.

    Definition Classes
    LearningTaskParams
  223. def transformSchema(schema: StructType): StructType

    Permalink
    Definition Classes
    Predictor → PipelineStage
  224. def transformSchema(schema: StructType, logging: Boolean): StructType

    Permalink
    Attributes
    protected
    Definition Classes
    PipelineStage
    Annotations
    @DeveloperApi()
  225. final val treeLimit: IntParam

    Permalink
    Definition Classes
    BoosterParams
  226. final val treeMethod: Param[String]

    Permalink

    The tree construction algorithm used in XGBoost.

    The tree construction algorithm used in XGBoost. options: {'auto', 'exact', 'approx'} [default='auto']

    Definition Classes
    BoosterParams
  227. val uid: String

    Permalink
    Definition Classes
    XGBoostClassifier → Identifiable
  228. final val useExternalMemory: BooleanParam

    Permalink

    whether to use external memory as cache.

    whether to use external memory as cache. default: false

    Definition Classes
    GeneralParams
  229. def validateAndTransformSchema(schema: StructType, fitting: Boolean, featuresDataType: DataType): StructType

    Permalink
    Attributes
    protected
    Definition Classes
    ProbabilisticClassifierParams → ClassifierParams → PredictorParams
  230. final val verbosity: IntParam

    Permalink

    Verbosity of printing messages.

    Verbosity of printing messages. Valid values are 0 (silent), 1 (warning), 2 (info), 3 (debug). default: 1

    Definition Classes
    GeneralParams
  231. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  232. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  233. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  234. final val weightCol: Param[String]

    Permalink
    Definition Classes
    HasWeightCol
  235. def write: MLWriter

    Permalink
    Definition Classes
    DefaultParamsWritable → MLWritable

Inherited from DefaultParamsWritable

Inherited from MLWritable

Inherited from XGBoostClassifierParams

Inherited from NonParamVariables

Inherited from HasContribPredictionCol

Inherited from HasLeafPredictionCol

Inherited from ParamMapFuncs

Inherited from HasNumClass

Inherited from HasBaseMarginCol

Inherited from HasWeightCol

Inherited from BoosterParams

Inherited from LearningTaskParams

Inherited from GeneralParams

Inherited from ProbabilisticClassifier[Vector, XGBoostClassifier, XGBoostClassificationModel]

Inherited from ProbabilisticClassifierParams

Inherited from HasThresholds

Inherited from HasProbabilityCol

Inherited from Classifier[Vector, XGBoostClassifier, XGBoostClassificationModel]

Inherited from ClassifierParams

Inherited from HasRawPredictionCol

Inherited from Predictor[Vector, XGBoostClassifier, XGBoostClassificationModel]

Inherited from PredictorParams

Inherited from HasPredictionCol

Inherited from HasFeaturesCol

Inherited from HasLabelCol

Inherited from Estimator[XGBoostClassificationModel]

Inherited from PipelineStage

Inherited from Logging

Inherited from Params

Inherited from Serializable

Inherited from Serializable

Inherited from Identifiable

Inherited from AnyRef

Inherited from Any

getParam

param

Ungrouped