xgboost
|
Meta information about dataset, always sit in memory. More...
#include <data.h>
Public Member Functions | |
MetaInfo ()=default | |
default constructor More... | |
MetaInfo (MetaInfo &&that)=default | |
MetaInfo & | operator= (MetaInfo &&that)=default |
MetaInfo & | operator= (MetaInfo const &that) |
void | Validate (int32_t device) const |
Validate all metainfo. More... | |
MetaInfo | Slice (common::Span< int32_t const > ridxs) const |
bst_float | GetWeight (size_t i) const |
Get weight of each instances. More... | |
const std::vector< size_t > & | LabelAbsSort () const |
get sorted indexes (argsort) of labels by absolute value (used by cox loss) More... | |
void | Clear () |
clear all the information More... | |
void | LoadBinary (dmlc::Stream *fi) |
Load the Meta info from binary stream. More... | |
void | SaveBinary (dmlc::Stream *fo) const |
Save the Meta info to binary stream. More... | |
void | SetInfo (const char *key, const void *dptr, DataType dtype, size_t num) |
Set information in the meta info. More... | |
void | SetInfo (const char *key, std::string const &interface_str) |
Set information in the meta info with array interface. More... | |
Public Attributes | |
uint64_t | num_row_ {0} |
number of rows in the data More... | |
uint64_t | num_col_ {0} |
number of columns in the data More... | |
uint64_t | num_nonzero_ {0} |
number of nonzero entries in the data More... | |
HostDeviceVector< bst_float > | labels_ |
label of each instance More... | |
std::vector< bst_group_t > | group_ptr_ |
the index of begin and end of a group needed when the learning task is ranking. More... | |
HostDeviceVector< bst_float > | weights_ |
weights of each instance, optional More... | |
HostDeviceVector< bst_float > | base_margin_ |
initialized margins, if specified, xgboost will start from this init margin can be used to specify initial prediction to boost from. More... | |
HostDeviceVector< bst_float > | labels_lower_bound_ |
lower bound of the label, to be used for survival analysis (censored regression) More... | |
HostDeviceVector< bst_float > | labels_upper_bound_ |
upper bound of the label, to be used for survival analysis (censored regression) More... | |
Static Public Attributes | |
static constexpr uint64_t | kNumField = 9 |
number of data fields in MetaInfo More... | |
Meta information about dataset, always sit in memory.
|
default |
default constructor
|
default |
void xgboost::MetaInfo::Clear | ( | ) |
clear all the information
|
inline |
Get weight of each instances.
i | Instance index. |
|
inline |
get sorted indexes (argsort) of labels by absolute value (used by cox loss)
void xgboost::MetaInfo::LoadBinary | ( | dmlc::Stream * | fi | ) |
Load the Meta info from binary stream.
fi | The input stream |
void xgboost::MetaInfo::SaveBinary | ( | dmlc::Stream * | fo | ) | const |
Save the Meta info to binary stream.
fo | The output stream. |
void xgboost::MetaInfo::SetInfo | ( | const char * | key, |
const void * | dptr, | ||
DataType | dtype, | ||
size_t | num | ||
) |
Set information in the meta info.
key | The key of the information. |
dptr | The data pointer of the source array. |
dtype | The type of the source data. |
num | Number of elements in the source array. |
void xgboost::MetaInfo::SetInfo | ( | const char * | key, |
std::string const & | interface_str | ||
) |
Set information in the meta info with array interface.
key | The key of the information. |
interface_str | String representation of json format array interface. [ column_0, column_1, ... column_n ] Right now only 1 column is permitted. |
MetaInfo xgboost::MetaInfo::Slice | ( | common::Span< int32_t const > | ridxs | ) | const |
void xgboost::MetaInfo::Validate | ( | int32_t | device | ) | const |
Validate all metainfo.
HostDeviceVector<bst_float> xgboost::MetaInfo::base_margin_ |
initialized margins, if specified, xgboost will start from this init margin can be used to specify initial prediction to boost from.
std::vector<bst_group_t> xgboost::MetaInfo::group_ptr_ |
the index of begin and end of a group needed when the learning task is ranking.
|
static |
number of data fields in MetaInfo
HostDeviceVector<bst_float> xgboost::MetaInfo::labels_ |
label of each instance
HostDeviceVector<bst_float> xgboost::MetaInfo::labels_lower_bound_ |
lower bound of the label, to be used for survival analysis (censored regression)
HostDeviceVector<bst_float> xgboost::MetaInfo::labels_upper_bound_ |
upper bound of the label, to be used for survival analysis (censored regression)
uint64_t xgboost::MetaInfo::num_col_ {0} |
number of columns in the data
uint64_t xgboost::MetaInfo::num_nonzero_ {0} |
number of nonzero entries in the data
uint64_t xgboost::MetaInfo::num_row_ {0} |
number of rows in the data
HostDeviceVector<bst_float> xgboost::MetaInfo::weights_ |
weights of each instance, optional