The number of samples that fall into the node at the training stage. It is used to resolve the difficult cases - when the variable for the primary split is missing, and all the variables for other surrogate splits are missing too,
the sample is directed to the left if left->sample_count>right->sample_count and to the right otherwise