Skip to content

Variable importance - optional metrics #38

@JanBenisek

Description

@JanBenisek

We compute variable importance by calculating Pearson's correlation between scores and target encoded variables:

importance_by_variable = {
utils.clean_predictor_name(predictor): stats.pearsonr(
data[predictor],
y_pred
)[0]
for predictor in self.predictors
}

It'd be nice to choose different correlation (like Kendall)? Pearson assumes normality, but doesn't always hold for the variables considered.

https://datascience.stackexchange.com/questions/64260/pearson-vs-spearman-vs-kendall

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestgood first issueGood for newcomersquestionFurther information is requested

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions