Bradley-Terry Logistic Ranking

Discover what you value through pairwise comparisons

NameMinMaxBins
Comparisons
0
Confidence
Stability
Model Reliability

Which would you prefer?

Option A

Option B

CI Level: 95%

Linear Weights (one weight per facet, assumes uniform bin importance)


Per-Bin Weights (thermometer encoding) Bins:

Compare Evaluators

Import CSVs from multiple evaluators to find where their learned models differ.

Click to add evaluator CSV files

Each file should have an "evaluator" column