Situation
The prediction target is a continuous numerical value such as star rating or price movement (a regression problem). In order to use Jaxon, and to bucket outcomes, it is necessary to reframe the prediction as a classification problem.
Remedy
Model the outcomes as a discrete probability distribution. In the spirit of calculus, a continuous data stream (i.e. the possible numerical outcomes) can be placed into arbitrarily granular buckets. Consider reframing a 1-10 star review into 3 classes:
This allows a choice of granularity (how wide is each class “bucket”), as well as informed decisions about where to focus the model. For ranges that require granular discernment, narrower buckets can be used in combination with wider buckets for less interesting ranges; alternatively, buckets can be sized by area (there’s that calculus again!) in order to arrive at evenly-sized classes. Long tails and rare extreme events can be captured by bounding the first and last bucket on only one side (e.g. x > 12 in the above illustration).
Tradeoffs
Pros:
Cons:
Troubleshooting: what if the bucketing strategy is poor?
A sharp peak in the adjusted distribution:
A flat curve in the distribution:
Reframing classification to classification
Reframing can be applied to a priori classification problems as well. Consider navigating a class hierarchy and deciding at which granularity to frame a problem. For example, if classifying recipes into cuisines, is a lower-granularity class of Asian cuisine sufficient for the problem, or is it a better idea to instead use several higher-granularity classes such as Chinese, Japanese, Korean, and Thai? This process may continue (Schezuan, Dim Sum, Sushi, Yakitori, etc.).
For more on class hierarchies and tradeoffs, see the Cascade solution.