Skip to main content

Creating Models

📄️ Domain Optimization for Classification

Classification using simClassify+ has extra parameters and features for optimizing results based upon a domain attribute. The domain attribute can be any REAL column and could represent money, effort, size, and so on. Typically, this would be used in fraud detection with the domain attribute being the amount of money in a transaction. In this example, this directs the metric learner to optimize not just for fraud transactions caught, but fraud money caught.

📄️ Thresholding

Thresholding, a feature used in simClassify and simClassify+ predictors, acts as a limit to split resulting confidence values into a true or false category for binomial predictions. Thresholds only apply to confidence levels for the positive class of a prediction (i.e. the class with the lower number of predictions). If the confidence value for the positive class exceeds the threshold value, the result is considered true and the positive class is predicted. If the confidence value is less than the threshold, the result is false and the negative class is predicted.