Skip to main content



simRecommend Data Type Specifications

Data Type Specifications tell the model what form the data from each column is in so it knows how to properly compare values. Unlike other models, the class column will be set to "CLASS_ITEM_SET". This item set must be duplicated in the data set, with the duplicated column type set to "ITEM_SET". The item set being described here is what will tell the model the individual's historical preferences. More detail on the format of the item set will come in later sections.

IDA mandatory field which uniquely identifies each object.
CLASS_ITEM_SETA mandatory field which specifies the field to be classified. Item set format. A series of values with weights. (Formatted as item1:weight1;item2:weight2;item3:weight3)
ITEM_SETDuplicate of the column of type CLASS_ITEM_SET
REALNumerical values.
NOMINALValues that do not bear a quantitative relationship with each other (i.e., strings and numbers which represent non-numerical information).
MULTI_PLAINMultiple NOMINAL values separated by spaces. Non-language specific.
MULTI_ENGLISHMultiple NOMINAL values separated by spaces. The text is English language.
MULTI_SPANISHMultiple NOMINAL values separated by spaces. The text is Spanish language.
MULTI_JAPANESEMultiple NOMINAL values separated by spaces. The text is Japanese language.
IGNOREThe column shall be ignored by the program.
NULL_INDICATORThis column type identifies the presence or absence of non-numerical data, assigning different weights to any cell with data versus those without data.