If you are doing a linear model you want to do something to limit the effect of a horse being beat double digit lengths, such as limit the beaten length parameter to a predetermined value or you might use 10 minus beaten lengths and bottom out at 0.
On the theory that a horse that runs last in a 6 horse field hasn't really done anything better than a horse that runs last in a 12 horse field you might limit the finish position (or use 6 minus fin position as above).
The speed and class numbers should probably be in relation to what would be expected at that level.
I would be leery of the information that is only available for winners. That is probably not going to fit in a nice linear manner with the others.
|