|
|
12-28-2010, 12:34 PM
|
#1
|
Registered User
Join Date: Feb 2008
Posts: 1,591
|
Using Chi Square Statistic To Produce A Handicapping Method
If there is sufficient interest, I will along with any PA member(s) will go through the process of generating a handicapping method based on the Chi Square statistic.
I (We) will do it in a such a manner which will be non-technical, easy to understand, using only basic arithmetic, and will allow for anyone to participate.
The end result I believe will be a profitable system.
We will need someone with a large database to provide data for the factors that we will test, and include in our model.
If there is interest I will proceed.
Let me know.
Mike
|
|
|
12-28-2010, 12:40 PM
|
#2
|
Join Date: Mar 2001
Location: Reno, NV
Posts: 16,915
|
I can tell you that I am certainly interested to see this.
|
|
|
12-28-2010, 12:45 PM
|
#3
|
Registered User
Join Date: Dec 2010
Posts: 9
|
Please allow me to partake and supply any information that i may help obtain.
|
|
|
12-28-2010, 12:48 PM
|
#4
|
Registered User
Join Date: Aug 2005
Posts: 153
|
Absolutely count me in.
|
|
|
12-28-2010, 12:48 PM
|
#5
|
The Voice of Reason!
Join Date: Mar 2001
Location: Canandaigua, New york
Posts: 112,887
|
I will help anyway I can.
__________________
Who does the Racing Form Detective like in this one?
|
|
|
12-28-2010, 01:33 PM
|
#6
|
Registered User
Join Date: Feb 2008
Posts: 1,591
|
Let's start with some definitions
The Chi Square Statistic
Types of Data:
There are basically two types of random variables and they yield two types of data: numerical and categorical.
Categorical variables yield data in particular categories.
Responses to such questions as "What is your hair color?" or Do you own a
home?" are categorical because they yield data such as "biology" or "yes/no."
Numerical variables yield data in numerical form.
Responses to such questions as "How tall are you?" or "What is your
age?" are numerical.
Numerical data can be either discrete or continuous.
The table below may help you see the differences between these two variables.
Data Type Question Type Possible Responses
Categorical What is your sex? male or female
Numerical Discrete- How many jackets do you own? four or five
Numerical Continuous - How old are you? 42 years
Discrete data arise from a counting process, while continuous data
arise from a measuring process.
The Chi Square statistic compares the counts of categorical responses between two (or more) independent groups.
Chi square tests can only be used on actual numbers and not on percentages, proportions, means, etc.
Since I want all interested to understand the process, I will answer questions. No question is too simple or foolish... be free to ask and open. However, I will not answer any theoretical questions nor will I attempt to be rigorous. I will make up terms in the process, which may not be mathematically correct, but I will do so when necessary to clarify a point.
Mike
Next, I will use the "standard" dice example to explain the process.
|
|
|
12-28-2010, 02:41 PM
|
#7
|
Senior Member
Join Date: Mar 2001
Location: Oklahoma City
Posts: 1,718
|
I expect that this will be an interesting thread/topic.
Mike,
Regarding discrete vs. continuous numerical data. Are you saying that uncertainty in discrete data arises from measurement uncertainty whereas uncertainty in continuous data is caused by measurement imprecision? Somebody answering the question "how are you?"with 42 as the answer really means "42.50 plus or minus 0.50."
__________________
London calling, yeah, I was there, too
An' you know what they said? Well, some of it was true!
(Strummer/Jones)
|
|
|
12-28-2010, 02:51 PM
|
#8
|
Racing Form Detective
Join Date: Jul 2007
Location: Lincoln, Ne but my heart is at Santa Anita
Posts: 16,316
|
I will be interested this too. I looked into this back in the 70s and gave it up because at the time the compiling of the data was not feasible. I did a regression instead. Good luck
__________________
Some day in the not too distant future, horse players will betting on computer generated races over the net. Race tracks will become casinos and shopping centers. And some crooner will be belting out "there used to be a race track here".
|
|
|
12-28-2010, 03:04 PM
|
#9
|
EXCEL with SUPERFECTAS
Join Date: Mar 2004
Posts: 10,206
|
This may sound whimsical, to those who have worked in, or, have studied statistical functions (?) (which I sadly have not), but, what is the syntax for a "Chi Square" test? And, what is the reasoning behind using this function(?)? What does one hope to achieve as a result of the test?
|
|
|
12-28-2010, 03:13 PM
|
#10
|
Registered User
Join Date: Feb 2008
Posts: 1,591
|
Quote:
Originally Posted by ceejay
I expect that this will be an interesting thread/topic.
Mike,
Regarding discrete vs. continuous numerical data. Are you saying that uncertainty in discrete data arises from measurement uncertainty whereas uncertainty in continuous data is caused by measurement imprecision? Somebody answering the question "how are you?"with 42 as the answer really means "42.50 plus or minus 0.50."
|
There is no need to go into measurement error(s) or uncertainty of measurements. The point is that we will use statistical data that can be counted or tallied.
For example:
How many horses won or lost their next race after finishing second in their previous race. We will use a Chi Square statistic to determine significance (or lack of significance) for that factor, and then compute a weight for that factor. We are jumping ahead, but I do want you to understand.
Mike
|
|
|
12-28-2010, 03:22 PM
|
#11
|
Registered User
Join Date: Dec 2001
Posts: 6,128
|
I've done this sort of thing many times, never came up with a profitable system though. Will be interested to see what you come up with...
|
|
|
12-28-2010, 03:23 PM
|
#12
|
Registered User
Join Date: Feb 2008
Posts: 1,591
|
Quote:
Originally Posted by raybo
This may sound whimsical, to those who have worked in, or, have studied statistical functions (?) (which I sadly have not), but, what is the syntax for a "Chi Square" test? And, what is the reasoning behind using this function(?)? What does one hope to achieve as a result of the test?
|
I believe the "reason behind using the function" and what we can achieve will be become clear after the dice example and after we test a factor. As I said I will not go into the theoretical aspects of the distribution. The symbol is unimportant.
The information we will use will be in table form.
Mike
|
|
|
12-28-2010, 03:30 PM
|
#13
|
Registered User
Join Date: Feb 2008
Posts: 1,591
|
Quote:
Originally Posted by GameTheory
I've done this sort of thing many times, never came up with a profitable system though. Will be interested to see what you come up with...
|
It will depend on the factors we select and test. And we will weight them. The same can be said about any approach. As you probably know we will not be using speed/pace ratings.
I believe we can make approach work during the testing phase by eliminating certain categories of races.
Mike
P.S. Join in we'll all have a hand in choosing the factors.
Last edited by TrifectaMike; 12-28-2010 at 03:32 PM.
|
|
|
12-28-2010, 03:42 PM
|
#14
|
Racing Form Detective
Join Date: Jul 2007
Location: Lincoln, Ne but my heart is at Santa Anita
Posts: 16,316
|
The one thing I will say about this project. Chi-Square is known for producing some very strange relationships. Be very careful that you don't get a bunch of Gobbly Goop.
__________________
Some day in the not too distant future, horse players will betting on computer generated races over the net. Race tracks will become casinos and shopping centers. And some crooner will be belting out "there used to be a race track here".
|
|
|
12-28-2010, 03:48 PM
|
#15
|
Registered User
Join Date: Feb 2008
Posts: 1,591
|
Quote:
Originally Posted by Robert Goren
The one thing I will say about this project. Chi-Square is known for producing some very strange relationships. Be very careful that you don't get a bunch of Gobbly Goop.
|
Maybe I don't understand your post, but Chi-Square doesn't produce any relationships.
Mike
|
|
|
|
|
Posting Rules
|
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
HTML code is Off
|
|
|
|
|