Logarithmic scoring rule is proper

From Calculus
Revision as of 16:52, 10 September 2017 by Vipul (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Statement

The logarithmic scoring rule is a proper scoring rule. Explicitly:

Consider a random variable X that can take n distinct values 1,2,,n. Suppose we estimate probabilities p1,p2,,pn for these values respectively (with pi[0,1], i=1npi=1). The logarithmic scoring rule works as follows: for every instance of the random variable X, we assign score equal to the negative of the logarithm of the corresponding probability pi. Explicitly, if the instances are X1,X2,,Xm, the total score is:

j=1mln(pXj)

The claim is that, if the actual probabilities are q1,q2,,qn, then the assignment that minimizes the expected value of the score is p1=q1,p2=q2,,pn=qn.

Related facts

Proof

Reduction to one random instance

Under the assumption that the instances are independent of each other, it suffices to show the result for one instance.

Reduction to the case that all probabilities are strictly between zero and one

We now show that if any particular qi=0, the corresponding pi must equal 0, and if any qi=1, the corresponding pi must equal 1.

Fill this in later

Proof for one instance and where all the actual probabilities are nonzero and less than one

The expected value for one instance is:

i=1nqiln(pi)

In words, we weight each score by the probability that that score is attained.

We are constrained to lie on the codimension one hyperplane given by i=1npi=1. We can therefore use the idea of Lagrange multipliers to find the optima. The gradient vector of the expected value function is the vector with coordinates:

(q1p1,q2p2,,qnpn)

The normal vector to the hyperplane is given as the gradient vector of the function i=1npi, and is the vector:

(1,1,,1)

By the theory of Lagrange multipliers, we have that at any local extreme value, there exists a value λ such that:

qipi=λ(1)

for all i. In other words:

qi=(λ)pi

for all i. Adding up, we get:

i=1nqi=(λ)i=1npi

We have that i=1nqi=1 as well (these are the actual probabilities, so they add up to 1), so we get:

1=(λ)(1)

so λ=1. Plugging back, we get that the only point that could potentially be a point of local extremum satisfies qi=pi for all i.

We can now verify that this is indeed a point of local minimum. Fill this in later

We can also verify that the absolute minimum does not occur at the boundary: if qi0 but we set pi=0, then our expected score is , because there's a nonzero probability of paying an infinite cost, namely, in the case that the random variable takes the value i.