Example: Decide between 4 modes of transportation to work:
Ingredients of Decision Problem: No data case.
In the example we might use the following table for
:
| C | B | T | H | |
| R | 3 | 8 | 5 | 25 |
| S | 5 | 0 | 2 | 25 |
Notice that if it rains I will be glad if I drove. If it is sunny I will be glad if I rode my bike. In any case staying at home is expensive.
In general we study this problem by comparing various functions of
. In this
problem a function of
has only two values, one for rain and one for sun and
we can plot any such function as a point in the plane. We do so to indicate the geometry
of the problem before stating the general theory.
Statistical problems have another ingredient, the data. We observe
a random variable taking values in say
.
We may make our decision
depend on
. A
decision rule is a function
from
to
.
We will want
to be small for all
. Since
is random we quantify this by averaging over
and compare procedures
in terms of the risk function
To compare two procedures we must compare two functions of
and
pick ``the smaller one''. But typically the two functions will cross each
other and there won't be a unique `smaller one'.
Example: In estimation theory to estimate a real parameter
we used
,
Example: Transport problem has no data
so the only possible (non-randomized) decisions are the
four possible actions
. For
and
the worst case is rain.
For the other two actions Rain and Sun are equivalent. We have the
following table:
| C | B | T | H | |
| R | 3 | 8 | 5 | 25 |
| S | 5 | 0 | 2 | 25 |
| Maximum | 5 | 8 | 5 | 25 |
Smallest maximum: take car, or transit.
Minimax action: take car or public transit.
Now imagine: toss coin with probability
of getting Heads, take my car if Heads, otherwise take transit.
Long run average daily loss would be
when it rains and
when it is Sunny. Call this procedure
; add it to
graph for each value of
. Varying
from 0 to 1 gives a straight line running from
to
. The two losses are equal when
. For smaller
worst case risk is for sun; for larger
worst case risk is for rain.
Added to graph: loss functions for each
,
(straight line) and set of
pairs for which
-- worst case risk for
when
.
In general we might consider using a 4 sided coin where we
took action
with probability
,
with
probability
and so on. The loss function of such
a procedure is a convex combination of the losses of the four basic
procedures making the set of risks achievable with the aid of randomization
look like the following:
Graph shows many points in the picture correspond to bad decision procedures. Rain or not taking my car to work has a lower loss than staying home; the decision to stay home is inadmissible.
Definition: A decision rule
is inadmissible if there is
a rule
such that
Admissible procedures have risks on lower left of graphs, i.e., lines connecting B to T and T to C are the admissible procedures.
There is a connection between Bayes procedures and admissible procedures. A prior distribution
in our example problem is specified by two probabilities,
and
which
add up to 1. If
is the risk function for some procedure then the Bayes
risk is
Proof: If
is
Bayes for
but not admissible there is a
such that
Notice: theorem actually
requires the extra hypotheses: positive density, and
risk functions of
and
continuous.
![]() |