While the expected-utility model includes a straightforward decision rule, it does not include a method for calculating pivot probabilities. Indeed this calculation appears to be anything but straightforward. Several approaches to this calculation from the literature are presented and analyzed here. These approaches all become more computationally complex as the number of election alternatives increases. We introduce a novel approach that does not increase in complexity with the number alternatives.
Before discussing methods for calculating pivot probabilities, it is
useful to examine the definition of pivot probability in more detail.
As already mentioned, a voter's pivot probability for any two
candidates is the probability that he or she will be decisive in
making or breaking an -place tie between those two
candidates. If we assume that for each possible election outcome we
can determine the probability associated with the occurrence of that
outcome, the pivot probability is the sum of the probabilities
associated with each of the possible election outcomes that involve an
-place tie between those two candidates.
Figure: A three-dimensional view of the barycentric coordinate system
The election outcome space for a three-candidate election can be
represented visually as a barycentric coordinate system -- an
equilateral triangle on the three-dimensional plane , where
,
, and
represent the
proportions of votes for candidates 1, 2, and 3 respectively.
Each of the triangle corners represents one candidate. The closer an
outcome point is to a particular corner, the more votes the
alternative represented by that corner received. Thus, an outcome
point in a corner of the triangle represents a ``shut-out'' in which
one alternative received all the votes, while an outcome point in the
geometrical center of the triangle represents a three-way tie. As
shown in Figure
, the line segments that bisect the
triangle represent two-way winning ties. This model can be extended
arbitrarily. For an election with four alternatives the barycentric
triangle becomes a solid tetrahedron. An election with N
alternatives can be represented by a simplex polyhedron of N - 1
dimensions.
Because finding the candidate that maximizes the expected-gain equations (Equation 1) depends on the ratio of the pivot probabilities rather than the actual probabilities themselves, we do not need to find a method for calculating the absolute probability of reaching each point in the outcome space; rather, it is sufficient to find a method of calculating relative probabilities. Thus the methods we examine here do not necessarily result in the probabilities over the entire outcome space summing to 1.
When comparing probability calculation methods and analyzing the
suitable of each for use in the expected-utility model of voting, it
is helpful to be able to visualize the behavior of the
expected-utility model when different probability calculation methods
are used. Hoffman [55] includes contour plots in his
paper to help illustrate this behavior. To allow visual comparison of
our new method with Hoffman's we wrote a computer program to generate
contour plots such as the one shown in Figure . The
curves on these plots represent predicted outcome-proportions for a
voter's first choice,
, in a three-candidate election.
Predicted outcome-proportions for a voter's second choice,
,
are shown along the x-axis.
, a voter's utility rating for
is shown along the y-axis. Given a predicted outcome at
point
, a voter should vote for for
if and only if the point
lies above the curve for
outcome
; otherwise the voter should vote for
.
Consider, for example a predicted outcome of P = (.15, .4, .45).
Voters who rate
should vote for
; voters
who rate
should vote for
. The plots
shown in this chapter all assume normalized ratings such that
.
We now consider several methods for calculating pivot probabilities.
One approach to calculating pivot probabilities involves making a prediction about the likely outcome of the election, plotting that outcome as a point on a barycentric coordinate system, and computing the relative distances between that outcome point and the outcome lines for each of the two-way ties. Black [14] used this method in a model of a three-candidate plurality election in which each voter was assumed to be able to make a reasonable prediction about the election outcome based on the results of previous elections, opinion polls, or other data. In batched DSV we can determine the voters' sincere strategies from their utility information, and aggregate these strategies to determine a first-round predicted outcome. Subsequent rounds can use the results of the previous round as a predicted outcome. In ballot-by-ballot DSV, the interim tally of ballots counted so far can serve as the predicted outcome point.
Figure: Distances used to calculate Black's probability estimates
Black assumes that the pivot probability for any pair of candidates is
proportional to 1-d, where d is the Euclidean distance between the
line segment representing an
-place tie
between those candidates and the predicted outcome point A, as shown
in Figure
.
For
this distance
is equivalent to the
length of the perpendicular line segment
from the
predicted outcome point to the first-place tie line
.
The distance calculation
for can be expressed as:
where and
.
The other pivot probabilities can be calculated in a similar manner.
We developed and experimented with a variation on Black's method involves calculating the distance d between the predicted outcome and a given point on a two way tie line. Using this technique, the pivot probability for any two candidates can be found by summing the differences 1-d for every point on the two-way tie line for those candidates. This method makes more intuitive sense than Black's method if one considers what the pivot probabilities really represent.
Figure: Contour plot for 3-candidate plurality election using our
simple distance pivot probabilities
One problem with both Black's method and our variation is that
they assume that the probability of a tie gets linearly larger the
farther away a predicted outcome point is from a two-way tie line.
This uniform probability distribution, while simple, does not seem
consistent with empirical evidence. For example, as
Figure illustrates, these methods rarely select
the strategy of voting for a second-choice candidate unless the
second-choice candidate has a utility rating at least 80% as high as
the first-choice candidate. In addition, these methods do not take
into consideration the certainty of the predicted outcome point.
Hoffman [55] offers another approach that allows for the
modeling of voting schemes other than simple plurality, and assumes a
Gaussian distribution of outcomes rather than a uniform distribution.
Although Hoffman introduces his approach by describing a
three-candidate election, the approach may be applied to elections
involving any number of candidates and is most easily understood by
considering a two-candidate election first. A two-candidate election
with B voters can be modeled as a line segment. This line segment
can be divided into B smaller segments, each one vote wide, as shown
in Figure . Each of the smaller line segments represents
a possible election outcome. If there are an odd number of voters,
there will be a line segment T at the center of the line that
represents a tie. If we randomly sample the voters, we can predict an
election outcome and represent it somewhere along our line at segment
P.
Figure: Gaussian geometry for a 2-candidate election
Given P, we are interested in determining the probability that the
election outcome will be a tie. As Figure illustrates,
we can construct a normal distribution curve around the center point of
P, given by the formula
where D is the distance from each point to (the center of) P, and
is a measure of the uncertainty of the prediction. Hoffman does
not specify how
should be calculated, but the ``standard error
of the estimate'' seems to be an appropriate measure of uncertainty.
As we shall see,
is an important factor and can affect the
outcome of a DSV election dramatically. When using the standard error
to calculate
in a ballot-by-ballot DSV system, we multiply it by
the finite-population correction factor yielding:
where and
represent the proportion of votes for each
candidate thus far, B represents the total voter population, b
represents the number of votes counted thus far, and
represents the area under the standard normal curve for confidence
level
. Thus as the election progresses, b will
increase and
will decrease. The fact that uncertainty
decreases as the election progresses is consistent with the fact that
the amount of information increases.
The above method of calculating uncertainty is useful for ballot-by-ballot DSV, in which the predicted outcome involves a sample that increases in size until it equals the entire population; however, in batch DSV, the predicted outcome point is based on polling the entire population -- thus our prediction derives no uncertainty from sampling error. In batch DSV the uncertainty of the prediction is based on not knowing how many voters will find that their optimal strategies are not their sincere strategies.
The simplest way to account for uncertainty in batch DSV would be to
select an arbitrary value for . For example,
has the property that as long as a
is predicted to receive at
least 5% of the vote, voters will not vote for
unless
is at least 10% as large as
(on a 10 point scale, a voter
with
must have
in order to vote for
). We used this approach in our simulations, described in
Chapter
. We also tried conducting batch DSV
simulations with the number of rounds equal to the number of voters
and adjusting
according to Equation
, letting b
equal the number of rounds run so far. However this approach
generally resulted in the same election outcome as was produced using
a constant
equal to the value produced by Equation
when b = B.
Another approach to calculating would be to develop a formula
based on some aggregation of the utilities submitted by the
voters -- perhaps taking into account the percentage of voters for
whom it might never be optimal to voter for
. Such a
calculation is also likely to be somewhat arbitrary, however. Still
another approach would be to assign random values to
within a
reasonable range (say .001 - .5). Finally, voters could select
their own uncertainty values based on their attitudes toward risk,
formulated perhaps as a function of the ``maturity'' of the election:
number of rounds, position in the voting sequence, etc.
Regardless of what method is used to calculate , the
probability that the election outcome will involve a first-place tie
between candidates 1 and 2 can be computed as follows:
where U = 1/B (the length of a one-vote wide line segment). That is, the probability of a tie is proportional to the area under the segment of the normal curve that lies above the tie segment T.
Figure: Hoffman's geometry for a 3-candidate election
Hoffman illustrates his approach for a three-candidate election. As
shown in Figure , Hoffman specifies the region
as the portion of the outcome triangle in which candidate i
loses to candidate j by one vote (or less in systems that allow
fractional votes). He defines the pivot probability as ``the
probability that the election result
lies in
the region
.'' Thus
can be expressed as:
where D is the distance from the predicted outcome point to a point in
the region . Because
is only one vote wide,
can be approximated by using Simpson's rule or another numerical
integration technique to integrate over the outcome points for which
. Irrespective of the number of candidates
N, first-place ties can occur only when
. Values of
and
greater than 1/2 would be
impossible because
; and values of
and
less than 1/N would indicate that at least one other
candidate has a higher proportion of the vote. Values of
and
between 1/N and 1/3 may or may not indicate first-place
ties, depending on the outcome proportions for the other candidates.
However, when
,
and
must be in a first-place tie. Thus, for a three-candidate election,
can be approximated by:
where x represents the proportion of votes for candidates i and
j at a given outcome point. Note that when N > 2, the standard
error of the estimate becomes quite difficult to calculate. We
use the approximation
to calculate the standard error of the estimate for N > 2.
A four-candidate election is modeled as a solid tetrahedron. However,
finding the outcome region in which first-place ties occur is not as
easy in four dimensions as it is in two and three. As discussed
above, first-place ties may occur in regions where , depending on the proportion of votes received by each
of the other candidates. With four candidates, the outcome region in
which candidates i and j tie for first place is a kite-shaped
section of the plane. The probability of an outcome occurring in this
region can be approximated by:
where x represents the proportion of votes for candidates i and j, and y represents the proportion of votes for candidate k at a given outcome point.
Hoffman's approach is appealing because it does not assume a uniform
probability distribution and because it takes into account the
uncertainty associated with the predicted outcome. Indeed, this
uncertainty proves to be a significant factor in calculating a voter's
optimal strategy. Figure shows the contours we
have calculated for
values of .02 and .05. The figure
illustrates that the more certain the prediction, the more willing
voters should be to vote for
rather than
. For
example, given a predicted outcome P of (.15, .4, .45) the optimal
strategy when
is to vote for
if
.
However under the same circumstances but with
, it is
optimal to vote for
if
.
Figure: Contour plot for 3-candidate plurality election, Hoffman's
method pivot probabilities
A drawback to Hoffman's approach is that the pivot probability calculations increase in complexity with the number of candidates. For elections involving many candidates, these calculations can be quite difficult and time consuming. The reason for the increasing complexity is that Hoffman's method requires the projected outcome for all candidates to be considered in every pivot probability calculation. However, we have discovered that the only projected outcomes crucial to the pivot probability calculation are the outcomes for the candidate predicted to win and for the two candidates whose pivot probability is being computed. Furthermore, a simplifying assumption allows us to calculate pivot probabilities for N candidate elections using the predicted outcomes for just two candidates.
Figure: Predicted outcomes for candidates i and j
To calculate using our 2-D Gaussian method, we being by
plotting the predicted outcomes for i and j along an outcome line
and constructing normal distribution curves around each outcome point,
as shown in Figure
. In ballot-by-ballot DSV it is
important that the
values used to construct the normal curves
decrease as the election progresses. We can simplify our
calculation while still ensuring that
decreases by
substituting an average value for the numerator in the standard error
approximation presented in Equation
:
Thus, as a ballot-by-ballot DSV election progresses, the normal curves will become increasingly narrower. However all the curves calculated for a given ballot will have the same width.
The shaded area in Figure , where the curves
intersect, represents the probability that the final outcome will
involve a tie between i and j. But not all ties are winning ties.
If our election involved only three candidates we could declare the
pivot probability to be the section of the intersection area
representing outcomes where i and j each receive between 1/3 and
1/2 of the votes. However, elections involving four or more
candidates cannot be represented this simply. To solve this problem we plot
the outcome point for the predicted winner on our outcome line and
construct a normal curve around it. As illustrated in
Figure
, the pivot probability is the intersection of
all three curves.
Figure: Geometry for calculating pivot probability using 2-D
Gaussian method
Note, that when either i or j is the predicted winner, only two
curves need be plotted. Also note, that because the curves all use
the same value of , the intersection of three curves is equal
to the intersection of the left-most and right-most curve. Thus, to
compute
we need only determine the area of the intersection
of the curve for the predicted winner and either i or j, whichever
is predicted to receive fewer votes.
To compute the area of the intersection we must first determine the point at which the two normal curves intersect. The point of intersection I can be calculated:
where is the predicted outcome for the predicted winner and
is the predicted outcome for either i or j, whichever
is predicted to receive fewer votes. The pivot probability
is
then calculated:
where . Because there is no region in which ties
are guaranteed to occur, we assume the curve is infinite in both
directions. However, only half the area need be computed, as the
curve is symmetrical about the intersection point.
The above integral can be approximated using Simpson's rule or another
numerical integration technique. However, as decreases,
Simpson's rule requires increasingly more steps to converge. A more
efficient approach is to use a table of areas under the standard
normal curve or a computer function that returns the values in such a
table. In this case a change of variable must be performed to remove
from the exponent. The pivot probability is then calculated:
In our simulations we use the erf function from the C math library.
However, the erf function is not intended for use with the large upper
bound values that occur when gets very small. When our
computations using the erf function return 0, we approximate the area
as
where .
Figure: Contour plot for 3-candidate plurality election, 2-D Gaussian
method pivot probabilities
Our 2-D Gaussian method results in contour plots (shown in
Figure quite similar to those produced using
Hoffman's method. The most obvious difference between the results of
these two methods can be seen in the effects of varying
. An
value approximately 0.4 times smaller than the one used in
Hoffman's method must be used in the 2-D Gaussian method to produce
similar results.
Another difference can be observed by comparing the upper right-hand
corners of Figures and
. When a
voter's second-choice is predicted to win by a large margin the 2-D
Gaussian method produces pivot probabilities that will result in a
vote for the voter's first choice, regardless of
. Under the
same circumstances Hoffman's method produces pivot probabilities that
may result in a vote for the voter's second choice, depending on
. In this situation, the voter's first choice has practically
no chance of winning and the voter's second choice has practically no
chance of loosing. Therefore, the voter cannot influence the outcome
with his or her vote, and thus, we believe, should vote sincerely.
Hoffman's method and the 2-D Gaussian method indicate different
strategies in this situation due to the fact that for a
three-candidate race Hoffman's method indicates a higher probability
of a winning tie between third-place and first-place candidates than
between third-place and second-place candidates, while our method
indicates that the probability of a winning tie is the same for these
two pairs of candidates. Because the number of voters who must change
their votes to achieve a winning tie in each of these two situations
is identical, we believe our method is a more accurate model of pivot
probabilities involving third-place candidates.