next up previous
Next: Tie-Breaking Rules Up: Rational Strategy Formulation Previous: Expected-Utility Model

Pivot Probability Calculations

 

While the expected-utility model includes a straightforward decision rule, it does not include a method for calculating pivot probabilities. Indeed this calculation appears to be anything but straightforward. Several approaches to this calculation from the literature are presented and analyzed here. These approaches all become more computationally complex as the number of election alternatives increases. We introduce a novel approach that does not increase in complexity with the number alternatives.

Before discussing methods for calculating pivot probabilities, it is useful to examine the definition of pivot probability in more detail. As already mentioned, a voter's pivot probability for any two candidates is the probability that he or she will be decisive in making or breaking an tex2html_wrap_inline239 -place tie between those two candidates. If we assume that for each possible election outcome we can determine the probability associated with the occurrence of that outcome, the pivot probability is the sum of the probabilities associated with each of the possible election outcomes that involve an tex2html_wrap_inline239 -place tie between those two candidates.

   figure144
Figure: A three-dimensional view of the barycentric coordinate system

The election outcome space for a three-candidate election can be represented visually as a barycentric coordinate system -- an equilateral triangle on the three-dimensional plane tex2html_wrap_inline521 , where tex2html_wrap_inline523 , tex2html_wrap_inline525 , and tex2html_wrap_inline527 represent the proportions of votes for candidates 1, 2, and 3 respectively. Each of the triangle corners represents one candidate. The closer an outcome point is to a particular corner, the more votes the alternative represented by that corner received. Thus, an outcome point in a corner of the triangle represents a ``shut-out'' in which one alternative received all the votes, while an outcome point in the geometrical center of the triangle represents a three-way tie. As shown in Figure gif, the line segments that bisect the triangle represent two-way winning ties. This model can be extended arbitrarily. For an election with four alternatives the barycentric triangle becomes a solid tetrahedron. An election with N alternatives can be represented by a simplex polyhedron of N - 1 dimensions.

Because finding the candidate that maximizes the expected-gain equations (Equation 1) depends on the ratio of the pivot probabilities rather than the actual probabilities themselves, we do not need to find a method for calculating the absolute probability of reaching each point in the outcome space; rather, it is sufficient to find a method of calculating relative probabilities. Thus the methods we examine here do not necessarily result in the probabilities over the entire outcome space summing to 1.

   figure158
Figure: Example contour plot

When comparing probability calculation methods and analyzing the suitable of each for use in the expected-utility model of voting, it is helpful to be able to visualize the behavior of the expected-utility model when different probability calculation methods are used. Hoffman [55] includes contour plots in his paper to help illustrate this behavior. To allow visual comparison of our new method with Hoffman's we wrote a computer program to generate contour plots such as the one shown in Figure gif. The curves on these plots represent predicted outcome-proportions for a voter's first choice, tex2html_wrap_inline481 , in a three-candidate election. Predicted outcome-proportions for a voter's second choice, tex2html_wrap_inline483 , are shown along the x-axis. tex2html_wrap_inline545 , a voter's utility rating for tex2html_wrap_inline483 is shown along the y-axis. Given a predicted outcome at point tex2html_wrap_inline551 , a voter should vote for for tex2html_wrap_inline483 if and only if the point tex2html_wrap_inline555 lies above the curve for outcome tex2html_wrap_inline523 ; otherwise the voter should vote for tex2html_wrap_inline481 . Consider, for example a predicted outcome of P = (.15, .4, .45). Voters who rate tex2html_wrap_inline563 should vote for tex2html_wrap_inline483 ; voters who rate tex2html_wrap_inline567 should vote for tex2html_wrap_inline481 . The plots shown in this chapter all assume normalized ratings such that tex2html_wrap_inline571 .

We now consider several methods for calculating pivot probabilities.

Simple Distance Methods

One approach to calculating pivot probabilities involves making a prediction about the likely outcome of the election, plotting that outcome as a point on a barycentric coordinate system, and computing the relative distances between that outcome point and the outcome lines for each of the two-way ties. Black [14] used this method in a model of a three-candidate plurality election in which each voter was assumed to be able to make a reasonable prediction about the election outcome based on the results of previous elections, opinion polls, or other data. In batched DSV we can determine the voters' sincere strategies from their utility information, and aggregate these strategies to determine a first-round predicted outcome. Subsequent rounds can use the results of the previous round as a predicted outcome. In ballot-by-ballot DSV, the interim tally of ballots counted so far can serve as the predicted outcome point.

   figure188
Figure: Distances used to calculate Black's probability estimates

Black assumes that the pivot probability for any pair of candidates is proportional to 1-d, where d is the Euclidean distance between the line segment tex2html_wrap_inline583 representing an tex2html_wrap_inline239 -place tie between those candidates and the predicted outcome point A, as shown in Figure gif.gif For tex2html_wrap_inline601 this distance tex2html_wrap_inline609 is equivalent to the length of the perpendicular line segment tex2html_wrap_inline611 from the predicted outcome point to the first-place tie line tex2html_wrap_inline583 .

The distance calculation for tex2html_wrap_inline609 can be expressed as:

displaymath577

where tex2html_wrap_inline617 and tex2html_wrap_inline619 . The other pivot probabilities can be calculated in a similar manner.

We developed and experimented with a variation on Black's method involves calculating the distance d between the predicted outcome and a given point on a two way tie line. Using this technique, the pivot probability for any two candidates can be found by summing the differences 1-d for every point on the two-way tie line for those candidates. This method makes more intuitive sense than Black's method if one considers what the pivot probabilities really represent.

   figure217
Figure: Contour plot for 3-candidate plurality election using our simple distance pivot probabilities

One problem with both Black's method and our variation is that they assume that the probability of a tie gets linearly larger the farther away a predicted outcome point is from a two-way tie line. This uniform probability distribution, while simple, does not seem consistent with empirical evidence. For example, as Figure gif illustrates, these methods rarely select the strategy of voting for a second-choice candidate unless the second-choice candidate has a utility rating at least 80% as high as the first-choice candidate. In addition, these methods do not take into consideration the certainty of the predicted outcome point.

Hoffman's Method

Hoffman [55] offers another approach that allows for the modeling of voting schemes other than simple plurality, and assumes a Gaussian distribution of outcomes rather than a uniform distribution. Although Hoffman introduces his approach by describing a three-candidate election, the approach may be applied to elections involving any number of candidates and is most easily understood by considering a two-candidate election first. A two-candidate election with B voters can be modeled as a line segment. This line segment can be divided into B smaller segments, each one vote wide, as shown in Figure gif. Each of the smaller line segments represents a possible election outcome. If there are an odd number of voters, there will be a line segment T at the center of the line that represents a tie. If we randomly sample the voters, we can predict an election outcome and represent it somewhere along our line at segment P.

   figure228
Figure: Gaussian geometry for a 2-candidate election

Given P, we are interested in determining the probability that the election outcome will be a tie. As Figure gif illustrates, we can construct a normal distribution curve around the center point of P, given by the formula

equation236

where D is the distance from each point to (the center of) P, and tex2html_wrap_inline653 is a measure of the uncertainty of the prediction. Hoffman does not specify how tex2html_wrap_inline653 should be calculated, but the ``standard error of the estimate'' seems to be an appropriate measure of uncertainty. As we shall see, tex2html_wrap_inline653 is an important factor and can affect the outcome of a DSV election dramatically. When using the standard error to calculate tex2html_wrap_inline653 in a ballot-by-ballot DSV system, we multiply it by the finite-population correction factor yielding:

  equation242

where tex2html_wrap_inline523 and tex2html_wrap_inline525 represent the proportion of votes for each candidate thus far, B represents the total voter population, b represents the number of votes counted thus far, and tex2html_wrap_inline669 represents the area under the standard normal curve for confidence level tex2html_wrap_inline671 . Thus as the election progresses, b will increase and tex2html_wrap_inline653 will decrease. The fact that uncertainty decreases as the election progresses is consistent with the fact that the amount of information increases.

The above method of calculating uncertainty is useful for ballot-by-ballot DSV, in which the predicted outcome involves a sample that increases in size until it equals the entire population; however, in batch DSV, the predicted outcome point is based on polling the entire population -- thus our prediction derives no uncertainty from sampling error. In batch DSV the uncertainty of the prediction is based on not knowing how many voters will find that their optimal strategies are not their sincere strategies.

The simplest way to account for uncertainty in batch DSV would be to select an arbitrary value for tex2html_wrap_inline653 . For example, tex2html_wrap_inline679 has the property that as long as a tex2html_wrap_inline481 is predicted to receive at least 5% of the vote, voters will not vote for tex2html_wrap_inline483 unless tex2html_wrap_inline545 is at least 10% as large as tex2html_wrap_inline687 (on a 10 point scale, a voter with tex2html_wrap_inline689 must have tex2html_wrap_inline691 in order to vote for tex2html_wrap_inline483 ). We used this approach in our simulations, described in Chapter gif. We also tried conducting batch DSV simulations with the number of rounds equal to the number of voters and adjusting tex2html_wrap_inline653 according to Equation gif, letting b equal the number of rounds run so far. However this approach generally resulted in the same election outcome as was produced using a constant tex2html_wrap_inline653 equal to the value produced by Equation gif when b = B.

Another approach to calculating tex2html_wrap_inline653 would be to develop a formula based on some aggregation of the utilities submitted by the voters -- perhaps taking into account the percentage of voters for whom it might never be optimal to voter for tex2html_wrap_inline483 . Such a calculation is also likely to be somewhat arbitrary, however. Still another approach would be to assign random values to tex2html_wrap_inline653 within a reasonable range (say .001 - .5). Finally, voters could select their own uncertainty values based on their attitudes toward risk, formulated perhaps as a function of the ``maturity'' of the election: number of rounds, position in the voting sequence, etc.

Regardless of what method is used to calculate tex2html_wrap_inline653 , the probability that the election outcome will involve a first-place tie between candidates 1 and 2 can be computed as follows:

displaymath629

where U = 1/B (the length of a one-vote wide line segment). That is, the probability of a tie is proportional to the area under the segment of the normal curve that lies above the tie segment T.

   figure281
Figure: Hoffman's geometry for a 3-candidate election

Hoffman illustrates his approach for a three-candidate election. As shown in Figure gif, Hoffman specifies the region tex2html_wrap_inline721 as the portion of the outcome triangle in which candidate i loses to candidate j by one vote (or less in systems that allow fractional votes). He defines the pivot probability as ``the probability that the election result tex2html_wrap_inline727 lies in the region tex2html_wrap_inline721 .'' Thus tex2html_wrap_inline465 can be expressed as:

displaymath630

where D is the distance from the predicted outcome point to a point in the region tex2html_wrap_inline721 . Because tex2html_wrap_inline721 is only one vote wide, tex2html_wrap_inline465 can be approximated by using Simpson's rule or another numerical integration technique to integrate over the outcome points for which tex2html_wrap_inline739 . Irrespective of the number of candidates N, first-place ties can occur only when tex2html_wrap_inline743 . Values of tex2html_wrap_inline745 and tex2html_wrap_inline747 greater than 1/2 would be impossible because tex2html_wrap_inline751 ; and values of tex2html_wrap_inline745 and tex2html_wrap_inline747 less than 1/N would indicate that at least one other candidate has a higher proportion of the vote. Values of tex2html_wrap_inline745 and tex2html_wrap_inline747 between 1/N and 1/3 may or may not indicate first-place ties, depending on the outcome proportions for the other candidates. However, when tex2html_wrap_inline767 , tex2html_wrap_inline745 and tex2html_wrap_inline747 must be in a first-place tie. Thus, for a three-candidate election, tex2html_wrap_inline465 can be approximated by:

displaymath631

where x represents the proportion of votes for candidates i and j at a given outcome point. Note that when N > 2, the standard error of the estimate becomes quite difficult to calculate. We use the approximationgif

  equation324

to calculate the standard error of the estimate for N > 2.

A four-candidate election is modeled as a solid tetrahedron. However, finding the outcome region in which first-place ties occur is not as easy in four dimensions as it is in two and three. As discussed above, first-place ties may occur in regions where tex2html_wrap_inline785 , depending on the proportion of votes received by each of the other candidates. With four candidates, the outcome region in which candidates i and j tie for first place is a kite-shaped section of the plane. The probability of an outcome occurring in this region can be approximated by:

displaymath632

where x represents the proportion of votes for candidates i and j, and y represents the proportion of votes for candidate k at a given outcome point.

Hoffman's approach is appealing because it does not assume a uniform probability distribution and because it takes into account the uncertainty associated with the predicted outcome. Indeed, this uncertainty proves to be a significant factor in calculating a voter's optimal strategy. Figure gif shows the contours we have calculated for tex2html_wrap_inline653 values of .02 and .05. The figure illustrates that the more certain the prediction, the more willing voters should be to vote for tex2html_wrap_inline483 rather than tex2html_wrap_inline481 . For example, given a predicted outcome P of (.15, .4, .45) the optimal strategy when tex2html_wrap_inline815 is to vote for tex2html_wrap_inline817 if tex2html_wrap_inline819 . However under the same circumstances but with tex2html_wrap_inline821 , it is optimal to vote for tex2html_wrap_inline483 if tex2html_wrap_inline825 .

   figure357
Figure: Contour plot for 3-candidate plurality election, Hoffman's method pivot probabilities

2-D Gaussian Method

A drawback to Hoffman's approach is that the pivot probability calculations increase in complexity with the number of candidates. For elections involving many candidates, these calculations can be quite difficult and time consuming. The reason for the increasing complexity is that Hoffman's method requires the projected outcome for all candidates to be considered in every pivot probability calculation. However, we have discovered that the only projected outcomes crucial to the pivot probability calculation are the outcomes for the candidate predicted to win and for the two candidates whose pivot probability is being computed. Furthermore, a simplifying assumption allows us to calculate pivot probabilities for N candidate elections using the predicted outcomes for just two candidates.

   figure365
Figure: Predicted outcomes for candidates i and j

To calculate tex2html_wrap_inline465 using our 2-D Gaussian method, we being by plotting the predicted outcomes for i and j along an outcome line and constructing normal distribution curves around each outcome point, as shown in Figure gif. In ballot-by-ballot DSV it is important that the tex2html_wrap_inline653 values used to construct the normal curves decrease as the election progresses. We can simplify our tex2html_wrap_inline653 calculation while still ensuring that tex2html_wrap_inline653 decreases by substituting an average value for the numerator in the standard error approximation presented in Equation gif:

displaymath833

Thus, as a ballot-by-ballot DSV election progresses, the normal curves will become increasingly narrower. However all the curves calculated for a given ballot will have the same width.

The shaded area in Figure gif, where the curves intersect, represents the probability that the final outcome will involve a tie between i and j. But not all ties are winning ties. If our election involved only three candidates we could declare the pivot probability to be the section of the intersection area representing outcomes where i and j each receive between 1/3 and 1/2 of the votes. However, elections involving four or more candidates cannot be represented this simply. To solve this problem we plot the outcome point for the predicted winner on our outcome line and construct a normal curve around it. As illustrated in Figure gif, the pivot probability is the intersection of all three curves.

   figure389
Figure: Geometry for calculating pivot probability using 2-D Gaussian method

Note, that when either i or j is the predicted winner, only two curves need be plotted. Also note, that because the curves all use the same value of tex2html_wrap_inline653 , the intersection of three curves is equal to the intersection of the left-most and right-most curve. Thus, to compute tex2html_wrap_inline465 we need only determine the area of the intersection of the curve for the predicted winner and either i or j, whichever is predicted to receive fewer votes.

To compute the area of the intersection we must first determine the point at which the two normal curves intersect. The point of intersection I can be calculated:

displaymath834

where tex2html_wrap_inline887 is the predicted outcome for the predicted winner and tex2html_wrap_inline889 is the predicted outcome for either i or j, whichever is predicted to receive fewer votes. The pivot probability tex2html_wrap_inline465 is then calculated:

displaymath835

where tex2html_wrap_inline897 . Because there is no region in which ties are guaranteed to occur, we assume the curve is infinite in both directions. However, only half the area need be computed, as the curve is symmetrical about the intersection point.

The above integral can be approximated using Simpson's rule or another numerical integration technique. However, as tex2html_wrap_inline653 decreases, Simpson's rule requires increasingly more steps to converge. A more efficient approach is to use a table of areas under the standard normal curve or a computer function that returns the values in such a table. In this case a change of variable must be performed to remove tex2html_wrap_inline653 from the exponent. The pivot probability is then calculated:

displaymath836

In our simulations we use the erf function from the C math library. However, the erf function is not intended for use with the large upper bound values that occur when tex2html_wrap_inline653 gets very small. When our computations using the erf function return 0, we approximate the area as

displaymath837

where tex2html_wrap_inline905 .

   figure440
Figure: Contour plot for 3-candidate plurality election, 2-D Gaussian method pivot probabilities

Our 2-D Gaussian method results in contour plots (shown in Figure gif quite similar to those produced using Hoffman's method. The most obvious difference between the results of these two methods can be seen in the effects of varying tex2html_wrap_inline653 . An tex2html_wrap_inline653 value approximately 0.4 times smaller than the one used in Hoffman's method must be used in the 2-D Gaussian method to produce similar results.

Another difference can be observed by comparing the upper right-hand corners of Figures gif and gif. When a voter's second-choice is predicted to win by a large margin the 2-D Gaussian method produces pivot probabilities that will result in a vote for the voter's first choice, regardless of tex2html_wrap_inline545 . Under the same circumstances Hoffman's method produces pivot probabilities that may result in a vote for the voter's second choice, depending on tex2html_wrap_inline545 . In this situation, the voter's first choice has practically no chance of winning and the voter's second choice has practically no chance of loosing. Therefore, the voter cannot influence the outcome with his or her vote, and thus, we believe, should vote sincerely. Hoffman's method and the 2-D Gaussian method indicate different strategies in this situation due to the fact that for a three-candidate race Hoffman's method indicates a higher probability of a winning tie between third-place and first-place candidates than between third-place and second-place candidates, while our method indicates that the probability of a winning tie is the same for these two pairs of candidates. Because the number of voters who must change their votes to achieve a winning tie in each of these two situations is identical, we believe our method is a more accurate model of pivot probabilities involving third-place candidates.


next up previous
Next: Tie-Breaking Rules Up: Rational Strategy Formulation Previous: Expected-Utility Model

lorrie@acm.org