What Influences the News Coverage that Candidates
Receive on Issues?
A look at Education Coverage in the 2004 Presidential
Election
Results
Introduction | Background | Expectations | Data | Results | Conclusions | Appendix
To analyze this data, I ran several regression models. As mentioned above, regression analysis seeks to explain variation in the dependent variable using one or more independent variables. Further, predicted values allow one to examine the way one specific independent variable affects the value of the dependent variable. My regression models seek to explain the nature of the variation of candidate coverage on education, based on different variables. Tables six, seven and eight present the results of my regression models. Tables nine, ten and eleven present predicted levels of coverage.
I initially ran separate models using each of the three primaries for which
we had data. However, the Iowa and New Hampshire primary variables did not show
a significant difference from the base model that included the simple passage
of time. Therefore, I included only the results from the model that included
the South Carolina primary variables, which did present some interesting comparisons.
Model One
My first model, which I call the base model, used the variables of whether a
candidate issued a press release, the number of real candidates in the race,
the days of the week, whether there was a debate, the day before, day of and
day after the State of the Union, and the time variable beginning with one.
I also added a condition that restricted this model to the time during which
the candidates were in the race. This presents a more realistic picture of how
candidate behavior influences coverage, since candidates essentially stop agitating
after they drop out of the race. Table six presents the results of my regression
model and table nine presents the predicted levels of coverage.
Seven of the variables I included were significant for one or more candidates.
I expected that candidates would not be able to affect their coverage on education
by issuing press releases, and this expectation held for five of the six candidates.
However, Dean was able to affect his coverage by issuing a press release. The
coefficient for the effect of his press release on coverage was 0.53 and was
significant at the p<.05 level. This means that as he issued more press releases,
he gained more coverage. For example, if Dean did not release a press release
on a particular day, he would receive on average 0.59 stories. However, if he
issued one press release, 1.12 stories published mentioned him and education.
I understand this variation due to Dean’s status for most of our time
period as front-runner, which caused him to have the greatest number of stories
published about him. Consequently, it was easier for him to affect coverage
than the other candidates, since many in the news media focused on him.
For Clark, Kerry and Lieberman the number of candidates in the race had a significant
effect on their coverage. For each man, as the number of candidates decreased,
their coverage increased. However, since Lieberman dropped out second, this
translated from an average of 0.08 stories published when all six candidates
were in the race to 0.38 after Gephardt dropped out. Clearly, this boost did
not prove effective enough to keep him in the race. Clark received a similar
boost to that of Lieberman. With all six candidates, 0.04 stories mentioned
him and education, while with five 0.35 did and with four 0.67 stories mentioned
Clark and education. While Clark and Lieberman received a similar change based
on the number of candidates in the race, Clark ended with more stories about
him since he was in the race longer. For Kerry the results are quite impressive.
Kerry jumped from a low of 0.37 stories published per day with all six candidates
(note that this is much higher than either Clark or Lieberman) to a high of
1.37 articles published per day. With each successive candidate leaving the
race, Kerry received a 0.2 increase in the stories published about him per day.
This regression coefficient was significant at the p<.01 level. These results
matched my hypothesis that coverage would increase as the number of candidates
decreased.
Concerning the days of the week, Monday and Thursday were significant. Dean
and Kerry both received a sizeable increase in coverage on Mondays. Both received
approximately 0.6 more stories on a Monday than any other day, but Dean had
more average stories in general. This means that, on any given Monday, Dean
received 0.96 stories and Kerry received 0.80 stories whereas on days other
than Monday 0.33 stories mentioned Dean while 0.15 stories mentioned Kerry.
Thursday was significant for Dean, Edwards and Kerry. They received, on average,
slightly less than 0.60 more stories on a Thursday than other days of the week.
For Dean, this translated into a jump from 0.33 stories on days other than Thursday
to 0.86 on Thursday. Edwards moved from 0.24 stories on days other than Thursday
to 0.76 stories on Thursday. The stories mentioning Kerry increased from 0.15
on days other than Thursday to 0.76 on Thursday. Interesting, Kerry and Edwards,
the two last candidates in the race, still received fewer stories than Dean
did. As mentioned above, I believe this is due to the fact that Dean was the
front-runner for the majority of the time we collected data. These findings
near my hypothesis that the beginning and ending days of the week would be important.
However, I failed to consider the Congressional workweek from Tuesday to Thursday.
This could explain the fact that Dean and Kerry received more coverage on Monday
(the day before the beginning of the Congressional workweek), following the
indexing hypothesis and the idea of “beats” for reporters. I suspect
that the three candidates received more coverage Thursday because it is the
last day of the Congressional workweek. Of course, other explanations could
exist for these variations.
The State of the Union variables proved significant for two candidates. For
Lieberman, coverage increased on the day of the State of the Union address.
He received 1.15 stories on the day of the address, while on any other average
day he received only 0.26 stories. An explanation for this increase in coverage
is difficult to find without more in-depth study of his coverage. The day after
the State of the Union address proved significant for Kerry. The day after the
address, Kerry received 3.63 stories, considerably more than his average of
0.48 stories on any day other than the State of the Union address. This increase
in coverage could be explained by the fact that Kerry had just won the Iowa
and New Hampshire primaries and began to unseat Dean as the front-runner.
The passage of time proved a significant variable for four of the six candidates.
Dean received the greatest boost, moving from 0.16 stories forty days after
the beginning of our study to 1.12 stories 147 days (January 25, 2004, two days
before the New Hampshire primary) in the study. Edwards received a similar increase
in coverage, moving from 0.19 stories per day to 0.62. Stories mentioning Gephardt
increased from 0.06 to 0.26 from the fortieth day to the last day on which he
contested the race (January 20, 2004, the 142nd day of our tabulation). Kerry
received a larger increase, from 0.22 stories per day to 0.76. It is interesting
to note that Dean increased the most of any candidate, but Kerry began with
slightly more stories than Dean or Edwards.
Model Two
My second model differs from the first only in that I removed the condition
and ran the model for the entire race, and added whether the candidate was in
the race as an independent variable. Many of the results are similar, so I will
focus only on the differences. One important difference is that I was able to
run a regression that modeled all coverage concerning education during the entire
race. Table seven presents the results of the regression and table ten presents
the predicted levels of coverage.
Three important variables were not significant in both models. In the model
concerning the entire race, the number of candidates is no longer significant
for Lieberman. I believe this is because measuring the effects of the independent
variables over the course of the entire race adds a complete month that Lieberman
did not contest. However, measuring over the entire race changes the day after
the State of the Union address from insignificant to significant for Lieberman.
In the model for the entire race, 0.83 stories were predicted to be published
on the day after the State of the Union address, while only 0.10 stories were
predicted on any other day. The candidate’s participation variable proved
significant for Clark at the p<.01 level. When he was in the race, 0.36 stories
mentioned about him and education, while zero stories appeared after he exited
the race. This follows my hypothesis that candidates would receive less coverage
after they exited the race.
For Dean, considering the entire race decreases the predicted number of stories
published in each of his significant variables, while shifting the regression
coefficients in various directions. Interestingly, Edwards’s regression
coefficients do not change, while his predicted values increase slightly. This
change is quite unexpected, since removing the condition that Edwards be in
the race should not have any affect on the results because he was in the race
every day we collected data. Yet, as we lagged the coverage a day behind our
other values, perhaps the coverage on Thursday, March 4, 2004 affected these
variables (particularly since Thursday is significant for Edwards). The regression
coefficients do not vary in any consistent pattern for Clark. The values for
Gephardt and Lieberman decrease, probably due to the fact that by considering
the entire race, we add over a month in which those candidates were not in the
race.
For the cumulative regression I ran on all candidates, three variables proved
significant, the number of candidates in the race, Monday and Friday. The regression
coefficient for the number of candidates in the race was -1.02, and was significant
at the p<.01 level. As I hypothesized, the fewer candidates in the race,
the more stories published. In fact, when there were six candidates in the race,
the predicted level of stories mentioning education or No Child Left Behind
was 5.91. When three candidates remained, 8.96 stories published mentioned one
or the other and when only one candidate remained, an impressive 10.99 stories
appeared with education or No Child Left Behind. From table two, we see that
the frequency distribution of stories indicates that, on 22% of the days that
we studied, ten or more stories appeared mentioning education or No Child Left
Behind. Thus, while these results are surprising, they are not implausible.
The other significant variables were Monday and Friday. Monday was significant
at the p<.001 level with a regression coefficient of 9.75. This translates
into a difference of 5.10 stories published on days other than Monday to 14.85
on a Monday. This large increase could be due to the media starting the week
with more stories than the rest of the week. Also, this finding was not surprising,
since Monday was significant for several of the individual candidates. Friday
was significant too, but not for any of the individual candidates. Friday, like
Monday, was significant at the p<.001 level with a coefficient of 6.36. This
means that on days other than Friday, 5.63 stories mentioned education or No
Child Left Behind, while on Fridays nearly twelve stories did. Note that the
negative variable for both Monday and Friday predict similar levels of coverage.
I believe that Friday produced more stories as the media issued more stories
to begin the weekend.
Model Three
My third model is also similar to the first. I used the same main independent
variables and the same condition that the regression run only the dates on which
the candidate participated in the race. However, I removed the variable that
captured all the days on which we collected data and replaced it with the South
Carolina primary variables. The first variable captured the days remaining until
the primary and the second captured the days after the primary. As with the
previous analysis, I will focus primarily on the differences between this model
and the first. Table eight presents the regression model results and table eleven
presents the predicted levels of coverage.
The most important change is that the independent variable representing number
of candidates in the race became significant Dean and Edwards and all candidates
combined. Since the variable is not part of the Gephardt model, this effectively
means that the variable is significant for every candidate. For Clark, the variable
became more significant and increased in absolute value. This translated to
an increase in the predicted levels of coverage of 0.15 stories per day. Since
this variable was not significant for Dean or Edwards in the first model, comparisons
are not possible. Dean ranged from a low of 0.04 stories per day with six candidates
to a high of 2.22 stories when only three candidates remained. With those three
candidates, Edwards actually received fewer stories than Dean: 2.01. However,
Edwards began the race of six candidates with 0.07 stories per day and ended
with 2.66 when only he and Kerry remained. Lieberman’s predicted levels
of coverage changed from 0.38 with five candidates and 0.08 with six in the
first model to zero articles, regardless of the number of candidates in the
race. Kerry received the largest increase of any candidate, moving from zero
stories with six candidates to a high of 4.75. This is markedly more stories
than predicted by the base model that included all the days on which we collected
data. Recall, in that model, Kerry ranged from a low of 0.37 to a high of 1.37.
The change in these results, particularly the increase in Kerry’s coverage
(since we know that four or more articles published mentioned Kerry on only
six days), was surprising. Thus, I ran a correlation between the number of candidates
and South Carolina primary variables. These results can be found in table twelve.
From this test, we see that the days until the South Carolina primary and the
number of candidates in the race correlate 62% of the time. The days after the
South Carolina primary independent variable and the number of candidates in
the race correlate at the 94% level. This correlation could be because five
candidates remained in the race after the South Carolina primary and candidates
dropped out successively after February 3, 2004. Similar correlation tests run
on the days until the Iowa and New Hampshire primaries prove that the South
Carolina primary variables were the optimum choice because the days after Iowa
correlated with the number of candidates at the 99% level and days after New
Hampshire correlated at 98%. This too, makes sense because the six candidates
dropped out after Iowa, and five after New Hampshire. The high level of correlation
between the number of candidates and the days after South Carolina variables
could skew the results in this model and result in unreliable measures.
Concerning the days of the week variables, using the South Carolina primary
variables increased regression coefficients slightly on Monday for Dean, Kerry
and all candidates combined. The variable change also increased the coefficient
for Edwards and Kerry on Thursday. This translated into an increase of less
than 0.50 stories per day in each category. Interestingly, Tuesday and Wednesday
became significant at the p<.05 level for Kerry in the third model. The regression
coefficients were approximately 0.5 for both days. This is near the regression
coefficients for both Monday and Thursday, although Monday’s coefficient
is the largest at 0.68. These coefficients indicate a predicted level of coverage
of approximately one story per day. This finding is interesting, since we know
that Kerry actually received coverage on only about 27% of the days on which
we collected data, while this finding implies that he would have a story approximately
57% of days (every Monday, Tuesday, Wednesday and Thursday). This discrepancy
could be due to factors not included in this model, or the imprecision of predicted
values. The regression for all candidates, however, the inclusion of the South
Carolina variables produced almost no change, though it was in the same direction
as the changes for individual candidates, meaning more predicted stories.
The dummy variable reflecting debates became significant for Kerry in this model.
The regression coefficient was 0.42, which meant that 0.91 stories the day of
a debate mentioned Kerry and education, while on any other day 0.49 stories
mentioned him. I believe this is because most of the debates were held in 2004,
when Kerry’s candidacy gained momentum. However, an explanation as to
why the debate variable was significant for only Kerry, and not the other candidates,
is difficult to find without more study.
Concerning the State of the Union address, the regression coefficient for Lieberman
did not change greatly but this translated into interesting predicted values.
Instead of 1.15 on the day of the address and 0.26 on other days, both values
moved to zero. Since no stories mentioned Lieberman for 90% of the days on which
we collected data, this second set of results seems more plausible. However,
since the President mentioned education in the State of the Union address, it
is probable that at least one story mentioned education and Lieberman on the
day of the State of the Union address. The day after the State of the Union
address, which was significant for Kerry, did not change in a specific pattern.
The South Carolina primary variables were significant for several candidates.
The days until the primary variable was significant for Dean and Gephardt. When
there were 116 days until the primary, Dean had 0.14 predicted stories, while
Gephardt had 0.09. Closer to the primary, at fifteen days (which happens to
be the day on which the Iowa caucuses were held), Dean moved to 0.82 stories
and Gephardt to 0.34. The variable capturing days after the South Carolina primary
was significant at the p<.001 level for Dean, Edwards and Kerry. This is
not surprising since Clark dropped out of the race only eight days after the
primary, Gephardt exited the race before the primary and Lieberman the day of
the contest. The regression coefficients for Edwards and Kerry were about -0.10
and for Dean -0.23. This means that as more days pass after the South Carolina
primary, fewer stories mentioned these three candidates. One day after the primary,
0.82 stories are predicted to mention Dean, 0.61 mention Edwards and 0.72 mention
Kerry. This finding is curious, since Edwards won the South Carolina primary,
with Kerry finishing second, and Dean coming in fifth behind Sharpton and Clark.
Further, Dean did not do markedly better in any of the other primaries that
were held on February 2. But, given the fact that more stories mentioned Dean
than any other candidate, this is not wholly unexpected. Even more curious is
the predicted level of coverage nine days after the South Carolina primary (February
15). For Dean, Edwards and Kerry, the predicted level of coverage is zero articles.
One could speculate that this is because February 15 was a Sunday, yet the same
level of coverage is predicted for all days after about the seventh. Essentially,
the predicted levels of coverage for all days after February 9, 2004 are zero.
Since the regression coefficient is negative (and about 0.15) for all three
candidates, it logically follows that their levels of coverage, since they begin
at less than one one day after the primary, eventually their coverage will reach
zero and stay there.
Introduction | Background | Expectations | Data | Results | Conclusions | Appendix
By: Beth Daniel '04
© Davidson College, 2004, Department of Political Science,
Davidson College, Davidson, NC 28035
Send comments, questions, and suggestions to Patrick
Sellers
Created: 4/27/2004. Last updated: 5/2/2004.