Appendix B

Home Research Topics Methodological Research Survey Methods International Survey Methods

Report

February 25, 2016

Evaluating a New Proposal for Detecting Data Falsification in Surveys

Appendix B

An additional analysis stemming from the Kuriakose & Robbins piece is a scatter plot showing the number of questions (x-axis) against percentage of near duplicates (y-axis). This raised the question — if more questions in a survey cause a lower near duplicate rate, shouldn’t that be apparent in the scatter plot? One of the problems with the graph is that several important survey characteristics are confounded when run that way in a simple bivariate analysis.

To demonstrate this point, we conducted an additional empirical analysis. Using 367 Pew Research Center international surveys, we ran a regression predicting the percentage of near duplicates using the following covariates: the number of questions, the sample size, and the percentage of questions with five or more response options.

As the results show, two survey characteristics are significant predictors of the percentage of near duplicates. The number of questions turned out to be not significantly predictive, however, the sample size and the proportion of questions with five or more response categories both have a strong association with the percentage of near duplicates. The larger the sample size, the higher the percentage of near duplicates. The greater the proportion of questions with 5+ response options, the lower the percentage of near duplicates. This adds to the evidence in our paper that a uniform threshold of 85% is wrong-headed because survey characteristics affect the likelihood of there being a high match rate. All of the 367 datasets in this analysis are publically available on our website and the R code for this regression is available upon request.

Works Cited

← Prev Page

1 2 3 4

Topics

International Survey Methods

Most Popular

1615 L St. NW, Suite 800
Washington, DC 20036
USA
(+1) 202-419-4300 | Main
(+1) 202-857-8562 | Fax
(+1) 202-419-4372 | Media Inquiries

Research Topics

ABOUT PEW RESEARCH CENTER Pew Research Center is a nonpartisan fact tank that informs the public about the issues, attitudes and trends shaping the world. It conducts public opinion polling, demographic research, media content analysis and other empirical social science research. Pew Research Center does not take policy positions. It is a subsidiary of The Pew Charitable Trusts.

About

Terms & Conditions

Cookie Settings

Reprints, Permissions & Use Policy

Feedback

Careers

Topics

Regions & Countries

Formats

Evaluating a New Proposal for Detecting Data Falsification in Surveys

Appendix B

Most Popular

Table of Contents

Table of Contents

Research Topics

Follow Us

Evaluating a New Proposal for Detecting Data Falsification in Surveys

Appendix B

Table of Contents

Table of Contents

Sign up for The Briefing

Related

Use our updated Global Indicators Database to explore survey findings from around the world

How the political typology groups compare

How Pew Research Center has dealt with the challenges of international polling during the pandemic

Bhutanese in the U.S. Fact Sheet

The coronavirus pandemic’s impact on our polling

Most Popular

Table of Contents

Table of Contents

Research Topics

Follow Us