Reproducibility as part of code quality control
In this post, we discuss reproducibility as a part of Pew Research Center’s code review process.
A behind-the-scenes blog about research methods at Pew Research Center.
For our latest findings, visit pewresearch.org.
In this post, we discuss reproducibility as a part of Pew Research Center’s code review process.
Several posts on this blog have examined unsupervised methods of natural language processing. These algorithms and models can help…
(Related post: Validating 2020 voters in Pew Research Center’s survey data)
Topic models can produce clusters of words that characterize written documents. But how do we figure out what those clusters mean, exactly?
Topic models can scan documents, examine words and phrases within them, and “learn” groups of words that characterize those documents.
There are a variety of tools that can help researchers analyze large volumes of written material. In this post, I’ll examine two of these…
Regression models are a cornerstone of modern social science. Yet social scientists can run into a lot of situations where regression…
At Pew Research Center, we regularly use APIs to collect information for the studies we produce. Web APIs provide a means of communication between websites and users, structured by rules.
Pew Research Center releases its survey data publicly as IBM SPSS files with the .sav extension. But if you don’t have access to SPSS, there are free, open-source tools available to analyze and make use of the data.
An introduction to the methodological musings, puzzles and tangles that you would see if you could flip those picture-perfect research products over.
To search or browse all of Pew Research Center findings and data by topic, visit pewresearch.org
This is a blog about research methods and behind-the-scenes technical matters at Pew Research Center. To get our latest findings, visit pewresearch.org.
Copyright 2024 Pew Research Center