Pew Research Center is a nonpartisan fact tank that informs the public about the issues, attitudes and trends shaping America and the world. It conducts public opinion polling, demographic research, media content analysis and other empirical social science research in the areas of U.S. politics and policy views; media and journalism; internet and technology; science and society; religion and public life; Hispanic trends; global attitudes and U.S. social and demographic trends. Pew Research Center does not take policy positions. It is a subsidiary of The Pew Charitable Trusts. The Center’s work is carried out by a staff of about 140.
The Data Scientist will be part of the Data Labs team, a new area of the Center that provides researchers intellectual and analytical expertise and support needed to experiment with newly available data sources and methodologies. Pew Research Data Labs uses cutting edge data science and computational methods to contribute to our ongoing research in our key areas: politics; religion; journalism; science and technology, Hispanics; social trends; and global attitudes.
The Data Scientist should be hungry and eager to be a part of a team using bleeding edge methods to contribute to research that serves the public good. He/she will be adept at extracting critical takeaways from complex analyses, and articulating them with concise prose and compelling visualizations. He/she should have experience designing PhD level social science research, and should have experience answering important questions using approaches from machine learning and computational social science. He/she should be adaptable and comfortable trying out new approaches and languages.
The Data Scientist will contribute on all aspects of a wide range of research projects. This includes development, original research and writing – and managing involvement in a number of different projects at once.
- Determine the best research methods and tools to answer the questions at hand
- Project design and development
- Conduct/Implement original research
- Coordinate with outside data suppliers
- Data management, statistical modeling
- Write research reports
- Write short form posts related to project work
- Stay abreast of developments in data science, new data sources, and methodologies
- PhD level research design experience is strongly preferred
- Experience interacting with web APIs, working with JSON data, and utilizing regex
- Experience with machine learning (e.g., SVM, Random Forests, GBRT/GBDT, ensemble methods, etc.)
- Experience working with datasets too large to fit into memory
Knowledge and Skills Requirements
- Proficiency in R and/or Python (including Pandas, Scikit-learn, SciPy + NumPy)
- Proficiency with SQL
- Familiarity with Natural Language Processing (preprocessing, term-document matrix representation, named entity recognition, POS taggers/parsers/etc.) preferred
- Familiarity working with image data, OpenCV, and/or convolutional neural networks for machine learning (e.g., Caffe) preferred
- Experience using crowd-sourcing (e.g., Mechanical Turk) to gather or make data preferred
- Experience scraping unstructured data from the web preferred
- Experience compiling and/or using network data preferred
FLSA Status: Exempt
Applicant should send a résumé, cover letter (indicating where you learned of the opening) with salary expectations to firstname.lastname@example.org. Responses can also be mailed to:
Human Resources Department
Pew Research Center
1615 L Street, NW Suite 800
Washington, DC 20036
We are an equal opportunity employer.