NOTE: This position has been deemed critical/has specific funding, has been approved by the Law School for posting, and is exempt from the hiring freeze.
The Regulation, Evaluation, and Governance Lab (RegLab) at Stanford University is looking for a Data Scientist to work across our research programs. This role will collaborate closely with teams of research fellows and students on projects related to machine learning and the public sector. We are looking for someone who is a self-learner, research-oriented, and excited by the mission of the Lab.
About us:
Stanford RegLab is a social impact lab that partners with government and nonprofits to use machine learning and data science to modernize the public sector. We are an interdisciplinary team of lawyers, data scientists, social scientists, and engineers who are passionate about building high impact demonstration projects for the future of governance. Some of our partners include the EPA, IRS, DOL, and Santa Clara County Public Health.
As a Data Scientist, you will:
-
Support collecting, managing, and cleaning of data sets from regulatory agencies, understand discrepancies and build out and document a data dictionary to improve knowledge of different systems and how to manage discrepancies.
-
Produce reports and publications on findings to our government partners and for academic publications, based on extensive data analysis.
-
Work with PI and Research Director to ensure data management/collection is clear, secure, and robust; improve processes for collecting and managing data.
-
Support training of research fellows on best practices and technical skills
Core Duties:
-
Identify and select usable data from subtle and complex data patterns. Assess and produce relevant, standard, or custom information (reports, charts, graphs and tables) from structured data sources by querying data repositories and generating the associated information.
- Design methods to validate data to ensure high quality product. Explore creative approach to using data based on technical expertise of available data. Distribute reports to applicable agencies, researchers, management and other internal end-users and provide interpretation of data when needed.
- Collect, manage and clean datasets using an extraction and reporting programming language to ensure data integrity.
- Research and reconcile data discrepancies occurring among various information systems and reports.
- Collaborate with data managers to define and implement data standards and common data elements for data collection.
- Identify new sources of data and methods to improve data collection, analysis and reporting.
- May test prototype software and participate in approval and release process for new software.
- Other duties as assigned.