In 2004, Patil worked in the Advanced Systems and Concepts Office in the Department of Defense. He served as the project leader for the Threat Anticipation Project.[14] His role was to anticipate threats connected to and surrounding terrorism, weapons of mass destruction and failed states (with an emphasis on human rights violations),[15] employing social network analysis to help anticipate these threats.[16][10] Patil described himself as part of “the second wave of people”[17] who were to use data to detect signalled noise, a concern which grew following the 9/11 attacks.
2011
While a data-scientist-in-residence at Greylock Partners, Patil produced Building Data Science Teams.[18] The book provides advice and strategies on creating data science teams in business and technology. It was published by O'Reilly Media.[19]
2012
Patil wrote Data Jujitsu--The art of turning data into product, also while a data-scientist-in-residence at Greylock Partners.[20] The book gives instructions as to solving data science problems and whether they are "worth solving" at all. It was also published by O'Reilly Media.[21]
2015–2017
On February 18, 2015, the White House announced Patil would be the first U.S. Chief Data Scientist (Deputy Chief Technology Officer for Data Policy and Chief Data Scientist). In addresses to the public, Patil explained: "The mission of the U.S. Chief Data Scientist, simply put, is to responsibly unleash the power of data to benefit all Americans."[22] He added that his team's priority was to do so by creating data.[23]
In a memorandum on February 20, 2015, entitled “Unleashing the Power of Data to Serve the American People”, Patil outlined his goals as Chief Data Scientist:
Providing vision on how to provide a maximum social return on federal data.
Creating nationwide data policies that enable shared services and forward-leaning practices to advance our nation's leadership in the data age.
Working with agencies to establish best practices for data management and ensure the long-term sustainability of databases.
Recruiting and retaining the best minds in data science for public service to address these data science objectives and act as conduits among the government, academia, and industry.[24]
In his tenure, Patil helped launch the White House's Police Data Initiative as well as the White House's Data-Driven Justice Initiative, collecting data on police activities, and worked on the Precision Medicine Initiative, aiming to build the largest database on genetic information.[25]
Law Enforcement
In 2015, President Obama initiated The President's Task Force on 21st Century Policing in response to the Ferguson shootings in 2014. Included were the Police Data Initiative and the Data-Driven Justice Initiative.[17]
Police Data Initiative
The Police Data Initiative aimed to build trust between the police and their communities by releasing data sets on “stops and searches, uses of force, officer-involved shootings, or other police actions.”[26]
Open data was also thought to increase accountability within the police. The initiative had gathered 126 jurisdictions across the country.[27]
The information would enable investigation into patterns of injustice as well as the ability to see if systems in place were wanted or working. Police departments struggled to release their data, primarily because it was not “collected well”, as Patil had mentioned in an interview.[28] Disorganized datasets had prevented accurate assessments. Patil proposed that this was due to the absence of a dedicated technician whose purpose is to organize the data.[29] Efforts were taken to improve coding abilities of police superintendents[30] and homogenizing organization across the 18,000 jurisdictions on board.[29]
More than 170 datasets were released at the start of the initiative in 2015.[31]
Data-Driven Justice Initiative
With open police data, Patil saw that data helped track law enforcement actions’ disproportionate interactions with minorities, the poor and those with mental health concerns. In a memorandum to the American people, Patil said, “if all the data from law enforcement were effectively captured, analyzed, and shared, imagine how the effective and analysis of data could advance proven reforms, increase efficiency, and prevent injustice.”.[23] Efforts to decrease the injustice is considered the Data-Driven Justice Initiative (DDJ).
DDJ aimed to help governments from the county to the state level to filter low-level offenders and those with mental illness from higher level offenders and the criminal system at large. Changing the approach to pre-trial incarceration would allow low-risk offenders to shorten jail time especially if they cannot afford bail.[32][33]
At its start in 2015, more than 91 million people are covered by the DDJ.[33]
Medicine
Precision Medicine Initiative
Patil had a special interest in using data to improve the medical field by way of the Precision Medicine Initiative (PMI). PMI aimed to build the largest database of genomic information to increase understanding of cancer treatments, chronic disease, and rare diseases[34]
The initiative was started as there is not a current dataset in which analyses and correlations can be made across gender and ethnic diversities. This would help with curing diseases but also understand the health of the general population. Patil had often stated that it would be helpful because one may very well be able to see what diseases are not actually rare, but very common. A key issue in PMI that Patil worked on was of trust.[17]
Considering vulnerability and the issue of ethics that arises in pursuing an initiative PMI, Patil sought to develop workshops and “listening sessions” to hear specific concerns regarding the initiative. Based on the feedback, Patil devised the “privacy and trust principles.”[23]
The first principle in its section entitled “Data Sharing, Access, and Use”, is that “data access, use, and sharing should be permitted for authorized purposes only."[35]
Concerned with maintaining these principles grew for the future as Patil addressed concerns about the effect of American Healthcare Act (AHCA) on contributions to PMI. He feared that, with the AHCA, people will be scared to donate their genomic information because it will expose pre-existing conditions. While there are protections in place to secure data privacy such as the Genetic Information Nondiscrimination Act (GINA) instilled in 2008, Patil expressed a worry that the Act would not be heeded.[36]
^O'Reilly.com. "Building Data Science Teams". O'Reilly. O'Reilly Media, Inc. Archived from the original on November 19, 2017. Retrieved November 19, 2017.