Katherine Mangu-Ward from the March 2010 issue
New reporting requirements in Oklahoma could force women who receive abortions to have their private information entered in a public database. The rules are on hold pending a court hearing, which at press time had not yet been scheduled.
The new regulations would require doctors to collect and report information about every abortion in the state, including the mother’s age, marital status, race, number of children, education level, relationship to the father, and reason for the abortion, as well as the cost and method of payment. The form contains 37 questions in all, most with several subsections. One goal of the law, which also includes a ban on sex-selective abortions, is to make the data available to researchers and the general public on the state government’s website.
To keep such personal information private, the database would strip out women’s names and other obvious identifying information, theoretically “anonymizing” the data. But as Latanya Sweeney of Harvard’s Center for Research on Computation and Society told BioEdge, “data tend to flow around and get linked to other data.” Even when obvious identifying information is removed from a large data set, personal identities often can be cracked by a geek with time on his hands. Arvind Narayanan and Vitaly Shmatikov, for instance, broke the anonymity of a large set of Netflix movie preference data by comparing the dates of specific rankings with similar rankings on the popular Internet Movie Database, where users reveal personal information in public profiles. Something similar happened when AOL released “anonymized” search queries that nonetheless made identifying some users quite simple, with potentially embarrassing results.
Paul Ohm, a law professor at the University of Colorado, summed up the problem in an interview with the tech website Ars Technica: “Data can either be useful or perfectly anonymous but never both.”
Reason needs your support. Please donate today!
Try Reason's award-winning print edition today! Your first issue is FREE if you are not completely satisfied.
(310) 367-6109
3415 S. Sepulveda Blvd.
Suite 400
Los Angeles, CA 90034
(310) 391-2245
Editor's Note: We invite comments and request that they be civil and on-topic. We do not moderate or assume any responsibility for comments, which are owned by the readers who post them. Comments do not represent the views of Reason.com or Reason Foundation. We reserve the right to delete any comment or disable your ability to comment for any reason at any time.
Arvind Narayanan|3.5.10 @ 2:36AM|#
Since you cited my paper, you may be interested to know that I think this is a giant overreaction: http://33bits.org/2009/10/09/o.....-it-wrong/
choice C|3.13.10 @ 2:30AM|#
Very interesting reading Arvind. Thank you.
abercrombie milano|5.27.10 @ 2:50AM|#
I don't got your point. But thank you all the same.
nfl jerseys|11.4.10 @ 11:25PM|#
hrtr
Finnish Spitz|1.21.11 @ 8:58PM|#
Arvind Narayanan in addition to Vitaly Shmatikov, in particular, out of cash that anonymity of your substantial couple auto repair of Netflix film choice records through evaluating that schedules regarding unique field by using same field to the famous Net Film Repository, exactly where end users disclose private information within criminal court profiles. Some thing same transpired whenever AOL produced “anonymized” seek questions which even so produced discovering a good number of end users fairly simple, by using probably uncomfortable final results.
nike air max|8.2.11 @ 2:39AM|#
is good