Most Canadians can be uniquely identified from their date of birth and postal code

Aug 08, 2011

There are increasing pressures for health care providers to make individual-level data readily available for research and policy making. But Canadians are more likely to allow the sharing of their personal data if they believe that their privacy is protected.

A new report by Dr. Khaled El Emam, the Canada Research Chair in Information at the University of Ottawa and the Children's Hospital of Eastern Ontario Research Institute, suggests that can be uniquely identified from their date of , postal code, and gender. This means if this triad of data exists in any database, even if it has no names or other identifying information, it would be possible to determine the identity of those individuals. The report is now available in BMC and Decision Making Journal.

"Most people tend to think twice before reporting their year of birth [to protect their privacy] but this report forces us all to think about the combination or the totality of data we share," said Dr. El Emam. "It calls out the urgency for more precise and quantitative approaches to measure the different ways in which individuals can be re-identified in databases – and for the general population to think about all of the pieces of personal information which in combination can erode their anonymity."

The research study used a sizable Montreal-based population. The provincial health insurance claims database of Quebec holds demographic information on all citizens that have health insurance. Because it is publicly financed insurance, it effectively captures the whole population. For the purpose of the investigation, only date of birth, gender and full postal code data were obtained.

Using only the postal codes, the proportion of individuals who are unique is significant. When the full date of birth is used together with the full postal code, then approximately 97% of the population are unique with only one year of data. When the full date of birth and a multi-year residential trail are considered, then almost 100% of the population is unique. Reducing the granularity of the postal code to 1 character together with the full date of birth does reduce the proportion uniqueness considerably.

"The findings are important because they offer yet another onion skin to peel back in the overarching dialogue about individual privacy rights. We need to continuously evaluate these risks to privacy, and put in place measures to protect anonymity, whether technological or policy-based. Failure to do so will result in a public unwilling for their health data to be used for secondary purposes, such as health research," said Dr. El Emam. "Take for example, if only a three character postal code is combined with the full date of birth, close to 80% of the is unique – or easily identifiable. I suspect this will surprise most people."

Explore further: Can science eliminate extreme poverty?

More information: BMC Medical Informatics and Decision Making 2011, 11:46 doi:10.1186/1472-6947-11-46

Provided by Children's Hospital of Eastern Ontario Research Institute

4.6 /5 (5 votes)

Related Stories

Privacy risks from geographic information

Apr 08, 2010

In today's world more geographic information is being collected about us, such as where we live, where the clinic we visited is located, and where we work. Web sites are also collecting more geographic information about their ...

New study looks at re-identification risks

Oct 14, 2009

A recent study led by Dr. Khaled El Emam, the Canada Research Chair in Electronic Health Information at the CHEO Research Institute, found that the information in hospital prescription records can quite easily re-identify ...

Don't stop anonymizing data

Jun 16, 2011

Canadian privacy experts have issued a new report today that strongly backs the practice of de-identification as a key element in the protection of personal information. The joint paper from Ontario's Information and Pri ...

File-sharing software potential threat to health privacy

Mar 01, 2010

The personal health and financial information stored in thousands of North American home computers may be vulnerable to theft through file-sharing software, according to a research study published online today in the Journal of ...

Novel K-anonimity algorithm safeguards access to data

Nov 20, 2009

As electronic health records become more widely deployed, increasing amounts of health information are being collected. This data has many beneficial applications, such as research, public health, and health system planning. ...

Recommended for you

Study finds law dramatically curbing need for speed

20 hours ago

Almost seven years have passed since Ontario's street-racing legislation hit the books and, according to one Western researcher, it has succeeded in putting the brakes on the number of convictions and, more importantly, injuries ...

Newlyweds, be careful what you wish for

Apr 17, 2014

A statistical analysis of the gift "fulfillments" at several hundred online wedding gift registries suggests that wedding guests are caught between a rock and a hard place when it comes to buying an appropriate gift for the ...

User comments : 0

More news stories

Study finds law dramatically curbing need for speed

Almost seven years have passed since Ontario's street-racing legislation hit the books and, according to one Western researcher, it has succeeded in putting the brakes on the number of convictions and, more importantly, injuries ...

Health care site flagged in Heartbleed review

People with accounts on the enrollment website for President Barack Obama's signature health care law are being told to change their passwords following an administration-wide review of the government's vulnerability to the ...

Airbnb rental site raises $450 mn

Online lodging listings website Airbnb inked a $450 million funding deal with investors led by TPG, a source close to the matter said Friday.