Data Ethics 101
These days, every organization is a data organization. It's estimated that 402 million terabytes of data are created each day, and that, as of 2024, 90% of the world's data was generated between 2022 and 2024. This massive surge in data creation isn't just a technical challenge—it's an ethical one. As we navigate this data-driven world, data ethics has become more crucial than ever, because it can guide us to use this wealth of information responsibly and justly.
What is data ethics?
Before getting into it, let's begin by defining data and ethics. Data refers to facts, figures, observations, or recordings that can take the form of an image, sound, text or physical measurement. Ethics refers to our moral judgments and behaviour.
Together, data ethics encompasses the ethical and moral responsibilities involved in the collection, sharing, and use of data. It ensures that data is collected and used fairly and for the greater good, and that it is used appropriately throughout all stages of the data lifecycle.
Data ethics guides data users to consider the ethical implications before data is collected, ensuring it serves a specific purpose beneficial to both society and individuals. By evaluating the moral issues related to data, data ethics formulates solutions that promote right conduct and values. This also extends to algorithms, such as those used in artificial intelligence and machine learning, as well as practices like responsible innovation and programming.
What are the guiding principles of data ethics?
There isn't a set list of guiding principles1 of data ethics. It's a "one size fits most" as opposed to a "one size fits all" type of situation. While the general themes tend to remain consistent, organizations will adapt them to suit the environment where they'll be applied. While there are others, here are some of the most common guiding principles and reflection questions to address ethical challenges:
Privacy and security
Is personal data being protected and kept secure, so that identifiable personal information or information that could negatively impact people or organizations is not available to unauthorized users? Has it been anonymized to further protect privacy?
How to ensure privacy and security
- Privacy principles: Before collecting data, consider if you really need it and if its benefits justify any privacy impacts.
- Minimal intrusion: Justify every intrusion, explaining why it is necessary and proportionate to the objectives.
- Informed consent: Respect privacy by obtaining individuals' consent before collecting, using, or disclosing their personal information.
- Security assessment: Conduct a high-level security assessment and protect data and information according to their security level (unclassified, classified, protected).
- Data anonymization: Use and implement techniques such as anonymization and pseudonymization to protect personal data.
Transparency and accountability
Is it clear what data is being used for, and how and where it is being stored? Is there strong oversight and management of data to ensure it is used ethically?
How to ensure transparency and accountability
- Open and clear communication: Establish clear communication channels to explain the purposes and uses of the data being collected to all stakeholders using simple language.
- Respond promptly: Address inquiries about data transparency promptly and transparently.
- Be transparent about privacy: Organizations have to explain the systems and processes for protecting individuals' privacy when handling sensitive data. Report on compliance with relevant laws and regulations, such as data protection laws.
- Implement data strategies: Get to know and implement your department's data strategies and priorities, the missions of the Data Strategy for the Federal Public Service, and the Indigenous Data Sovereignty provisions of the United Nations Declaration on the Rights of Indigenous Peoples Act Action Plan.
Fairness and inclusivity
Have you investigated any potentially negative, harmful or discriminatory outcomes from your use of particular data? Will your use of data have discriminatory consequences for particular groups, even if this is unintended?
How to ensure fairness and inclusivity
- Bias awareness and mitigation: Identify and understand various types of biases that can affect your work and implement proactive measures to mitigate biases in data collection, analysis, and interpretation.
- Representation and diversity: Ensure that the data collected reflects the diversity of the population it represents, avoiding over-representation or under-representation of any group.
- Participatory approach: Engage with communities, advocacy groups, and ethics experts to gain valuable insights, ensure fairness and equity in data, and make sure data aligns with community needs and values.
- Inclusivity and non-discrimination: Include diverse perspectives and stakeholders in designing, implementing, and evaluating data initiatives.
- Continuous monitoring and evaluation: Perform regular monitoring and impact assessments to evaluate how data use might affect different groups, and to identify and address disparities and inequities as they arise.
As public servants, but most importantly, as Canadians, these principles are the backbone of our shared values—respect for democracy, respect for people, integrity, stewardship, and excellence. They guide us in recognizing and tackling ethical challenges in our work and building public confidence.
Why is it important to understand data ethics as public servants?
Data ethics has always played a crucial role in government processes, from data collection to decision-making, but two factors have made it more important than ever: data maturity2 and the increase in public expectations for digital services.
The amount of data available to organizations has skyrocketed. This data often comes from multiple sources, and it's not always clear where it originated or whether proper permissions were granted for its use, especially with personal information. Navigating this complexity requires a strong ethical framework to ensure responsible data handling. Additionally, the rise of machine learning and AI algorithms has transformed how organizations process and interpret data. While these technologies offer significant benefits, they also introduce risks related to fairness and discrimination. Automated decisions made by AI, without human oversight, can unintentionally perpetuate biases and inequalities.
Without an understanding of data ethics, we risk unintentionally creating or perpetuating harmful biases or unfairness within our processes. Here are some of the main reasons for public servants to have an understanding of data ethics:
- Upholding the public trust: Knowledge of data ethics ensures that data collection and usage are conducted in ways that serve the public interest and prevent harm. By being transparent and ethical, public servants can maintain and strengthen public trust.
- Promoting fairness and equity: Data can reflect and amplify societal biases. Public servants who understand data ethics can work to ensure that data collection and analysis do not discriminate against certain groups, thereby promoting fairness and equity in public services. Otherwise, biased data or faulty analysis could lead to worse policy outcomes for some groups, for instance, in how government sets benefit eligibility or invests in regional economic activity.
- Encouraging transparency and accountability: Data-driven decisions must be transparent and decision-makers must be accountable to build trust in government. Openly sharing data sources, methods, and criteria allows public oversight and ensures officials are accountable for decisions and outcomes.
- Fostering innovation and progress: Ethical data practices pave the way for responsible innovation. By using data ethically, public servants can drive progress and enhance public services, all while maintaining the public's trust and upholding the public interest.
- Avoiding legal and reputational risks: Unethical data practices can lead to significant legal and reputational risks for the Government of Canada. A solid understanding of data ethics helps public servants navigate these potential pitfalls, ensuring that data usage complies with legal standards and maintains the government's integrity.
- Ensuring safety and security: Cyber attacks targeting government agencies have become increasingly common as malicious actors seek to exploit sensitive information. A strong understanding of data ethics fosters adherence to security protocols, preventing data breaches and safeguarding crucial data.
Conclusion
In essence, data ethics is about collecting, sharing, and using data responsibly. It applies to everyone, from public servants delivering services to policy-makers, and is crucial for delivering better programs, policies, and services that meet the diverse needs of Canadians. By reflecting on guiding principles and learning more about how to integrate them into your work through deeper learning (start by exploring the resources below), you can start to implement data ethics in a meaningful way. This ensures that your data practices are not only effective but also fair, transparent, and beneficial to society as a whole.
Definitions
Guiding principles: Fundamental beliefs or values that provide a framework for ethical decision-making and behaviour.
-
Data maturity: Data maturity refers to how advanced and capable an organization is in handling and using data effectively and ethically.
Resources