Dan Mazur, PhD

Personal website of Dan Mazur, PhD. Dan is a machine learning engineer in Vancouver, BC.

View My GitHub Profile

Statement of Data Science Philosophy

Within some professions, especially teaching, it is normal to write one’s reflections on the goals of their work and how they go about achieving them.

Data science has become very broadly defined. When you meet a data scientist, it isn’t clear what they do or how they go about solving problems. Different organizations have different job descriptions and standards of professional practice for data scientists. So, I think it is a useful exercise to reflect on my personal philosophy and be able to communicate it with colleagues, mentors, mentees, and employers.

Purpose of data science

The purpose of data science is to improve decision making using data by applying best practices from science, statistics, decision theory, software engineering, business, and related disciplines. The improvement may be in the quality of the outcomes of the decisions, in removing a burden on humans to make decisions, or it may be in the speed or ability to scale of the decision making process. This approach can have many benefits to an organization, to society, or in technology. It can help leaders of organizations make more effective decisions to benefit their customers and their employees. Or, it can be used to automate the many decisions involved in complex activities like driving a car or managing your personal finances.

Data Science Values

I believe that the most difficult aspect of data science is in managing the many ways that a data scientist can be fooled unintentionally. When that happens, the wrong decisions can be made and that has real-world consequences. Data can be very hard to interpret correctly and it requires great care to avoid fooling onesself in the face of many sources of uncertainty.

Therefore, when I work, I value the following things: