How to Present Data Science Findings to Clients – 3 Principles and a Case Study

Background

Students enrolled in Spark! courses/practicum are usually loaded with adequate technical skills to perform data analytics. However, there might not be a consensus how to present our findings/results to a client. Below is a blog post, originated from a conversation with a FA23 Spark! team, that could give you some insights on: How to Present Data Science Findings to Clients.

3 Principles

  1. Use data-driven strategies for storytellings. Support your conclusions and hypothesis with data.
  2. Know your audience, level your presentations with your clients and explain in ways they can understand.
  3. Embrace Ownership Principle. Task shouldn’t be limited to completing the code/presentation/report; avoid the mindset that the rest is “not my job”. Consider the bigger picture of how findings could benefit the client.

Case Study

Case Study Background

The case study used in the blog post is from a Spark! data science project that centers around unanswered survey data, where we want to help analyze survey data over the past 8-9 years.

Case Study Practice

Before going deeper into the blog post, imagine you were given a tabular survey data for 20 survey responders across 10 years e.g. “Effectiveness of head coach’s communication skills (on a scale from 1 to 4)”, and a list of the head coach changes for all the teams throughout the years. How would you use it to answer the following questions, “How have athletes’ perceptions of coach communication evolved over the 8-9 year period?”.

Two Options

Option A:

  1. A list of scatter plots of all teams throughout the 10 years
  2. A statistical test findings over a ANOVA table and F-test to show that there is/isn’t a statistical significant difference between the years.

Implementations:

Scatter Plot

png

ANOVA Table

png

Option B:

With the help of using an interactive dashboard such as Tableau

  1. A written conclusion indicating which team are under-performing/over-performing, which teams have experienced a large spike (upward and downward), whether the head coach change will impact the communications.
  2. An interactive dashboard to show each individual team’s survey rating and head coach changing history.

Conclusion Panel

png

Dashboard Panel

png

Which one would you choose?

Presentation Principles Breakdown

If you are not sure which one to choose, please read along and reflect.

Role of a Data Scientist / Business Analyst

The first question that might pop into your head is: Why should I care about how I present my data findings? My model/report/analysis speaks for themselves! However, learning from my past tenure as a data scientist in marketing and product analysis, being a data scientist / business analyst is not all about deploying a model, building a dashboard. These tasks are repetitive and highly exchangeable for anyone with the particular skill set.

Instead, what we really want to do is to push your client (might be your stake-holders, your manager or your product team) with your data-driven conclusions to form data-driven decisions. In that way you are part of the decision-making process and your values are magnified to a larger scale.

So how do you convince your client that your idea/strategy are viable?

Modes of Persuasion

In Aristotle’s Rhetoric, strategies to persuasion consist of, Ethos (an appeal to the authority or credibility of the presenter), Pathos (an appeal to the audience’s emotions), and Logos (logical appeal or the simulation of it). They are often referred as the modes of persuasion or the modes of appeal.

Although ancient, it provides a great guideline on how to persuade others. After plugging in the context of data science and analytics, here are 3 most important aspects where you could apply them:

Principle 1: Data-Driven

The most unique quality you possess, as a Data scientist or a Business Analyst (compared to a product manager or a software engineer),is your ability to combine storytelling and data. When doing analysis, you are often flooded with all kinds of data and metrics, and you get to be familiar with the statistical definition of the metrics. Thus, finding the correct metrics to pursue your claim is a critical step of your final deliverables. Therefore, the first principle, derived from Logos, data-driven.

In the case of our case study, both options have presented data and graphs to support its conclusions. However, option A has no clear defined conclusions while option B as a clear conclusion, drawn from data – “The picking and changing of head coaches has been really successful”.

The power of data-driven also lies in making actionable plans or hypothesis in the process of decision making. Suppose we are doing further correlation analysis on the survey data and you were to find directions for further improvements.

Anyone can propose something or make an educated guess, what makes you different? What justifies a good idea and a bad idea? Data.

In the case study, the client wants to explore the how to improve the communications between athletes and coaches. Based on your understanding, you threw out a few hypothesis:

  1. We could keep the head coach and let the him/her takes a communication course;
  2. We could make an assistant position who is hired from the third party to ensure the communication goes well.
  3. We could replace the head coach with someone with more experience.

All of the hypothesis seems helpful, but only the third one could be supported by our survey data (See the chart below). Thus, making it more convincing in the decision-making process. While everyone else has a fascinating idea and many reasonings behind, data ensured an absolute neutral subjective way to justify decisions, thus making story-telling with data more convincing than ever.

To further how to speak with data, the lean book series, especially Lean Analytics, is a good place to seek inspirations.

png

The book provides many insights and frameworks that are suitable for analytics. For example, the AARRR(A) mentioned here is a common metrics framework providing guidance to many products

But is mastering the data-driven story telling enough? The answer is not yet! In many cases, your clients have a different background then yours thus leading to potential blockers in communications.

Principle 2: Know-Your-Audience

Compared to other technical positions such as software engineer or a machine learning engineer, data scientist and business analyst involve a lot more communication with clients from different background. Therefore, choosing the correct approach to communicate is essential, and the key to choosing the approach, derived from Pathos, is via understanding your clients and level-up with him/her.

png

A data science venn diagram I love to use

Besides coding, statistics and, domain knowledge (business), communication is an important aspect of a data scientist’s responsibility. Imagine in a scenario, your client excels in business administration and haven’t touched statistics for a long time. Imagine the following (Also a very common data science interview questions): How would you explain p-value to your client?

Many answers pop inside your head. Citing wikipedia’s definition, “In null-hypothesis significance testing, the p-value is the probability of obtaining test results at least as extreme as the result actually observed, under the assumption that the null hypothesis is correct.” Perhaps this is not the optimal way to reciting its strict definition to your client since that will bring more jargons to the table: “test results at least as extreme”, “null hypothesis”, etc.

For this particular question, I would recommend checking out Cassie Kozyrkov (Google’s former Chief Decision Scientist)’s blog post where she used an excellent analogy to explain the concept: Explaining p-values with puppies.

However, know your audience doesn’t simply translate to “use as simple words as possible”, and it’s more like “use the most accurate and concise words that your audience could understand”. If your audience comes from a STEM or technical background, then there is no need to re-explain basic concepts. Instead, finding the balance to communicate is what you are looking for.

In the case study, an ANOVA table were presented to show that there is statistical significant correlations between coaching communication and winning across all team. However, applying the data-driven and know-your-audience principles, it could translate to: our analysis shows coaching communication has shown a strong correlation with seasonal placing, insert detailed metrics and by changing head coach to teams showing bad communication scores, we are certainly likely to, insert probability or other statistical metrics, increase our seasonal placing by a small/medium/huge advancements.

Furthermore, coupled with a few presentation tricks, e.g. less is more, putting conclusion first, repeat conclusions in different ways, and hiding the details in addendum, will extremely boost the effectiveness of a client meeting.

Now, you have become an excellent data scientist. But is there room improvement? The last but most important principle:

Principle 3: Ownership

The word ownership is discussed at a lot of technology companies, but most commonly appeared in Amazon’s Leadership Principles. In short, one’s work shouldn’t be halted if the works goes out of the scope, using “not my business” as an excuse to avoid taking responsibilities.

In the context of a data science project, although you spend a large amount of time in the technical pipeline, your analysis shouldn’t stop after just finishing a data visualization or a statistical analysis. A mediocre presentation and an excellent presentation has a huge difference! (While startups and VCs were at its peak a few years ago, a team with no concrete product yet a successful roadshow presentation can win over an investor easily.)

The Ownership principle could also be interpreted as “thinking in the shoes of the clients”. In the context of the project, when the team presented their findings to its client, what will the client do with the findings? Will he/she use it to leverage change and make decisions to help the 20 teams in Boston University’s Athletics department? If so, will the findings be sufficient for he/she to convince her supervisor? her manager? Will he/she have doubts utilizing many jargons in the findings? How would he/she explain them?

If you have shown great ownership and leadership in your work, then naturally trusts are built between you and your client, thus deriving from Ethos, further communications would be easier and your words become more convincing naturally.

Summary

In summary, presenting data science findings effectively to clients involves three key principles:

  1. Data-Driven Approach: Always back your conclusions with solid data. This forms the core of your credibility and persuasiveness.

  2. Know Your Audience: Tailor your presentation to suit the background and understanding level of your clients. Avoid jargon and use clear, relatable language.

  3. Ownership: Go beyond just presenting data. Think about how your findings can be applied in practical scenarios to benefit the client. Show initiative and consider the broader impact of your work.

Applying these principles, as shown in the case study, will help you communicate your findings more effectively, ensuring they’re not only understood but also actionable. Remember, your role is to make data meaningful and useful for decision-making, bridging the gap between technical analysis and practical application.