gtag('config', 'G-6TW216G7W9', { 'user_id': wix.currentUser.id });
top of page

Biggest Challenges Faced By A Data Scientist At Cohere Health

Shailja's biggest challenges as a Data Scientist at Cohere Health involve "data wrangling," transforming messy healthcare data like "doctors' notes that are literally scribbled," into usable formats. A second key challenge is effectively communicating complex technical findings and limitations to non-technical stakeholders.

Data Wrangling, Data Analysis, Communication, Problem-Solving, Healthcare Data

Advizer Information

Name

Job Title

Company

Undergrad

Grad Programs

Majors

Industries

Job Functions

Traits

Shailja Somani

Data Scientist

Cohere Health

Johns Hopkins University, 2020

Currently pursuing my MS in Applied Data Science at the University of San Diego (part-time online while working full-time)

Psychology

Technology

Data and Analytics

Greek Life Member, LGBTQ

Video Highlights

1. Data cleaning and wrangling is a significant part of the job, requiring skills in handling imperfect and diverse data formats. This includes working with unstructured data like handwritten doctor's notes.

2. Translating complex technical findings into easily understandable explanations for non-technical stakeholders is crucial for effective communication and collaboration.

3. Healthcare data presents unique challenges due to its varied formats, inconsistencies, and the need to handle sensitive patient information responsibly and ethically

Transcript

What is your biggest challenge in your current role?

I think I would say two things. First, data is never perfect. There are always issues, and there are always things that we might want that we might not have access to.

Sometimes data is in vastly different formats than we expect. For example, I work in healthcare, so you get doctors' notes that are literally scribbled or just crazy chicken scratch. Or maybe they type things out in abbreviations that only their office understands. So there's a lot of data wrangling and figuring all of that out.

The second thing I would say is a little bit what I answered to an earlier question, but it's translating from technical work to non-technical stakeholders. Sometimes it's hard to explain what exactly we're working on. This includes explaining limitations and even, on the opposite side, explaining insights and things that we've understood from the data.

It's challenging to explain this to people who don't necessarily understand what's in the data, what it looks like, or how it might be able to specifically generate certain insights, or what algorithms we use to generate those.

bottom of page