Emily Oster

6 min Read Emily Oster

Emily Oster

More Studies on Coffee

Can it really save your life?

Emily Oster

6 min Read

If you are a regular reader, you’ll know that among my biggest frustrations are studies that confuse correlation and causality, especially when it comes to food. This is a topic that I research in my academic work and have written about frequently here (see, for example, this post on coffee, closely related to today’s rant).

Last week, we got a new study, reported in the New York Times with the characteristic headline “Coffee Drinking Linked to Lower Mortality Risk, New Study Finds.” I briefly pooh-poohed this headline (and the whole study) on Twitter, but I will take the opportunity here to expand on that criticism. Although I’ve talked through many of these points before in general, I want to pull out the details of how I unpack a particular study of this type.

To begin: here is the study. It’s published in the Annals of Internal Medicine, which is a prestigious journal (impact factor 25.39!).

The contours of the study are pretty typical. The researchers use a large sample (about 171,000 people) in the U.K., created as part of a broader study called the UK Biobank. They have information at baseline about the consumption of (among other things) coffee, including whether people drink their coffee unsweetened, sweetened, or artificially sweetened. They follow these individuals for seven years and look at the relationship between coffee consumption and death. They find that consumption of unsweetened or sugar-sweetened coffee is associated with a reduced risk of death. There doesn’t seem to be an association with artificially sweetened coffee.

The effects are sizable, up to a 30% reduction in the hazard rate of death over this period. The sweet spot seems to be between 1.5 and 3.5 cups a day.

The obvious concern with this paper is that it’s not the coffee but the other differences across people that drive the results. The approach this paper, and many others, take is to try to adjust for observable differences between individuals. In this case, the data set is detailed, and they are able to control for differences in underlying health, other dietary choices, and socioeconomic status.

The key question in these papers is: are these controls sufficient? Or, alternatively, are there other important unobserved controls that might be driving the results?

I have two ways into thinking about this question. The first is conceptual. Generating a causal estimate here is going to require isolating some variation in coffee consumption that is effectively random. That is: if we think these observational data generate a causal link, it must be that we think that once the controls are included, we’ve isolated variation in coffee consumption that isn’t related to other important characteristics. This could happen if, say, people choose how much coffee they drink at random, conditional on their observed characteristics, or if their choice was driven by some external factor unrelated to their health overall.

I find this idea implausible. I don’t think that coffee consumption is chosen at random, even conditional on controls; it’s part of a larger diet and lifestyle. And all of the external factors I can think of that might influence coffee consumption — stress, how busy you are, sleep — are also variables that influence health. However, this is an inherently untestable view. I find it implausible, but others might not.

The second way I try to evaluate this question is with data — specifically, looking at the differences across groups in the observed controls. Why? Generally, because I tend to think that any differences we see in the observed controls might be reflective of differences in characteristics that remain unobserved.

In the case of this particular paper, we can look at Table 1, which gives characteristics for the four groups considered (non-consumers, unsweetened consumers, sugar-sweetened consumers, and artificially sweetened consumers).

There are a few sizable differences I notice.

  • Gender: 60% of the sugar-sweetened coffee group identify as men, versus 40% to 44% in the other three groups.
  • Race: all coffee drinker groups are more likely to be white (with unsweetened much more likely).
  • Education: unsweetened coffee drinkers are much more likely to have a degree than any other group.
  • Smoking: large differences, in various directions (sugar-sweetened consumers are more likely to smoke; unsweetened are less likely).
  • Diabetes: sugar-sweetened consumers are much less likely to be diabetic; artificially sweetened are much more.
  • Blood-pressure-drug use: artificially sweetened are much more likely to use.
  • Sugar consumption: all three coffee groups, but especially the unsweetened group, report less sugar consumption.

When I look at this table, it paints a picture of quite different people in each group. Relative to the non-consumers, those who drink unsweetened coffee are better-educated, less likely to smoke, do more exercise, and consume less sugar. It’s a picture of a better-off, more health-conscious group.

In contrast, the group that consumes artificial sweetener seems to be less healthy (more diabetes, higher weight, more likely to be a former smoker). They look like a group that is consuming artificial sweetener in part due to trying to improve their health.

The group that consumes sugar-sweetened coffee actually looks most similar to the non-consumers, except they are 20 percentage points more likely to be men. Which … is a large difference.

The bottom line is that on observable dimensions, these groups have some very important differences. This makes me concerned about differences in characteristics we do not observe. Most of the controls we see are pretty coarse — two categories of education, some general measure of diet based on a small number of 24-hour recalls — and my main worry is that these do not capture anything like the full picture of individuals. To the extent that there are differences in unobserved features that are reflected in the differences in observed features, this could drive some of the results.

This concern is also fundamentally untestable. For me: I believe that observing these large differences across groups on variables we observe points to likely differences in variables we do not observe, which could generate bias. The authors may say they disagree; that while there are differences in observed variables, they don’t think there are important unobserved differences. Their argument becomes more persuasive as we observe more and more controls; mine becomes more as the differences on observable dimensions get larger. But we cannot be convinced for sure either way.

In the end, I find the idea that the findings in this paper are causal to be implausible for these two key reasons. What I cannot do — what no one can do — is prove that. Which may be why studies with this structure continue to be published, and people like me continue to complain.

A child measuring their height on the wall.

Nov 04 2022

3 min read

Will My Child Be Shorter If I Drink Caffeine in Pregnancy?

I read about a new study saying caffeine in pregnancy makes children shorter. True? —Anonymous Many of you read about Read more

Emily Oster
Pregnant person stands on a balcony while drinking a cup of coffee and smiling.

Feb 11 2022

2 min read

Another Bad Study on Caffeine in Pregnancy

This new paper on caffeine and brain scans didn’t get the play that baby metals did, but a number of Read more

Emily Oster
Person with a pregnant belly sits by the window drinking coffee in a cafe.

Aug 27 2020

4 min read

Scary but Bad Study on Caffeine in Pregnancy

In Expecting Better I review in detail the data on coffee in pregnancy, and discuss the evidence for links with Read more

Emily Oster
A close-up of a lime in a fizzing seltzer drink.

Aug 09 2023

2 min read

Should I Quit Drinking Seltzer?

Hi Emily, I’m newly postpartum, and my pelvic floor therapist suggested I stop drinking so much seltzer because it’s a Read more

Sara Reardon

Instagram

left right
I hear from many of you that the information on ParentData makes you feel seen. Wherever you are on your journey, it’s always helpful to know you’re not alone. 

Drop an emoji in the comments that best describes your pregnancy or parenting searches lately… 💤🚽🍻🎒💩

I hear from many of you that the information on ParentData makes you feel seen. Wherever you are on your journey, it’s always helpful to know you’re not alone.

Drop an emoji in the comments that best describes your pregnancy or parenting searches lately… 💤🚽🍻🎒💩
...

Milestones. We celebrate them in pregnancy, in parenting, and they’re a fun thing to celebrate at work too. Just a couple years ago I couldn’t have foreseen what this community would grow into. Today, there are over 400,000 of you here—asking questions, making others feel seen wherever they may be in their journey, and sharing information that supports data > panic. 

It has been a busy summer for the team at ParentData. I’d love to take a moment here to celebrate the 400k milestone. As I’ve said before, it’s more important than ever to put good data in the hands of parents. 

Share this post with a friend who could use a little more data, and a little less parenting overwhelm. 

📷 Me and my oldest, collaborating on “Expecting Better”

Milestones. We celebrate them in pregnancy, in parenting, and they’re a fun thing to celebrate at work too. Just a couple years ago I couldn’t have foreseen what this community would grow into. Today, there are over 400,000 of you here—asking questions, making others feel seen wherever they may be in their journey, and sharing information that supports data > panic.

It has been a busy summer for the team at ParentData. I’d love to take a moment here to celebrate the 400k milestone. As I’ve said before, it’s more important than ever to put good data in the hands of parents.

Share this post with a friend who could use a little more data, and a little less parenting overwhelm.

📷 Me and my oldest, collaborating on “Expecting Better”
...

I spend a lot of time talking people down after they read the latest panic headline. In most cases, these articles create an unnecessary amount of stress around pregnancy and parenting. This is my pro tip for understanding whether the risk presented is something you should really be worrying about.

Comment “link” for an article with other tools to help you navigate risk and uncertainty.

#emilyoster #parentdata #riskmanagement #parentstruggles #parentingstruggles

I spend a lot of time talking people down after they read the latest panic headline. In most cases, these articles create an unnecessary amount of stress around pregnancy and parenting. This is my pro tip for understanding whether the risk presented is something you should really be worrying about.

Comment “link” for an article with other tools to help you navigate risk and uncertainty.

#emilyoster #parentdata #riskmanagement #parentstruggles #parentingstruggles
...

Here’s why I think you don’t have to throw away your baby bottles.

Here’s why I think you don’t have to throw away your baby bottles. ...

Drop your toddlers favorite thing right now in the comments—then grab some popcorn.

Original thread source: Reddit @croc_docs

Drop your toddlers favorite thing right now in the comments—then grab some popcorn.

Original thread source: Reddit @croc_docs
...

Just keep wiping.

Just keep wiping. ...

Dr. Gillian Goddard sums up what she learned from the Hot Flash  S e x  Survey! Here are some key data takeaways:

🌶️ Among respondents, the most common s e x u a l frequency was 1 to 2 times per month, followed closely by 1 to 2 times per week
🌶️ 37% have found their sweet spot and are happy with the frequency of s e x they are having
🌶️ About 64% of respondents were very or somewhat satisfied with the quality of the s e x they are having

Do any of these findings surprise you? Let us know in the comments!

#hotflash #intimacy #midlifepleasure #parentdata #relationships

Dr. Gillian Goddard sums up what she learned from the Hot Flash S e x Survey! Here are some key data takeaways:

🌶️ Among respondents, the most common s e x u a l frequency was 1 to 2 times per month, followed closely by 1 to 2 times per week
🌶️ 37% have found their sweet spot and are happy with the frequency of s e x they are having
🌶️ About 64% of respondents were very or somewhat satisfied with the quality of the s e x they are having

Do any of these findings surprise you? Let us know in the comments!

#hotflash #intimacy #midlifepleasure #parentdata #relationships
...

Should your kid be in a car seat on the plane? The AAP recommends that you put kids under 40 pounds into a car seat on airplanes. However, airlines don’t require car seats.

Here’s what we know from a data standpoint:
✈️ The risk of injury to a child on a plane without a carseat is very small (about 1 in 250,000)
✈️ A JAMA Pediatrics paper estimates about 0.4 child air crash deaths per year might be prevented in the U.S. with car seats 
✈️ Cars are far more dangerous than airplanes! The same JAMA paper suggests that if 5% to 10% of families switched to driving, then we would expect more total deaths as a result of this policy. 

If you want to buy a seat for your lap infant, or bring a car seat for an older child, by all means do so! But the additional protection based on the numbers is extremely small.

#parentdata #emilyoster #flyingwithkids #flyingwithbaby #carseats #carseatsafety

Should your kid be in a car seat on the plane? The AAP recommends that you put kids under 40 pounds into a car seat on airplanes. However, airlines don’t require car seats.

Here’s what we know from a data standpoint:
✈️ The risk of injury to a child on a plane without a carseat is very small (about 1 in 250,000)
✈️ A JAMA Pediatrics paper estimates about 0.4 child air crash deaths per year might be prevented in the U.S. with car seats
✈️ Cars are far more dangerous than airplanes! The same JAMA paper suggests that if 5% to 10% of families switched to driving, then we would expect more total deaths as a result of this policy.

If you want to buy a seat for your lap infant, or bring a car seat for an older child, by all means do so! But the additional protection based on the numbers is extremely small.

#parentdata #emilyoster #flyingwithkids #flyingwithbaby #carseats #carseatsafety
...

SLEEP DATA 💤 PART 2: Let’s talk about naps. Comment “Link” for an article on what we learned about daytime sleep!

The first three months of life are a chaotic combination of irregular napping, many naps, and a few brave or lucky souls who appear to have already arrived at a two-to-three nap schedule. Over the next few months, the naps consolidate to three and then to two. By the 10-to-12-month period, a very large share of kids are napping a consistent two naps per day. Over the period between 12 and 18 months, this shifts toward one nap. And then sometime in the range of 3 to 5 years, naps are dropped. What I think is perhaps most useful about this graph is it gives a lot of color to the average napping ages that we often hear. 

Note: Survey data came from the ParentData audience and users of the Nanit sleep monitor system. Both audiences skew higher-education and higher-income than the average, and mostly have younger children. The final sample is 14,919 children. For more insights on our respondents, read the full article.

SLEEP DATA 💤 PART 2: Let’s talk about naps. Comment “Link” for an article on what we learned about daytime sleep!

The first three months of life are a chaotic combination of irregular napping, many naps, and a few brave or lucky souls who appear to have already arrived at a two-to-three nap schedule. Over the next few months, the naps consolidate to three and then to two. By the 10-to-12-month period, a very large share of kids are napping a consistent two naps per day. Over the period between 12 and 18 months, this shifts toward one nap. And then sometime in the range of 3 to 5 years, naps are dropped. What I think is perhaps most useful about this graph is it gives a lot of color to the average napping ages that we often hear.

Note: Survey data came from the ParentData audience and users of the Nanit sleep monitor system. Both audiences skew higher-education and higher-income than the average, and mostly have younger children. The final sample is 14,919 children. For more insights on our respondents, read the full article.
...

Happy Father’s Day to the Fathers and Father figures in our ParentData community! 

Tag a Dad who this holiday may be tricky for. We’re sending you love. 💛

Happy Father’s Day to the Fathers and Father figures in our ParentData community!

Tag a Dad who this holiday may be tricky for. We’re sending you love. 💛
...

“Whilst googling things like ‘new dad sad’ and ‘why am I crying new dad,’ I came across an article written by a doctor who had trouble connecting with his second child. I read the symptoms and felt an odd sense of relief.” Today we’re bringing back an essay by Kevin Maguire of @newfatherhood about his experience with paternal postpartum depression. We need to demystify these issues in order to change things for the better. Comment “Link” for a DM to read his full essay.

#parentdata #postpartum #postpartumdepression #paternalmentalhealth #newparents #emilyoster

“Whilst googling things like ‘new dad sad’ and ‘why am I crying new dad,’ I came across an article written by a doctor who had trouble connecting with his second child. I read the symptoms and felt an odd sense of relief.” Today we’re bringing back an essay by Kevin Maguire of @newfatherhood about his experience with paternal postpartum depression. We need to demystify these issues in order to change things for the better. Comment “Link” for a DM to read his full essay.

#parentdata #postpartum #postpartumdepression #paternalmentalhealth #newparents #emilyoster
...

What does the data say about children who look more like one parent? Do they also inherit more character traits and mannerisms from that parent? Let’s talk about it 🔎

#emilyoster #parentdata #parentingcommunity #lookslikedaddy #lookslikemommy

What does the data say about children who look more like one parent? Do they also inherit more character traits and mannerisms from that parent? Let’s talk about it 🔎

#emilyoster #parentdata #parentingcommunity #lookslikedaddy #lookslikemommy
...

SLEEP DATA 💤 We asked you all about your kids’ sleep—and got nearly 15,000 survey responses to better understand kids’ sleep patterns. Comment “Link” for an article that breaks down our findings!

This graph shows sleeping location by age. You’ll notice that for the first three months, most kids are in their own sleeping location in a parent’s room. Then, over the first year, this switches toward their own room. As kids age, sharing a room with a sibling becomes more common. 

Head to the newsletter for more and stay tuned for part two next week on naps! 🌙

#parentdata #emilyoster #childsleep #babysleep #parentingcommunity

SLEEP DATA 💤 We asked you all about your kids’ sleep—and got nearly 15,000 survey responses to better understand kids’ sleep patterns. Comment “Link” for an article that breaks down our findings!

This graph shows sleeping location by age. You’ll notice that for the first three months, most kids are in their own sleeping location in a parent’s room. Then, over the first year, this switches toward their own room. As kids age, sharing a room with a sibling becomes more common.

Head to the newsletter for more and stay tuned for part two next week on naps! 🌙

#parentdata #emilyoster #childsleep #babysleep #parentingcommunity
...

Weekends are good for extra cups of ☕️ and listening to podcasts. I asked our team how they pod—most people said on walks or during chores. What about you?

Comment “Link” to subscribe to ParentData with Emily Oster, joined by some excellent guests.

#parentdata #parentdatapodcast #parentingpodcast #parentingtips #emilyoster

Weekends are good for extra cups of ☕️ and listening to podcasts. I asked our team how they pod—most people said on walks or during chores. What about you?

Comment “Link” to subscribe to ParentData with Emily Oster, joined by some excellent guests.

#parentdata #parentdatapodcast #parentingpodcast #parentingtips #emilyoster
...