Original research

Netflix Hours as Study Hours: The Immersion Math

We already spend hundreds of hours a year watching TV. The research says that time, if it were understandable, could cover a real share of what fluency takes.

By Lachlan McRitchie, Founder, LingoBinge

Updated June 29, 2026

Learning a language is usually framed as finding extra time you do not have. But most of us already spend a startling number of hours in front of a screen. This study lines up how much TV people actually watch against how many hours it takes to learn a language, and against the research on whether watching helps at all. The numbers are larger, and smaller, than you might expect.

2.6 hrs
of TV a day for the average American, their single biggest leisure activity

BLS, 2025

~950 hrs
of TV a year per person, more than the FSI hours for an easy language

Derived from BLS, 2025

600 to 750
FSI classroom hours to professional working Spanish or French

US State Dept (FSI)

3,000
word families cover about 95% of the words on television

Webb and Rodgers, 2009

We watch an astonishing amount of television

The US Bureau of Labor Statistics runs the American Time Use Survey, a diary study of how people actually spend their days. In its 2025 results, watching TV was the single biggest leisure activity at 2.6 hours a day, about half of all leisure time. Over a year that is roughly 950 hours per person.

It is not just an American habit. Survey data put global daily TV viewing at about 3 hours 13 minutes, Indonesia at 2 hours 41 minutes, and total in-home video in the UK at around four and a half hours a day. The exact figures vary by method, but the headline is consistent: watching is one of the largest time commitments in modern life.

Average daily time watching TV, selected markets (methods differ by source).
CategoryMinutes per day
UK (broadcast)
2.4 hrs
United States
2.6 hrs
Indonesia
2.7 hrs
Global
3.2 hrs

Fluency takes fewer hours than people think

The US Foreign Service Institute has spent decades working out how long it takes its diplomats to learn languages. For an easy language like Spanish or French, it estimates about 600 to 750 classroom hours to reach professional working proficiency. Indonesian sits around 900 hours, and the hardest languages for English speakers, like Japanese and Arabic, around 2,200.

Conversational ability comes well before that professional bar. Cambridge English estimates roughly 350 to 400 guided hours to reach B1, the level where you can handle most everyday and travel situations, and 500 to 600 for B2. Those are approximations: the CEFR framework itself deliberately sets no fixed hours and defines levels by what you can do.

Approximate FSI classroom hours to professional working proficiency, by language.
CategoryHours
Spanish
~675
Indonesian
900
Russian
1,100
Japanese
2,200

The immersion math

Put the two numbers together. The average person watches about 950 hours of TV a year. The FSI bar for professional Spanish is 600 to 750 hours. On paper, a single year of normal watching exceeds the entire classroom requirement for an easy language.

Of course, passive watching in a language you do not understand teaches almost nothing, so the honest version uses a fraction. Suppose just one fifth of your watch time became real, understandable practice. That is about 190 hours a year. Against a conversational B1 (roughly 375 guided hours), that is more than half the way there in twelve months. Against the full FSI Spanish bar, it is about 28% a year, or the whole professional requirement in under four years, from time you are already spending.

~190 hrs
a year at just 20% conversion: more than half the way to a conversational B1, from TV time you already spend

LingoBinge analysis (inputs shown)

Does watching actually work? The research says yes, if you understand it

The catch in all of this is comprehension. Input only teaches you when you can understand most of it, which is why difficulty matching matters so much. The good news is that the research on understandable, captioned video is strong. A 2013 meta-analysis of 18 studies found that captions produce a large positive effect on both vocabulary learning and listening comprehension. Later work showed learners pick up vocabulary incidentally just from watching, with how often a word recurs being a key predictor of whether it sticks.

There is even a clean vocabulary target. Analysing a corpus of 88 television programmes, researchers found that knowing about 3,000 word families covers roughly 95% of the words on TV, and 7,000 covers about 98%. That is the entire case for learning words in frequency order: a few thousand of the right words unlock most of what you hear.

94 billion
hours watched on Netflix in just six months of 2024

Netflix engagement report, H2 2024

LingoBinge is built on this idea: it turns the shows you already watch into understandable practice, swapping the highest-frequency words first and saving them for spaced review.

Methodology and caveats

Watch-time figures come from the US Bureau of Labor Statistics American Time Use Survey (2025), a diary-based measure of primary activity, cross-checked against survey panels (DataReportal/GWI 2025, Ofcom 2025). The annual figure (about 950 hours) is 2.6 hours a day times 365 and is our arithmetic, not a BLS figure. Panel surveys count some concurrent and second-screen use, so they read higher than the BLS diary.

Language-hour benchmarks are the US Foreign Service Institute estimates to professional working proficiency, and Cambridge English guided-learning-hour estimates for CEFR levels. Both are approximations; the CEFR sets no fixed hours. FSI hours assume intensive instruction with daily self-study, so they describe a high bar under unusual conditions, not casual learning.

The immersion math uses an illustrative 20% conversion of watch time to genuine practice. That number is a deliberately conservative assumption, not a measured result. Passive watching is not the same as active study, and the gains depend entirely on the input being comprehensible. We do not attribute any per-member hours figure to Netflix; the 94 billion hours is the company's own reported total.

Sources

  1. US Bureau of Labor Statistics, American Time Use Survey (2025)
  2. US Department of State, Foreign Service Institute, Foreign Language Training
  3. Cambridge English, Guided Learning Hours
  4. Council of Europe, CEFR level descriptions
  5. Webb and Rodgers, Vocabulary demands of television programs, Language Learning (2009)
  6. Montero Perez et al., Captioned video for L2 listening and vocabulary, System (2013)
  7. Peters and Webb, Incidental vocabulary acquisition through viewing L2 television, SSLA (2018)
  8. Netflix, What We Watched: a Netflix engagement report (H2 2024)
  9. DataReportal, Digital 2025: TV trends and realities
  10. Ofcom, Media Nations UK 2025
  11. DataReportal, Digital 2024: Indonesia

This study and its data are free to cite and reuse under a Creative Commons Attribution 4.0 licence. Please link back to this page.

Frequently asked questions

How many hours does it take to learn a language?
The US Foreign Service Institute estimates about 600 to 750 hours of instruction for an easy language like Spanish, around 900 for Indonesian, and about 2,200 for a hard one like Japanese, to reach professional working proficiency. Conversational ability comes sooner, with Cambridge estimating roughly 350 to 400 guided hours to reach B1.
Can you really learn a language by watching TV?
The research supports it when the content is understandable. A 2013 meta-analysis found captioned video produces a large positive effect on vocabulary and listening, and later studies show learners absorb words incidentally just from viewing. The key is comprehension, which is why graded difficulty matters.
How much TV does the average person watch?
About 2.6 hours a day in the United States according to the 2025 American Time Use Survey, which is roughly 950 hours a year. Global and other-market surveys put the figure between about 2.4 and 4.5 hours a day depending on the country and how it is measured.

More research and references