What Is Statistics?
Medication dosages, IV drip rates, vital monitoring
Discounts, tax, tips, profit margins
Every time you check the weather forecast, read a nutrition label, or hear that a medication is “95% effective,” you are relying on statistics. Statistics is the foundation for making sense of data — and in a world that runs on data, understanding the basics gives you a real advantage in your career and daily decisions.
What Is Statistics?
Statistics is the branch of mathematics that deals with collecting, organizing, analyzing, and interpreting data. Its goal is to turn raw numbers into useful information that supports decision-making.
You encounter statistics constantly, even when you do not realize it:
- Weather forecasts — meteorologists analyze temperature, pressure, and humidity data from thousands of stations to predict a “70% chance of rain.”
- Medical research — clinical trials compare treatment groups to determine whether a new drug works better than a placebo.
- Sports analytics — a basketball player’s shooting percentage, rebounds per game, and plus/minus rating are all statistics that inform coaching decisions.
- Quality control — manufacturers sample products off the assembly line and use defect rates to decide whether an entire batch meets standards.
- Business — a retailer tracks average transaction size, customer return rates, and inventory turnover to make stocking decisions.
At its core, statistics answers a simple question: What does the data tell us, and how confident can we be in that answer?
Population vs Sample
Two of the most important terms in statistics are population and sample. Getting these right is the first step toward understanding everything else.
- Population — the entire group you want to study or draw conclusions about. It includes every individual, item, or measurement that fits your definition.
- Sample — a subset of the population that you actually observe or measure. You use the sample to make inferences about the larger population.
Why not just study the entire population? In most real-world situations, it is too expensive, too time-consuming, or physically impossible to measure every member. Sampling gives you a practical way to learn about a population without examining every single element.
Example 1: Hospital Patient Satisfaction
A regional hospital serves approximately 50,000 patients per year. The administration wants to know how satisfied patients are with their care.
- Population: All 50,000 patients treated at the hospital in a given year.
- Sample: 500 patients randomly selected to complete a satisfaction survey.
Surveying all 50,000 patients would require enormous staff time and cost. Instead, the hospital selects 500 patients at random. If the sample is representative — meaning it reflects the diversity of the full patient population in terms of age, department, and length of stay — the results from those 500 patients provide a reliable estimate of overall satisfaction.
Example 2: Quality Control in Manufacturing
A factory produces 10,000 light bulbs per day. To check quality, the factory tests a sample of bulbs for brightness and lifespan.
- Population: All 10,000 bulbs produced in one day.
- Sample: 200 bulbs pulled from the production line at regular intervals.
Here, testing is destructive — once you run a bulb until it burns out to measure its lifespan, you cannot sell it. Testing every bulb would leave nothing to ship. By testing 200 bulbs, the quality team can estimate the defect rate for the entire day’s production without destroying the inventory.
Population vs Sample at a Glance
| Feature | Population | Sample |
|---|---|---|
| Definition | The entire group of interest | A subset selected from the population |
| Size | Usually large (often impractical to measure) | Smaller and manageable |
| Purpose | What you want to learn about | What you actually measure |
| Data collection | Census (measure every member) | Survey or experiment (measure some members) |
| Cost and time | High | Lower |
| Example | All 50,000 hospital patients | 500 surveyed patients |
Parameters vs Statistics
Once you understand populations and samples, the next distinction is between the numbers that describe them.
- A parameter is a numerical measurement that describes a population. Parameters are usually unknown because you rarely measure the entire population.
- A statistic is a numerical measurement that describes a sample. Statistics are calculated from your collected data and used to estimate the corresponding parameter.
| Measure | Population (Parameter) | Sample (Statistic) |
|---|---|---|
| Mean | (mu) | (x-bar) |
| Standard deviation | (sigma) | |
| Size | ||
| Proportion | (p-hat) |
Key idea: You calculate a sample statistic (like ) as your best estimate of the population parameter (like ). The closer your sample represents the population, the better your estimate.
For example, if you measure the heights of 100 randomly selected adults in a city and calculate cm, you are using that sample mean to estimate , the true average height of all adults in the city.
Descriptive vs Inferential Statistics
Statistics as a field splits into two major branches: descriptive and inferential.
Descriptive statistics summarize and organize data you already have. They answer the question: What happened?
- Calculating an average test score for your class
- Creating a bar chart of monthly sales
- Reporting the range of temperatures recorded last week
- Finding the median household income in a county
Inferential statistics use sample data to make predictions or draw conclusions about a larger population. They answer the question: What can we conclude beyond our data?
- Using a poll of 1,200 voters to predict the outcome of a national election
- Testing whether a new drug lowers blood pressure more than the current standard
- Estimating the average commute time for all employees based on a survey of some employees
- Determining whether a difference in test scores between two groups is statistically significant or just due to chance
Example 3: Using Sample Data to Estimate a Population
A company with 2,000 employees wants to know the average commute time. Surveying everyone would disrupt operations, so the HR department randomly surveys 300 employees.
Results from the sample:
- Sample size:
- Sample mean commute time: minutes
- Sample standard deviation: minutes
Descriptive statistics tell us about these 300 employees: their average commute is 27 minutes, with most commutes falling between 15 and 39 minutes.
Inferential statistics let us go further: based on this sample, we estimate that the true mean commute time for all 2,000 employees is approximately 27 minutes. A 95% confidence interval might be minutes, meaning we are 95% confident the true average is between 25.6 and 28.4 minutes.
This is the power of inferential statistics — a well-chosen sample of 300 gives reliable information about a population of 2,000 without surveying everyone.
Descriptive vs Inferential at a Glance
| Feature | Descriptive | Inferential |
|---|---|---|
| Purpose | Summarize collected data | Draw conclusions about a larger population |
| Scope | Only the data in front of you | Beyond the data — generalizing to a population |
| Tools | Mean, median, mode, charts, tables | Confidence intervals, hypothesis tests, regression |
| Question answered | ”What happened?" | "What can we conclude?” |
| Example | ”The average score was 82." | "We are 95% confident the true average is between 80 and 84.” |
Real-World Application: Nursing — Understanding Patient Data
Suppose you are a nurse on a medical-surgical ward with 20 patients. You record the systolic blood pressure (BP) for each patient during morning rounds:
Descriptive statistics for your ward:
- Mean BP: Add all 20 values and divide by 20.
- Range: Highest value minus lowest value = mmHg.
- Summary: The 20 patients have an average systolic BP of 134.75 mmHg, with readings ranging from 119 to 150.
Inferential statistics let you take the next step:
- These 20 patients are a sample of the broader population of patients who pass through this ward.
- Based on your sample mean of , you can estimate , the average systolic BP for all patients admitted to this type of ward.
- If the hospital standard for this ward’s patient population is mmHg, your sample suggests that current patients may be running slightly higher. A formal hypothesis test would determine whether the difference (134.75 vs 130) is statistically significant or could be due to random variation.
This kind of reasoning — using the data you have to draw conclusions about a bigger picture — is what statistics is all about.
Practice Problems
Test your understanding with these problems. Click to reveal each answer.
Problem 1: A school district wants to know the average reading level of all 8,000 third-graders. They test 400 students selected at random. Identify the population and the sample.
- Population: All 8,000 third-graders in the school district.
- Sample: The 400 students who were tested.
Answer: The population is all 8,000 third-graders; the sample is the 400 tested students.
Problem 2: A researcher reports that the average cholesterol level in a sample of 250 adults is mg/dL. Is 198 a parameter or a statistic?
The value 198 was calculated from a sample of 250 adults, not from the entire population of all adults.
Answer: It is a statistic (a sample mean, denoted ). The corresponding population parameter would be , the true mean cholesterol for all adults.
Problem 3: A city reports: “The median home price in our city last year was $285,000.” Is this descriptive or inferential statistics?
The city is summarizing actual data — the prices of homes that were sold last year. It is not using a sample to predict something about a larger group.
Answer: This is descriptive statistics. It summarizes data that was already collected (all home sales in the city last year).
Problem 4: A polling firm surveys 1,500 registered voters and reports: “We estimate that 54% of all voters support the ballot measure, with a margin of error of ±3%.” Is this descriptive or inferential?
The firm is using data from 1,500 voters (a sample) to make a prediction about all registered voters (the population). The margin of error quantifies the uncertainty of that estimate.
Answer: This is inferential statistics. The sample result (54%) is being used to estimate a population proportion, and the margin of error reflects the uncertainty involved in generalizing from a sample.
Problem 5: A factory tests 150 batteries from a production run of 5,000. The sample mean lifespan is hours with hours. Identify: (a) the population, (b) the sample, (c) the parameter of interest, and (d) the statistic used to estimate it.
- (a) Population: All 5,000 batteries in the production run.
- (b) Sample: The 150 batteries that were tested.
- (c) Parameter of interest: , the true mean lifespan of all 5,000 batteries.
- (d) Statistic: hours, the sample mean lifespan, used as an estimate of .
Answer: The sample of 150 batteries (with hours) is used to estimate , the true mean lifespan for the entire production run of 5,000.
Key Takeaways
- Statistics is the science of collecting, organizing, analyzing, and interpreting data to support decisions.
- A population is the entire group you want to study; a sample is the subset you actually measure.
- Parameters (like and ) describe populations; statistics (like and ) describe samples and estimate parameters.
- Descriptive statistics summarize the data you have — averages, charts, and tables.
- Inferential statistics use sample data to draw conclusions about a larger population — confidence intervals, hypothesis tests, and predictions.
- Sampling is necessary because measuring every member of a population is usually too costly, too slow, or physically impossible.
- The quality of your conclusions depends on having a representative sample — one that accurately reflects the population’s characteristics.
Return to Statistics for more topics in this section.
Next Up in Statistics
All Statistics topicsLast updated: March 28, 2026