Subject:

## Frequency distribution and histogram pdf

By using our site, you acknowledge that you have read and understand our Cookie Policy , Privacy Policy , and our Terms of Service. If we want to visibly see the distribution of a continuous data, which one among histogram and pdf should be used?

The red line is the empirical density estimate, the blue line is the theoretical pdf of the underlying normal distribution. Note that the histogram is expressed in densities and not in frequencies here.

This is done for plotting purposes, in general frequencies are used in histograms. So to answer your question : you use the empirical distribution i.

A pdf, on the other hand, is a closed-form expression for a given distribution. That is different from describing your dataset with an estimated density or histogram. There's no hard and fast rule here. If you know the density of your population, then a PDF is better.

On the other hand, often we deal with samples and a histogram might convey some information that an estimated density covers up. For example, Andrew Gelman makes this point:.

Variations on the histogram.

A key benefit of a histogram is that, as a plot of raw data, it contains the seeds of its own error assessment. Or, to put it another way, the jaggedness of a slightly undersmoothed histogram performs a useful service by visually indicating sampling variability.

## Your Answer

That's why, if you look at the histograms in my books and published articles, I just about always use lots of bins. I also almost never like those kernel density estimates that people sometimes use to display one-dimensional distributions.

I'd rather see the histogram and know where the data are. Home Questions Tags Users Unanswered. Difference between histogram and pdf? Ask Question. Asked 9 years, 4 months ago.

## Making Frequency Distributions and Histograms by Hand

Active 4 years, 6 months ago. Viewed 48k times. What are the differences, not formula wise, between histogram and pdf? By definition, a pdf describes a theoretical probability distribution. Do you perhaps mean the edf empirical distribution function?

## Steps to Making Your Frequency Distribution

You could construct the following plot: The red line is the empirical density estimate, the blue line is the theoretical pdf of the underlying normal distribution. Joris Meys Joris Meys 5, 2 2 gold badges 27 27 silver badges 43 43 bronze badges. All frequencies summed equals the number of observations. Density is short for PDF probability density function , which is a proxy for the probability of having a certain value.

The area under the PDF sums to 1. A density estimate is an alternative.

## What to study next

These days we use both, and there is a rich literature about which defaults one should use. Dirk Eddelbuettel Dirk Eddelbuettel 8, 2 2 gold badges 25 25 silver badges 42 42 bronze badges. It will not necessarily "fit" the data. Now, there exist several kind of non-parametric density estimates, where you only use the data at hand plus some kernel specifications or window span, etc.

For example, Andrew Gelman makes this point: Variations on the histogram A key benefit of a histogram is that, as a plot of raw data, it contains the seeds of its own error assessment. But on the narrower comparison of histogram v.

## Use Excel 2016 to make Frequency distribution and Histogram for quantitative data

But, does this approach hold for simulations, in which case we are actually trying to estimate a density? Harsha Manjunath Harsha Manjunath 3 3 bronze badges. Sign up or log in Sign up using Google.

Sign up using Facebook. Sign up using Email and Password.

Post as a guest Name. Email Required, but never shown.

## Subscribe to RSS

This week, StackOverflowKnows syntactic sugar, overfit or nah, and the…. Featured on Meta. Thank you, Robert Cartaino. Change in roles for Jon Ericson leaving SE.

Reinstate Monica. Linked 0. Related 5. Hot Network Questions. Question feed. Cross Validated works best with JavaScript enabled.