Let's say I take a random sample from a full population of which I know average and median. Can I estimate the average and median for the sample from the average and median of the full population (without an explicit calculation)? I am assuming the median should be close, while the averages do not need to be close at all, depending on the distribution. In other words, if I select an hypothetical random element, the value for that element should be probabilistically closer to the median of the population rather than the average.
asked Jun 14, 2022 at 15:11 1,125 1 1 gold badge 6 6 silver badges 16 16 bronze badgesAs an example, consider a sample sized $5$ from a normal distribution with mean and median $0$ , and let's simulate that $10^5$ times using R. The sample median is closer to $0$ than the sample mean is in about $40\%$ of simulations, and the dispersion for the sample median (red) is wider than for the sample mean (blue):
set.seed(2022) samplem sims
Now consider a Laplace distribution (i.e. an exponential distribution with random signs). The sample median is closer to $0$ than the sample mean is in about $55\%$ of simulations, and the dispersion for the sample median (red) is narrower than for the sample mean (blue):
set.seed(2022) samplem sims