Sample-quartile I don't know : Is there a sample such that

Nylah Hendrix 2022-07-14 Answered
Sample-quartile
I don't know : Is there a sample such that the mean does not lie between the lower and upper quartile? Is there a sample such that the median does not lie between the lower and upper quartile?
You can still ask an expert for help

Expert Community at Your Service

  • Live experts 24/7
  • Questions are typically answered in as fast as 30 minutes
  • Personalized clear answers
Learn more

Solve your problem for the price of one coffee

  • Available 24/7
  • Math expert for every subject
  • Pay only if we can solve it
Ask Question

Answers (1)

Tristin Case
Answered 2022-07-15 Author has 15 answers
"Is there a sample such that the mean does not lie between the lower and upper quartile?"
One such sample is { 1 , 2 , 3 , 4 , 5 , 120 }, which has quartiles Q 1 = 2 and Q 3 = 5, but a mean of 22.5. Note that the mean of a data set is heavily influenced by outliers while the quartiles are generally resistant to extreme values.
"Is there a sample such that the median does not lie between the lower and upper quartile?"
Consider the definitions of the median and quartiles. The first quartile of a data set separates the bottom 25 % of observations from the top 75 %. Similarly, the third quartiles separates the bottom 75 % of observations from the top 25 %. As noted in the comments, the median is simply the "second quartile" of a data set and separates the bottom 50 % from the top 50 %. If there was a data set that had a median less than the first quartile, then the observation at the 50th percentile would be less than an observation at the 25th percentiles, which makes no sense. Similar logic shows that the median cannot be greater than the third quartile.
This can be shown more rigorously by providing a formal definition for the median of a data set. We can define median x ~ of a set of n observations by
x ~ = 1 2 ( a n / 2 + a ( n / 2 ) + 1 )
where a i is the ith element of the order sequence of observations and : R Z is the ceiling function. The first quartiles is median of the sequence a 1 , . . . , a n / 2 and the third quartiles is the median of the sequence a ( n / 2 ) + 1 , . . . , a n . Hence, we have the ordering Q 1 x ~ Q 3 , meaning that the median can equal the first or third quartiles, but cannot be less than the first nor more than the third.

We have step-by-step solutions for your answer!

Expert Community at Your Service

  • Live experts 24/7
  • Questions are typically answered in as fast as 30 minutes
  • Personalized clear answers
Learn more

You might be interested in

asked 2022-05-09
Calculating the position of the median
I know this is a simple question, but I cannot find a straight answer anywhere.
When calculating the medium of listed data, the formula is (n+1)/2. My Statistics teacher said for grouped data the position of the medium is n/2. However, this seems contradictory as for discrete grouped data, the data could be written in a list if the original values were known. Therefore two different values for the median are found.
I know this question is probably going to get flagged as it has already been asked, however, it has never been answered. I find it frustrating that such a fundamental concept in Statistics, what is supposed to be precise and never subjective has a wishy-washy answer.
Rant over, I think the position should be (n+1)/2 for discrete data and n/2 for continuous data. Moreover, this then leads into the obvious question of how we should calculate quartiles as well.
It would be nice if everyone could agree on a certain method for calculating the medium as all my textbooks are saying different things.
Someone please dispell the confusion and this statistical mess.
asked 2022-05-26
Hypothesis Testing help
Really have no idea where to start :(
In an experiment comparing two weight-loss regimes A and B 20 test subjects were matched into 10 pairs so that within each pair the subjects were as similar as possible. Then A was randomly allocated to one of the subjects in each pair, and then B allocated to the other. The number of kilograms lost for each person is obtained and then the ordered A−B differences for each pair (in kg) are given below in the object d, together with some summary statistics:
sort(d) [1] -0.9 -0.3 0.2 0.4 0.6 1.2 1.4 3.3 3.5 4.3
mean(d) [1] 1.37
sd(d) [1] 1.755025
By specifying and checking (with a boxplot) an appropriate normality assumption perform a formal hypothesis test of H0:“regimes the same” against H1:“regimes not the same”.
Any help is appreciated!
asked 2022-06-25
How do I calculate the quartiles for this problem?
I have the following list of numbers, and I'm trying to calculate the quartiles:
2, 4, 4, 5, 7, 7, 7, 7, 7, 7, 8, 8, 9, 9, 9, 9
I'm running into a bit of confusion because the median turns out to be 7. Now I don't know how to demarcate the lower group from the upper group since there are a whole bunch of 7's. Consequently, I don't know how to calculate Q 1 and Q 3
asked 2022-06-21
How to calculate lower & upper quartiles?
I'm sure this has been asked many times before but it's confusing me a lot so hopefully someone can help!
I am given this data set:
0.28 0.30 0.42 0.59 0.71 1.67 1.82 2.39 4.71 4.79 4.89 5.00 5.00 5.76 6.09 6.17 6.81 7.12 7.44 8.05 8.59 8.86 9.04 9.78
There are 24 numbers here. I need to find the lower & upper quartile. I figured that the median is 5.00
And looking at the answer sheet the Lower Quartile is 1.745 and the Upper quartile is 7.28
My question is how do i get to these answers!?
Thank you in advance.
asked 2022-05-02
Can anyone help me with statistics?
Question: Following is the distribution of marks obtained by 500 candidates in the statistics paper of a civil services examination:
+ marks more than: 0 10 20 30 40 50
+ number of candidates: 500 460 400 200 100 30
Calculate the lower quartile marks. If 70 % of the candidates pass in the paper, find the minimum marks obtained by a passing candidate.
In the above problem, I am able to find the first part to calculate lower quartile marks but not able to find the second section.
Anyone, please help I am a newbie to statistics.
Thanks in advance, please ignore the bad English.
asked 2022-06-19
Identifying the k points in 2D geographic space which are 'most distant' from each other
I have a set of DNA samples from Y plants in a given geographic area. I'm going to be doing DNA sequencing on individuals in this population (and a number of other, separate populations), however due to financial constraints I'm unable to perform sequencing on all Y individuals. I've decided that I can afford to sequence k (out of Y) in each separate area.
I'd like to select the k samples which are farthest apart/most geographically distributed within the Y samples I collected. Samples that are taken from plants in close proximity to each other are more likely to be closely related, and I'd like to sequence what are ultimately the most genetically diverse samples for my later analysis.
So, the question is: given a set of Y points/samples in 2d geographic space (lat/long coordinates), how do I select the k points that are most geographically distributed/distant from each other?
As I've explored this a bit, I think the problem I'm really having is how to define 'distance' or 'most distributed'. Some of the metrics I've though of (e.g. maximum average distance between the k points) result in really unintuitive point selection in certain cases (for example, if two points are right next to each other but far away from another cluster of all the other points, the two points will be included even if they're almost on top of each other).
I have a feeling that this is not an uncommon problem, and there must be good answers out there. I'll add that I have access to a large cluster and I can brute force the problem to some extent.
I'd appreciate any and all advice or discussion; this is an interesting theoretical topic to me.
asked 2022-07-07
Boxplot: whiskers and outliers doubt
I have a doubt on boxplot.
I'll expose my knowledge and then my doubt.
x = { x 1 , x 2 . . . x n }: the set of samples
q 1 , q 3 : the first and third quartiles
w l , w u : the lower and upper whiskers
I Q R = q 3 q 1
box extends from q 1 to q 3
w l = m a x ( m i n ( x ) , q 1 1.5 I Q R )
w u = m i n ( m a x ( x ) , q 3 + 1.5 I Q R )
o u t l i e r s = { x i x | x i < w l x i > w u }
Observations:
whiskers' distance from box are not symmetric ( w l = m i n ( x ) w u = m a x ( x ) )
w u q 3 < q 1 w l x i : x i o u t l i e r s x i > w u
w u q 3 > q 1 w l x i : x i o u t l i e r s x i < w l
My doubt: if all what I exposed is correct, how do you explain the presence of outliers in this speed of light boxplot (third experiment, lower outliers) and in this plot (see wednesday, lower outliers)?
In the case my reasoning is wrong, please provide a simple numeric counterexample.

New questions