Statistics Marathon & Questions (3 Viewers)

leehuan

Well-Known Member
Joined
May 31, 2014
Messages
5,805
Gender
Male
HSC
2015
Re: Statistics

Quick question regarding probability spaces:

 

Drongoski

Well-Known Member
Joined
Feb 22, 2009
Messages
4,255
Gender
Male
HSC
N/A
Re: Statistics

leehuan - what would you do if InteGrand decides to take a 6 month sabbatical?
 

leehuan

Well-Known Member
Joined
May 31, 2014
Messages
5,805
Gender
Male
HSC
2015
Re: Statistics

leehuan - what would you do if InteGrand decides to take a 6 month sabbatical?
I don't know; InteGrand can do as he wants. But why are you asking me this?
 
Last edited:

leehuan

Well-Known Member
Joined
May 31, 2014
Messages
5,805
Gender
Male
HSC
2015
Re: Statistics

I wasn't taught the hypergeometric distribution properly so can someone walk me through how to use it? Here's my question if it helps to refer to it.



A factory produces 80 items in a batch. To test if the batch is defective, an acceptance sampling scheme is adopted: a random sample of 10 items is selected, and if 2 or more items don’t meet customer specifications, the batch is considered defective.

If there are actually 11 defective items in the batch,

1i) What is the probability that 2 sampled items are defective?
ii) What is a general formula for x sampled items being defective?
 

Drongoski

Well-Known Member
Joined
Feb 22, 2009
Messages
4,255
Gender
Male
HSC
N/A
Re: Statistics

Doing this for the first time for a very very long long time. Not sure if correct.

 
Last edited:

leehuan

Well-Known Member
Joined
May 31, 2014
Messages
5,805
Gender
Male
HSC
2015
Re: Statistics

Doing this for the first time for a very very long long time. Not sure if correct.

I think the expressions are right.

But I'm not sure what the parameters mean either. This question is pretty much an example but for an arbitrary scenario how would I be able to tell what the parameters (N, K, n) actually meant?
 

InteGrand

Well-Known Member
Joined
Dec 11, 2014
Messages
6,109
Gender
Male
HSC
N/A
Re: Statistics

I think the expressions are right.

But I'm not sure what the parameters mean either. This question is pretty much an example but for an arbitrary scenario how would I be able to tell what the parameters (N, K, n) actually meant?
The parameters are as follows:

• N is the total population
• K is the number of "tagged" objects (defective objects in your example)
• n is the size of our sample.

The hypergeometric distribution pmf Drongoski wrote (in terms of the parameters N, K, n) then gives the probability that our sample has exactly k "tagged" (defective) objects present, under the assumption that we are sampling without replacement. This follows from basic combinatorics.
 

InteGrand

Well-Known Member
Joined
Dec 11, 2014
Messages
6,109
Gender
Male
HSC
N/A
Re: Statistics

The reason for terms like "population" and "tagged" is that one place this distribution comes up is in ecology when we tag some members of an animal population (like a fish population) and then later draw (without replacement) a random sample from the animal population and count how many are tagged. This can be used to try and estimate the total population for example (it is sometimes known as the "capture-recapture method", and you can read more about it here: https://en.wikipedia.org/wiki/Mark_and_recapture).
 

Flop21

Well-Known Member
Joined
May 12, 2013
Messages
2,807
Gender
Female
HSC
2015
Re: Statistics

Any tips for catching up on stats... like any resources you use?

Thanks I am behind lol.
 

leehuan

Well-Known Member
Joined
May 31, 2014
Messages
5,805
Gender
Male
HSC
2015
Re: Statistics

Can't deny it. I'm also a fair bit behind.

Been cramming a lot of the course pack tbh.
 

Flop21

Well-Known Member
Joined
May 12, 2013
Messages
2,807
Gender
Female
HSC
2015
Re: Statistics

Can't deny it. I'm also a fair bit behind.

Been cramming a lot of the course pack tbh.
what course pack? the questions that are on moodle or?
 

leehuan

Well-Known Member
Joined
May 31, 2014
Messages
5,805
Gender
Male
HSC
2015
Re: Statistics

what course pack? the questions that are on moodle or?
Course pack is just what I use to call the online 'textbook' thing. It's like what we had in first year except it's just not printed.
 

davidgoes4wce

Well-Known Member
Joined
Jun 29, 2014
Messages
1,877
Location
Sydney, New South Wales
Gender
Male
HSC
N/A
Re: University Statistics Discussion Marathon

Q1. If we have a data set where the median is significantly greater than the mean, which of the following is likely to be true?

A. The data is left skewed
B. There has been an error in data input
C. Categorical data is being treated as numeric data
D. The data is right skewed
 

davidgoes4wce

Well-Known Member
Joined
Jun 29, 2014
Messages
1,877
Location
Sydney, New South Wales
Gender
Male
HSC
N/A
Re: University Statistics Discussion Marathon

A national TV poll is run asking viewers to ring in regarding whether they think the
head of the Australian Bureau of Statistics should be sacked over the problems with the
census. 500,000 people ring in, with 83% of respondents claiming he should be sacked.
Which one of the following is correct?
(1 mark)

A. The biggest problem with this survey is that people under 18 may have responded.

B. We conclude that majority of Australians believe he should be sacked.

C. The sample size is large enough to overcome any doubts about the validity of this
sample.

D. The results are unreliable as they quite likely suffer from self-selection bias.
 
Last edited:

davidgoes4wce

Well-Known Member
Joined
Jun 29, 2014
Messages
1,877
Location
Sydney, New South Wales
Gender
Male
HSC
N/A
Re: University Statistics Discussion Marathon

Id rule out D straight away in that question.

This from Wikipedia:

"In statistics, self-selection bias arises in any situation in which individuals select themselves into a group, causing a biased sample with nonprobability sampling. "
 

Users Who Are Viewing This Thread (Users: 0, Guests: 3)

Top