















Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
A comprehensive list of questions and answers for ap statistics exam, covering topics such as five number summary, z score, standard deviation, categorical and quantitative data, parameter, sample, bias, nonresponse, voluntary response bias, histogram, box plot, lurking variable, mean, median, mode, range, minimum, margin of error, statistical normal, simple random sample, stratified random sample, systematic sample, placebo, type i error, type ii error, joint frequency, conditional probability, sample space, confounded variable, marginal frequency, coefficient of determination, law of large numbers, extrapolation, snowball, confidence interval, residual, convenience sample, two way table, spread, shape, discrete random variable, central limit theorem, standardized value, mutually exclusive, wording bias, causation, z test, t test, chi squared goodness of fit, stem and leaf display, multimodal, se, r2, leverage, influential point, census, prospective study, and statistic factor.
Typology: Exams
1 / 23
This page cannot be seen from the preview
Don't miss anything!
5 number summary - Answer>> The minumum value, lower quartile, median, upper quartile, and maximum value for a data set. These five values give a summary of the shape of the distribution and are used to make box plots. The five numbers that help describe the center, spread and shape of data z score - Answer>> a measure of how many standard deviations you are away from the norm (average or mean) -Number of standard deviations a score is above or below the mean (positive above, negative below standard deviation - Answer>> A statistical measure of how far away each value is, on average, from the mean. A measure of spread. Specifically, the typical distance the data points are from the mean. population - Answer>> (statistics) the entire aggregation of items from which samples can be drawn What the sample in an experiment or study usually reperesents
categorical data - Answer>> Data that can be placed into categories. For example "gender" is a categorical data and the categories are "male" and "female". Labels or names used to identify categories of like items If you asked people in which month they were born or what their favorite class is, they would answer with names, which would be categorical data. However, if you asked them how many siblings they have, they would answer with numbers, not categories Labels or names used to identify categories of like items quantitative data - Answer>> Data associated with mathematical models and statistical techniques used to analyze spatial location and association. numerical information describing how much, how little, how big, how tall, how fast, etc. age is quantitative bar graph - Answer>> a type of graph in which the lengths of bars are used to represent and compare data in categories A graph that uses horizontal or vertical bars to represent data.
Undercoverage - Answer>> A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population. When some groups in the population are left out of the process of choosing the sample nonresponse - Answer>> bias introduced to a sample when a large fraction of those sampled fails to respond When many people of a sample do not respond voluntary response bias - Answer>> Bias introduced to a sample when individuals can choose on their own whether to participate in the sample. statistic - Answer>> Application of mathematics to describing and analyzing data independent - Answer>> (statistics) a variable whose values are independent of changes in the values of other variables historgram - Answer>> graphical representation of a frequency distribution using vertical bars but bars touch each other to indicate variables are related
box plot - Answer>> A dsiplay that shows the distribution of values in a data set seperated into four equal-sized groups. A box plot is constructed from the five number summary of the data. scatterplot - Answer>> A graphed cluster of dots, each of which represents the values of two variables. The slope of the points suggests the direction of the relationship between the two variables. The amount of scatter suggests the strength of the correlation (little scatter indicates high correlation). correlation - Answer>> A measure of the extent to which two factors vary together, and thus of how well either factor predicts the other. The correlation coefficient is the mathematical expression of the relationship, ranging from -1 to + skewness - Answer>> The extent to which cases are clustered more at one or the other end of the distribution of a quantitative variable rather than in a symmetric pattern around its center varience - Answer>> commons measure of spread about the mean as center statistical significance - Answer>> A statistical statement of how likely it is that an obtained result occurred by chance/The condition that exists when the probability that the observed findings are due to chance is very low
example, if a bag contains a red marble, a white marble and a blue marble then the probability of selecting a red marble is 1/3. descriptive statistics - Answer>> Mathematical procedures for organizing collections of data, such as determining the mean, the median, the range, the variance, and the correlation coefficient mean - Answer>> A measure of center in a set of numerical data, computed by adding the values in a list and then dividing by the number of values in the list. median - Answer>> A measure of center in a set of numerical data. The median of a list of values is the value appearing at the center of a sorted version of the list - or the mean of the two central values if the list contains an even number of values. mode - Answer>> Measure of central tendency that uses most frequently occurring score. range - Answer>> Distance between highest and lowest scores in a set of data. data - Answer>> Facts and statistics collected together for reference or analysis Q1 - Answer>> A location measure of the data such that has one fourth or 25% of the data is smaller than it. Found by dividing the ordered data set in half (excluding
the middle observation if n is odd) and finding the median of the lower half of the data. Q3 - Answer>> A location to measeure when counting data to such as the median where instead of counting 50% it is 75% from the beginning of the sorted data minimum - Answer>> (n.) the smallest possible amount; (adj.) the lowest permissible or possible outlier - Answer>> A value much greater or much less than the others in a data set margin of error - Answer>> In statistical research, the range of outcomes we expect for a population, given the data revealed by a sample drawn from that population statistical normal - Answer>> scoring the middle of the bell-curve; low, moderate, or high scoring simple random sample - Answer>> A sample selected in such a way that every element in the population or sampling frame has an equal probability of being chosen. Equivalently, all samples of size n have an equal chance of being selected. A sample of size n selected from the population in such a way that each possible sample of size n has an equal chance of being selected.
cluster sample - Answer>> Is obtained by selecting all individuals within a randomly selected collection or group of individuals. 10% rule - Answer>> a sample has to be lass than 10% of the whole population Interpolation - Answer>> The estimation of an unknown number between known numbers. Interpolation is a way of approximating price or yield using bond tables that do not give the net yield on every amount invested at every rate of interest and for every maturity. Qualitative - Answer>> Data in the form of recorded descriptions rather than numerical measurements. theoretical probability - Answer>> A probability obtained by analyzing a situation. If all of the outcomes are equally likely, you can find the theoretical probability of an event by listing all of the possible outcomes and then finding the ratio of the number of outcomes producing the desired event to the total number of outcomes. For example, there are 36 possible equally likely outcomes (number pairs) when two fair number cubes are rolled. of these six have a sum of 7, so the probability of rolling a sum of 7 is 6/36 or 1/ experimental probability - Answer>> block design - Answer>> The subjects in an experiment are first divided into groups (called 'blocks') based on
some common characteristic (such as gender) that is hypothesised to have an effect on the response. Randomization of treatments then happens within each block (each block is like its own mini-experiment)." blinding - Answer>> The practice of concealing group assignment from study subjects, investigators, and/or those who assess subject outcomes, typically in the context of a randomized controlled trial. For ex, study subjects may receive capsules with identical appearance and taste; however, the treatment group receives the active drug, whereas the control group receives the placebo. double blind - Answer>> An experiment in which neither the subjects nor the people who work with them know which treatment each subject is receiving Neither the subjects nor the people who have contact with them know which treatment a subject received placebo - Answer>> A fake treatment. A chemically inert substance that produces real medical benefits because the patient believes it will help her least squares regression line - Answer>> the line with the smallest sum of squared residuals type I error - Answer>> An error that occurs when a researcher concludes that the independent variable had
marginal frequency - Answer>> A set of intervals, usually adjacent and of equal width, into which the range of a statistical distribution is divided, each associated with a frequency indicating the number of measurements in that interval. coefficient of determination - Answer>> The statistic or number determined by squaring the correlation coefficient. Represents the amount of variance accounted for by that correlation. Statistic that represents amount of variance accounted for by a correlation. binomial - Answer>> A two-name naming system. unimodal - Answer>> having one mode; this is a useful term for describing the shape of a histogram when it's generally mound-shaped a data set with one mode such a normal distribution usually has only one mode bimodal - Answer>> A type of distribution, where there is two or more categories with an equal count or cases and with more cases than the other categories. A distribution with two modes
experiment - Answer>> A kind of research in which the researcher controls all the conditions and directly manipulates the conditions, including the independent variable. Testing the hypothesis law of large numbers - Answer>> (statistics) law stating that a large number of items taken at random from a population will (on the average) have the population statistics extrapolation - Answer>> calculation of the value of a function outside the range of known values snowball - Answer>> Huyen wanted to conduct market research to find out why students were unhappy with Marketing 431, probably the finest course ever to be offered by a university. In order to do this she needed to find people who were unhappy with the course. Figuring that these people would talk to each other, she used a sampling technique where she found one person who was unhappy with the course and, after asking her research questions, asked this person for the name of another person who was unhappy with the course. IQR - Answer>> A measure of variability, based on dividing a data set into quartiles
degrees of freedom - Answer>> The number of individual scores that can vary without changing the sample mean. Statistically written as 'N-1' where N represents the number of subjects. two way table - Answer>> A table containing counts for two categorical variables. It has r rows and c columns. describes to categorical variables with row variable and column variable spread - Answer>> The visible variation in a sample distribution center - Answer>> The measure of the distance the mode is from the center of a distribution shape - Answer>> discrete random variable - Answer>> central limit theorem - Answer>> standardized value - Answer>> mutually exclusive - Answer>> wording bias - Answer>> Whenever a bias is created in a sample by the way the survey is worded to favor one question
causation - Answer>> z test - Answer>> t test - Answer>> chi squared goodness of fit - Answer>> tests how well close the observes data is to what would be expected under the model. If a sign diff is found b/w the two then ob. data has not been generated by chance. nominal data Determine if scores from one variable match expectations for that distribution a gambler placed $1,000 into a game of greed in which he lost. He hopes to catch his opponent and bust him for loading the dice. He does this by choosing one dice to roll 36 times. He knows that the each side has an equal chance of landing face up. He hopes to get an outcome abnormal to this. Given the data below, can we prove that the dice are loaded frequency table - Answer>> A grouping of qualitative data into mutually exclusive classes showing the number of observations in each class.
A distribution with more than two modes uniform - Answer>> A histogram doesn't appear to have any mode and in which all the bars are approximately the same height Evenly spaced symetric - Answer>> When in a normal distribution both sides are identical time plot - Answer>> Displays data that change over time. Often, successive values are connected with lines to show trends more clearly. Sometimes a smooth curve is added to the plot to help show long-term patterns and trends. Displays data that change over time. se - Answer>> standard deviation of residuals r2 - Answer>> overall measure of how successful the regression is in linearlly relating to y and x leverage - Answer>>
influential point - Answer>> a point when omitted will give very different results census - Answer>> When a survey has no sample but instead test or surveys the entire population multistage samole - Answer>> pilot - Answer>> small trial run of a survey to see if questions are clear convenience sample - Answer>> Choosing a sample because it is convenient. failing to get a proper representation of the population because If you survey everyone on your soccer team who attends tonight's practice, you are surveying a convenience sample. response bias - Answer>> Anything in a survey design that influences responses falls under the heading of response bias. One typical response bias arises from the wording of questions, which may suggest a favored response. Voters, for example, are more likely to express support of "the president" than support of the particular person holding that office at the moment. Anything that changes the response in a survey