Virtual Sampling

Explore virtual sampling techniques in R with the moderndive package to mimic tactile sampling. Learn to generate repeated samples, compute proportions of red balls, and visualize sampling variation across multiple replicates. Understand how increasing the number of samples smooths the distribution of sample proportions, illustrating key concepts in statistical inference.

We'll cover the following...

Using the virtual shovel once
Using the virtual shovel 33 times
Using the virtual shovel 1,000 times

We performed this sampling activity by hand first so that we can develop a firm understanding of the root ideas behind sampling. Now, we’ll mimic this tactile sampling activity with a virtual sampling activity. In other words, we’ll use a virtual analog to the bowl of balls and a virtual analog to the shovel.

Using the virtual shovel once

Let’s start by performing the virtual analog of the tactile sampling exercise we performed. We first need a virtual analog of the bowl. To this end, we included a data frame named bowl in the moderndive package. The rows of bowl correspond exactly with the contents of the actual bowl.

Observe that bowl has 2,400 rows, which tells us that the bowl contains 2,400 equally sized balls. The first variable ball_ID is used as an identification variable. None of the balls in the actual bowl are marked with numbers. The second variable color indicates whether a particular virtual ball is red or white. We’ll view the contents of the bowl and scroll through the contents to convince ourselves that bowl is indeed a virtual analog of the actual bowl.

Now that we’ve a virtual analog of our bowl, we now need a virtual analog to the shovel to generate virtual samples of 50 balls. We’re going to use the rep_sample_n() function included in the moderndive package. This function allows us to take repeated, or replicated, samples of size n.

1.Getting Started with Data in R

2.Data Visualization

3. Data Wrangling

4.Data Importing and “Tidy” Data

5.Basic Regression

6.Multiple Regression

7.Statistical Inference with the infer Package

8.Bootstrapping and Confidence Intervals

9.Hypothesis Testing

10.Inference for Regression

Project

11. Tell a Story with Data

12.Appendix

Project

Virtual Sampling

Using the virtual shovel once