Hide Solution
SSW=g∑i=1ni∑j=1 (xi,j−ˉxi)2=4∑i=1ni∑j=1 (xi,j−ˉxi)2=12∑j=1 (x1,j−13.3333)2+9∑j=1 (x2,j−11)2+8∑j=1 (x3,j−16.5)2+8∑j=1 (x4,j−16.25)2=(x1,1−13.3333)2 +(x1,2−13.3333)2 +(x1,3−13.3333)2 +(x1,4−13.3333)2 +(x1,5−13.3333)2 +(x1,6−13.3333)2 +(x1,7−13.3333)2 +(x1,8−13.3333)2 +(x1,9−13.3333)2 +(x1,10−13.3333)2 +(x1,11−13.3333)2 +(x1,12−13.3333)2 +(x2,1−11)2 +(x2,2−11)2 +(x2,3−11)2 +(x2,4−11)2 +(x2,5−11)2 +(x2,6−11)2 +(x2,7−11)2 +(x2,8−11)2 +(x2,9−11)2 +(x3,1−16.5)2 +(x3,2−16.5)2 +(x3,3−16.5)2 +(x3,4−16.5)2 +(x3,5−16.5)2 +(x3,6−16.5)2 +(x3,7−16.5)2 +(x3,8−16.5)2 +(x4,1−16.25)2 + (x4,2−16.25)2 + (x4,3−16.25)2 + (x4,4−16.25)2 + (x4,5−16.25)2 + (x4,6−16.25)2 + (x4,7−16.25)2 + (x4,8−16.25)2=(20−13.3333)2 +(17−13.3333)2 +(10−13.3333)2 +(15−13.3333)2 +(14−13.3333)2 +(21−13.3333)2 +(12−13.3333)2 +(1−13.3333)2 +(14−13.3333)2 +(20−13.3333)2 +(15−13.3333)2 +(1−13.3333)2 +(4−11)2 +(5−11)2 +(21−11)2 +(5−11)2 +(15−11)2 +(5−11)2 +(6−11)2 +(15−11)2 +(23−11)2 +(24−16.5)2 +(16−16.5)2 +(29−16.5)2 +(19−16.5)2 +(17−16.5)2 +(18−16.5)2 +(9−16.5)2 +(0−16.5)2 +(16−16.25)2 + (13−16.25)2 + (15−16.25)2 + (7−16.25)2 + (38−16.25)2 + (5−16.25)2 + (20−16.25)2 + (16−16.25)2=(6.6667)2 +(3.6667)2 +(−3.3333)2 +(1.6667)2 +(0.6667)2 +(7.6667)2 +(−1.3333)2 +(−12.3333)2 +(0.6667)2 +(6.6667)2 +(1.6667)2 +(−12.3333)2 +(−7)2 +(−6)2 +(10)2 +(−6)2 +(4)2 +(−6)2 +(−5)2 +(4)2 +(12)2 +(7.5)2 +(−0.5)2 +(12.5)2 +(2.5)2 +(0.5)2 +(1.5)2 +(−7.5)2 +(−16.5)2 +(−0.25)2 + (−3.25)2 + (−1.25)2 + (−9.25)2 + (21.75)2 + (−11.25)2 + (3.75)2 + (−0.25)2=(44.4444) +(13.4444) +(11.1111) +(2.7778) +(0.4444) +(58.7778) +(1.7778) +(152.1111) +(0.4444) +(44.4444) +(2.7778) +(152.1111) +(49) +(36) +(100) +(36) +(16) +(36) +(25) +(16) +(144) +(56.25) +(0.25) +(156.25) +(6.25) +(0.25) +(2.25) +(56.25) +(272.25) +(0.0625) + (10.5625) + (1.5625) + (85.5625) + (473.0625) + (126.5625) + (14.0625) + (0.0625)=484.6667 +458 +550 +711.4375=2204.1667
From these calculations, the within sum of squares is SSW = 2204.1667.
Hide the R Code
There are two ways of performing these calculations in R. The method you select will depend on how your data are stored.
Method 1: Wide Format
Copy and paste the following code into your R script window, then run it from there.
## Import data
treatment1 = c(20, 17, 10, 15, 14, 21, 12, 1, 14, 20, 15, 1)
treatment2 = c(4, 5, 21, 5, 15, 5, 6, 15, 23)
treatment3 = c(24, 16, 29, 19, 17, 18, 9, 0)
treatment4 = c(16, 13, 15, 7, 38, 5, 20, 16)
## Change to Long Format
mmt = c( treatment1, treatment2, treatment3, treatment4 )
grp = c( rep("trt1",12), rep("trt2",9), rep("trt3",8), rep("trt4",8) )
## Model the data
mod = aov(mmt~grp)
summary(mod)
In the R output, the value of the sum of squares within is the number in the table under Sum Sq
and to the right of Residuals
. If you would like better precision for that value, or if you would like to have only that value, run the following code in addition to that above:
modSummary = summary(mod)
modSummary[[1]][2,2]
Here, the number outputted is the sum of squares between. How did you get the number? The summary table (also known as an ANOVA table) is just a table. Thus, the first line saves the table as the variable modSummary
the last line looks inside that variable, selects the ANOVA table ([[1]]
), and then selects the row 2, column 2 value.
Method 2: Long Format
Copy and paste the following code into your R script window, then run it from there.
## Import data
yields = c(20, 17, 10, 15, 14, 21, 12, 1, 14, 20, 15, 1, 4, 5, 21, 5, 15, 5, 6, 15, 23, 24, 16, 29, 19, 17, 18, 9, 0, 16, 13, 15, 7, 38, 5, 20, 16)
grp = c('trt1', 'trt1', 'trt1', 'trt1', 'trt1', 'trt1', 'trt1', 'trt1', 'trt1', 'trt1', 'trt1', 'trt1', 'trt2', 'trt2', 'trt2', 'trt2', 'trt2', 'trt2', 'trt2', 'trt2', 'trt2', 'trt3', 'trt3', 'trt3', 'trt3', 'trt3', 'trt3', 'trt3', 'trt3', 'trt4', 'trt4', 'trt4', 'trt4', 'trt4', 'trt4', 'trt4', 'trt4')
## Model the data
mod = aov(yields~grp)
summary(mod)
As discussed above, in the R output, the value of the sum of squares within is the number in the table under Sum Sq
and to the right of Residuals
. If you would like better precision for that value, or if you would like to have only that value, run the following code in addition to that above:
modSummary = summary(mod)
modSummary[[1]][2,2]
Here, the number outputted is the sum of squares between. How did you get the number? The summary table (also known as an ANOVA table) is just a table. Thus, the first line saves the table as the variable modSummary
the last line looks inside that variable, selects the ANOVA table ([[1]]
), and then selects the row 2, column 2 value.
Note: The difference between wide and long formats is this: In wide formatted data, each group has its own variable. In long formatted data, the group number is a variable.