Differences between Percentages and Paired Alternatives   Lecture 6 
                 | 
              
              
                
                  Percentages
                 | 
              
              
                  
                  
                    - 
                      
Apply the same methodology to percentages 
                    
                  
                     
                   
                  
                    
                      - 
                        
 Where P is  the percentage calculated from the data while r is the Greek letter that represents the population parameter  
                       - 
                        
 Using the same logic, the standard error (SE) is: 
                      
                    
                   
                  
                     
                   
                  
                    
                      - 
                        
 The variances are 
                      
                    
                   
                  
                     
                   
                  
                    
                      - 
                        
 This test is approximate because we are calculating the variances from the data 
                       - 
                        
 Then calculate the z-statistic 
                      
                    
                   
                  
                    - 
                      
Example: 
                      
                        - 
                          
 300 students are randomly chosen (n 1) at Suleyman Demirol  
                          
                            - 
                              
 P 1 = 55% are women  
                             - 
                              
 1 – P 1 = 45% are men  
                            
                         
                        - 
                          
 400 students are randomly chosen (n 2) from the business school  
                          
                            - 
                              
 P 2 = 60% are women  
                             - 
                              
 1 – P 2 = 40% are men  
                            
                         
                        - 
                          
 Are the same percentage of women studying business is the same percentage as the student body? 
                         - 
                          
 The hypothesis is: 
                        
                     
                   
                  
                     
                   
                  
                    
                      - 
                        
 The standard error (SE) is: 
                      
                    
                   
                  
                     
                   
                  
                    
                      - 
                        
 The z-statistic 
                      
                    
                   
                  
                     
                   
                  
                    
                      - 
                        
 The critical z-value is 1.96. The p-value is 0.092669 
                        
                          - 
                            
 Excel, the p-value is calculated from =normdist(-1.3245, 0, 1, 1) 
                           - 
                            
 According to both the z and p values, fail to reject the null hypothesis and conclude both percentages are the same 
                           - 
                            
 Both the methods will give the same results. The z and p values are testing the same hypothesis from different angles 
                           - 
                            
 Note – Excel doubles the p-value for two-tail test 
                            
                           
                         
                       
                     
                   
                  
                    - 
                      
You can also use confidence intervals for hypothesis testing 
                    
                  
                     
                   
                  
                    
                    
                   
                    
                 | 
              
              
                 Poisson Distribution
               | 
              
              
                  
                  
                    - 
                      
The probability of a number of events that occur in a specific time period 
                      
                        - 
                          
Counting distribution 
                          
                            - 
                              
Number of events 
                             - 
                              
Number of deaths 
                             - 
                              
Number of births 
                             - 
                              
Number of accidents at a street intersection 
                            
                         
                       
                     
                    - 
                      
The PDF is 
                    
                  
                     
                   
                  
                    
                      - 
                        
 k is the number of occurrences, k = 1, 2, 3, … 
                       - 
                        
                          l is expected number of occurrences in interval
                         
                      
                    
                   
                  
                    - 
                      
This distribution is unique, the mean = variance = l
                       
                      
                     
                    - 
                      
Example: Heart disease 
                      
                        - 
                          
 In 2008, there were 543 deaths (n 1)  
                         - 
                          
 In 2009, there were 674 deaths (n 2)  
                         - 
                          
 Is this increase in deaths due to chance? 
                         - 
                          
 The hypothesis is: 
                        
                     
                   
                  
                     
                   
                  
                    
                      - 
                        
 The standard error (SE) is 
                      
                    
                   
                  
                     
                   
                  
                    
                      - 
                        
 The z-statistic is: 
                      
                    
                   
                  
                     
                   
                  
                    
                      - 
                        
 Using a = 0.05, the z c = 1.96  
                        
                          - 
                            
 Reject the null hypothesis and conclude the heart attack rate is higher 
                           - 
                            
 The z-statistic is approximate, because it came from a Poisson distribution 
                          
                       
                     
                   
                    
                 | 
              
              
                 McNeman’s Test
               | 
              
              
                  
                  
                    - 
                      
You will not be tested over this test 
                      
                        - 
                          
 It is an interesting test 
                         - 
                          
 You have an example where your sample has two treatments and the results are paired 
                         - 
                          
 Your sample had two experimental medications 
                         - 
                          
 A matrix of your results 
                        
                     
                   
                  
                    
                      
                         
                       | 
                      Treatment A | 
                      Treatment B | 
                     
                    
                      | Outcome 1 | 
                      Responded | 
                      Responded | 
                     
                    
                      | Outcome 2 | 
                      Responded | 
                      Did not respond | 
                     
                    
                      | Outcome 3 | 
                      Did not respond | 
                      Responded | 
                     
                    
                      | Outcome 4 | 
                      Did not respond | 
                      Did not respond | 
                     
                   
                  
                    - 
                      
You are interested if Treatment A is better than Treatment B? 
                      
                        - 
                          
 Ignore Outcomes 1 and 4 
                         - 
                          
 Focus on Outcomes 2 and 3 
                         - 
                          
 Observations have to be paired 
                         - 
                          
 Example: 
                          
                            - 
                              
 Each person gets both treatments 
                             - 
                              
 Or the sample is divided by 2 and then randomly pair one person to another 
                            
                         
                       
                     
                    - 
                      
Example: 200 people with heart problems 
                      
                        - 
                          
 Treatment A: Patients have to eat right and exercise 
                         - 
                          
 Treatment B: Patients take a drug, Plavix 
                         - 
                          
 Randomly pair sample into 100 pairs 
                        
                     
                   
                  
                    
                      
                         
                       | 
                      Treatment A | 
                      Treatment B | 
                      Observations | 
                     
                    
                      | Outcome 1 | 
                      Responded | 
                      Responded | 
                      15 | 
                     
                    
                      | Outcome 2 | 
                      Responded | 
                      Did not respond | 
                      30 | 
                     
                    
                      | Outcome 3 | 
                      Did not respond | 
                      Responded | 
                      45 | 
                     
                    
                      | Outcome 4 | 
                      Did not respond | 
                      Did not respond | 
                      10 | 
                     
                    
                      
                         
                       | 
                      
                         
                       | 
                      
                         
                       | 
                      100 | 
                     
                   
                  
                    
                      - 
                        
 The n 1 = 45 and n 2 = 30  
                       - 
                        
 Calculate the z-statistic 
                      
                    
                   
                  
                     
                   
                  
                    
                      - 
                        
 Fail to reject the treatments are the same, because a = 0.05 and the z c = 1.96  
                       - 
                        
 Note – I could give all patients both treatments, but I have to discern which treatment did what 
                      
                    
                   
                 |