Sampling And Confidence Intervals
Beyond ratios : Sampling & confidence intervals Problem: Suppose we have 100 log lines, each line with a different severity level. INFO, WARN, SEVERE. Since processing all of them might be expensive, How do we sample a proportion of these log lines? What can we say about the ‘population’ of the log lines from this sample? Confidence intervals It lets us make statements such as ‘with x% ’level of confidence’ the number of severe lines in the overall population will be between y and z....