Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Previous Chapter

About the Authors

Index

A

A/B testing, A/B Testing-For Further Reading
- control group, advantages of using, Why Have a Control Group?
- epsilon-greedy algorithm, Multi-Arm Bandit Algorithm
- importance of permissions, Why Just A/B? Why Not C, D…?
- traditional, shortcoming of, Multi-Arm Bandit Algorithm
accuracy, Evaluating Classification Models
- improving in random forests, Random Forest
Adaboost, Boosting
- boosting algorithm, The Boosting Algorithm
adjusted R-squared, Assessing the Model
adjustment of p-values, Multiple Testing, Multiple Testing
agglomerative algorithm, The Agglomerative Algorithm
AIC (Akaike's Information Criteria), Model Selection and Stepwise Regression, Selecting the Number of Clusters
- variants of, Model Selection and Stepwise Regression
Akike, Hirotugu, Model Selection and Stepwise Regression
all subset regression, Model Selection and Stepwise Regression
alpha, Statistical Significance and P-Values, Alpha
- dividing up in multiple testing, Multiple Testing
alternative hypothesis, Hypothesis Tests, Alternative Hypothesis
American Statistical Association (ASA), statement on p-values, Value of the p-value
anomaly detection, Outliers, Regression and Prediction
ANOVA (analysis of variance
- statististical test based on F-statistic, F-Statistic
ANOVA (analysis of variance), ANOVA-Further Reading
- computing ANOVA table in R, F-Statistic
- decomposition of variance, F-Statistic
- two-way, Two-Way ANOVA
arms (multi-arm bandits), Multi-Arm Bandit Algorithm
AUC (area under the ROC curve), AUC
average linkage, Measures of Dissimilarity

B

backward elimination, Model Selection and Stepwise Regression
backward selection, Model Selection and Stepwise Regression
bagging, The Bootstrap, Resampling, Statistical Machine Learning, Bagging
- better predictive performance than single trees, How Trees Are Used
- boosting vs., Boosting
- using with random forests, Random Forest
bandit algorithms, Multi-Arm Bandit Algorithm
- (see also multi-arm bandits)
bar charts, Exploring Binary and Categorical Data
Bayesian classification, Naive Bayes
- (see also naive Bayes algorithm)
- impracticality of exact Bayesian classification, Why Exact Bayesian Classification Is Impractical
Bayesian infomation criteria (BIC), Model Selection and Stepwise Regression, Selecting the Number of Clusters
beta distribution, Multi-Arm Bandit Algorithm
bias, Bias
- selection bias, Selection Bias-Further Reading
bias-variance tradeoff, Choosing K
biased estimates, Standard Deviation and Related Estimates
- from naive Bayes classifier, The Naive Solution
BIC (Bayesian information criteria), Model Selection and Stepwise Regression, Selecting the Number of Clusters
bidirectional alternative hypothesis, One-Way, Two-Way Hypothesis Test
big data
- and outliers in regression, Outliers
- use of regression in, Prediction versus Explanation (Profiling)
- value of, Size versus Quality: When Does Size Matter?
binary data, Elements of Structured Data
- exploring, Exploring Binary and Categorical Data-Correlation
binomial, Binomial Distribution
binomial distribution, Binomial Distribution-Further Reading
binomial trials, Binomial Distribution
bins
- hexagonal binning, Hexagonal Binning and Contours (Plotting Numeric versus Numeric Data)
- in frequency tables, Frequency Table and Histograms
- in histograms, Frequency Table and Histograms
bivariate analysis, Exploring Two or More Variables
black swan theory, Long-Tailed Distributions
blind studies, Why Have a Control Group?
boosting, Statistical Machine Learning, Tree Models, Boosting-Summary
- bagging vs., Boosting
- boosting algorithm, The Boosting Algorithm
- hyperparameters and cross-validation, Hyperparameters and Cross-Validation
- overfitting, avoiding using regularization, Regularization: Avoiding Overfitting
- XGBoost, XGBoost
bootstrap, The Bootstrap-Further Reading, Resampling
- confidence interval generation, Confidence Intervals, Confidence and Prediction Intervals
- permutation tests, Exhaustive and Bootstrap Permutation Test
- resampling vs. bootstrapping, Resampling versus Bootstrapping
- using with random forests, Random Forest
bootstrap sample, The Bootstrap
boxplots, Exploring the Data Distribution
- combining with a violin plot, example, Categorical and Numeric Data
- example, percent of airline delays by carrier, Categorical and Numeric Data
- outliers in, Outliers
- percentiles and, Percentiles and Boxplots
Breiman, Leo, Statistical Machine Learning
bubble plots, Influential Values

C

categorical data, Elements of Structured Data
- exploring, Exploring Binary and Categorical Data-Correlation
  - expected value, Expected Value
  - mode, Mode
  - numerical data as categorical data, Exploring Binary and Categorical Data
- exploring two categorical variables, Two Categorical Variables
- importance of the concept, Elements of Structured Data
- numeric variable grouped by categorical variable, Categorical and Numeric Data
- scaling and categorical variables, Scaling and Categorical Variables-Summary
  - dominant variables, Dominant Variables
  - Gower's distance, Categorical Data and Gower’s Distance
  - scaling the variables, Scaling the Variables
categorical variables, Factor Variables in Regression
- (see also factor variables)
causation, regression and, Prediction versus Explanation (Profiling)
central limit theorem, Sampling Distribution of a Statistic, Central Limit Theorem, Student’s t-Distribution
- data science and, Student’s t-Distribution
chi-square distribution, Chi-Square Test: Statistical Theory
chi-square statistic, Chi-Square Test
chi-square test, Chi-Square Test-Further Reading
- detecting scientific fraud, Fisher’s Exact Test
- Fisher's exact test, Fisher’s Exact Test
- relevance for data science, Relevance for Data Science
- resampling approach, Chi-Square Test: A Resampling Approach
- statistical theory, Chi-Square Test: Statistical Theory
class purity, Measuring Homogeneity or Impurity
classification, Classification-Summary
- discriminant analysis, Discriminant Analysis-Further Reading
  - covariance matrix, Covariance Matrix
  - Fisher's linear discriminant, Fisher’s Linear Discriminant
  - simple example, A Simple Example
- evaluating models, Evaluating Classification Models-Further Reading
  - AUC metric, AUC
  - confusion matrix, Confusion Matrix
  - lift, Lift
  - precision, recall, and specificity, Precision, Recall, and Specificity
  - rare class problem, The Rare Class Problem
  - ROC curve, ROC Curve
- K-Nearest Neighbors, K-Nearest Neighbors
- logistic regression, Logistic Regression-Further Reading
  - and the GLM, Logistic Regression and the GLM
  - assessing the model, Assessing the Model
  - comparison to linear regression, Linear and Logistic Regression: Similarities and Differences
  - interpreting coefficients and odds ratios, Interpreting the Coefficients and Odds Ratios
  - logistic response function and logit, Logistic Response Function and Logit
  - predicted values from, Predicted Values from Logistic Regression
- more than two possible outcomes, Classification
- naive Bayes algorithm, Naive Bayes-Further Reading
  - impracticality of exact Bayesian classification, Why Exact Bayesian Classification Is Impractical
  - using numeric predictor variables, Numeric Predictor Variables
- strategies for imbalanced data, Strategies for Imbalanced Data-Further Reading
  - cost-based classification, Cost-Based Classification
  - data generation, Data Generation
  - exploring the predictions, Exploring the Predictions
  - oversampling and up/down weighting, Oversampling and Up/Down Weighting
  - undersampling, Undersampling
- unsupervised learning as building block, Unsupervised Learning
cluster mean, K-Means Clustering, A Simple Example, Interpreting the Clusters
clustering, Unsupervised Learning
- application to cold-start problems, Unsupervised Learning
- cluster analysis vs. PCA, Interpreting the Clusters
- hierarchical, Hierarchical Clustering-Measures of Dissimilarity, Categorical Data and Gower’s Distance
  - agglomerative algorithm, The Agglomerative Algorithm
  - dendrogram, The Dendrogram
  - dissimilarity measures, Measures of Dissimilarity
  - simple example, A Simple Example
- K-means, K-Means Clustering-Selecting the Number of Clusters, Scaling the Variables
  - interpreting the clusters, Interpreting the Clusters
  - K-means algorithm, K-Means Algorithm
  - selecting the number of customers, Selecting the Number of Clusters
  - simple example, A Simple Example-K-Means Algorithm
- model-based, Model-Based Clustering-Further Reading
  - mixtures of normals, Mixtures of Normals
  - selecting the number of clusters, Selecting the Number of Clusters
- problems with mixed data, Problems with Clustering Mixed Data
- standardizing data, Standardization (Normalization, Z-Scores)
clusters, K-Means Clustering
coefficient of determination, Assessing the Model
coefficients
- in logistic regression, Interpreting the Coefficients and Odds Ratios
- in simple linear regression, The Regression Equation
  - estimates vs. known, Fitted Values and Residuals
- interpretation in multiple linear regression, Example: King County Housing Data
complete linkage, The Agglomerative Algorithm
complexity parameter (cp), Stopping the Tree from Growing
conditional probabilities, Naive Bayes
conditioning variables, Visualizing Multiple Variables
confidence intervals, Confidence Intervals-Further Reading, Confidence and Prediction Intervals
- generating with bootstrap, Confidence Intervals
- level of confidence, Confidence Intervals
- prediction intervals vs., Confidence and Prediction Intervals
confidence level, Confidence Intervals-Confidence Intervals
confounding variables, Interpreting the Regression Equation, Confounding Variables
confusion matrix, Evaluating Classification Models-Confusion Matrix
contingency tables, Exploring Two or More Variables
- example, loan grade and status, Two Categorical Variables
continuous data, Elements of Structured Data
- continuous variable as test metric, A/B Testing
- predicting continuous value with a tree, Predicting a Continuous Value
contour plots, Exploring Two or More Variables
- using with hexagonal binning, Hexagonal Binning and Contours (Plotting Numeric versus Numeric Data)
contrast coding systems, Dummy Variables Representation
control group, A/B Testing
- advantages of using, Why Have a Control Group?
Cook's distance, Influential Values
correlated variables, Interpreting the Regression Equation
- multicollinearity, Multicollinearity
- predictor variables, Correlated Predictors
correlation, Correlation-Further Reading
- key terms for, Correlation
- regression vs., Simple Linear Regression
- scatterplots, Scatterplots
correlation coefficient, Correlation
- computing Pearson's correlation coefficient, Correlation
- key concepts, Scatterplots
- other types of, Correlation
correlation matrix, Correlation
- example, correlation between telecommunication stock returns, Correlation
cost-based classification, Cost-Based Classification
count data
- as test metric, A/B Testing
- Fisher's exact test for, Fisher’s Exact Test
covariance, Discriminant Analysis, Covariance Matrix, Computing the Principal Components
covariance matrix
- in discriminant analysis, Covariance Matrix
- in model-based clustering, Multivariate Normal Distribution
- using to compute Mahalanobis distance, Distance Metrics
cross-validation, Cross-Validation, Choosing K
- for selection of principal components, Interpreting Principal Components
- using for hyperparameters in boosting, Hyperparameters and Cross-Validation
- using to estimate value of complexity parameter, Stopping the Tree from Growing
cumulative gains charts, Lift

D

d.f. (degrees of freedom), Degrees of Freedom, Chi-Square Test
- (see also degrees of freedom)
data analysis, Exploratory Data Analysis
- (see also exploratory data analysis)
data distribution, Exploring the Data Distribution-Further Reading, Sampling Distribution of a Statistic
- frequency tables and histograms, Frequency Table and Histograms
- key terms for, Exploring the Data Distribution
- percentiles and boxplots, Percentiles and Boxplots
- sampling distribution vs., Sampling Distribution of a Statistic
data frames, Rectangular Data
- and indexes, Data Frames and Indexes
- typical data format, Rectangular Data
data generation, Strategies for Imbalanced Data, Data Generation
data snoopng, Selection Bias
data types
- key terms for, Elements of Structured Data
- resources for further reading, Further Reading
database normalization, Standardization (Normalization, Z-Scores)
decile gains charts, Lift
decision trees, The Bootstrap, Statistical Machine Learning
- meaning in operations research, Tree Models
- recursive partitioning algorithm, Random Forest
decomposition of variance, ANOVA, F-Statistic
degrees of freedom, Standard Deviation and Related Estimates, Student’s t-Distribution, Degrees of Freedom-Further Reading
- in chi-square test, Chi-Square Test: Statistical Theory
dendrograms, Hierarchical Clustering
- example, dendrogram of stocks, The Dendrogram
- hierarchical clustering with mixed variable types, Categorical Data and Gower’s Distance
density plots, Exploring the Data Distribution, Density Estimates
- example, density of state murder rates, Density Estimates
dependent variable, The Regression Equation
- (see also response)
deviation coding, Factor Variables in Regression, Dummy Variables Representation
deviations, Estimates of Variability
- standard deviation and related estimates, Standard Deviation and Related Estimates
directional alternative hypothesis, One-Way, Two-Way Hypothesis Test
discrete data, Elements of Structured Data
discriminant analysis, Discriminant Analysis-Further Reading
- covariance matrix, Covariance Matrix
- extensions of, A Simple Example
- Fisher's linear discriminant, Fisher’s Linear Discriminant
- simple example, A Simple Example-A Simple Example
discriminant function, Discriminant Analysis
discriminant weights, Discriminant Analysis
dispersion, Estimates of Variability
- (see also variability, estimates of)
dissimilarity, Hierarchical Clustering
- common measures of, Measures of Dissimilarity
- measuring with, complete-linkage method, The Agglomerative Algorithm
- metric in hierarchical clustering, A Simple Example
distance metrics, K-Nearest Neighbors, Hierarchical Clustering
- Gower's distance and categorical data, Categorical Data and Gower’s Distance
- in hierarchical clustering, A Simple Example, The Agglomerative Algorithm
- in K-Nearest Neighbors, Distance Metrics
Donoho, David, Exploratory Data Analysis
double blind studies, Why Have a Control Group?
dummy variables, Factor Variables in Regression
- representation of factor variables in regression, Dummy Variables Representation
- representing string factor data as numbers, One Hot Encoder
Durbin-Watson statistic, Heteroskedasticity, Non-Normality and Correlated Errors

E

EDA (see exploratory data analysis)
effect size, Power and Sample Size, Sample Size
elbow method, Selecting the Number of Clusters
ensemble learning, Statistical Machine Learning
- staged used of K-Nearest Neighbors, KNN as a Feature Engine
ensemble models, Boosting
entropy, Measuring Homogeneity or Impurity
epsilon-greedy algorithm, Multi-Arm Bandit Algorithm
errors, Normal Distribution
estimates, Estimates of Location
- indicated by hat notation, Fitted Values and Residuals
Euclidean distance, Distance Metrics
exact tests, Exhaustive and Bootstrap Permutation Test
Excel, pivot tables, Two Categorical Variables
exhaustive permutation tests, Exhaustive and Bootstrap Permutation Test
expectation or expected, Chi-Square Test
expected value, Exploring Binary and Categorical Data, Expected Value
- calculating, Expected Value
explanation vs. prediction (in regression), Prediction versus Explanation (Profiling)
exploratory data analysis, Exploratory Data Analysis-Summary
- binary and categorical data, Exploring Binary and Categorical Data-Correlation
- correlation, Correlation-Further Reading
- data distribution, Exploring the Data Distribution-Further Reading
- estimates of location, Estimates of Location-Further Reading
- estimates of variability, Estimates of Variability-Further Reading
- exploring two or more variables, Exploring Two or More Variables-Summary
- rectangular data, Rectangular Data-Estimates of Location
Exploratory Data Analysis (Tukey), Exploratory Data Analysis
exponential distribution, Poisson and Related Distributions
- calculating, Exponential Distribution
extrapolation
- dangers of, The Dangers of Extrapolation
- definition of, Prediction Using Regression

F

F-statistic, ANOVA, F-Statistic, Assessing the Model
facets, Visualizing Multiple Variables
factor variables, Factor Variables in Regression-Ordered Factor Variables
- different codings, Dummy Variables Representation
- dummy variables representation, Dummy Variables Representation
- handling in logistic regression, Fitting the model
- in naive Bayes algorithm, Naive Bayes
- ordered, Ordered Factor Variables
- reference coding, Interactions and Main Effects
- with many levels, Factor Variables with Many Levels
factors, conversion of text columns to, Elements of Structured Data
failure rate, estimating, Estimating the Failure Rate
false discovery rate, Multiple Testing, Multiple Testing
false positive rate, AUC
feature selection
- chi-square tests in, Relevance for Data Science
- using discriminant analysis, A Simple Example
features, Rectangular Data
- terminology differences, Data Frames and Indexes
field view (spatial data), Nonrectangular Data Structures
Fisher's exact test, Fisher’s Exact Test
Fisher's linear discriminant, Fisher’s Linear Discriminant
Fisher's scoring, Fitting the model
Fisher, R.A., Fisher’s Exact Test, Discriminant Analysis
fitted values, Simple Linear Regression, Fitted Values and Residuals
folds, Cross-Validation, Hyperparameters and Cross-Validation
forward selection and backward selection, Model Selection and Stepwise Regression
frequency tables, Exploring the Data Distribution
- example, population by state, Frequency Table and Histograms
Friedman, Jerome H. (Jerry), Statistical Machine Learning

G

gains, Lift
- (see also lift)
Gallup Poll, Random Sampling and Sample Bias
Gallup, George, Random Sampling and Sample Bias, Random Selection
Galton, Francis, Regression to the Mean
GAM (see generalized additive models)
Gaussian distribution, Normal Distribution
- (see also normal distribution)
generalized additive models, Polynomial and Spline Regression, Generalized Additive Models, Exploring the Predictions
generalized linear model (GLM), Logistic Regression and the GLM
Gini coefficient, Measuring Homogeneity or Impurity
Gini impurity, Measuring Homogeneity or Impurity
GLM (see generalized linear model)
Gosset, W.S., Student’s t-Distribution
Gower's distance, Scaling and Categorical Variables
- categorical data and, Categorical Data and Gower’s Distance
gradient boosted trees, Interactions and Main Effects
gradient boosting, The Boosting Algorithm
- definition of, Boosting
graphs, Nonrectangular Data Structures
- computer science versus statistics, Nonrectangular Data Structures
- lesson on misleading graphs, Further Reading
greedy algorithms, Multi-Arm Bandit Algorithm

H

hat notation, Fitted Values and Residuals
hat-value, Testing the Assumptions: Regression Diagnostics, Influential Values
heat maps, Hexagonal Binning and Contours (Plotting Numeric versus Numeric Data)
heteroskedastic errors, Heteroskedasticity, Non-Normality and Correlated Errors
heteroskedasticity, Testing the Assumptions: Regression Diagnostics, Heteroskedasticity, Non-Normality and Correlated Errors
hexagonal binning, Exploring Two or More Variables
- example, using with contour plot, Hexagonal Binning and Contours (Plotting Numeric versus Numeric Data)
hierarchical clustering, Hierarchical Clustering-Measures of Dissimilarity, Categorical Data and Gower’s Distance
- agglomerative algorithm, The Agglomerative Algorithm
- measures of dissimilarity, Measures of Dissimilarity
- simple example, A Simple Example
histograms, Exploring the Data Distribution
- example, population by state, Frequency Table and Histograms
homogeneity, measuring, Measuring Homogeneity or Impurity
hyperparameters
- and cross-validation in boosting, Hyperparameters and Cross-Validation
- for HGBoost, Hyperparameters and Cross-Validation
- in random forests, Hyperparameters
hypothesis tests, Hypothesis Tests-Further Reading
- alternative hypothesis, Alternative Hypothesis
- false discovery rate, Multiple Testing
- null hypothesis, The Null Hypothesis
- one-way and two-way tests, One-Way, Two-Way Hypothesis Test

I

impurity, Tree Models
- measuring, Measuring Homogeneity or Impurity
in-sample methods to assess and tune models, Model Selection and Stepwise Regression
independent variables, Simple Linear Regression, The Regression Equation
- main effects, Interactions and Main Effects
indexes, data frames and, Data Frames and Indexes
indicator variables, Factor Variables in Regression
inference, Exploratory Data Analysis, Statistical Experiments and Significance Testing
influence plots, Influential Values
influential values, Testing the Assumptions: Regression Diagnostics, Influential Values
information, Measuring Homogeneity or Impurity
interactions, Interpreting the Regression Equation
- and main effects, Interactions and Main Effects
- deciding which interaction terms to include in the model, Interactions and Main Effects
intercepts, Simple Linear Regression
- in cotton exposure and lung capacity example, The Regression Equation
Internet of Things (IoT), Elements of Structured Data
interquantile range (IQR), Estimates of Variability, Estimates Based on Percentiles
interval endpoints, Confidence Intervals

K

K (in K-Nearest Neighbors), K-Nearest Neighbors
k-fold cross-validation, Cross-Validation
K-means clustering, K-Means Clustering-Selecting the Number of Clusters
- interpreting the clusters, Interpreting the Clusters
- K-means algorithm, K-Means Algorithm
- selecting the number of clusters, Selecting the Number of Clusters
- simple example, A Simple Example-K-Means Algorithm
- using on unnormalized and normalized variables, Scaling the Variables
K-Nearest Neighbors, Predicted Values from Logistic Regression, K-Nearest Neighbors-KNN as a Feature Engine
- as a feature engine, KNN as a Feature Engine
- choosing K, Choosing K
- distance metrics, Distance Metrics
- example, predicting loan default, A Small Example: Predicting Loan Default
- one hot encoder, One Hot Encoder
- standardization, Standardization (Normalization, Z-Scores)
kernel density estimates, Density Estimates
KernSmooth package, Density Estimates
KNN (see K-Nearest Neighbors)
knots, Polynomial and Spline Regression, Splines
kurtosis, Frequency Table and Histograms

L

lambda, in Poisson and related distributions, Poisson and Related Distributions
Lasso regression, Model Selection and Stepwise Regression, Regularization: Avoiding Overfitting
Latent Dirichlet Allocation (LDA), Discriminant Analysis
leaf, Tree Models
least squares, Simple Linear Regression, Least Squares
leverage, Testing the Assumptions: Regression Diagnostics
- influential values in regression, Influential Values
lift, Evaluating Classification Models, Lift
lift curve, Lift
linear discriminant analysis (LDA), Discriminant Analysis, Exploring the Predictions
linear regression, Simple Linear Regression-Weighted Regression
- comparison to logistic regression, Linear and Logistic Regression: Similarities and Differences
- fitted values and residuals, Fitted Values and Residuals
- generalized linear model (GLM), Logistic Regression and the GLM
- least squares, Least Squares
- multiple, Multiple Linear Regression-Weighted Regression
  - assessing the model, Assessing the Model
  - cross-validation, Cross-Validation
  - example, King County housing data, Example: King County Housing Data
  - model selection and stepwise regression, Model Selection and Stepwise Regression
  - weighted regression, Weighted Regression
- prediction vs. explanation, Prediction versus Explanation (Profiling)
- regression equation, The Regression Equation
Literary Digest poll of 1936, Random Sampling and Sample Bias, Random Selection
loadings, Principal Components Analysis, A Simple Example
- for top five components (example), Interpreting Principal Components
log odds, Logistic Regression
log-odds function (see logit function)
log-odds ratio, Interpreting the Coefficients and Odds Ratios
logistic regression, Logistic Regression-Further Reading, Exploring the Predictions
- and the generalized linear model (GLM), Logistic Regression and the GLM
- assessing the model, Assessing the Model
- comparison to linear regression, Linear and Logistic Regression: Similarities and Differences
- interpreting the coefficients and odds ratios, Interpreting the Coefficients and Odds Ratios
- logistic response function and logit, Logistic Response Function and Logit
- predicted values from, Predicted Values from Logistic Regression
logit function, Logistic Regression, Logistic Response Function and Logit
long-tail distributions, Long-Tailed Distributions-Further Reading
loss, Tree Models
loss function, Oversampling and Up/Down Weighting

M

machine learning
- statistics vs., Statistical Machine Learning
machine learnng, Statistical Machine Learning
- (see also statistical machine learning)
Mahalanobis distance, Covariance Matrix, Distance Metrics
main effects, Interpreting the Regression Equation
- interactions and, Interactions and Main Effects
Mallows Cp, Model Selection and Stepwise Regression
Manhattan distance, Distance Metrics, Regularization: Avoiding Overfitting, Categorical Data and Gower’s Distance
maximum likelihood estimation (MLE), Fitting the model
mean, Estimates of Location
- formula for, Mean
- regression to, Regression to the Mean
- sample mean vs. population mean, Sample Mean versus Population Mean
- trimmed mean, Mean
- weighted mean, Mean
mean absolute deviation, Estimates of Variability, A/B Testing
- formula for calculating, Standard Deviation and Related Estimates
mean absolute deviation from the median (MAD), Standard Deviation and Related Estimates
median, Estimates of Location
- and robust estimates, Median and Robust Estimates
median absolute deviation, Estimates of Variability
metrics, Estimates of Location
minimum variance, Measures of Dissimilarity
MLE (see maximum likelihood estimation)
mode, Exploring Binary and Categorical Data
- examples in categorical data, Mode
model-based clustering, Model-Based Clustering-Further Reading
- limitations, Selecting the Number of Clusters
- mixtures of normals, Mixtures of Normals
- multivariate normal distribution, Multivariate Normal Distribution
- selecting the number of clusters, Selecting the Number of Clusters
moments, Frequency Table and Histograms
multi-arm bandits, Why Just A/B? Why Not C, D…?, Multi-Arm Bandit Algorithm-Further Reading
- definition of, Multi-Arm Bandit Algorithm
multicollinearity, Interpreting the Regression Equation, Multicollinearity
- problems with one hot encoding, One Hot Encoder
multicollinearity errors, Degrees of Freedom, Dummy Variables Representation
multiple linear regression (see linear regression)
multiple testing, Multiple Testing-Further Reading
- bottom line for data scientists, Multiple Testing
multivariate analysis, Exploring Two or More Variables
multivariate normal distribution, Multivariate Normal Distribution

N

n (sample size), Student’s t-Distribution
n or sample size, Degrees of Freedom
naive Bayes algorithm, Naive Bayes-Further Reading
- applying to numeric predictor variables, Numeric Predictor Variables
neighbors, K-Nearest Neighbors
network data structures, Nonrectangular Data Structures
nodes, Tree Models
non-normal residuals, Testing the Assumptions: Regression Diagnostics
nonlinear regression, Polynomial and Spline Regression-Further Reading
- definition of, Polynomial and Spline Regression
nonrectangular data structures, Nonrectangular Data Structures
normal distribution, Normal Distribution-Standard Normal and QQ-Plots
- key concepts, Standard Normal and QQ-Plots
- standard normal and QQ-Plots, Standard Normal and QQ-Plots
normalization, Standard Normal and QQ-Plots, Standardization (Normalization, Z-Scores), K-Means Clustering
- categorical variables before clustering, Scaling the Variables
- data distribution and, Standardization (Normalization, Z-Scores)
- in statistics vs. database context, Standardization (Normalization, Z-Scores)
null hypothesis, Hypothesis Tests, The Null Hypothesis
numeric variables
- grouped according to a categorical variable, Categorical and Numeric Data
- numeric predictor variables for naive Bayes, Numeric Predictor Variables
numerical data as categorical data, Exploring Binary and Categorical Data

O

object representation (spatial data), Nonrectangular Data Structures
Occam's razor, Model Selection and Stepwise Regression
odds, Logistic Regression, Logistic Response Function and Logit
odds ratios, Interpreting the Coefficients and Odds Ratios
- log-odds ratio and, Interpreting the Coefficients and Odds Ratios
omnibus tests, ANOVA
one hot encoder, Factor Variables in Regression, One Hot Encoder
one hot encoding, Dummy Variables Representation
one-way tests, Hypothesis Tests, One-Way, Two-Way Hypothesis Test
order statistics, Estimates of Variability, Estimates Based on Percentiles
ordered factor variables, Ordered Factor Variables
ordinal data, Elements of Structured Data
- importance of the concept, Elements of Structured Data
ordinary least squares (OLS), Least Squares, Heteroskedasticity, Non-Normality and Correlated Errors
- (see also least squares)
out-of-bag (OOB) estimate of error, Random Forest
outcome, Rectangular Data
outliers, Estimates of Location, Outliers, Testing the Assumptions: Regression Diagnostics
- in regression, Outliers
- sensitivity of correlation coefficient to, Correlation
- sensitivity of least squares to, Least Squares
- variance, standard deviation, mean absolute deviation and, Standard Deviation and Related Estimates
overfitting, Multiple Testing
- avoiding in boosting using regularization, Regularization: Avoiding Overfitting
- in linear regression, Model Selection and Stepwise Regression
oversampling, Strategies for Imbalanced Data, Oversampling and Up/Down Weighting

P

p-values, Statistical Significance and P-Values, P-Value
- adjusting, Multiple Testing
- data science and, Data Science and P-Values
- t-statistic and, Assessing the Model
- value of, Value of the p-value
pairwise comparisons, ANOVA
partial residual plots, Testing the Assumptions: Regression Diagnostics, Partial Residual Plots and Nonlinearity
- in logistic regression, Assessing the Model
PCA (see principal components analysis)
Pearson residuals, Chi-Square Test: A Resampling Approach
Pearson's chi-square test, Chi-Square Test: Statistical Theory
Pearson's correlation coefficient, Correlation
Pearson, Karl, Chi-Square Test, Principal Components Analysis
penalized regression, Model Selection and Stepwise Regression
percentiles, Estimates of Variability
- and boxplots, Percentiles and Boxplots
- estimates based on, Estimates Based on Percentiles
- precise definition of, Estimates Based on Percentiles
permission, obtaining for human subject testing, Why Just A/B? Why Not C, D…?
permutation tests, Resampling
- exhaustive and bootstrap, Exhaustive and Bootstrap Permutation Test
- for ANOVA, ANOVA
- value for data science, Permutation Tests: The Bottom Line for Data Science
- web stickiness example, Example: Web Stickiness
pertinent records (in searches), Size versus Quality: When Does Size Matter?
physical networks, Nonrectangular Data Structures
pie charts, Exploring Binary and Categorical Data
pivot tables (Excel), Two Categorical Variables
point estimates, Confidence Intervals
Poisson distributions, Poisson and Related Distributions, Generalized Linear Models
- calculating, Poisson Distributions
polynomial coding, Dummy Variables Representation
polynomial regression, Polynomial and Spline Regression, Polynomial
population, Random Sampling and Sample Bias
- sample mean vs. population mean, Sample Mean versus Population Mean
posterior probability, Naive Bayes, The Naive Solution
power and sample size, Power and Sample Size-Further Reading
precision, Evaluating Classification Models
- in classification models, Precision, Recall, and Specificity
predicted values, Fitted Values and Residuals
- (see also fitted values)
prediction
- explanation vs., in linear regression, Prediction versus Explanation (Profiling)
- harnessing results from multiple trees, How Trees Are Used
- K-Nearest Neighbors, K-Nearest Neighbors
  - using as first stage, KNN as a Feature Engine
- predicted values from logistic regression, Predicted Values from Logistic Regression
- unsupervised learning and, Unsupervised Learning
- using regression, Prediction Using Regression-Factor Variables in Regression
  - confidence and prediction intervals, Confidence and Prediction Intervals
  - dangers of extrapolation, The Dangers of Extrapolation
prediction intervals, Prediction Using Regression
- confidence intervals vs., Confidence and Prediction Intervals
predictor variables, Data Frames and Indexes, The Regression Equation
- (see also independent variables)
- correlated, Correlated Predictors
- in linear discriminant analysis, more than two, A Simple Example
- in naive Bayes algorithm, Naive Bayes
- main effects, Interactions and Main Effects
- numeric, applying naive Bayes to, Numeric Predictor Variables
- relationship between response and, Partial Residual Plots and Nonlinearity
principal components, Principal Components Analysis
principal components analysis, Principal Components Analysis-Further Reading
- cluster analysis vs., Interpreting the Clusters
- computing the principal components, Computing the Principal Components
- interpreting principal components, Interpreting Principal Components
- scaling the variables, Scaling the Variables
- simple example, A Simple Example-A Simple Example
- standardizing data, Standardization (Normalization, Z-Scores)
probability theory, Exploratory Data Analysis
profiling vs. explanation, Prediction versus Explanation (Profiling)
propensity score, Classification
proxy variables, Example: Web Stickiness
pruning, Tree Models, Stopping the Tree from Growing
pseudo-residuals, The Boosting Algorithm

Q

QQ-Plots, Normal Distribution
- example, returns for Netflix, Long-Tailed Distributions
- standard normal and, Standard Normal and QQ-Plots
quadratic discriminant analysis, A Simple Example
quantiles, Estimates Based on Percentiles
- R function, quantile, Estimates Based on Percentiles

R

R-squared, Multiple Linear Regression, Assessing the Model
random forests, Interactions and Main Effects, Tree Models, Random Forest-Hyperparameters
- better predictive performance than single trees, How Trees Are Used
- determining variable importance, Variable Importance
- hyperparameters, Hyperparameters
random sampling, Random Sampling and Sample Bias-Further Reading
- bias, Bias
- key terms for, Random Sampling and Sample Bias
- random selection, Random Selection
- sample mean vs. population mean, Sample Mean versus Population Mean
- size versus quality, Size versus Quality: When Does Size Matter?
random subset of variables, Random Forest
randomization, A/B Testing
randomization tests, Resampling
- (see also permutation tests)
randomness, misinterpreting, Hypothesis Tests
range, Estimates of Variability, Estimates Based on Percentiles
rare class problem, The Rare Class Problem
recall, Evaluating Classification Models, Precision, Recall, and Specificity
receiver operating characteristics (see ROC curve)
records, Rectangular Data, Simple Linear Regression
rectangular data, Rectangular Data-Estimates of Location
- terminology differences, Data Frames and Indexes
recursive partitioning, Tree Models, The Recursive Partitioning Algorithm, Random Forest
reference coding, Factor Variables in Regression-Dummy Variables Representation, Interactions and Main Effects, Logistic Regression and the GLM
regression, Regression and Prediction-Summary
- causation and, Prediction versus Explanation (Profiling)
- diagnostics, Testing the Assumptions: Regression Diagnostics-Polynomial and Spline Regression
  - heteroskedasticity, non-normality, and correlated errors, Heteroskedasticity, Non-Normality and Correlated Errors
  - influential values, Influential Values
  - outliers, Outliers
  - parial residual plots and nonlinearity, Partial Residual Plots and Nonlinearity
  - using scatterplot smoothers, Heteroskedasticity, Non-Normality and Correlated Errors
- different meanings of the term, Least Squares
- factor variables in, Factor Variables in Regression-Ordered Factor Variables
  - ordered factor variables, Ordered Factor Variables
  - with many levels, Factor Variables with Many Levels
- interpreting the regression equation, Interpreting the Regression Equation-Interactions and Main Effects
  - confounding variables, Confounding Variables
  - correlated predictors, Correlated Predictors
  - interactions and main effects, Interactions and Main Effects
  - multicollinearity, Multicollinearity
- KNN (K-Nearest Neighbors), KNN as a Feature Engine
- logistic regression, Logistic Regression-Further Reading
  - comparison to linear regression, Linear and Logistic Regression: Similarities and Differences
- multiple linear regression, Multiple Linear Regression-Weighted Regression
- polynomial and spline regression, Polynomial and Spline Regression-Summary
  - generalized additive models, Generalized Additive Models
  - polynomial regression, Polynomial
  - splines, Splines
- prediction with, Prediction Using Regression-Factor Variables in Regression
  - confidence and prediction intervals, Confidence and Prediction Intervals
  - dangers of extrapolation, The Dangers of Extrapolation
- ridge regression, Regularization: Avoiding Overfitting
- simple linear regression, Simple Linear Regression-Further Reading
  - fitted values and residuals, Fitted Values and Residuals
  - least squares, Least Squares
  - prediction vs. explanation, Prediction versus Explanation (Profiling)
  - regression equation, The Regression Equation
- unsupervised learning as building block, Unsupervised Learning
- with a tree, Predicting a Continuous Value
regression coefficient, Simple Linear Regression
- in cotton exposure and lung capacity example, The Regression Equation
regression to the mean, Regression to the Mean
regularization, Boosting
- avoding overfitting with, Regularization: Avoiding Overfitting
replacement (in sampling), Random Sampling and Sample Bias
- bootstrap, The Bootstrap
representativeness, Random Sampling and Sample Bias
resampling, The Bootstrap, Resampling-For Further Reading
- bootstrapping vs., Resampling versus Bootstrapping
- permutation tests, Permutation Test
  - exhaustive and bootstrap tests, Exhaustive and Bootstrap Permutation Test
  - value for data science, Permutation Tests: The Bottom Line for Data Science
  - web stickiness example, Example: Web Stickiness
- using in chi-square test, Chi-Square Test: A Resampling Approach
residual standard error, Multiple Linear Regression, Assessing the Model
residual sum of squares, Least Squares
- (see also least squares)
residuals, Simple Linear Regression, Fitted Values and Residuals
- computing, Fitted Values and Residuals
- distribution of, Heteroskedasticity, Non-Normality and Correlated Errors
- standardized, Testing the Assumptions: Regression Diagnostics
response, Simple Linear Regression, The Regression Equation
- relationship between predictor variable and, Partial Residual Plots and Nonlinearity
ridge regression, Model Selection and Stepwise Regression, Regularization: Avoiding Overfitting
robust, Estimates of Location
robust estimates of location
- example, population and murder rate by state, Example: Location Estimates of Population and Murder Rates
- mean absolute deviation from the median, Standard Deviation and Related Estimates
- median, Median and Robust Estimates
  - outliers and, Outliers
ROC curve, ROC Curve
root mean squared error (RMSE), Multiple Linear Regression, Assessing the Model, Predicting a Continuous Value
RSE (see residual standard error)
RSS (residual sum of squares), Least Squares
- (see also least squares)

S

sample bias, Random Sampling and Sample Bias, Random Sampling and Sample Bias
sample statistic, Sampling Distribution of a Statistic
samples
- definition of, Random Sampling and Sample Bias
- sample size, power and, Power and Sample Size-Further Reading
- terminology differences, Data Frames and Indexes
sampling, Data and Sampling Distributions-Summary
- binomial distribution, Binomial Distribution-Further Reading
- bootstrap, The Bootstrap-Further Reading
- confidence intervals, Confidence Intervals-Further Reading
- long-tail distributions, Long-Tailed Distributions-Further Reading
- normal distribution, Normal Distribution-Standard Normal and QQ-Plots
- oversampling imbalanced data, Oversampling and Up/Down Weighting
- Poisson and related distributions, Poisson and Related Distributions-Summary
  - estimating failure rate, Estimating the Failure Rate
  - exponential distribution, Exponential Distribution
  - Poisson distribution, Poisson Distributions
  - Weibull distribution, Weibull Distribution
- population versus sample, Data and Sampling Distributions
- random sampling and sample bias, Random Sampling and Sample Bias-Further Reading
- sampling distribution of a statistic, Sampling Distribution of a Statistic-Further Reading
- selection bias, Selection Bias-Further Reading
- Student's t-distribution, Student’s t-Distribution-Further Reading
- Thompson's sampling, Multi-Arm Bandit Algorithm
- undersampling imbalanced data, Undersampling
- with and without replacement, Random Sampling and Sample Bias, The Bootstrap, Resampling
sampling distribution, Sampling Distribution of a Statistic-Further Reading
- central limit theorem, Central Limit Theorem
- data distribution vs., Sampling Distribution of a Statistic
- standard error, Standard Error
scale parameter (Weibull distribution), Weibull Distribution
scaling and categorical variables, Scaling and Categorical Variables-Summary
- dominant variables, Dominant Variables
- Gower's distance and categorical data, Categorical Data and Gower’s Distance
- problems clustering mixed data, Problems with Clustering Mixed Data
- scaling the variables, Scaling the Variables
scatterplot smoothers, Heteroskedasticity, Non-Normality and Correlated Errors
scatterplots, Correlation
- example, returns for ATT and Verizon, Scatterplots
scientific fraud, detecting, Fisher’s Exact Test
screeplots, Principal Components Analysis, Interpreting Principal Components
- for PCA of top stocks, Dominant Variables
searches
- search queries on Google, Size versus Quality: When Does Size Matter?
- vast search effect, Selection Bias
selection bias, Selection Bias-Further Reading
- regression to the mean, Regression to the Mean
self-selection sampling bias, Random Sampling and Sample Bias
sensitivity, Evaluating Classification Models, Precision, Recall, and Specificity
shape parameter (Weibull distribution), Weibull Distribution
signal-to-noise ratio, Choosing K
significance level, Power and Sample Size, Sample Size
significance tests, Hypothesis Tests, Data Science and P-Values
- (see also hypothesis tests)
simple random sample, Random Sampling and Sample Bias
single linkage, Measures of Dissimilarity
skew, Long-Tailed Distributions
skewness, Frequency Table and Histograms
slope, Simple Linear Regression
- (see also regression coefficient)
- in regression equation, The Regression Equation
SMOTE algorithm, Data Generation
spatial data structures, Nonrectangular Data Structures
specificity, Evaluating Classification Models, Precision, Recall, and Specificity
spline regression, Polynomial and Spline Regression, Splines
splines, Splines
split value, Tree Models
square-root of n rule, Standard Error
SS (sum of squares), ANOVA
- withing cluster sum of squares, K-Means Clustering
standard deviation, Estimates of Variability
- and related estimates, Standard Deviation and Related Estimates
- covariance matrix and, Covariance Matrix
- in statistical testing output, A/B Testing
- sensitivity to outliers, Standard Deviation and Related Estimates
- standard error vs., Standard Error
standard error, Sampling Distribution of a Statistic
- formula for calculating, Standard Error
- standard deviation vs., Standard Error
standard normal distribution, Normal Distribution, Standard Normal and QQ-Plots
standardization, Standard Normal and QQ-Plots, K-Nearest Neighbors, K-Means Clustering
- in K-Nearest Neighbors, Standardization (Normalization, Z-Scores)
standardized residuals, Testing the Assumptions: Regression Diagnostics
- examining to detect outliers, Outliers
statistical experiments and significance testing, Statistical Experiments and Significance Testing-Summary
- A/B testing, A/B Testing-For Further Reading
- chi-square test, Chi-Square Test-Further Reading
- degrees of freedom, Degrees of Freedom-Further Reading
- hypothesis tests, Hypothesis Tests-Further Reading
- multi-arm bandit algorithm, Multi-Arm Bandit Algorithm-Further Reading
- multiple tests, Multiple Testing-Further Reading
- power and sample size, Power and Sample Size-Further Reading
- resampling, Resampling-Statistical Significance and P-Values
- statistical significance and p-values, Statistical Significance and P-Values-Further Reading
  - alpha, Alpha
  - data science and p-values, Data Science and P-Values
  - p-values, P-Value
  - type 1 and type 2 errors, Type 1 and Type 2 Errors
  - value of p-values, Value of the p-value
- t-tests, t-Tests-Further Reading
statistical inference, classical inference pipeline, Statistical Experiments and Significance Testing
statistical machine learning, Statistical Machine Learning-Summary
- bagging and the random forest, Bagging and the Random Forest-Hyperparameters
- boosting, Boosting-Summary
  - avoiding overfitting using regularization, Regularization: Avoiding Overfitting
  - hyperparameters and cross-validation, Hyperparameters and Cross-Validation
  - XGBoost, XGBoost
- K-Nearest Neighbors, K-Nearest Neighbors-KNN as a Feature Engine
  - as a feature engine, KNN as a Feature Engine
  - choosing K, Choosing K
  - distance metrics, Distance Metrics
  - example, predicting loan default, A Small Example: Predicting Loan Default
  - one hot encoder, One Hot Encoder
  - standardization, Standardization (Normalization, Z-Scores)
- tree models, Tree Models-Further Reading
  - measuring homogeneity or impurity, Measuring Homogeneity or Impurity
  - predicting a continuous value, Predicting a Continuous Value
  - recursive partitioning algorithm, The Recursive Partitioning Algorithm
  - simple example, A Simple Example
  - stopping tree growth, Stopping the Tree from Growing
  - uses of trees, How Trees Are Used
statistical moments, Frequency Table and Histograms
statistical significance, Permutation Test
statistics vs. machine learning, Statistical Machine Learning
stepwise regression, Model Selection and Stepwise Regression
stochastic gradient boosting, The Boosting Algorithm
- definition of, Boosting
- XGBoost implementation, XGBoost-Hyperparameters and Cross-Validation
stratified sampling, Random Sampling and Sample Bias, Random Selection
structured data, Elements of Structured Data-Further Reading
Student's t-distribution, Student’s t-Distribution-Further Reading
subjects, A/B Testing
success, Binomial Distribution
sum contrasts, Dummy Variables Representation

T

t-distributions, Student’s t-Distribution-Further Reading, t-Tests
- data science and, Student’s t-Distribution
t-statistic, t-Tests, Multiple Linear Regression, Assessing the Model
t-tests, t-Tests-Further Reading
tail, Long-Tailed Distributions
target shuffling, Selection Bias
test sample, Evaluating Classification Models
test statistic, A/B Testing, t-Tests
- selecting before the experiment, Why Have a Control Group?
Thompson sampling, Multi-Arm Bandit Algorithm
time series data, Nonrectangular Data Structures
time-to-failure analysis, Weibull Distribution
treatment, A/B Testing
treatment group, A/B Testing
tree models, Interactions and Main Effects, Exploring the Predictions, Tree Models
- how trees are used, How Trees Are Used
- measuring homogeneity or impurity, Measuring Homogeneity or Impurity
- predicting a continuous value, Predicting a Continuous Value
- recursive partitioning algorithm, The Recursive Partitioning Algorithm
- simple example, A Simple Example
- stopping tree growth, Stopping the Tree from Growing
Trellis graphics, Visualizing Multiple Variables
trials, Binomial Distribution
trimmed mean, Estimates of Location
- formula for, Mean
Tukey, John Wilder, Exploratory Data Analysis
two-way tests, Hypothesis Tests, One-Way, Two-Way Hypothesis Test
type 1 errors, Statistical Significance and P-Values, Type 1 and Type 2 Errors, Multiple Testing
type 2 errors, Statistical Significance and P-Values, Type 1 and Type 2 Errors

U

unbiased estimates, Standard Deviation and Related Estimates
undersampling, Undersampling
uniform random distribution, Fisher’s Exact Test
univariate analysis, Exploring Two or More Variables
unsupervised learning, Unsupervised Learning-Summary
- and prediction, Unsupervised Learning
- hierarchical clustering, Hierarchical Clustering-Measures of Dissimilarity
  - agglomerative algorithm, The Agglomerative Algorithm
  - dendrogram, The Dendrogram
  - dissimilarity measures, Measures of Dissimilarity
  - simple example, A Simple Example
- K-means clustering, K-Means Clustering-Selecting the Number of Clusters
  - interpreting the clusters, Interpreting the Clusters
  - K-means algorithm, K-Means Algorithm
  - selecting the number of customers, Selecting the Number of Clusters
  - simple example, A Simple Example-K-Means Algorithm
- model-based clustering, Model-Based Clustering-Further Reading
  - mixtures of normals, Mixtures of Normals
  - multivariate normal distribution, Multivariate Normal Distribution
  - selecting the number of clusters, Selecting the Number of Clusters
- principal components analysis, Principal Components Analysis-Further Reading
  - computing the principal components, Computing the Principal Components
  - interpreting principal components, Interpreting Principal Components
  - simple example, A Simple Example-A Simple Example
- scaling and categorical variables, Scaling and Categorical Variables-Summary
  - dominant variables, Dominant Variables
  - Gower's distance and categorical data, Categorical Data and Gower’s Distance
  - problems clustering mixed data, Problems with Clustering Mixed Data
  - scaling the variables, Scaling the Variables
up weight or down weight, Strategies for Imbalanced Data, Oversampling and Up/Down Weighting
uplift vs. lift, Lift

V

validation sample, Evaluating Classification Models
variability
variability, estimates of, Estimates of Variability-Further Reading
- example, murder rate by state population, Example: Variability Estimates of State Population
- key terminology, Estimates of Variability
- percentiles, Estimates Based on Percentiles
- standard deviation and related estimates, Standard Deviation and Related Estimates
variables
- exploring two or more, Exploring Two or More Variables-Summary
  - categorical and numeric data, Categorical and Numeric Data
  - hexagonal binning and contours, Hexagonal Binning and Contours (Plotting Numeric versus Numeric Data)
  - key concepts, Visualizing Multiple Variables
  - visualizing multiple variables, Visualizing Multiple Variables
- importance of, determining in random forests, Variable Importance
- rescaling with z-scores, Standardization (Normalization, Z-Scores)
variance, Estimates of Variability
- analysis of (ANOVA), ANOVA
- formula for calculating, Standard Deviation and Related Estimates
- sensitivity to outliers, Standard Deviation and Related Estimates
vast search effect, Selection Bias
violin plots, Exploring Two or More Variables
- combining with a boxplot, example, Categorical and Numeric Data

W

Ward's method, Measures of Dissimilarity
web stickiness example (permutation test), Example: Web Stickiness
web testing
- bandit algorithms in, Multi-Arm Bandit Algorithm
- deciding how long a test should run, Power and Sample Size
Weibull distribution, Poisson and Related Distributions
- calculating, Weibull Distribution
weighted mean, Estimates of Location
- expected value, Expected Value
weighted median, Estimates of Location, Median and Robust Estimates
- formula for calculating, Mean
weighted regression, Multiple Linear Regression, Weighted Regression
weights, Simple Linear Regression
- component loadings, A Simple Example
whiskers (in boxplots), Percentiles and Boxplots
wins, Multi-Arm Bandit Algorithm
within cluster sum of squares (SS), K-Means Clustering

X

XGBoost, XGBoost-Hyperparameters and Cross-Validation
- hyperparameters, Hyperparameters and Cross-Validation

Z

z-distribution, Standard Normal and QQ-Plots
- (see also normal distribution)
z-score, Normal Distribution, Strategies for Imbalanced Data, K-Nearest Neighbors, Standardization (Normalization, Z-Scores)
- converting data to, Standard Normal and QQ-Plots
- rescaling variables, Standardization (Normalization, Z-Scores)

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.