Actual vs. predicted plot linear model
Actual vs. predicted plot quadratic polynomial model
Amazon Food Review
American Statistical Association (ASA)
An Exploratory Technique for Investigating Large Quantities of Categorical Data
Apache Pig
Apriori
Area Under the Curve (AUC)
Artificial intelligence (AI)
Artificial neural networks (ANN)
architecture
components
linear seperability
MLP
attribute importance
by Garson method
by Olden method
deep learning
applications
architecture
darch for classification
guidelines
hidden layers
multi-layer
multiple linear and non-linear transformations
mxNet image classification
mxNet package
normalized image
volcano picture, image recognition exercise
evolutionary methods
expectation maximization
feed-forward back-propagation
GEP
hidden layer
human cognitive learning
learning algorithms
machine learning
non-parametric methods
particle swarm optimization
perceptron
purchase prediction
sigmoid neuron
simulated annealing
supervised vs . unsupervised neural nets
Association rule mining (ARM)
algorithms
apriori
confidence
Eclat
IBCF
item frequency plot
lift
Market Basket data
POS
scarcity visualization
support
transactional data
UBCF
Autocorrelation
Auto-correlation function (ACF)
Automatic grid search optimization
Back-propagation learning
Back-propagation method
Back-propagation of errors
Bagging
bootstrap aggregating
CART
random forest
Bayes formula
Bayesian algorithms
Bayesian optimization, machine learning models
black box function
Gaussian processes
parameters
random tuning
RMSE, cost and Sigma space
sample t-test
Bayes rule
Bayes theorem
Bias and variance tradeoff
boosting
bootstrap aggregation
bulls eye plot
components
definition
graphical representation
model performance improvements
plot function
random variable
real model prototype
Bias-variance decomposition
Bivariate plots
actual probability
actual vs. predicted plot
CustomerPropensity
IncomeClass
MembershipPoints
frequency
predicted probability
Boosting
Bootstrap aggregation
Bootstrap sampling
advantages
arguments
coefficient
confidence band
density function
disadvantages
histogram
hypothesis testing
jackknife
jackknife estimate
linear regression model
mean and variance
metric estimation
normal distribution
QQ plot
sampling distribution
t.test()
Boxplots
interquartile range
outliers
population
Breush-Pagan test
Bubble charts
fertility rate vs. life expectancy
GDP per capita vs. life expectancy
Business implications of sampling
deciding factors
features
machine learning
methods and interpretation
shortcomings
C5.0 algorithm
attribute-value description
discrete classes
evaluation
Hunt’s approach
logical classification models
model building
model summary
predefined classes
pruning
purchase prediction dataset
Ross Quinlan’s web page
sufficient data
caretEmseble() function
Caret package
complex regression and classification problems
function/tools
trainControl() function
train() function algorithm
CART
Central Limit Theorem
Centroid-based clustering
Chi-Square Automated Interaction Detection (CHAID)
algorithm
building the model
decision tree
model evaluation
R code
splitting
stopping
Classification and Regression Tree (CART)
building the model
cp (complexity parameter)
Gini-Index
model evaluation
pseudo code
regression tree-based approach
rpart function
Classification matrix
Classification tree
Class imbalance
Cluster dendogram
Cluster sampling
advantages
conditional statement
disadvantages
international transactions
International transactions
k-means function
outstanding balance
population data
single-stage sampling
startum variable
stratified() function
subsets
two-stage sampling
t.test()
two-stage
Clustering algorithms
Clustering analysis
algorithms
applications
centroid-based clustering
centroid models
connectivity models
definition
density-based clustering
density models
distribution-based clustering
distribution models
Dunn index
external evaluation
hierarchal
internal evaluation
Jaccard index
k-means
machine learning
principle
rand measure
silhouette coefficient
types
unsupervised learning algorithm
Cohort diagrams
active credit cards volume
credit example
definition
Collaborative filtering-based approach
Comma-separated values (CSV)
Computational savings
linear regression model
population dataset
sys.time()
Conditional independence
Confidence interval
Continuous variables
Convenience sampling
Cook’s distance
Correlation, definition
Correlation analysis
features
observations
Pearson correlation
population correlation coefficient
scatter plot, HousePrice vs. StoreArea
statistical relationship
Correlation plots
description
positive or negative correlation
world development indicators
Credit card fraud
data description
data exploration
data import
data transformation
pooled mean and variance
population mean
population variance
sampling plan
statistical measures
Credit risk modeling
Custom search algorithms
Data formats
Data frames
Data mining
Data preparation and exploration
categorical variables
data and visualization
date variable
derived variables
markup language
model building
n-day averages
reshaping
semi-Structured
structured
unstructured
variables types
Data science
Dataset
house sale prices prediction
purchase preference prediction
Data visualization, R
Data visualization, R
benefits
boxplots
bubble charts
cohort diagrams
correlation plots
definition
dendograms
elements, data presentation
ggplot2 package
heatmaps
histograms and density plots
line chart
pie charts
Sankey plots
scatterplot
spatial maps
stacked column charts
time series graphs
waterfall chart
wordclouds
world development indicators
Dates and times
Daylight saving time (DST)
Decision trees
algorithms
bagging
boosting
classification
decision nodes
ensemble models
ID3
leaf nodes
learning methods
measures
entropy
Gini Index
information gain
non-parametric model
regression
Deep learning algorithms
Dendograms
clusters, species classification
definition
distance/height
ggdendro() and dendextend()
x-axis
y-axis
Density-based clustering
border points
core points
DBSCAN
EM algorithm
outliers
parameters
Density-based spatial clustering of applications with noise (DBSCAN)
Density plot
Dimensionality reduction
algorithms
description
orthogonality, principal components
PCA
principal component analysis
Directed Acyclic Graph (DAG)
Distance-based/event-based algorithms
Distributed processing and storage
GFS
MapReduce
parallel execution in R
cores setting
problem statement
random forest model
stopping clusters
Distribution-based clustering
Distribution of studentized residuals
dplyr
Dunn Index
Durbin Watson statistics bounds
Durbin Watson test
Eclat
EM algorithm
Empirical Distribution Function (EDF)
Ensemble learning
methods
bagging
boosting
model performance improvement
supervised learning algorithm
voting ensembles
Ensemble models
Ensemble techniques illustration, R
algorithms, purchase prediction data
bagging trees
blending KNN and Rpart
C5.0 decision tree model
Caret package
caretStack() function
GBM model
resamples() function
stacking, caretEnsemble
Entropy
Exploratory Data Analysis (EDA)
Exposure at Default (EAD)
Extensible Markup languages (XML)
Factor variables
False positive rate (FPR)
Feature engineering
checklist
dimensionality reduction
embedded methods
feature ranking
filter methods
selection problem checklist
variable subset selection
working data
continuous/categorical features
EAD
LGD
PD
willingness to pay and ability to pay
wrapper methods
Feature ranking
Feedforward Neural Networks (FFNN)
Fine needle aspirate (FNA)
Fuzzy C-means clustering
Gains charts, AUC
Gauss-Markov theorem
Gene expression programming (GEP)
Generalized Linear Model (GLM)
GFS
ggplot2 Package
description
R documentation
Gini-Index
Google file system (GFS)
Gradient Boosting Machine (GBM)
H2O, machine learning in R
clusters initialization
deep learning demo
documented materials
java virtual machine
package installation
running demo
testing data
Hadoop ecosystem
Apache Pig
command pig-x local connects
count and sort
flattening tokens
group words
load data into A1
tokenize each line
components and tools
hadoop distributed file system
Hadoop YARN
HBase
create and put data
data scanning
starting HBase
Hive
Apache
creating tables
data loading, Hive table
describing tables
generating data and storing
HDFS
large-scale data processing
query selection
SQL queries
MapReduce
code snippet
libraries rmr2 and rhdfs
procedures
shuffle
Word Count
wordcount function
spark
Heat maps
description
regions vs. world development indicators
Hierarchal clustering
Hinge loss
Histogram
construction
description
GDP and population
Homoscedasticity
House sale price dataset
Human cognitive learning
Hyper-parameters
Bayesian approach
decision points
“higher-level” properties
optimization
automatic grid search
custom search algorithms
manual grid search
manual search
optimal search
random search
properties
random forest algorithm
random forest models
Hypertext Markup Language (HTML)
Hypothesis testing
Independent events
Influence plot
Infographics
Information gain
Initial data analysis (IDA)
description
dplyr
multiple sources
naming convention
str() function
table(): pattern
Item-Based Collaborative Filtering (IBCF)
cosine/Pearson correlation
creation rating matrix
data preparation
distribution of ratings
evaluation
exploring, rating matrix
loading data
raw ratings by users
true positive ratio vs. false positive ratio
UBCF recommendation model
Iteration error
Iterative Dichotomizer 3 (ID3)
algorithm
commands
model building
model evaluation
RWeka
RWekajars
Jaccard index
JSON file
Kappa error metric
K-fold cross validation
K-Means Clustering Algorithm
Knowledge Discovery and Data Mining (KDD)
Kolmogorov-Smirnov tests (KS test)
Kurtosis
Law of Large Numbers (LLN)
strong law
weak law
Learning Vector Quantization (LVQ)
Least Absolute Shrinkage and Selection Operator (LASSO)
LGD
Lift chart
Linear predictors
bias of estimator
consistent estimator
efficient estimator
OLS
Linear regression
actual vs. predicted
affine function
definition
dependent and independent variable
diagnostics
estimated equation
estimation
Gauss-Markov theorem
lm() package
minimization problem
model diagnostics
homoscedasticity
influential point analysis
multicollinearity
normality of residuals
outliers
residual autocorrelation
OLS
parametric method
predicted values
residuals
standard error
t-value and p-value
Line chart
description
GDP growth, countries
melt() function
Link function
List
Logistic regression
analysis
binomial
binomially distributed
logit transformation
model diagnostics
bivariate plots
concordance and discordant ratios
cumulative gains and lift charts
deviance
log likelihoods
pseudo R-Square
wald test
multinomial
odds ratio
ordered
predictor variables
Logit function
Logit transformation
Loss Given Default (LGD)
LOWESS plot (Locally Weighted Scatterplot Smoothing)
Machine learning (ML)
abstraction layer
algorithms
ANN
association rule mining
Bayesian algorithms
clustering algorithms
deep learning
dimensionality reduction
distance-based/event-based algorithms
ensemble learning
regression-based methods
regularization methods
text mining
tree-based algorithms
case study
computer vision
3D approach
demo in R
real-world use case
statistical background
distributions
evaluation
exploration
feature engineering
friction-less pipeline
intelligent personal assistant/machines
PEBE framework
phase forms
plethora of algorithms
predictive models
process flow
probability
conditional independence
counting
independent events
notation
statistics
randomness
R-package
statistical concepts
statistical learning
statistical modeling
statistics and computer science
types
factors
reinforcement learning
semi-supervised learning
supervised learning
unsupervised learning
Manual grid search optimization
MapReduce
Market Basket Data
Matrix
Maximum likelihood estimation (MLE)
Mean
Mean absolute error
Mean Absolute Percentage Error (MAPE)
Mean Absolute Scaled Error (MASE)
Microsoft Excel
Model building checklist
Model evaluation
continuous output
mean absolute error
model performance metrics
RMSE
R-square
discrete output
classification matrix
ROC curve
sensitivity and specificity
kappa error metric
population stability index
probabilistic techniques
statistical methods
Model performance
Bayesian optimization
bias and variance tradeoff
Caret package
continuous output
discrete output
ensemble learning
evaluation
hyper-parameters
machine learning and statistical modeling
testing data
training data
validation data
Model performance
Model sampling
Model-selection process
Model suffering
from bias
from variance
Moment
Monte Carlo method
acceptance-rejection methods
beta density
EDF
random sampling techniques
stochastic calculus
Multicollinearity
Multi-Layer Perceptron (MLP)
Multinomial logistic regression
classifier
class imbalance
estimation process
multinom() function
probability/proportion
Naive Bayes method
Bayes theorem
chain rule
conditional probability
data preparation
likelihood and marginal likelihood
model
model evaluation
posterior probability
prior probability
purchase prediction dataset
National Sample Survey Organization (NNSO)
Natural Language Processing (NLP)
Neuron anatomy
Nonparametric Multiplicative Regression (NPMR)
Non-probability sampling
Not Available (NAs)
Online machine learning algorithms
benefits and challenges
fuzzy C-means clustering
tackling
Optimal search optimization
Ordinary Least Square (OLS)
Particle swarm optimization
Part-of-speech (POS)
categorization
extraction
frequency
mapping
pre-processing
Pearson Product-Moment Correlation Coefficient
Perceptron
Performance evaluation metrics
Permutation
Pie charts
Point-of-sale (POS)
Polynomial regression
Pooled mean
Pooled variance
Population stability index
continuous distribution
discrete cases
discrete distributions
ECDF plots, Set_1 and Set_2
Empirical Cumulative Distribution Function (ECDF)
KS test
threshold values
Principal component analysis (PCA)
advantages
orthogonality
steps
Probabilistic techniques
bootstrap sampling
K-fold cross validation
Probability
vs. non-probability sampling
sampling technique
data dimensions
histogram
population mean
population variance
sampling methods
Probability of default (PD)
Pseudo R-Square
Purposive sampling
Quantile
Quota sampling
R
building blocks
calculations
data frames
data structures
functions
GNU S
lists
matrixes
packages
statistics
subsetting
vectors
Radial basis function (RBF)
Rand index
Random Forest
Random search algorithms
Random search optimization
rbinom()
R code
Receiver operating characteristic (ROC) curve
Recommendation algorithm
Recursive binary split
Recursive partitioning
Regression analysis
causation
distributional assumptions
linear model
non-parametric methods
notation
parametric methods
prediction/forecasting
statistical learning and machine learning space
statistical model
variables
Regression-based methods
Regression trees
Regularization algorithms
Reinforcement learning
Relational Database Management Systems (RDBMS)
Residual Sum of Squares (RSS)
Residuals vs. fitted plot
River plots
RMSE
ROC curve
Root mean square error (RMSE)
Root node
Sample point
Sampling
bias
classification
description
distribution
error
fraction
objectives
population mean
population statistics
sources and storing
technological advancement
test statistics
variance
Sampling without replacement (SWOR)
Sampling with replacement (SWR)
Sankey plots
Scatterplots
description
higher dimensional
population vs. GDP relationship
Semi-supervised learning
Serial correlation
Shapiro-Wilk test
Sigmoid function
Sigmoid neurons
Silhouette coefficient
Simple random sampling
distribution of data
function
histograms
hypothesis
KS test
population
population average
population sampling
population size
p-value of t.test
replacement
sample and population
sample() function
summarise function
without replacement
Simulated annealing
Simulation
Skewness
Spark’s machine learning
algorithms
build, ML model
MLlib
preprocessing
SparkDataFrame creation
SparkR session, initializing
sparkR.stop()
system properties, setting
test dataset
tools
Spatial maps
data frame creation
ggmap()
ggplot() function
India map, robbery counts
Specialization vs. generalization
Squared Euclidean distance
Stacked column charts
age dependency ratio
contribution, sectors
description
working age ratio
Stacking
Statistical learning
Stratified random sampling
disadvantages
histograms
KS test
population
proportion
sample() function
stratified function
stratified sampling
stratum variables
sub-populations
summarise() function
t.test()
Summary statistics
Supervised learning
Supervised vs. unsupervised learning
Support vector machine (SVM)
binary classifier
data preparation
data summary
model building
model evaluation
classification
class separation
hard margins
linear
multi-class
nonlinearity
overlapping classes
soft margins
Systematic random sampling
business and computational capacity
circular sampling frame
EDF
formula
homogeneous sets
KS test
population variance
sample distribution
sample frame
skip factor
subsetting
Term Frequency/Inverse Term frequency (TF_IDF)
Text mining algorithms
Text-mining approaches
consumer behavior/product performance
data preparation
data summary
Microsoft Cognitive Services
analytics features
language detection
mscstexta4r
Project Oxford
sentiment analysis
summarization
third-party API
topic detection
twitterR() package
NLP
POS tagging
summarization
text analysis
text data
TF-IDF
Twitter statics
word cloud
Time series graphs
GDP growth, countries
GDP growth, recession
Torsten Hothorn
True Negative Rate (TNR)
True positive rate (TPR)
Twitter feeds and article
UCI Machine Learning Repository
Unsupervised Fuzzy Competitive Learning
Unsupervised learning
User-Based Collaborative Filtering (UBCF)
Variable subset selection
definition
embedded method
fit model
fitted Cross Validated Linear Model
glmnet fit model
logistic regression
misclassification error and log of penalization factor (lambda)
regularization
statistical approaches
filter method
CoV
Gini coefficient
statistical approaches
variance threshold
wrapper method
Variance
Variance inflation factor (VIF)
Vectors
Wald test
Waterfall charts
Within cluster sum of squares (WCSS)
Wordclouds
World development indicators (WDI)
XML