Previous Chapter

Index

A

A/B testing
- about / A/B testing
abstraction, Scala
- about / Abstraction
- higher-kind projection / Higher-kind projection
- covariant functors for vectors / Covariant functors for vectors
- contravariant functors for co-vectors / Contravariant functors for co-vectors
- monads / Monads
actions / Actions
- asynchronous actions / Asynchronous actions
actions engine
- about / Basic components of a data-driven system, Actions engine
Activity Monitor
- about / System monitoring
Actor model
- about / The Actor model
- components / The Actor model
actors
- as people / Actors as people
- constructing / Actor construction, Anatomy of an actor, Follower network crawler
- fetcher / Fetcher actors
- about / Scalability
adaptive modeling / Model categorization
aggregate functions
- URL / Aggregation operations
aggregation operations
- about / Aggregation operations
aggregations
- with Group by / Aggregations with "Group by"
Akka.io
- about / An overview
Akka documentation / What we have not talked about
Akka framework
- about / An overview, Akka
- URL / Akka
- master-workers / Master-workers
- futures / Futures
Akka library / Futures example – stock price fetcher
Algebird
- about / Abstraction
algebraic libraries
- about / Algebraic and numerical libraries
- jBlas 1.2.3 / Algebraic and numerical libraries
- Colt 1.2.0 / Algebraic and numerical libraries
- AlgeBird 2.10 / Algebraic and numerical libraries
- Breeze 0.8 / Algebraic and numerical libraries
Alternating least squares (ALS)
- about / ML libraries
alternative preprocessing techniques
- autoregressive models / Alternative preprocessing techniques
- curve-fitting algorithms / Alternative preprocessing techniques
- nonlinear dynamic systems / Alternative preprocessing techniques
- Hidden Markov models / Alternative preprocessing techniques
Amazon Web Services (AWS)
- URL / Running Spark applications on EC2
Analysis of Variance (ANOVA)
- about / Multivariate regression
AngularJS
- about / UI component
- URL / UI component
annotation
- about / Segmentation, annotation, and chunking
annual dividend yield
- about / Fundamental analysis
ANother Tool for Language Recognition (ANTLR)
- URL / Text analysis pipeline
Apache Commons Math
- URL / Don't reinvent the wheel!
- about / Apache Commons Math
- description / Description
- licensing / Licensing
- installation / Installation
- installation, for Mac OS X / Installation
- installation, for Windows / Installation
Apache Parquet
- about / Parquet files
Apache Spark
- about / Apache Spark
- features / Why Spark?
- deign principles / Design principles
- deployment modes / Deploying Spark
- performance evaluation / Performance evaluation
- pros / Pros and cons
- cons / Pros and cons
Apache Spark (Akka)
- about / Scalability
APIs
- creating, with Play / Creating APIs with Play: a summary
application
- building / Building an application
applications
- Bootstrapping / Bootstrapping the applications
architecture, Spark
- about / Understanding Spark architecture
- task scheduling / Task scheduling
- Spark components / Spark components
- MQTT / MQTT, ZeroMQ, Flume, and Kafka
- ZeroMQ / MQTT, ZeroMQ, Flume, and Kafka
- Flume / MQTT, ZeroMQ, Flume, and Kafka
- Kafka / MQTT, ZeroMQ, Flume, and Kafka
- HDFS / HDFS, Cassandra, S3, and Tachyon
- Cassandra / HDFS, Cassandra, S3, and Tachyon
- S3 / HDFS, Cassandra, S3, and Tachyon
- Tachyon / HDFS, Cassandra, S3, and Tachyon
- Mesos / Mesos, YARN, and Standalone
- YARN / Mesos, YARN, and Standalone
- Standalone / Mesos, YARN, and Standalone
arrays
- about / Complex data types – arrays, maps, and structs, Arrays
Arrays / A whirlwind tour of JSON
artificial neural networks
- feed-forward neural networks / Feed-forward neural networks
- advantages / Benefits and limitations
- disadvantages / Benefits and limitations
Aster Data
- URL / Sessionization
authentication
- HTTP headers, adding / Authentication – adding HTTP headers
autonomous systems / The problem
Autoregressive Integrated Moving Average (ARIMA) / Alternative preprocessing techniques
Autoregressive Moving Average (ARMA) / Alternative preprocessing techniques
Avro
- about / Other serialization formats
AvroParquet
- about / Other serialization formats
Azkaban
- about / Data transformation layer

B

backend
- need for / Do I need a backend?
Balancer
- about / Running Hadoop HDFS
basic sampling
- about / Basic, stratified, and consistent sampling
batch gradient descent algorithm / Selecting an optimizer
batch training / Online training versus batch training
Baum-Welch estimator
- about / The Baum-Welch estimator (EM)
Bayesian network
- about / Probabilistic graphical models
Berkeley Data Analytics Stack (BDAS)
- reference / Apache Spark
Bernoulli mixture model
- about / Model
Bernoulli model
- about / The Multivariate Bernoulli classification
bias-variance decomposition
- about / Bias-variance decomposition
bias input / Mathematical background
BinaryClassificationMetrics instance
- URL / Evaluation
binary SVC
- about / The binary SVC
- LIBSVM / LIBSVM
- design / Design
- configuration parameters / Configuration parameters
- interface to LIBSVM / Interface to LIBSVM
- training / Training
- classification / Classification
- c-penalty and margin / C-penalty and margin
- kernel evaluation / Kernel evaluation
- applications in risk analysis / Applications in risk analysis
BLAS library / Basic Breeze data types
Body Mass Index (BMI) / DataFrames – a whirlwind introduction
BooleanColumnExtensionMethods class
- URL / Operations on columns
Bootstrap layouts
- URL / Towards a web application: HTML templates
Box-Cox transformation
- about / Heteroscedasticity
Breeze
- code, examples / Code examples
- installing / Installing Breeze
- help, getting / Getting help on Breeze
- Wiki page, on GitHub / Getting help on Breeze
- data types / Basic Breeze data types
- alternatives / Alternatives to Breeze
- URL / References, Linear regression
- API documents, URL / References
- diving into / Diving into Breeze
Breeze-viz
- about / Managing without documentation
- URL / Managing without documentation
- reference / Breeze-viz reference
Breeze Scala libraries / Abstraction
Broyden-Fletcher-Goldfarb-Shanno (BGFS) / BFGS
build.sbt file
- about / SBT
- URL / SBT

C

C-Epsilon SVM formulation / The nonseparable case – the soft margin
cake pattern
- about / Configurability
/ Step 3 – instantiation
Casbah
- URL / Casbah query DSL, References
- about / Beyond Casbah
Casbah query DSL
- about / Casbah query DSL
case classes
- used, for pattern matching / JSON in Scala – an exercise in pattern matching
- used, for extraction / Extraction using case classes
- as messages / Case classes as messages
- versus companion objects / Companion objects versus case classes
- versus enumerations / Enumerations versus case classes
- advantages / Enumerations versus case classes
cash per share
- about / Fundamental analysis
Cassandra
- about / HDFS, Cassandra, S3, and Tachyon
categorical field
- distinct values / Distinct values of a categorical field
categories, NP problems
- about / NP problems
- P-problems / NP problems
- NP problems / NP problems
- NP-complete problems / NP problems
- NP-hard problems / NP problems
central limit theorem (CLT)
- about / Sequential trials and dealing with risk
centroid / K-means clustering
Cholesky decomposition
- about / Cholesky factorization
Cholesky factorization
- about / Cholesky factorization
chromosomes / Evolutionary computing
chunking
- about / Segmentation, annotation, and chunking
class constructor template
- about / Class constructor template
classification metrics
- about / Classification metrics
classification model, evaluation factors
- accuracy / Key quality metrics
- precision / Key quality metrics
- recall / Key quality metrics
- F-measure or F-score F / Key quality metrics
- G-measure / Key quality metrics
classification model, terminology
- true positives (TP) / Key quality metrics
- true negatives (TN) / Key quality metrics
- false positives (FP) / Key quality metrics
- false negatives (FN) / Key quality metrics
class prior
- about / Formalism
class prior probability
- about / Formalism
Client
- about / Running Hadoop HDFS
client-server applications
- about / Client-server applications
client-side program
- architecture / Client-side program architecture
- model, designing / Designing the model
- event bus / The event bus
- AJAX calls, thorugh JQuery / AJAX calls through JQuery
- response views / Response views
clique
- about / A quick introduction to graphs
Cloudera
- URL / Running Hadoop HDFS
cluster assignment, K-means clustering
- about / Step 2 – cluster assignment
cluster configuration, K-means clustering
- about / Step 1 – cluster configuration
- clusters, defining / Defining clusters
- clusters, initializing / Initializing clusters
clustering
- about / Clustering
- expectation-maximization algorithm / The expectation-maximization algorithm
clustering algorithms
- K-means clustering / Clustering, K-means clustering
- EM / Clustering
co-vector
- about / Higher-kind projection
code snippets
- format / Code snippets format
collision / Transformers
command and control (C2)
- about / Influence diagrams
common discriminative kernels
- about / Common discriminative kernels
companion objects
- versus case classes / Companion objects versus case classes
complex adaptive systems / Introduction to LCS
complex queries / Complex queries
complex types
- ARRAY / Hive and Impala
- MAP / Hive and Impala
- STRUCT / Hive and Impala
- UNIONTYPE / Hive and Impala
components, XCS
- about / XCS components
- application to portfolio management / Application to portfolio management, The XCS core data
- XCS rules / XCS rules
- covering / Covering
- implementation example / An implementation example
computational workflow
- overview / An overview of computational workflows
conditional dependency / Training
conditional independence / A model by any other name
- about / Probabilistic graphical models
conditional random field (CRF)
- about / Conditional random fields, Introduction to CRF
- linear chain CRF / Linear chain CRF
- potential functions / Linear chain CRF
- identity potential functions / Linear chain CRF
- transition feature functions / Linear chain CRF
- state feature functions / Linear chain CRF
- text analytics / Regularized CRFs and text analytics
- versus HMM / Comparing CRF and HMM
configurability
- about / Configurability
configuration options
- URL / Reducing logging output and Spark configuration
configuration parameters, SVM
- SVM formulation / The SVM formulation
- SVM kernel function / The SVM kernel function
- SVM execution / The SVM execution
confusion matrix / F-score for multinomial classification
conjugate directions
- about / Conjugate gradient
conjugate gradient
- about / Conjugate gradient
connected components
- about / A quick introduction to graphs, Connected components
Connection class
- API documentation, URL / References
connectionism
- about / The biological background
consistent sampling
- about / Basic, stratified, and consistent sampling
constructive tuning strategy / Regularization
consumer price index (CPI)
- about / Introducing the multinomial Naïve Bayes
Consumer Price Index (CPI)
- about / Fundamental analysis
context bound / Coding against type classes
continuation-passing style (CPS) / Beyond actors – reactive programming
continuous space
- about / Continuous space and metrics
control learning / A solution – Q-learning
convolution neural networks
- about / Convolution neural networks
- local receptive fields / Local receptive fields
- weights, sharing / Sharing of weights
- convolution layers / Convolution layers
- subsampling layers / Subsampling layers
- fully connected hidden layer and output layer / Putting it all together
core parking
- about / Performance evaluation
correlation engine
- about / Basic components of a data-driven system, Correlation engine
correlations
- about / Basic correlations
Counter class
- about / Counter
covariant functor
- about / Covariant functors for vectors
cross-validation
- and model selection / Cross-validation and model selection
cross-validation, model
- about / Cross-validation
- one-fold cross validation / One-fold cross validation
- K-fold cross validation / K-fold cross validation
crossover operator, genetic algorithm implementation
- about / Crossover
- population / Population
- chromosomes / Chromosomes
- genes / Genes
curve fitting
- about / Supervised learning
custom supervisor strategies / Custom supervisor strategies
custom type serialization
- about / Custom type serialization

D

Darwinian process / The origin
data, profiling
- about / Profiling data
- immutable statistics / Immutable statistics
- Z-score / Z-Score and Gauss
data-driven system
- data ingest / Basic components of a data-driven system
- data transformation layer / Basic components of a data-driven system
- data analytics / Basic components of a data-driven system
- machine learning engine / Basic components of a data-driven system
- UI component / Basic components of a data-driven system
- actions engine / Basic components of a data-driven system
- correlation engine / Basic components of a data-driven system
- monitoring / Basic components of a data-driven system
data access layer
- about / Creating a data access layer
data analysis life cycle / Linear models
data analytics
- about / Basic components of a data-driven system, Data analytics and machine learning
database metadata
- accessing / Accessing database metadata
data chunks / 0xdata Sparkling Water
data clustering
- about / Clustering
data elements / 0xdata Sparkling Water
data extraction
- about / Data extraction
DataFrame
- using / Spark SQL and DataFrame
data frames / 0xdata Sparkling Water
- reference link / PySpark
DataFrames
- about / DataFrames – a whirlwind introduction
- joining, together / Joining DataFrames together
- custom functions / Custom functions on DataFrames
- immutability / DataFrame immutability and persistence
- persistence / DataFrame immutability and persistence
- SQL statements / SQL statements on DataFrames
DataFrameStatFunctions
- URL / Working with Scala and Spark Notebooks
data ingest
- about / Basic components of a data-driven system, Data ingest
- Syslog / Data ingest
- Rsync / Data ingest
- Kafka / Data ingest
data mapper pattern
- URL / References
data partitioning
- about / Clustering
data rearranging
- about / Sessionization
data science
- about / Data science
- programming in / Programming in data science
data segmentation
- about / Clustering
dataset
- URL / Data preprocessing and feature engineering
data shuffling
- about / Data shuffling and partitions
DataSourceConfig class
- pathName parameter / Data extraction
- normalize parameter / Data extraction
- reverseOrder parameter / Data extraction
- headerLines parameter / Data extraction
data sources
- interacting with / Interacting with data sources
- JSON files / JSON files
- Parquet files / Parquet files
data transformation layer
- about / Basic components of a data-driven system, Data transformation layer
- Oozie / Data transformation layer
- Azkaban / Data transformation layer
- StreamSets / Data transformation layer
data types
- about / Complex data types – arrays, maps, and structs
data types, Breeze
- about / Basic Breeze data types
- vectors / Vectors
- matrices / Matrices
- vectors, building / Building vectors and matrices
- matrices, building / Building vectors and matrices
- indexing / Advanced indexing and slicing
- slicing / Advanced indexing and slicing
- vectors, mutating / Mutating vectors and matrices
- matrices, mutating / Mutating vectors and matrices
- matrix multiplication / Matrix multiplication, transposition, and the orientation of vectors
- matrix transposition / Matrix multiplication, transposition, and the orientation of vectors
- vectors, orientation / Matrix multiplication, transposition, and the orientation of vectors
- data preprocessing / Data preprocessing and feature engineering
- feature engineering / Data preprocessing and feature engineering
- function optimization / Breeze – function optimization
- numerical derivatives / Numerical derivatives
- regularization / Regularization
DBpedia / Basics of information retrieval
decision-making agent / Concepts
decision boundary / Plotting data
decision tree
- about / Decision tree
decision tree, parameters
- maxDepth / Decision tree
- minInstancesPerNode / Decision tree
- maxBins / Decision tree
- minInfoGain / Decision tree
- maxMemoryInMB / Decision tree
- subsamplingRate / Decision tree
- useNodeIdCache / Decision tree
- checkpointDir / Decision tree
- checkpointInterval / Decision tree
decoding, hidden Markov model (HMM)
- about / Decoding – CF-3
- Viterbi algorithm / The Viterbi algorithm
def
- about / Understanding the problem
DenseVector or DenseMatrix
- URL / Vectors
dependency injection
- about / Configurability
deployment modes, Spark
- standalone / Deploying Spark
- local / Deploying Spark
- Yarn clusters manager / Deploying Spark
- Apache Mesos resource manager / Deploying Spark
descriptive models / Model categorization
descriptive statistics
- about / Working with Scala and Spark Notebooks
designing
- about / Model versus design
design principles, Spark
- about / Design principles
- in-memory persistency / In-memory persistency
- laziness / Laziness
- transforms / Transforms and actions
- actions / Transforms and actions
- shared variables / Shared variables
design template, for classifiers
- about / Design template for immutable classifiers
destructive tuning strategy / Regularization
DFT-based filtering
- about / DFT-based filtering
dimension reduction
- about / Dimension reduction, Dimension reduction
- principal components analysis / Principal components analysis
- non-linear models / Non-linear models
directed acyclic graph (DAG) / Lifting the hood
Directed Acyclic Graph (DAG)
- about / Graph constraints
directed graphical models
- about / Probabilistic graphical models
Dirichlet distribution
- about / LDA
discrete Fourier transform (DFT)
- about / Discrete Fourier transform
/ PCA
discrete Kalman filter
- about / The discrete Kalman filter
- recursive algorithm / The discrete Kalman filter, The recursive algorithm
- optimal estimator / The discrete Kalman filter
- state space estimation / The state space estimation
- benefits / Benefits and drawbacks
- drawbacks / Benefits and drawbacks
- alternative preprocessing techniques / Alternative preprocessing techniques
discretization / Value encoding
distributed algorithms
- reference link / LDA
dividend coverage ratio
- about / Fundamental analysis
DMatrix class
- about / DMatrix class
DNA / Evolutionary computing
documents
- inserting / Inserting documents
Domain Specific Languages (DSL)
- about / Maintainability
drivers
- URL / Importing Slick
Drools
- URL / Actions engine
Dropwizard
- URL / UI component
- about / UI component
Druid
- about / Data transformation layer
- URL / Data transformation layer
dynamic programming
- about / Overview of dynamic programming
dynamic routing
- about / Dynamic routing

E

e-mails
- obtaining / Who is getting e-mails?
earnings per share (EPS)
- about / Fundamental analysis
edge list
- about / A quick introduction to graphs
edges
- about / A quick introduction to graphs
- adding, to graph / Adding nodes and edges
Eigenvalue decomposition
- about / Eigenvalue decomposition
Elastic Net
- about / Regularization
element-wise operators
- pitfalls / Vectors
Emacs / SBT
encapsulation
- about / Encapsulation
- package scope / Encapsulation
- class or object scope / Encapsulation
encoding scheme, genetic encoding
- about / The encoding scheme
- flat encoding / Flat encoding
- hierarchical encoding / Hierarchical encoding
ensemble learning methods
- about / Bagging and boosting – ensemble learning methods
enumerations
- versus case classes / Enumerations versus case classes
- advantages / Enumerations versus case classes
epoch / The training epoch
Erlang programming language / The Actor model
error backpropagation, training epoch
- about / Step 2 – error backpropagation
- weights' adjustment / Weights' adjustment
- error propagation / The error propagation
- computational model / The computational model
error handling, monadic data transformation
- about / Error handling
- input value / Error handling
- output value / Error handling
error insensitive zone
- about / An overview
estimators
- about / Estimators
evaluation
- about / Evaluation, Evaluation
- execution profile / The execution profile
- impact of learning rate / Impact of the learning rate
- impact of momentum factor / The impact of the momentum factor
- impact of number of hidden layers / The impact of the number of hidden layers
- test case / Test case
evaluation, hidden Markov model (HMM)
- about / Evaluation – CF-1
- alpha algorithm / Alpha – the forward pass
- beta algorithm / Beta – the backward pass
event bus / The event bus
evidence
- about / Formalism
evolution
- about / Evolution
- origin / The origin
- NP problems / NP problems
- ary computing / Evolutionary computing
example data
- acquiring / Acquiring the example data
exchange-traded funds (ETFs) / Test case
execution contexts
- parallel execution, controlling with / Controlling parallel execution with execution contexts
ExecutionContextTaskSupport
- about / Processing a parallel collection
expectation-maximization (EM)
- about / Training – CF-2
expectation-maximization algorithm
- about / The expectation-maximization algorithm
- Gaussian mixture models / Gaussian mixture models
- overview / Overview of EM
- implementation / Implementation
- classification / Classification
- testing / Testing
- online EM algorithm / The online EM algorithm
Expectation Maximization (EM) algorithm
- about / LDA
experimenting, with Spark
- about / Experimenting with Spark
- Spark, deploying / Deploying Spark
- Spark shell, using / Using Spark shell
- MLlib / MLlib
- RDD generation / RDD generation
- K-means, using Spark / K-means using Spark
exploration-exploitation trade-off
- about / Exploration and exploitation
exponential moving average
- about / The exponential moving average
exponential normalization / Softmax
extended Kalman filter (EKF) / Benefits and drawbacks
Extended Kalman Filters (EKF) / The discrete Kalman filter
extended learning classifier systems
- about / Extended learning classifier systems
- exploration phase / Extended learning classifier systems
- exploitation phase / Extended learning classifier systems
- components / XCS components
extract, transform, and load (ETL)
- about / Basic components of a data-driven system
extraction
- used, for case classes / Extraction using case classes

F

-fold cross validation / K-fold cross validation
F-score for binomial classification
- about / F-score for binomial classification
F-score for multinomial classification
- about / F-score for multinomial classification
- macro method / F-score for multinomial classification
- micro method / F-score for multinomial classification
FACTORIE toolkit
- URL / POS tagging
- binary image, URL / POS tagging
Fast Fourier Transform (FFT)
- about / Discrete Fourier transform
feature construction
- reference link / MLlib algorithms in Spark
features extraction
- about / Extracting features
features maps / Sharing of weights
features selection
- about / Selecting features
Federal Election Commission (FEC)
- about / FEC data
- URL / FEC data
Federal Election Commission (FEC) data
- about / FEC data
- URL / FEC data
- Slick, importing / Importing Slick
- schema, defining / Defining the schema
- database, connecting to / Connecting to the database
- tables, creating / Creating tables
- inserting / Inserting data
- querying / Querying data
Federal Fund rate
- about / Fundamental analysis
Federal fund rate (FDF)
- about / Introducing the multinomial Naïve Bayes
feed-forward neural network (FFNN) / The biological background
feed-forward neural networks
- about / Feed-forward neural networks
- biological background / The biological background
- mathematical background / Mathematical background
FFNN without a hidden layer / The multilayer perceptron
finances 101
- about / Finances 101
- fundamental analysis / Fundamental analysis
- technical analysis / Technical analysis
- options trading / Options trading
- financial data sources / Financial data sources
first order predicate logic
- about / First order predicate logic
fitness functions, genetic algorithms
- about / The fitness score
- fixed fitness function / The fitness score
- evolutionary fitness function / The fitness score
- approximate fitness function / The fitness score
fixed lag smoothing / Fixed lag smoothing
Flex
- URL / Text analysis pipeline
floating point format
- URL / Defining the schema
Flume
- about / Data ingest, MQTT, ZeroMQ, Flume, and Kafka
- URL / MQTT, ZeroMQ, Flume, and Kafka
follower network crawler / Follower network crawler, Fault tolerance
fork-join pool
- about / Processing a parallel collection
ForkJoinTaskSupport
- about / Processing a parallel collection
Fourier analysis
- about / Fourier analysis
- discrete Fourier transform (DFT) / Discrete Fourier transform
- DFT-based filtering / DFT-based filtering
- market cycles, detecting / Detection of market cycles
Fourier transform
- about / Fourier analysis
frameworks
- about / Tools and frameworks
frequency domain
- about / Discrete Fourier transform
fully connected neural network / The network topology
functional approach
- versus object-oriented approach / Other serialization formats
function approximation
- about / Supervised learning
/ Quantization
function optimization / Breeze – function optimization
functors
- about / Abstraction
fundamental analysis
- about / Fundamental analysis
futures
- about / Futures
- URL / Futures, References
- result, using / Future composition – using a future's result
- blocking until completion / Blocking until completion
- parallel execution, controlling with execution contexts / Controlling parallel execution with execution contexts
- stock price fetchers example / Futures example – stock price fetcher
- concurrency and exception handling / Concurrency and exception handling with futures
futures, Akka framework
- about / Futures
- Actor life cycle / The Actor life cycle
- blocking on / Blocking on futures
- future callbacks, handling / Handling future callbacks

G

Ganglia
- URL / Monitoring, System monitoring
- about / System monitoring
Gauss-Newton technique
- about / Gauss-Newton
Gaussian mixture
- about / Unsupervised learning
generalization error
- about / Generalization error and overfitting
generalized autoregressive conditional heteroscedasticity (GARCH) / Alternative preprocessing techniques
generic Lp -norm
- about / Ln roughness penalty
genes / Evolutionary computing
genetic algorithms
- about / Genetic algorithms and machine learning
- discrete model parameters / Genetic algorithms and machine learning
- reinforcement learning / Genetic algorithms and machine learning
- neural network architecture / Genetic algorithms and machine learning
- ensemble learning / Genetic algorithms and machine learning
- components / Genetic algorithm components
- fitness score / The fitness score
- implementation / Implementation
- tests / Tests
- advantages / Advantages and risks of genetic algorithms
- disadvantages / Advantages and risks of genetic algorithms
genetic algorithms, for trading strategies
- about / GA for trading strategies
- trading strategies, defining / Definition of trading strategies
- test case / A test case
genetic encoding
- about / Genetic algorithm components, Encoding
- value encoding / Value encoding
- predicate encoding / Predicate encoding
- solution encoding / Solution encoding
- encoding scheme / The encoding scheme
genetic fitness functions
- about / Genetic algorithm components
genetic operators
- about / Genetic algorithm components, Genetic operators
- selection / Genetic operators, Selection
- crossover / Genetic operators, Crossover
- mutation / Genetic operators, Mutation
- transposition operator / Genetic operators
GitHub
- follower's graph / GitHub follower graph
- URL / JavaScript dependencies through web-jars
GitHub API
- URL / References
GitHub servers
- URL / Client-server applications
GitHub user data
- about / GitHub user data
- URL / GitHub user data
GNU Lesser General Public License (LGPL) / Licensing
GoogleFinancials / Data sources
gradient descent / Ordinary least squares regression
gradient descent methods
- about / Steepest descent
- steepest descent / Steepest descent
- conjugate gradient / Conjugate gradient
- stochastic gradient descent / Stochastic gradient descent
graph
- about / A quick introduction to graphs
graph-structured CRF / Introduction to CRF
graph algorithms
- about / Graph algorithms – GraphX and GraphFrames
- GraphX / Graph algorithms – GraphX and GraphFrames
- GraphFrames / Graph algorithms – GraphX and GraphFrames
Graph for Scala
- graph, creating / Graph for Scala
- reference link / Graph for Scala
- nodes, adding / Adding nodes and edges
- edges, adding / Adding nodes and edges
- constraints, setting / Graph constraints
- support for JSON / JSON
GraphFrames
- about / Graph algorithms – GraphX and GraphFrames
graphical models / Probabilistic graphical models
Graphite
- URL / Monitoring
GraphX
- about / Graph algorithms – GraphX and GraphFrames, GraphX
- node IDs / GraphX
- e-mails, obtaining / Who is getting e-mails?
- connected components / Connected components
- triangle counting algorithm / Triangle counting
- strongly connected components / Strongly connected components
- PageRank algorithm / PageRank
- SVD++ / SVD++
gross domestic product (GDP)
- about / Introducing the multinomial Naïve Bayes
Group by
- aggregations with / Aggregations with "Group by"
Growth Domestic Product (GDP)
- about / Fundamental analysis

H

Hadoop Distributed File System (HDFS) / Step 2 – loading data
- about / Task scheduling
Hadoop distributed file system (HDFS) / Apache Spark
Hadoop HDFS
- executing / Running Hadoop HDFS
- URL / Running Hadoop HDFS
hard margin / The separable case – the hard margin
HashingTF / Transformers
HDFS
- about / HDFS, Cassandra, S3, and Tachyon
headers
- adding, to HTTP requests in Scala / Adding headers to HTTP requests in Scala
Hello world
- with Akka / Hello world with Akka
Hessian matrix
- about / Jacobian and Hessian matrices
heteroscedasticity
- about / Heteroscedasticity
hidden layers / The multilayer perceptron
hidden Markov model (HMM)
- about / The hidden Markov model
- components / The hidden Markov model
- canonical forms / The hidden Markov model
- notations / Notations
- lambda model / The lambda model
- design / Design
- evaluation / Evaluation – CF-1
- training / Training – CF-2
- decoding / Decoding – CF-3
- canonical forms, implementing / Putting it all together
- training, test case 1 / Test case 1 – training
- evaluation, test case 2 / Test case 2 – evaluation
- as filtering technique / HMM as a filtering technique
- performance consideration / Performance consideration
Hidden Naïve Bayes (HNB) / Training
hinge loss / The nonseparable case – the soft margin
Hive
- about / Hive and Impala
- URL, for downloading / Hive and Impala
HMM constructor
- config / Putting it all together
- xt / Putting it all together
- form / Putting it all together
- quantize / Putting it all together
- f / Putting it all together
Homebrew package
- installation link / Setting up Python
HTML templates
- about / Towards a web application: HTML templates
HTTP
- about / HTTP – a whirlwind overview
HTTP headers
- adding / Authentication – adding HTTP headers
hyperplane / Binomial classification

I

Ignite File System (IGFS)
- URL / HDFS, Cassandra, S3, and Tachyon
Impala
- about / Hive and Impala
implementation, genetic algorithms
- about / Implementation
- software design / Software design
- key components / Key components
- selection operator / Selection
- population growth, controlling / Controlling the population growth
- GA configuration / The GA configuration
- crossover operator / Crossover
- mutation operator / Mutation
- reproduction / Reproduction
- solver / Solver
implementation, Q-learning
- about / Implementation
- software design / Software design
- states and actions / The states and actions
- search space / The search space, The policy and action-value
- Q-learning components / The Q-learning components
- Q-learning training / The Q-learning training
- tail recursion to rescue / Tail recursion to the rescue
- validation / The validation
- prediction / The prediction
indexing / Advanced indexing and slicing
influence diagrams
- about / Influence diagrams
- demonstration / Influence diagrams
information retrieval and text mining
- about / Basics of information retrieval
input forward propagation, training epoch
- about / Step 1 – input forward propagation
- computational flow / The computational flow
- error functions / Error functions
- operating nodes / Operating modes
- softmax / Softmax
insensitive error
- about / An overview
interactivity
- about / Optimization and interactivity
invokers
- about / Invokers
Iris dataset
- about / Iris dataset
- URL / Iris dataset

J

Jacobian matrix
- about / Jacobian and Hessian matrices
Java
- about / Java
java.sql.Types package
- API documentation, URL / JDBC summary
Java Management Extensions (JMX)
- about / Process monitoring
Java Mission Control (JMC)
- about / System monitoring
JavaScipt dependencies
- through web-jars / JavaScript dependencies through web-jars
Java Specification Request (JSR) / Linear models
- about / Process monitoring
JBlas/Linpack
- URL / Don't reinvent the wheel!
JDBC
- about / Interacting with JDBC
- first steps / First steps with JDBC
- database server, connecting to / Connecting to a database server
- tables, creating / Creating tables
- data, inserting / Inserting data
- data, reading / Reading data
- summary / JDBC summary
- functional wrappers / Functional wrappers for JDBC
- connections, with loan pattern / Safer JDBC connections with the loan pattern
- connections enriching, with pimp my library pattern / Enriching JDBC statements with the "pimp my library" pattern
- result sets in stream, wrapping / Wrapping result sets in a stream
- API documentation, URL / References
- versus Slick / Slick versus JDBC
JFreeChart
- about / JFreeChart
- description / Description
- licensing / Licensing
- installation / Installation
- installation, for Mac OSX / Installation
- installation, for Windows / Installation
JFreeChart documentation
- URL / Customizing plots
JFreeChart library
- about / Bias-variance decomposition
joda-time library
- about / GraphX
JSON
- about / A whirlwind tour of JSON
- interacting with / Interacting with JSON
- external APIs, querying / Querying external APIs and consuming JSON
- consuming / Querying external APIs and consuming JSON
- parsing / Parsing JSON
JSON4S types / JSON4S types
JSON files
- about / JSON files
JSON format
- about / Other serialization formats
JSON in Scala
- about / JSON in Scala – an exercise in pattern matching
- JSON4S types / JSON4S types
- fields extracting, XPath used / Extracting fields using XPath
JSON package
- URL / JSON
JSON support
- about / JSON
JSR110
- about / Process monitoring
JSR 223
- reference link / Jython and JSR 223
- about / Jython and JSR 223
Jython
- reference link / Jython and JSR 223

K

k-fold cross-validation / Cross-validation and model selection
K-fold cross-validation scheme / Assessing a model
k-means clustering
- about / Unsupervised learning
K-means clustering
- about / K-means clustering
- similarity, measuring / Measuring similarity
- algorithm, defining / Defining the algorithm
- cluster configuration / Step 1 – cluster configuration
- cluster assignment / Step 2 – cluster assignment
- reconstruction/error minimization / Step 3 – reconstruction/error minimization
- classification / Step 4 – classification
- curse of dimensionality / The curse of dimensionality
- evaluation, setting up / Setting up the evaluation
- results, evaluating / Evaluating the results
- number of clusters, tuning / Tuning the number of clusters
- validation / Validation
Kafka
- about / Data ingest, MQTT, ZeroMQ, Flume, and Kafka
Kalman smoothing
- about / Kalman smoothing
Kamon
- about / Monitoring
- URL / Monitoring
Kelly Criterion
- about / Sequential trials and dealing with risk
kernel functions
- about / Kernel functions, An overview
- common discriminative kernels / Common discriminative kernels
- linear kernel (dot product) / Common discriminative kernels
- polynomial kernel / Common discriminative kernels
- radial basis function (RBF) / Common discriminative kernels
- sigmoid kernel / Common discriminative kernels
- Laplacian kernel / Common discriminative kernels
- log kernel / Common discriminative kernels
- kernel monadic composition / Kernel monadic composition
kernel trick
- about / The kernel trick
key components, genetic algorithm implementation
- population / Population
- chromosomes / Chromosomes
- genes / Genes
keyquality metrics
- about / Key quality metrics
Kryo
- about / Other serialization formats
Kudu
- URL / HDFS, Cassandra, S3, and Tachyon
Kullback-Leibler (KL) distance
- about / Continuous space and metrics

L

L-BFGS method / Breeze – function optimization
L1 regularization / Ln roughness penalty
L2 regularization / Ln roughness penalty
labeled point
- about / Labeled point
- reference link / Labeled point
LabeledPoint
- about / Nested data
Lagrange multipliers
- about / Lagrange multipliers
LAPACK library / Basic Breeze data types
Laplace / The zero-frequency problem
lasso regularization
- about / Ln roughness penalty
Latent Dirichlet allocation (LDA)
- about / Probabilistic graphical models, ML libraries
Latent Dirichlet Allocation (LDA)
- about / Unsupervised learning, LDA
lazy computation
- about / Towards re-usable code
lazy methods
- about / Computation on demand
LDL decomposition / LDL decomposition
learning classifier systems (LCS)
- about / Learning classifier systems, Introduction to LCS
- components / Introduction to LCS
- features / Why LCS?
- terminology / Terminology
- benefits / Benefits and limitations of learning classifier systems
- limitations / Benefits and limitations of learning classifier systems
learning vector quantization / Clustering
Least Absolute Shrinkage and Selection Operator (LASSO)
- about / Regularization
least squares problem / Numerical optimization
lemmatization / Basics of information retrieval
LET IT CRASH blog
- URL / References
Levenberg-Marquardt
- about / Levenberg-Marquardt
Levenstein distance / Basics of information retrieval
libraries
- about / Other libraries and frameworks
libraries directory
- about / List of libraries and tools
LIBSVM
- about / LIBSVM
- URL, for downloading / LIBSVM
- URL, for documentation / LIBSVM
- benefits / LIBSVM
LIBSVM, Java classes
- svm_model / LIBSVM
- svm_node / LIBSVM
- svm_parameters / LIBSVM
- svm_problem / LIBSVM
- svm / LIBSVM
LIBSVM format
- need for / Labeled point
Lidstone / The zero-frequency problem
life-cycle hooks
- about / Life-cycle hooks
Lift
- about / UI component
lift-json library
- about / GraphX
Lift framework
- about / SBT
likelihood
- about / Formalism
Limited-Memory BFGS (L-BFGS)
- about / ML libraries
Limited memory Broyden-Fletcher-Goldfarb-Shanno (L-BFGS) / L-BFGS
linear algebra
- about / Linear algebra
- QR decomposition / QR decomposition
- LU factorization / LU factorization
- LDL decomposition / LDL decomposition
- Cholesky factorization / Cholesky factorization
- singular value decomposition (SVD) / Singular Value Decomposition
- Eigenvalue decomposition / Eigenvalue decomposition
- algebraic libraries / Algebraic and numerical libraries
- numerical libraries / Algebraic and numerical libraries
linear chain CRF / Introduction to CRF
linear chain structured graph CRF / Introduction to CRF
linear regression
- about / Linear regression, Linear regression
- one-variate linear regression / One-variate linear regression
- ordinary least squares regression / Ordinary least squares regression
- versus SVR / SVR versus linear regression
Linear Support Vector Machine (SVM)
- about / SVMWithSGD
linear SVM
- about / The linear SVM
- separable case (hard margin) / The separable case – the hard margin
- nonseparable case (soft margin) / The nonseparable case – the soft margin
line type
- customizing / Customizing the line type
Ling-Spam dataset
- URL / Reference, Introducing MLlib – Spam classification
Ling-Spam email dataset
- URL / Acquiring the example data, Spam filtering
loan pattern / Reading data
- JDBC connections with / Safer JDBC connections with the loan pattern
LogBinRegression constructor
- obsSet / Step 5 – implementing the classifier
- expected / Step 5 – implementing the classifier
- maxIters / Step 5 – implementing the classifier
- eta / Step 5 – implementing the classifier
- eps / Step 5 – implementing the classifier
logistic regression
- about / An example – logistic regression, Beyond logistic regression, Logistic regression, Logistic regression, Logistic regression
- regularization / Regularization in logistic regression
- logistic function / Logistic function
- binomial classification / Binomial classification
- design / Design
- training workflow / The training workflow
- classification / Classification
looser coupling
- with type classes / Looser coupling with type classes
- type classes / Type classes
- coding, against type classes / Coding against type classes
- type classes, using / When to use type classes
- type classes, benefits / Benefits of type classes
loss functions
- about / Linear regression
low-band filter
- about / The exponential moving average
LU factorization
- about / LU factorization
- basic LU factorization / LU factorization
- with pivot / LU factorization

M

machine learning
- features / Why machine learning?
Machine Learning (ML)
- about / Data analytics and machine learning
machine learning algorithms
- taxonomy / Taxonomy of machine learning algorithms
Machine Learning course
- URL / References
machine learning engine
- about / Basic components of a data-driven system, Data analytics and machine learning
machine learning problems
- classification / Classification
- prediction / Prediction
- optimization / Optimization
- regression / Regression
maintainability
- about / Maintainability
map optimization
- about / Using word2vec to find word relationships
maps
- about / Maps
Markov Chain Decision Process
- about / Influence diagrams
Markov decision processes
- about / Markov decision processes
- Markov property / Markov decision processes, The Markov property
- first order discrete Markov chain / The first order discrete Markov chain
master-workers, Akka
- about / Master-workers
- exchange of messages / Exchange of messages
- worker actors / Worker actors
- workflow controller / The workflow controller
- master actor / The master actor
- master with routing / Master with routing
- discrete Fourier transform (DFT) / Distributed discrete Fourier transform
- limitations / Limitations
mathematical abstractions
- about / Supporting mathematical abstractions
- variable declaration / Step 1 – variable declaration
- model definition / Step 2 – model definition
- instantiation / Step 3 – instantiation
mathematical concepts
- about / Mathematics
- linear algebra / Linear algebra
- first order predicate logic / First order predicate logic
- Jacobian matrix / Jacobian and Hessian matrices
- Hessian matrix / Jacobian and Hessian matrices
- optimization techniques / Summary of optimization techniques
- dynamic programming / Overview of dynamic programming
mathematical notation / Mathematical notation for the curious
matrices
- about / Matrices
- building / Building vectors and matrices
- mutating / Mutating vectors and matrices
maximum margin classifiers
- kernel trick / Max-margin classification
mean squared error (MSE) / One-variate linear regression
measurement noise covariance / The measurement equation
Mesos
- URL / Task scheduling
- about / Mesos, YARN, and Standalone
message
- passing, between actors / Message passing between actors
message-passing mechanisms
- fire-and-forget or tell / The Actor model
- send-and-receive or ask / The Actor model
message sender
- accessing / Accessing the sender of a message
metaphor for graphical models / Probabilistic graphical models
methodology
- defining / Defining a methodology
metrics
- about / Continuous space and metrics
Michigan approach / Why LCS?
micro-batch processing
- about / Streaming word count
mirrors
- reference link / Linux
mixins
- about / Composing mixins to build a workflow
mixins, composing for building workflow
- about / Composing mixins to build a workflow
- problem, understanding / Understanding the problem
- modules, defining / Defining modules
- workflow, instantiating / Instantiating the workflow
MLlib / Breeze – function optimization
- spam classification / Introducing MLlib – Spam classification
MLlib algorithms
- about / MLlib algorithms in Spark
- Term Frequency Inverse Document Frequency (TF-IDF) / TF-IDF
- Latent Dirichlet Allocation (LDA) / LDA
ML libraries
- about / ML libraries
- SparkR / SparkR
- graph algorithms / Graph algorithms – GraphX and GraphFrames
model
- about / A model by any other name
- features / A model by any other name
- attributes / A model by any other name
- variables / A model by any other name
- parametric / A model by any other name
- differential / A model by any other name
- probabilistic / A model by any other name
- graphical / A model by any other name
- directed graphs / A model by any other name
- numerical method / A model by any other name
- chemistry / A model by any other name
- taxonomy / A model by any other name
- grammar and lexicon / A model by any other name
- inference logic / A model by any other name
- versus design / Model versus design
- features, selecting / Selecting features
- features, extracting / Extracting features
model, assessing
- about / Assessing a model
- validation / Validation
- cross-validation / Cross-validation
- bias-variance decomposition / Bias-variance decomposition
- overfitting / Overfitting
Model-View-Controller (MVC)
- architecture / Model-View-Controller architecture
model categorization
- about / Model categorization
- predictive models / Model categorization
- descriptive models / Model categorization
- adaptive modeling / Model categorization
modeling
- about / Modeling, Model versus design
model monitoring
- about / Model monitoring
- performance, monitoring / Performance over time
- model, retiring criteria / Criteria for model retiring
- A/B testing / A/B testing
modular JavaScript
- through RequireJS / Modular JavaScript through RequireJS
monadic composition
- about / Monads
monadic data transformation
- about / Monadic data transformation
- explicit model / Monadic data transformation, Explicit models
- implicit model / Monadic data transformation, Implicit models
- error handling / Error handling
monads
- about / Abstraction, Monads
MongoDB
- about / MongoDB
- manual installation, URL / MongoDB
- connecting, with Casbah / Connecting to MongoDB with Casbah
- authentication, connecting with / Connecting with authentication
- reference documentation, URL / Complex queries
Monitor class
- about / Monitor
monitoring
- about / Basic components of a data-driven system, Monitoring
Monthly Active Users (MAU)
- about / Linear regression
morphism / Error handling
moving averages
- about / Moving averages
- simple moving average / The simple moving average
- weighted moving average / The weighted moving average
- exponential moving average / The exponential moving average
MQTT
- about / MQTT, ZeroMQ, Flume, and Kafka
MTable instances
- URL / Accessing database metadata
multiclass problems
- about / Multiclass problems
multilayer perceptron
- about / The multilayer perceptron
- activation function / The activation function
- network topology / The network topology
- design / Design
- UML class diagram / Design
- configuration / Configuration
- network components / Network components
- model / The model
- problem types (modes) / Problem types (modes)
- online training, versus batch training / Online training versus batch training
- training epoch / The training epoch
- training and classification / Training and classification
Multilayer Perceptron Classifier (MLCP)
- about / Perceptron
multinomial Naïve Bayes model
- about / Introducing the multinomial Naïve Bayes
- formalism / Formalism
- frequentist perspective / The frequentist perspective
- predictive model / The predictive model
- zero-frequency problem / The zero-frequency problem
Multivariate Analysis of Variance (MANOVA)
- about / Multivariate regression
Multivariate Bernoulli classification
- about / The Multivariate Bernoulli classification
- model / Model
- implementation / Implementation
multivariate regression
- about / Multivariate regression
MurmurHash function
- about / Basic, stratified, and consistent sampling
mutation operator, genetic algorithm implementation
- about / Mutation
- population / Population
- chromosomes / Chromosomes
- genes / Genes
Mutual Information (MI) / Spam filtering

N

.NET MyMediaLite library
- reference link / SVD++
n-grams / Basics of information retrieval
NameNode
- about / Running Hadoop HDFS
Namenode UI
- URL / Running Hadoop HDFS
natural language processing (NLP) / The feature functions model
Naïve Bayes
- applying, to text mining / Naïve Bayes and text mining
Naïve Bayes algorithm
- pros / Pros and cons
- cons / Pros and cons
Naïve Bayes classifiers
- about / Naïve Bayes classifiers
- multinomial Naïve Bayes / Introducing the multinomial Naïve Bayes
Naïve Bayes classifiers implementation
- about / Implementation
- design / Design
- training / Training
- classification / Classification
- F1 validation / F1 validation
- feature extraction / Feature extraction
- testing / Testing
Naïve Bayes models
- about / Probabilistic graphical models
- mathematical notation / Formalism
nested data
- about / Nested data
- working with / Nested data
net profit margin
- about / Fundamental analysis
net sales
- about / Fundamental analysis
network components, multilayer perceptron
- about / Network components
- network topology / The network topology
- input and hidden layers / Input and hidden layers
- output layer / The output layer
- synapses / Synapses
- connections / Connections
- initialization weights / The initialization weights
NodeJS
- about / UI component
Node Manager
- about / Mesos, YARN, and Standalone
nodes
- adding, to graph / Adding nodes and edges
non-linear models, dimension reduction
- about / Non-linear models
- kernel PCA / Kernel PCA
- manifolds / Manifolds
nonlinear least squares minimization
- about / Nonlinear least squares minimization
- Gauss-Newton / Gauss-Newton
- Levenberg-Marquardt / Levenberg-Marquardt
nonlinear SVM
- about / The nonlinear SVM
- max-margin classification / Max-margin classification
- kernel trick / The kernel trick
NP problems
- categories / NP problems
- about / NP problems
Nu-SVM / The nonseparable case – the soft margin
numerical optimization
- about / Numerical optimization
- Newton / Numerical optimization
- Quasi-Newton / Numerical optimization
NumericColumnExtensionMethods class
- URL / Operations on columns
numeric field
- summarization / Summarization of a numeric field
- grepping, across multiple fields / Grepping across multiple fields
NVD3
- used, for drawing plots / Drawing plots with NVD3
- URL / Drawing plots with NVD3

O

object-oriented approach
- versus functional approach / Other serialization formats
object-oriented design patterns
- URL / References
objects
- extracting, from database / Extracting objects from the database
Objects / A whirlwind tour of JSON
observation
- about / Extracting features
one-class SVC
- used, for anomaly detection / Anomaly detection with one-class SVC
one-variate linear regression
- about / One-variate linear regression
- implementation / Implementation
- test case / Test case
online training / Online training versus batch training
Online Transaction Processing (OLTP)
- about / Hive and Impala
Oozie
- about / Data transformation layer
operating income
- about / Fundamental analysis
operating profit margin
- about / Fundamental analysis
operations
- on columns / Operations on columns
optimal substructures
- about / Overview of dynamic programming
optimization
- about / Optimization and interactivity
- feedback loops / Feedback loops
optimization techniques
- about / Summary of optimization techniques
- gradient descent methods / Steepest descent
- Quasi-Newton algorithms / Quasi-Newton algorithms
- nonlinear least squares minimization / Nonlinear least squares minimization
- Lagrange multipliers / Lagrange multipliers
OptionModel class / The OptionModel class
OptionProperty class / The OptionProperty class
options trading
- about / Options trading
option trading, with Q-learning
- about / Option trading using Q-learning
- OptionProperty class / The OptionProperty class, The OptionModel class
- quantization / Quantization
Ordering
- URL / Transformations and actions on RDDs
ordinary least squares regression
- about / Ordinary least squares regression
- design / Design
- implementation / Implementation
- trending, test case 1 / Test case 1 – trending
- feature selection, test case 2 / Test case 2 – feature selection
outputs, linear models
- residuals / Linear models
- coefficients / Linear models
- residual standard error / Linear models
- multiple R-squared / Linear models
- F-statistic / Linear models
overfitting
- about / Overfitting, The frequentist perspective, Generalization error and overfitting
overlapping substructures
- about / Overview of dynamic programming
overload operators
- about / Overloading
- += / Overloading
- + / Overloading

P

package.scala source file
- URL / Breeze-viz reference
padding / Value encoding
PageRank algorithm
- about / PageRank
PaintScale.scala source file
- URL / More advanced scatter plots
parallel collections
- about / Parallel collections
- limitations / Limitations of parallel collections
- error handling / Error handling
- parallelism level, setting / Setting the parallelism level
- cross-validation with / An example – cross-validation with parallel collections
parallel collections, Scala
- about / Processing a parallel collection
- benchmark framework / The benchmark framework
- performance evaluation / Performance evaluation
Parallel Colt
- URL / Don't reinvent the wheel!
parallel execution
- controlling, with execution contexts / Controlling parallel execution with execution contexts
parameters, SparkR glm implementation
- formula / Generalized linear model
- family / Generalized linear model
- data / Generalized linear model
- lambda / Generalized linear model
- alpha / Generalized linear model
- standardize / Generalized linear model
- solver / Generalized linear model
Paretto chart
- about / Working with Scala and Spark Notebooks
Parquet
- about / Nested data, Other serialization formats
- reference link / Nested data
parquet file
- about / Nested data
- URL / Nested data
Parquet files
- URL / References
parsers
- URL / Understanding and parsing the request
Partial Least Square Regression (PLSR) / Evaluation
partially connected neural networks / The network topology
pattern matchin
- case classes used / JSON in Scala – an exercise in pattern matching
Pattern matching
- for comprehensions / Pattern matching in for comprehensions
- internals / Pattern matching internals
- URL / Reference
pattern matching
- working with / Working with pattern matching
pay-out ratio
- about / Fundamental analysis
Pearson correlation coefficient
- about / Working with Scala and Spark Notebooks
penalized least squares regression / Ln roughness penalty
perceptron
- about / Perceptron
performance considerations
- about / Performance considerations
- K-means / K-means
- EM / EM
- PCA / PCA
performance evaluation, Spark
- about / Performance evaluation
- parameters, tuning / Tuning parameters
- tests / Tests
- performance considerations / Performance considerations
permanence spectrum / Programming in data science
persistence level
- URL / Persisting RDDs
Pimp my Library pattern
- URL / References
pimp my library pattern
- URL / Enriching JDBC statements with the "pimp my library" pattern
pimp my library pattern
- used, for enriching JDBC statements / Enriching JDBC statements with the "pimp my library" pattern
pipeline
- about / Pipeline components
- transformers / Transformers
- estimators / Estimators
pipeline API
- URL / References
Pittsburgh approach / Why LCS?
Play
- about / UI component
Play framework / Futures example – stock price fetcher
- about / The Play framework, SBT
- URL / Dynamic routing
plots
- customizing / Customizing plots
- drawing, with NVD3 / Drawing plots with NVD3
Poisson distribution
- about / Heteroscedasticity
Pool
- about / Key components
Porter Stemmer
- implementation / Simple text analysis, A Porter Stemmer implementation of the code
- URL / Simple text analysis
- reference link / A Porter Stemmer implementation of the code
POS (part-of-speech) tagging
- about / POS tagging
posterior probability
- about / Formalism
Power Iteration Clustering (PIC)
- about / ML libraries, Unsupervised learning
Predicted Residual Error Sum of Squares (PRESS) / Evaluation
predictive model
- about / The predictive model
predictive models / Model categorization
PreparedStatement API documentation
- URL / Inserting data
PreparedStatement class
- API documentation, URL / References
price/book value ratio (PB)
- about / Fundamental analysis
price/earnings ratio (PE)
- about / Fundamental analysis
price/sales ratio (PS)
- about / Fundamental analysis
price patterns
- about / Price patterns
Price to Earnings/Growth (PEG)
- about / Fundamental analysis
primal problem / The nonseparable case – the soft margin
Principal Component Analysis (PCA)
- about / ML libraries
principal components analysis, dimension reduction
- about / Principal components analysis
- algorithm / Algorithm
- implementation / Implementation
- test case / Test case
- evaluation / Evaluation
probabilistic graphical models
- about / Probabilistic graphical models
probabilistic kernels
- about / Common discriminative kernels
probabilistic reasoning
- about / Probabilistic graphical models
probabilistic structures
- about / Probabilistic structures
problem dimensionality
- about / Problem dimensionality
process monitoring
- about / Process monitoring
Project Gutenberg
- URL / Simple text analysis
projections
- about / Projections
propositional logic
- about / First order predicate logic
protein sequence annotation
- about / An overview
Protobuf
- about / Other serialization formats
pseudo-regret
- about / Exploration and exploitation
PySpark / PySpark
Python
- integrating with Scala / Integrating with Python
- setting up / Setting up Python
- PySpark / PySpark
- calling from Java/Scala / Calling Python from Java/Scala
Python, calling from Java/Scala
- about / Calling Python from Java/Scala
- sys.process._, using / Using sys.process._
- Spark pipe / Spark pipe
- Jython / Jython and JSR 223
- JSR 223 / Jython and JSR 223

Q

Q-learning
- about / A solution – Q-learning
- Bellman optimality equations / The Bellman optimality equations
- temporal difference, for model-free learning / Temporal difference for model-free learning
- action-value iterative update / Action-value iterative update
- implementation / Implementation
- for option trading / Option trading using Q-learning
- implementing / Putting it all together
- evaluation / Evaluation
QR decomposition / Ordinary least squares regression
QStar class / The Viterbi algorithm
quantization / Value encoding
Quasi-Newton algorithms
- about / Quasi-Newton algorithms
- Broyden-Fletcher-Goldfarb-Shanno (BGFS) / BFGS
- Limited memory Broyden-Fletcher-Goldfarb-Shanno (L-BFGS) / L-BFGS
queue control
- and pull pattern / Queue control and the pull pattern

R

R
- Scala, integrating with / Integrating with R
- setting up / Setting up R and SparkR
- setting up, on Linux / Linux
- setting up, on Mac OS / Mac OS
- for Mac OS, download link / Mac OS
- for Windows, download link / Windows
- setting up, on Windows / Windows
read-evaluate-print-loop (REPL)
- about / Getting started with Scala
real-world Bayesian network
- example / Probabilistic graphical models
Receiver Operating Characteristic (ROC)
- about / SVMWithSGD
receiver operating characteristic (ROC) curve / Evaluation
recombination
- about / Evolutionary computing
reconstruction/error minimization, K-means clustering
- about / Step 3 – reconstruction/error minimization
- K-means components, creating / Creating K-means components
- tail recursive implementation / Tail recursive implementation
- iterative implementation / Iterative implementation
recursive algorithm, discrete Kalman filter
- about / The recursive algorithm
- prediction phase / Prediction
- correction / Correction
- Kalman smoothing / Kalman smoothing
- fixed lag smoothing / Fixed lag smoothing
- experimentation / Experimentation
regression
- about / What regression stands for?
regression model / Design
regression trees
- about / Regression trees
regression weights
- about / One-variate linear regression
regularization / Regularization
- in logistic regression / Regularization in logistic regression
- about / Regularization, Ln roughness penalty, Regularization
- Ln roughness penalty / Ln roughness penalty
- ridge regression / Ridge regression
reinforcement learning
- about / Model categorization, Reinforcement learning
- problem / The problem
- Q-learning / A solution – Q-learning
- terminologies / Terminology
- value of a policy / Value of a policy
- pros / Pros and cons of reinforcement learning
- cons / Pros and cons of reinforcement learning
reinforcement learning agent
- overview architecture / Concepts
Remote Procedure Call (RPC)
- about / Other serialization formats
reproducible kernel Hilbert spaces
- about / Common discriminative kernels
request
- parsing / Understanding and parsing the request
RequireJS
- modular JavaScript through / Modular JavaScript through RequireJS
residuals mean square (RMS) / Step 5 – minimizing the sum of square errors
resilient applications
- building / Futures
resilient distributed dataset (RDD) / Apache Spark
- transformation / Apache Spark
- action / Apache Spark
Resilient Distributed Dataset (RDD)
- about / Task scheduling
Resilient Distributed Datasets (RDD)
- about / Computation on demand
Resilient distributed datasets (RDD)
- about / Resilient distributed datasets
- immutability / RDDs are immutable
- operations, executing / RDDs are lazy
- constructing / RDDs know their lineage
- resiliency / RDDs are resilient
- distribution / RDDs are distributed
- transformations / Transformations and actions on RDDs
- actions / Transformations and actions on RDDs
- operations, URL / Transformations and actions on RDDs
- persisting / Persisting RDDs
- Key-value / Key-value RDDs
- double / Double RDDs
Resource Manager
- about / Mesos, YARN, and Standalone
response
- composing / Composing the response
response views / Response views
Rest APIs
- about / Rest APIs: best practice
results
- URL / Composing the response
ResultSet interface
- API documentation, URL / References
ridge regression
- about / Ln roughness penalty, Ridge regression
- design / Design
- implementation / Implementation
- test case / Test case
Riemann metric
- about / Kernel monadic composition
risk handling
- about / Sequential trials and dealing with risk
ROC
- about / SVMWithSGD
routing
- about / Routing
Rsclient/Rserve
- reference link / Using Rserve
RStudio
- reference link / Running Spark via R's command line
Rsync
- about / Data ingest
Run-Length Encoding (RLE)
- about / Nested data

S

S3
- about / HDFS, Cassandra, S3, and Tachyon
SBT
- about / SBT
- features / SBT
- URL / SBT
sbteclipse project
- URL / SBT
Scala
- and data science / Data science
- uses / Why Scala?, Scala encourages immutability, Easier parallelism
- static typing and type inference / Static typing and type inference
- and functional programs / Scala and functional programs
- null pointer uncertainty / Null pointer uncertainty
- interoperability, with Java / Interoperability with Java
- drawbacks / When not to use Scala
- references / References
- URL / References
- about / Why Scala?, Scala, Scala
- features / Why Scala?
- abstraction / Abstraction
- scalability / Scalability
- configurability / Configurability
- maintainability / Maintainability
- computation / Computation on demand
- time series / Time series in Scala
- object creation / Object creation
- streams / Streams
- parallel collections / Parallel collections
- URL, for downloading / Getting started with Scala
- installing / Getting started with Scala
- working with / Working with Scala and Spark Notebooks
- integrating, with R / Integrating with R
- big data / Generalized linear model
- nulls / Generalized linear model
- invoking from R / Invoking Scala from R
- Rserve, using / Using Rserve
- integrating, with Python / Integrating with Python
Scala, integrating with Python
- about / Integrating with Python
- PySpark / PySpark
Scala, integrating with R
- DataFrames / DataFrames
- linear models / Linear models
- generalized linear model / Generalized linear model
- JSON files, reading in SparkR / Reading JSON files in SparkR
- Parquet files, writing in SparkR / Writing Parquet files in SparkR
Scala API
- reference link / PySpark
scalability
- about / Scalability
scalability, with Actors
- about / Scalability with Actors
- Actor model / The Actor model
- partitioning / Partitioning
- reactive programming / Beyond actors – reactive programming
Scalable frameworks
- about / An overview
Scala constructs
- URL / Reference
Scala plugin for Eclipse
- reference / Scala
Scala plugin for IntelljIDEA
- reference / Scala
Scala programming
- about / Scala programming
- libraries directory / List of libraries and tools
- code snippets format / Code snippets format
- encapsulation / Encapsulation
- class constructor template / Class constructor template
- companion objects, versus case classes / Companion objects versus case classes
- enumerations, versus case classes / Enumerations versus case classes
- overload operators / Overloading
- design template, for classifiers / Design template for immutable classifiers
- data extraction / Data extraction
- financial data sources / Data sources
- document extraction / Extraction of documents
- DMatrix class / DMatrix class
- Counter class / Counter
- Monitor class / Monitor
scalastyle plugin
- URL / SBT
Scala Swing
- about / UI component
Scalate template
- URL / Process monitoring
Scalatra
- URL / Process monitoring
Scalaz
- about / Abstraction
scatter plot matrix plots
- about / Multi-plot example – scatterplot matrix plots
scatter plots
- about / More advanced scatter plots
schema
- defining / Defining the schema
Secondary Namenode
- about / Running Hadoop HDFS
segmentation
- about / Segmentation, annotation, and chunking
semantic URLs / Dynamic routing, References
semi-supervised learning
- about / Semi-supervised learning
sequences
- extracting / Extracting sequences
Sequential Minimal Optimization (SMO) / The nonseparable case – the soft margin
- about / LIBSVM
sequential trials
- managing / Sequential trials and dealing with risk
serialization
- about / Simple text analysis
serialization formats
- about / Other serialization formats
- XML / Other serialization formats
- JSON / Other serialization formats
- YAML / Other serialization formats
- Protobuf / Other serialization formats
- Avro / Other serialization formats
- Thrift / Other serialization formats
- Parquet / Other serialization formats
- Kryo / Other serialization formats
sessionization
- about / Sessionization
short interest
- about / Fundamental analysis
short interest ratio
- about / Fundamental analysis
shrinkage
- about / Ln roughness penalty
shuffling / Data shuffling and partitions
Simple Build Tool (SBT)
- about / Scala
simple build tool (sbt) / Deploying Spark
simple moving average
- about / The simple moving average
simple workflow
- writing / Writing a simple workflow
- problem, scoping / Step 1 – scoping the problem
- data loading / Step 2 – loading data
- data, preprocessing / Step 3 – preprocessing the data
- immutable normalization / Immutable normalization
- patterns, discovering / Step 4 – discovering patterns
- data, analyzing / Analyzing data
- data, plotting / Plotting data
- classifier, implementing / Step 5 – implementing the classifier
- optimizer, selecting / Selecting an optimizer
- model, training / Training the model
- observations, classifying / Classifying observations
- model, evaluating / Step 6 – evaluating the model
single page applications
- about / Single page applications
singular value decomposition / Ordinary least squares regression
Singular Value Decomposition (SVD)
- about / ML libraries, SVD++
singular value decomposition (SVD) / PCA
- about / Singular Value Decomposition
slicing / Advanced indexing and slicing
Slick
- importing / Importing Slick
- arguments, URL / Defining the schema
- joins, URL / Invokers
- versus JDBC / Slick versus JDBC
- URL / References
- about / UI component
smoothing factor for counters
- about / The zero-frequency problem
smoothing kernels
- about / Common discriminative kernels
soft margin / The nonseparable case – the soft margin
source code
- about / Source code
- context, versus view bounds / Context versus view bounds
- presentation / Presentation
- primitive types / Primitive types
- type conversions / Type conversions
- implicit conversion / Type conversions
- immutability / Immutability
- Scala iterators, performance / Performance of Scala iterators
spam filtering
- about / Spam filtering
Spark
- installing / Installing Spark
- URL / Installing Spark, SQL statements on DataFrames, Setting up Spark
- on EC2, URL / Running Spark applications on EC2
- data shuffling / Data shuffling and partitions
- Web UI, URL / Reference
- internals, URL / Reference
- setting up / Setting up Spark
- architecture / Understanding Spark architecture
- components / Spark components
- performance, tuning / Spark performance tuning
Spark, applications
- word count / Word count
- word count, streaming / Streaming word count
- Spark SQL, using / Spark SQL and DataFrame
- DataFrame, using / Spark SQL and DataFrame
SPARK-3703
- reference link / Bagging and boosting – ensemble learning methods
Spark applications
- running, locally / Running Spark applications locally
- URL / Running Spark applications locally
- running, on EC2 / Running Spark applications on EC2
Spark ecosystem
- about / Apache Spark
Sparkling Water
- about / 0xdata Sparkling Water
Spark Master
- about / Task scheduling
Spark Notebook
- URL / Working with Scala and Spark Notebooks
Spark notebooks
- URL / Data visualization beyond breeze-viz
Spark Notebooks
- working with / Working with Scala and Spark Notebooks
SparkR
- about / SparkR
- setting up / Setting up R and SparkR
- setting up, on Linux / Linux
- Linux setup, reference link / Linux
- setting up, on Mac OS / Mac OS
- setting up, on Windows / Windows
- running, via Scripts / Running SparkR via scripts
- running, via R command line / Running Spark via R's command line
- JSON files, reading / Reading JSON files in SparkR
- Parquet files, writing / Writing Parquet files in SparkR
Spark RDDs
- reference link / PySpark
Spark SQL
- using / Spark SQL and DataFrame
spectral density estimation
- purpose / Fourier analysis
SQL statements
- on DataFrames / SQL statements on DataFrames
stackable trait injection / Composing mixins to build a workflow
stand-alone programs
- building / Building and running standalone programs
Standalone
- URL / Task scheduling
- about / Mesos, YARN, and Standalone
standalone programs
- about / Standalone programs
Stanford NLP toolkit
- URL / Spam filtering
stateful actors / Stateful actors
state space estimation, discrete Kalman filter
- about / The state space estimation
- transition equation / The transition equation
- measurement equation / The measurement equation
steepest descent
- about / Steepest descent
stemming / Basics of information retrieval
stimuli / The biological background
stochastic gradient descent / Ordinary least squares regression
- about / Stochastic gradient descent
Stochastic Gradient Descent (SGD)
- about / ML libraries
Stochastic Gradient Descent (SGD) algorithm
- about / Logistic regression
stratified sampling
- about / Basic, stratified, and consistent sampling
streaming k-means
- about / Unsupervised learning
StreamSets
- about / Data transformation layer
StringColumnExtensionMethods class
- URL / Operations on columns
strongly connected components
- about / Strongly connected components
structs
- about / Structs
substructures
- about / Overview of dynamic programming
sum of squared errors (SSE) / One-variate linear regression
supervised learning
- about / Supervised learning, Records and supervised learning
- Iris dataset / Iris dataset
- labeled point / Labeled point
- SVMWithSGD / SVMWithSGD
- logistic regression / Logistic regression
- decision tree / Decision tree
- ensemble learning methods / Bagging and boosting – ensemble learning methods
supervised machine learning algorithms
- about / Supervised learning
- generative models / Generative models
- discriminative models / Discriminative models
support vector machines (SVMs)
- about / Support vector machines
- linear SVM / The linear SVM
- nonlinear SVM / The nonlinear SVM
SVC
- about / Support vector classifiers – SVC
- binary SVC / The binary SVC
- one-class SVC / Anomaly detection with one-class SVC
SVD++
- about / SVD++
SVM
- components / Design
- configuration parameters / Configuration parameters
- performance considerations / Performance considerations
SVM dual problem
- kernel trick / Max-margin classification
SVMLight
- about / LIBSVM
SVMWithSGD
- about / SVMWithSGD
SVR
- about / Support vector regression
- overview / An overview
- versus linear regression / SVR versus linear regression
Syslog
- about / Data ingest
system monitoring
- about / System monitoring

T

Tachyon
- about / HDFS, Cassandra, S3, and Tachyon
tagging model / Basics of information retrieval
task scheduling
- about / Task scheduling
TaskSupport
- about / Processing a parallel collection
taxonomy, machine learning algorithms
- about / Taxonomy of machine learning algorithms
- unsupervised learning / Unsupervised learning
- supervised learning / Supervised learning
- semi-supervised learning / Semi-supervised learning
- reinforcement learning / Reinforcement learning
technical analysis
- about / Technical analysis
- trading data / Trading data
- trading signal and strategy / Trading signals and strategy
- price patterns / Price patterns
technical analysis, terminology
- bearish or bearish position / Terminology
- bullish or bullish position / Terminology
- long position / Terminology
- neutral position / Terminology
- oscillator / Terminology
- overbought / Terminology
- oversold / Terminology
- relative strength index (RSI) / Terminology
- resistance / Terminology
- short position / Terminology
- support / Terminology
- technical indicator / Terminology
- trading range / Terminology
- trading signal / Terminology
- volatility / Terminology
temporal difference
- about / Temporal difference for model-free learning
Term Frequency Inverse Document Frequency (TF-IDF)
- about / TF-IDF
terminology, LCS
- environment / Terminology
- agent / Terminology
- predicate / Terminology
- compound predicate / Terminology
- action / Terminology
- rule / Terminology
- classifier / Terminology
- rule fitness or score / Terminology
- sensors / Terminology
- input data stream / Terminology
- rule matching / Terminology
- covering / Terminology
- predictor / Terminology
terminology, reinforcement learning
- environment / Terminology
- agent / Terminology
- state / Terminology
- goal / Terminology
- absorbing state / Terminology
- terminal state / Terminology
- action / Terminology
- policy / Terminology
- best policy / Terminology
- reward / Terminology
- episode / Terminology
- horizon / Terminology
test case, evaluation
- about / Test case
- implementation / Implementation
- evaluation of models / Evaluation of models
- impact of the hidden layers' architecture / Impact of the hidden layers' architecture
test case, trading strategy
- about / A test case
- trading strategies, creating / Creating trading strategies
- optimizer, configuring / Configuring the optimizer
- best trading strategy, finding / Finding the best trading strategy
testing, Naïve Bayes
- about / Testing
- textual information, retrieving / Retrieving the textual information
- text mining classifier, evaluating / Evaluating the text mining classifier
tests, genetic algorithms
- about / Tests
- weighted score / The weighted score
- unweighted score / The unweighted score
text analysis pipeline
- about / Text analysis pipeline
- simple text analysis / Simple text analysis
text analytics, conditional random field (CRF)
- about / Regularized CRFs and text analytics
- feature functions model / The feature functions model
- design / Design
- implementation / Implementation
- CRF classifier, configuring / Configuring the CRF classifier
- CRF model, training / Training the CRF model
- CRF model, applying / Applying the CRF model
- tests / Tests
- training convergence profile / The training convergence profile
- impact, of size of training set / Impact of the size of the training set
- impact, of L2 regularization factor / Impact of the L2 regularization factor
text mining
- about / Naïve Bayes and text mining
- Naïve Bayes, applying to / Naïve Bayes and text mining
text mining methodology
- implementing / Implementation
- documents, analyzing / Analyzing documents
- frequency of relative terms, extracting / Extracting the frequency of relative terms
- features, generating / Generating the features
ThreadPoolTaskSupport
- about / Processing a parallel collection
Thrift
- about / Other serialization formats
- reference link / Other serialization formats
time series, in Scala
- about / Time series in Scala
- types and operations / Types and operations
- magnet pattern / The magnet pattern
- transpose operator / The transpose operator
- differential operator / The differential operator
- lazy views / Lazy views
tokenization
- about / Transformers, Text analysis pipeline
tokens
- URL / Authentication – adding HTTP headers
tools
- about / Tools and frameworks
trading signal / Trading signals and strategy
trading strategies
- about / Definition of trading strategies
- trading operators / Trading operators
- cost function / The cost function
- trading signals / Trading signals
- trading strategies / Trading strategies
- trading signal encoding / Trading signal encoding
training, hidden Markov model (HMM)
- about / Training – CF-2
- Baum-Welch estimator / The Baum-Welch estimator (EM)
training, Naïve Bayes classifiers implementation
- about / Training
- class likelihood / Class likelihood
- binomial model / Binomial model
- multinomial model / The multinomial model
- classifier components / Classifier components
training and classification, multilayer perceptron
- about / Training and classification
- regularization / Regularization
- model generation / The model generation
- Fast Fisher-Yates shuffle / The Fast Fisher-Yates shuffle
- prediction / Prediction
- model fitness / Model fitness
training epoch, multilayer perceptron
- about / The training epoch
- input forward propagation / Step 1 – input forward propagation
- error backpropagation / Step 2 – error backpropagation
- exit condition / Step 3 – exit condition
- implementing / Putting it all together
training workflow, logistic regression
- about / The training workflow
- optimizer, configuring / Step 1 – configuring the optimizer
- Jacobian matrix, computing / Step 2 – computing the Jacobian matrix
- convergence of optimizer, managing / Step 3 – managing the convergence of the optimizer
- least squares problem, defining / Step 4 – defining the least squares problem
- sum of square errors, minimizing / Step 5 – minimizing the sum of square errors
- binomial multivariate logistic regression, testing / Test
traits
- working with / Working with traits
transformations
- URL / Key-value RDDs
transformers
- about / Transformers
- URL / References
trending / Test case 1 – trending
triangle counting algorithm
- about / Triangle counting
triangle inequality
- about / Continuous space and metrics
try/catch statements
- versus Try type / Error handling
Try type
- versus try/catch statements / Error handling
- URL / References
tuning memory usage
- URL / Persisting RDDs
Turkey paradox
- about / Unknown unknowns
two-step lag smoothing algorithm / Experimentation
type classes
- loose coupling with / Looser coupling with type classes
- about / Type classes
- coding against / Coding against type classes
- usage / When to use type classes
- benefits / Benefits of type classes
- URL / References
Typesafe Activator
- URL / Akka
Typesafe activators
- about / The Play framework
- URL / The Play framework

U

UI component
- about / Basic components of a data-driven system, UI component
- Scala Swing / UI component
- Lift / UI component
- Play / UI component
- Dropwizard / UI component
- Slick / UI component
- NodeJS / UI component
- AngularJS / UI component
unknown unknowns
- about / Unknown unknowns
unstructured data
- usage / Other uses of unstructured data
unsupervised learning
- about / Unsupervised learning, Unsupervised learning
- data clustering / Clustering
- dimension reduction / Dimension reduction
URL design / Dynamic routing
user-defined function (UDF) / Custom functions on DataFrames
user-defined functions (UDFs) / Custom functions on DataFrames

V

validation, model
- about / Validation
- key quality metrics / Key quality metrics
- F-score for binomial classification / F-score for binomial classification
- F-score for multinomial classification / F-score for multinomial classification
Vapnik-Chervonenkis (VC) dimension
- about / Problem dimensionality
variance-bias trade-off
- about / Bias-variance decomposition
Vector
- about / Nested data
- SparseVector / Nested data
- DenseVector / Nested data
vector quantization
- about / Clustering
vectors
- about / Vectors
- dense / Dense and sparse vectors and the vector trait
- sparse / Dense and sparse vectors and the vector trait
- trait / Dense and sparse vectors and the vector trait
- building / Building vectors and matrices
- mutating / Mutating vectors and matrices
vertices
- about / A quick introduction to graphs
vi / SBT
view bounds / Context versus view bounds
Viterbi algorithm
- about / The Viterbi algorithm
- psi / The Viterbi algorithm
- qStar / The Viterbi algorithm
- delta / The Viterbi algorithm
ViterbiPath class / Putting it all together
ViterbiPath object / Putting it all together

W

web-jars
- JavaScipt dependencies through / JavaScript dependencies through web-jars
web APIs
- querying / Querying web APIs
web application
- about / Towards a web application: HTML templates
web frameworks
- about / Introduction to web frameworks
web services
- external web services, calling / Calling external web services
weighted graph
- about / A quick introduction to graphs
weighted moving average
- about / The weighted moving average
word2vec
- using / Using word2vec to find word relationships
- Porter Stemmer / A Porter Stemmer implementation of the code
WordNet / Basics of information retrieval
workflow computational model
- about / A workflow computational model
- mathematical abstractions, supporting / Supporting mathematical abstractions
- mixins, combining to build workflow / Composing mixins to build a workflow
- modularization / Modularization

X

0xdata H2O / 0xdata Sparkling Water
0xdata Sparkling Water
- about / 0xdata Sparkling Water
XML format
- about / Other serialization formats
XPath
- used, for extracting fields / Extracting fields using XPath
XPath DSL / Extracting fields using XPath

Y

1-year Treasury bill (1yTB)
- about / Introducing the multinomial Naïve Bayes
Yahoo Finances / Step 1 – scoping the problem
YahooFinancials / Data sources
YAML format
- about / Other serialization formats
YARN
- URL / Task scheduling
- about / Mesos, YARN, and Standalone

Z

zero-frequency problem
- about / The zero-frequency problem
ZeroMQ
- about / MQTT, ZeroMQ, Flume, and Kafka

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Index

Create new playlist

Sign In

Sign Up

Index

A

B

C

D

E

F

G

H

I

J

K

L

M

N

O

P

Q

R

S

T

U

V

W

X

Y

Z

Table of Contents for
Index