ACID
Actor model
Akka installation
Akka logos
OOP vs . actors
thread-based concurrency
Agents server
Aggregation techniques
materialized views
probabilistic data structures
windowed events
Akka Actors
actor communication
actor lifecycle methods
actor monitoring
actor reference
actorSelection () method
actor system
BadPerformer
deadlock
GoodPerformer
GreeterActor
import akka.actor.Actor
installation
kill Actors
match expression
receive() method
shut down () method
starting actors
stopping actors
Thread.sleep
Apache Cassandra
cassandra.yaml
client-server architecture
driver
service petitioners
service providers
via CQLs
cluster booting
cluster setting
connection establishment
data model
GitHub
gossip
installation
CQL commands
CQL shell
DESCRIBE command
execution
file download
requirements
validation
memory access
column-family
key-value
NoSQL
characteristics
data model
Apache Kafka
add servers
amazingTopic
cluster mirroring
headers
Kafka topics
reAmazingTopic
reassign-partition tool
remove configuration
replication factor
architecture
design
goals
groups
leaders
log compaction
message compression
offset
replication modes
segment files
cluster
broker property
components
multiplebroker
singlebroker
consumer
consumer API
multithreadedconsumer
properties
Scalaconsumer
GitHub project
Gradle compilation
installation
importing
install Java 1.7
Linux
integration
Apache Spark
consumer parameters
data processing algorithms
JDK validation
libmesos
message broker
CEP
distributed
multiclient
persistent
scenario
types of, actors
uses
producers
custompartitioning
Producer API
Properties
Scala Kafkaproducer
tools
Apache Mesos
clusters
ApacheKafka
Apache Spark
indicators
MASTER
SLAVES
concurrency
coordinators
distributed systems
characteristics
complexity
models
types of, processes
dynamic process
Framework
abstraction levels
architecture
implementation
Mesos 101
Aurora framework
Chronosframework
installation
Marathon framework
ZooKeeper framework
rule
Apache Spark
Amazon S3
architecture
metadata
methods
object creation
sparkcontext
cluster manager
administration commands
Amazon EC2
architecture
cluster mode
deploy-mode option
driver
environment variables
execution
master flag
Mesos
scheduling data
Spark Master UI
spark-submit flags
spark-submit script
variables
core module
download page
GraphX module
MLIB module
modern shells
Parallelism
RDDs
dataframes API
goals
operations
rules
standalone applications
types
SQL module
Streaming
24/7 spark streaming
architecture
batch size
checkpointing
garbage collector
module
operation
parallelism techniques
Transformations
testing
Upload text file
Application programming interface (API)
Big Data
Akka model
Apache Cassandra
Apache Hadoop
Apache Kafka
Apache Mesos
data center operation
DevOps
open source technology
data engineers
ETL
infrastructure needs
lambda architecture
OLAP
prediction
SMACK stack
vs . Modern Big Data
vs . Traditional Big Data
vs . Traditional Data
Business intelligence (BI)
Cassandra Query Language (CQL)
cassandra.yaml
Chronos framework
architecture
installation process
.jar file
web interface
Client-server
Cloud
Cluster
Commutative operations
Complex event processing (CEP)
Concurrency
Conflict-free replicated data types (CRDTs)
Consistent, Available, and Partition Tolerant (CAP)
Coordinator
Cqlsh
Custom Partitioning
compile
consumer program
create topic
CustomPartitionProducer.scala
import
properties
RUN command
SimplePartitioner class
Dashboard
Data allocation
Data analyst
Data architects
Data feed
Data gravity
Data pipelines
Akka and Cassandra
CassandraCluster
ConfigCassandraCluster App
TestActorRef class
TweetScanActor downloads
TweetWriteActor writes
TwitterReadActor reads
Akka and Kafka
Akka and Spark
ReceiverInputDStream
remote actor system
ssc.start() method
StreamingContext
asynchronous message passing
checkpointing
consensus
data locality
data parallelism
Dynamo system
failure detection
gossip protocol
HDFS implementations
isolation
kafka-connect-cassandra
bulk mode
CQL types
SinkRecords
timestamp based mode
location transparency
masterless
network partition
replication
scalable infrastructure
shared nothing architecture
Spark-Cassandra connector
Cassandra function
CassandraOption.deleteIfNone
CassandraOption.unsetIfNone
collection of, Objects
collection of, Tuples
Enable Spark Streaming
modify CQL collections
save RDD
saving data
setting up Spark Streaming
Stream creation
user-defined types
SPOF
Data recovery
DBMS
Determinism
Development operations (DevOps)
Dimension data
Directed acyclic graph (DAG)
Distributed computing
Eventual consistency (EC)
Exponential backoff
Extract, Transformtransform, and Loadload (ETL)
Failover
Fast data
ACID vs . CAP
consistency
CRDT
properties
theorem
Apache Hadoop
applications
big data
characteristics
analysis streaming
direct ingestion
message queue
per-event transactions
data enrichment
advantages
capacity
data pipelines
data recovery
data streams analysis
queries
real-time user interaction
Streaming Transformations
Tag data identifiers
avoid idempotency
idempotent operation
ordered requests
timestamp resolution
unique id
unordered requests
use offset
use upsert
gossip
Graph database
Hadoop Distributed File System (HDSF)
Hybrid Transaction Analytical Processing (HTAP)
Infrastructure as a Service (IaaS)
In-memory data grid (IMDG)
Internet of Things (IoT)
Java Message Service (JMS)
Keyspace
Key-value
Lambda architecture
Latency
Lazy evaluation
Literal functions
Map() method
Maps
immutable maps
mutable maps
master-slave
Mesos installation
libraries
master server
missing dependency
slave server
stepby-step installation
Metadata
Multiple broker
consumer client
reAmazingTopic
server.properties
start producers
ZooKeeper running
Multithreaded consumer
amazingTopic
Compile
import
MultiThreadConsumer class
properties
Run MultiThreadConsumer
Run SimpleProducer
NoSQL
Online analytical analytical processing (OLAP)
Online transaction processing (OLTP)
Operational analytics
Platform as a Service (PaaS)
Probabilistic data structures
Relational database management system (RDBMS)
RDD operations
main spark actions
persistence levels
Transformations
Real-time analytics
Recovery time objective (RTO)
reduce() method
Replication
modes
asynchronous replication process
synchronous replication process
Resilient distributed dataset (RDDs)
Software as a Service (SaaS)
Scala
Array
creation
type
ArrayBuffer
extract subsequences
filtering
flattening
functional programming
implicit loops
literal functions
predicate
hierarchy collections
map
sequences
set
Lazy evaluation
mapping
merging and subtracting
queues
ranges
sort method
split method
stacks
streams
traversing collections
for loop
foreach method
iterators
unicity
Scalability
Scala consumer
amazingTopic
Compile
Import
properties
Run command
Run SimpleConsumer
SimpleConsumer class
Scala Kafka producer
compile command
consumer program
create topic
define properties
import
metadata.broker.list
request.required.acks
Run command
serializer.class
SimpleProducer.scala code
Sequence collections
immutable sequences
mutable sequences
Sets
immutable sets
mutable sets
Shared nothing
Single broker
amazingTopic
consumer client
producer.properties
start producers
start ZooKeeper
Single point of failure (SPOF)
SMACK stack model
Spark-Cassandra Connector
Streaming analytics
Streaming Transformations
Synchronization
Transformations
output operations
stateful transformations
updateStateByKey()method
Windowed operations
stateless transformations
Unstructured data