Introducing Big Data | 29
Indeed, one should write SQL queries for them. Also, unlike MapReduce, there is no Reduce
step, although the queries can always do aggregational work. Even though they are really a grid
of multiple databases, they present an interface where one sends them a single query and one
gets back a single result set. Thus, it abstracts the complexity of parallel processing. Typically,
MPP data warehouses are packaged as physical appliances, and typically, they do not use direct
attached storage, but instead they use a more enterprise-oriented network storage scheme.
2.7.5 NoSQL
NoSQL stands for ‘not only SQL’. NoSQL databases are different from relational databases, where
they do not mandate a predened structured model for the data. This makes it a better choice to
store any information that does not render itself easily into the relational table and record format.
The key differentiating characteristics of NoSQL databases are as follows.
• Most NoSQL databases allow the schema from row to row to differ quite substantially.
• They do not have the concept of referential integrity. In fact, most NoSQL databases do not
even support an ACID compliance. From the perspective of consistency, NoSQL databases
do not support immediate consistency in the database. However, by doing that, they do allow
for greater availability. They allow write’s to be buffered and thus, read’s to be less blocked.
NoSQL databases are quite popular with a wide variety of industrial applications. There are four
types of NoSQL databases and they are briey explained as follows.
• Key-value stores: In key-value stores, the data is stored in key-value pairs. Look at the exam-
ple in Figure 2.8. Here, we have a Customer table and an Order table. Look at the Customer
table. For a Row ID of 101, we have a few keys, such as First_Name, Last_Name, Address
and Last_Order_ID. Then, there is another row with an ID of 102.
FIGURE 2.8 Example of key-value stores
Database
Table: Customers
Table: Orders
Row ID: 101
First_Name: John
Last_Name: Doe
Address: 123 Park Street
Last_Order_ID: 1701
Row ID: 102
First_Name: Jane
Last_Name: Doe
Address: 456 Green Street
Last_Order_ID: 1702
Row ID: 1701
Price: 1000 USD
Item_ID: 2345
Item_ID: 7890
Row ID: 1702
Price: 700 USD
Item_ID: 4321
Item_ID: 5446
M02 Big Data Simplified XXXX 01.indd 29 5/10/2019 9:56:53 AM