7. Managing, Optimizing, and Tuning VLDBs

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

7. Managing, Optimizing, and Tuning VLDBs

Data is the heart and lifeblood of every business. Data growth in every industry is phenomenal and is increasing at a very rapid pace with each passing year, sometimes more than 100 percent from its previous year. Data warehousing is a common requirement for any organization, whatever the size. At the same time, configuring and maintaining very large databases (VLDBs) and extremely large databases (XLDBs) are challenging tasks that require advanced skills. This chapter presents a 360-degree overview of managing VLDBs, offering tips for best practices and strategies in configuration, performance, backup and recovery, and maintenance.

Overview of Very Large Databases

As many Oracle and data warehousing gurus have voiced, there is no fixed definition or standard rule to categorize a certain sized database as a VLDB. Not that long ago, databases that were only hundreds of gigabytes in size were considered to be VLDBs; however, over the past few years, the definition of VLDBs has changed significantly due to enormous data growth. It is not unusual to encounter databases that have grown to the size of several terabytes or even petabytes and that hold billions or even trillions of records. VLDBs and XLDBs therefore demand highly efficient software and hardware resources and require extremely large amounts of storage capacity.

VLDBs and XLDBs used for decision support systems (DSS) are often referred to as data warehouse (DW) databases. DSSs are critical to an organization’s business management to analyze the functionality and business growth of the products and services that the organization offers. Data warehouses generally hold historical data that is loaded from various data sources, including another database, a flat file, an Excel spreadsheet, and other sources. Extract-transform-load (ETL) tools are typically used to load data from those sources, while applications and utilities such as business intelligence (BI) and business objects are used to generate management reports and dashboards for the business’s needs.

VLDBs and XLDBs therefore engender a unique set of challenges and can raise some daunting tasks for the information technology (IT) department of an organization. They demand state-of-the-art technologies to meet myriad business needs while simultaneously delivering optimal performance. When designing such a database, you will almost certainly encounter the need for high-end hardware resources, huge storage capacities, large network bandwidths, and special considerations for backup and recovery. The following sections present best practices and discuss how to apply the tools, tips, and tricks to ensure a solid VLDB and XLDB foundation so that you can manage them with ease.

Optimal Basic Configuration

One of the key factors in having an optimal setup is getting the basics right and building a solid foundation. This segment briefly reviews some of the initial configuration tips for VLDBs:

Choosing the right database configuration template in DBCA

Selecting the optimal data block size

Sizing adequate system global area (SGA), program global area (PGA), and other memory components

Leveraging data compression

Using temporary tablespaces effectively

Implementing partitioning for easy data management

Making the right choices for partitioned indexes (especially global vs. local)

Enabling parallelism to take advantage of multiple CPUs

Verifying application code for effectiveness before its production deployment

Implementing appropriate backup and recovery strategies

Data Warehouse Template

When deploying a new database, it is advisable to follow some basic rules specific to the application category: online transaction processing (OLTP), DSS, or a combination of both. To create a new Oracle database, you can use the Database Configuration Assistant (DBCA), which is a Java-based GUI tool, or you can use the CREATE DATABASE statement, a manual approach that requires running scripts. Depending on the nature of the application—that is, whether it’s an OLTP or a DSS application—you should choose the appropriate database creation template through DBCA, as shown in Figure 7.1. If the database is intended for DSS or VLDB, choose the Data Warehouse template so that the appropriate database initialization parameters, online redo log sizing, and tablespace sizing are set properly.

Figure 7.1 Database Configuration Assistant: Create database template

Optimal Data Block Size

It is essential to choose the right database block size for databases, especially for VLDBs and XLDBs. The data block size has an impact on the overall database read and write performance. Although the default 8 KB block size is most appropriate and can meet the demands of an OLTP application, DSS systems generally require a larger block size. If the database is already configured with a default 8 KB block size, the DBA can use Oracle’s multiple block feature. Under some circumstances, using a 16 KB database block size or even a 32 KB size for VLDBs and XLDBs would yield performance benefits over a 4 KB or 8 KB block size. The block size is controlled with the db_block_size initialization parameter.

Oracle supports multiple data block sizes within the same database, but the practice is not encouraged unless your application demands it. Any VLDB might have data block sizes of 16 KB or even 32 KB, but only the DBA can determine which block size best supports the application’s performance and behavior. Remember, once a database is created with a default block size, the default block sized cannot be modified unless the database is re-created.

Following are a few tips for choosing the right block size that meets the application’s needs:

A smaller block size is efficient when rows are smaller and data access is random.

Choose a larger block size to improve the read performance when the rows are smaller and access is sequential or when you have a mixture of random and sequential reads.

Larger block size produces significant read performance when the rows are larger, such as in large object (LOB) columns.

With large block size for high concurrency systems, ensure you have set appropriate values for the INITRANS and MAXTRANS parameters.

Larger block size has less overhead and stores more rows in a single block.

With a larger block size, several rows are put into buffer cache with a single read.

Bigfile Tablespaces

Today’s VLDBs and XLDBs commonly grow into terabyte (TB) or even petabyte (PB) sizes. Traditionally, the larger the database, the more tablespaces and datafiles will be required to support the increasing data demands. Depending on the operating system platform, the datafiles of a smallfile tablespace are allowed to grow to a maximum size of 32 GB for an 8 KB block size (128 GB for a 32 KB block size).

To limit the number of datafiles for VLDBs, Oracle introduced the bigfile tablespace concept, which allows a single large datafile per tablespace. However, that single datafile can grow to a maximum size of 32 TB for an 8 KB block size (and up to 128 TB for a 32 KB block size). This feature eliminates the need for numerous datafiles and simplifies tablespace management. Keep in mind that bigfile tablespaces are valid only for locally managed tablespaces with automatic segment space management.

When you create bigfile tablespaces, you need to ensure there is enough free space for growth on the storage/filesystem and that the system supports striping.

Table of Contents for 7. Managing, Optimizing, and Tuning VLDBs

Create new playlist

Sign In

Sign Up

7. Managing, Optimizing, and Tuning VLDBs

Overview of Very Large Databases

Optimal Basic Configuration

Data Warehouse Template

Optimal Data Block Size

Bigfile Tablespaces

Adequate SGA and PGA

Temporary Tablespace Groups

Data Partitioning

Index Partitioning: Local versus Global

Data Compression

Table Compression

Heat Map and Automatic Data Optimization

Advanced Index Partition Compression

VLDB Performance Tuning Principles

Real-World Scenario

Limiting the Impact of Indexes on Data Loading

Maximizing Resource Utilization

Gathering Optimizer Statistics

Incremental Statistics Synopsis

Gathering Statistics Concurrently

Setting the ESTIMATE_PERCENT Value

Backup and Recovery Best Practices

Exadata Solutions

Utilizing a Data Guard Environment

Summary

Table of Contents for
7. Managing, Optimizing, and Tuning VLDBs