Chapter 8. strings, string_views, Text Files, CSV Files and Regex

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 8. `string`s, `string_view`s, Text Files, CSV Files and Regex

Objectives

In this chapter, you’ll:

■ Determine string characteristics.

■ Find, replace and insert characters in strings.

■ Use C++11 numeric conversion functions.

■ See the C++20 update to how string member function reserve modifies string capacity.

■ Use C++17 string_views for lightweight views of contiguous characters.

■ Write and read sequential files.

■ Perform input from and output to strings in memory.

■ Do an objects-natural case study using an object of an open-source-library class to read and process data about the Titanic disaster from a CSV (comma-separated values) file.

■ Do an objects-natural case study using C++11 regular expressions (regex) to search strings for patterns, validate data and replace substrings.

Outline

8.1 Introduction

8.2 string Assignment and Concatenation

8.3 Comparing strings

8.4 Substrings

8.5 Swapping strings

8.6 string Characteristics

8.6.1 C++20 Update to string Member-Function reserve

8.7 Finding Substrings and Characters in a string

8.8 Replacing Characters in a string

8.9 Inserting Characters into a string

8.10 C++11 Numeric Conversions

8.11 C++17 string_view

8.12 Files and Streams

8.13 Creating a Sequential File

8.14 Reading Data from a Sequential File

8.15 C++14 Reading and Writing Quoted Text

8.16 Updating Sequential Files

8.17 String Stream Processing

8.18 Raw String Literals

8.19 Objects Natural Case Study: Reading and Analyzing a CSV File Containing Titanic Disaster Data

8.19.1 Using rapidcsv to Read the Contents of a CSV File

8.19.2 Reading and Analyzing the Titanic Disaster Dataset

8.20 Objects Natural Case Study: Introduction to Regular Expressions

8.20.1 Matching Complete Strings to Patterns

8.20.2 Replacing Substrings

8.20.3 Searching for Matches

8.21 Wrap-Up

8.1 Introduction

17 This chapter discusses additional std::string features and introduces C++17 string_views, text file-processing, CSV file processing and regular expressions.

`std::string`s

20 We’ve been using std::string object since Chapter 2. Here, we introduce many more std::string manipulations, including assignment, comparisons, extracting substrings, searching for substrings, modifying std::string objects and converting std::string objects to numeric values. We also introduce a C++20 change to the mechanics of the std::string member function reserve.

C++17 `string_view`s

17 We introduce C++17’s string_views, which are read-only views of C-strings or std::string objects. Like std::span, a string_view does not own the data it views. You’ll see that string_views have many similar capabilities to std::strings, making them appropriate for many cases in which you do not need modifiable strings.

Text Files

Data storage in memory is temporary. Files are used for data persistence—permanent retention of data. Computers store files on secondary storage devices, such as flash drives, and frequently today, in the cloud. In this chapter, we explain how to build C++ programs that create, update and process sequential text files. We also show how to output data to and read data from a std::string in memory using ostringstreams and istringstreams.

Objects Natural Case Study: CSV Files and the Titanic Disaster Dataset

In this chapter’s first objects-natural case study, we introduce the CSV (comma-separated values) file format. CSV is popular for datasets used in big data, data analytics and data science, and artificial intelligence applications like natural language processing, machine learning and deep learning.

DS One of the most commonly used datasets for data analytics and data science beginners is the Titanic disaster dataset. It lists all the passengers and whether they survived when the ship Titanic struck an iceberg and sank on its maiden voyage April 14–15, 1912. We use a class from the open-source rapidcsv library to create an object that reads the Titanic dataset from a CSV file. Then, we view some of the data and perform some basic data analytics.

Objects Natural Case Study: Using Regular Expressions to Search Strings for Patterns, Validate Data and Replace Substrings

11 In this chapter’s second objects-natural case study, we introduce regular expressions, which are particularly crucial in today’s data-rich applications. We’ll use C++11 regex objects to create regular expressions then use them with various functions in the <regex> header to match patterns in text. In earlier chapters, we mentioned the importance of validating user input in industrial-strength code. The std::string, string stream and regular expression capabilities presented in this chapter are frequently used to validate data.

8.2 `string` Assignment and Concatenation

Figure 8.1 demonstrates std::string assignment and concatenation.

Table of Contents for Chapter 8. strings, string_views, Text Files, CSV Files and Regex

Create new playlist

Sign In

Sign Up

Chapter 8. strings, string_views, Text Files, CSV Files and Regex

8.1 Introduction

std::strings

C++17 string_views

Text Files

Objects Natural Case Study: CSV Files and the Titanic Disaster Dataset

Objects Natural Case Study: Using Regular Expressions to Search Strings for Patterns, Validate Data and Replace Substrings

8.2 string Assignment and Concatenation

String Assignment

Accessing String Elements By Index

Accessing String Elements By Index

8.3 Comparing strings

Comparing Strings with the Relational and Equality Operators

Comparing Strings with Member Function compare

8.4 Substrings

8.5 Swapping strings

8.6 string Characteristics

8.6.1 C++20 Update to string Member-Function reserve

8.7 Finding Substrings and Characters in a string

Member Functions find and rfind

Member Function find_first_of

Member Function find_last_of

Member Function find_first_not_of

8.8 Replacing Characters in a string

8.9 Inserting Characters into a string

8.10 C++11 Numeric Conversions

Converting Numeric Values to string Objects

Converting string Objects to Numeric Values

Functions That Convert strings to Integral Types

Functions That Convert strings to Floating-Point Types

8.11 C++17 string_view

Creating a string_view

string_views “See” Changes to the Characters They View

string_views Are Comparable with std::strings or string_views

string_views Can Remove a Prefix or Suffix

string_views Are Iterable

string_views Enable Various String Operations on C-Strings

8.12 Files and Streams

File-Processing Streams

8.13 Creating a Sequential File

Opening a File

Opening a File via the open Member Function

Testing Whether a File Was Opened Successfully

Processing Data

Closing a File

Sample Execution

8.14 Reading Data from a Sequential File

Opening a File for Input

Reading from the File

File-Position Pointers

8.15 C++14 Reading and Writing Quoted Text

Reading Quoted Text

Writing Quoted Text

8.16 Updating Sequential Files

8.17 String Stream Processing

Demonstrating ostringstream

Demonstrating istringstream

8.18 Raw String Literals

8.19 Objects Natural Case Study: Reading and Analyzing a CSV File Containing Titanic Disaster Data

Datasets

account.csv

8.19.1 Using rapidcsv to Read the Contents of a CSV File

Caution: Commas in CSV Data Fields

Caution: Missing Commas and Extra Commas in CSV Files

8.19.2 Reading and Analyzing the Titanic Disaster Dataset

Getting to Know the Data

Missing Data

Loading the Dataset

Removing the Quotes from the Columns Containing Strings

Removing the Quotes from the Column Containing Strings

Viewing Some Rows in the Titanic Dataset

Basic Descriptive Statistics

Basic Descriptive Statistics for the Cleaned Age Column

Determining Passenger Counts By Class

Basic Descriptive Statistics for the Cleaned Age Column

Counting By Sex and By Passenger Class the Numbers of People Who Survived

Table of Contents for
Chapter 8. strings, string_views, Text Files, CSV Files and Regex

Chapter 8. `string`s, `string_view`s, Text Files, CSV Files and Regex

`std::string`s

C++17 `string_view`s

8.2 `string` Assignment and Concatenation

8.3 Comparing `string`s

Comparing Strings with Member Function `compare`

8.5 Swapping `string`s

8.6 `string` Characteristics

8.6.1 C++20 Update to `string` Member-Function `reserve`

8.7 Finding Substrings and Characters in a `string`

Member Functions `find` and `rfind`

Member Function `find_first_of`

Member Function `find_last_of`

Member Function `find_first_not_of`

8.8 Replacing Characters in a `string`

8.9 Inserting Characters into a `string`

Converting Numeric Values to `string` Objects

Converting `string` Objects to Numeric Values

Functions That Convert `string`s to Integral Types

Functions That Convert `string`s to Floating-Point Types

8.11 C++17 `string_view`

Creating a `string_view`

`string_view`s `“`See`”` Changes to the Characters They View

`string_view`s Are Comparable with `std::string`s or `string_view`s

`string_view`s Can Remove a Prefix or Suffix

`string_view`s Are Iterable

`string_view`s Enable Various String Operations on C-Strings

Opening a File via the `open` Member Function

Demonstrating `ostringstream`

Demonstrating `istringstream`

`account.csv`

8.19.1 Using `rapidcsv` to Read the Contents of a CSV File