【C++】12.string类的使用

文章目录

1. 为什么学习string类？
- 1.1 C语言中的字符串
- 1.2 两个面试题(暂不做讲解)
2. 标准库中的string类
- 2.1 string类(了解)
- 2.2 auto和范围for
3. 查看技术文档
4. string的访问
5. 如何读取每个字符呢？
6. auto语法糖（C++11）
7. 范围for 语法糖（C++11）
8. reserve
9. resize
10. Modifiers
- insert，erase（偶尔使用）
- replace（偶尔使用）
11. find
12.c_str，rfind，find_first_not_of
13. 需要掌握的部分

1. 为什么学习string类？

1.1 C语言中的字符串

C语言中，字符串是以'\0'结尾的一些字符的集合，为了操作方便，C标准库中提供了一些str系列的库函数，但是这些库函数与字符串是分离开的，不太符合OOP的思想，而且底层空间需要用户自己管理，稍不留神可能还会越界访问。

1.2 两个面试题(暂不做讲解)

字符串相加

在OJ中，有关字符串的题目基本以string类的形式出现，而且在常规工作中，为了简单、方便、快捷，基本都使用string类，很少有人去使用C库中的字符串操作函数。

2. 标准库中的string类

2.1 string类(了解)

string类的文档介绍

在使用string类时，必须包含#include头文件以及using namespace std;

2.2 auto和范围for

auto关键字

在这里补充2个C++11的小语法，方便我们后面的学习。

在早期C/C++中auto的含义是：使用auto修饰的变量，是具有自动存储器的局部变量，后来这个不重要了。C++11中，标准委员会变废为宝赋予了auto全新的含义即：auto不再是一个存储类型指示符，而是作为一个新的类型指示符来指示编译器，auto声明的变量必须由编译器在编译时期推导而得。

用auto声明指针类型时，用auto和auto*没有任何区别，但用auto声明引用类型时则必须加&

当在同一行声明多个变量时，这些变量必须是相同的类型，否则编译器将会报错，因为编译器实际只对第一个类型进行推导，然后用推导出来的类型定义其他变量。

auto不能作为函数的参数，可以做返回值，但是建议谨慎使用

auto不能直接用来声明数组

#include<iostream>
using namespace std;
int func1()
{
    return 10;
}
// 不能做参数
void func2(auto a)
{}
// 可以做返回值，但是建议谨慎使用
auto func3()
{
    return 3;
}
int main()
{
    int a = 10;
    auto b = a;
    auto c = 'a';
    auto d = func1();
    // 编译报错:rror C3531: “e”: 类型包含“auto”的符号必须具有初始值设定项
    auto e;
    cout << typeid(b).name() << endl;
    cout << typeid(c).name() << endl;
    cout << typeid(d).name() << endl;
    int x = 10;
    auto y = &x;
    auto* z = &x;
    auto& m = x;
    cout << typeid(x).name() << endl;
    cout << typeid(y).name() << endl;
    cout << typeid(z).name() << endl;
    auto aa = 1, bb = 2;
    // 编译报错：error C3538: 在声明符列表中，“auto”必须始终推导为同一类型
    auto cc = 3, dd = 4.0;
    // 编译报错：error C3318: “auto []”: 数组不能具有其中包含“auto”的元素类型
    auto array[] = { 4, 5, 6 };
    return 0;
}

#include<iostream>
#include <string>
#include <map>
using namespace std;
int main()
{
    std::map<std::string, std::string> dict = { { "apple", "苹果" },{ "orange", 
                                                                   "橙子" }, {"pear","梨"} };
    // auto的用武之地
    //std::map<std::string, std::string>::iterator it = dict.begin();
    auto it = dict.begin();
    while (it != dict.end())
    {
        cout << it->first << ":" << it->second << endl;
        ++it;
    }
    return 0;
}

3. 查看技术文档

网址：https://legacy.cplusplus.com/reference/
896896a9ab70440e3c484718b19c40da

848e7892cf9965b8d418248a4a07a8b3

#include <iostream>
#include <string>

using namespace std;

int main()
{
	string s1;//构造一个没有有效字符的string
	string s2("1111122222");//字符串初始化
	string s3("1111111111", 3);//字符串的前3个初始化
	string s4(100, 'x');//100个x字符初始化
	string s5(s2, 4, 3);//将string从pos开始的len个字符拷贝给它
	string s6(s2, 4);//没有传第三个参数就拷贝到s2的结束
	string s7(s2, 4, 20);//传递的第三个参数比剩下的字符长，就拷贝到结束

	cout << s1 << endl;//（什么都不打印）
	cout << s2 << endl;//1111122222
	cout << s3 << endl;//111
	cout << s4 << endl;//xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
	cout << s5 << endl;//122
	cout << s6 << endl;//122222
	cout << s7 << endl;//122222

	return 0;
}

f970bc64da1180333e54ba3e470df593

b7a4feac66f55e01c39bbe134c6935ac

d30c02b045c40d8057f9e8cac6c047ab

这个npos是string里面的一个静态成员变量，所以在这里可以做缺省参数，他给的是-1，其实他不是-1，是整形的最大值。因为这里他的类型是size_t也就是unsigned int。

所以第三个参数不写就会一直走下去，因为是整型的最大值。所以会走到字符串结束。

4. string的访问

edfc246cd7c4ce0996abaa4087b87652

访问第pos位置的字符。

string为了和C兼容，末尾是添加了\0的。string是在STL之前就有的，所以有的接口会比较繁琐。

operator[]的底层类似于

class string
{
public:
	char& operator[] (size_t pos)
	{
		assert(pos < _size);//防止越界
		return _str[pos];
	}
private:
	char* _str;
	size_t _size;
	size_t _capacity;
};

这里面牛逼的地方在于是传引用返回，而不是传值返回。

这样做有一个好处，如果想修改一个字符，就可以直接修改。

#include <iostream>
#include <string>

using namespace std;

int main()
{
	string s2("1111122222");//字符串初始化
	cout << s2 << endl;//1111122222
	
	//s2.operator[](0) = 'x';
	s2[0] = 'x';
	cout << s2 << endl;//x111122222

	return 0;
}

但他不是数组，相当于调用s2.operator[](0) = 'x';

这个使用起来很方便，就像数组一样使用。

我们想遍历这个string的每个字符，就可以像数组一样去遍历每个字符。

#include <iostream>
#include <string>

using namespace std;

int main()
{
	string s2("1111122222");//字符串初始化
	cout << s2 << endl;//1111122222
	
	s2[0] = 'x';
	s2[5] = 'x';
	cout << s2 << endl;//x1111x2222

	for (size_t i = 0; i < s2.size(); i++)
	{
		s2[i]++;
	}
	cout << s2 << endl;//y2222y3333
	
	return 0;
}

可以很好的解决越界问题，因为里面有assert断言

d57e85b6664f3c2ca3b8bec291ffa828

5. 如何读取每个字符呢？

下标+[]

int main()
{
	//auto ret1 = func2();
	string s1("hello world");

	//1. 下标+[]
	for (size_t i = 0; i < s1.size(); i++)
	{
		s1[i]++;
	}

	for (size_t i = 0; i < s1.size(); i++)
	{
		cout << s1[i] << " ";
	}
	cout << endl;
	
	return 0;
}

710d197d4c9ae8ded131484448d8a54b

迭代器

int main()
{
	string s1("hello world");

	//2. 迭代器 -- 像指针一样的对象
	string::iterator it1 = s1.begin();//begin返回第一个类型的迭代器
	while (it1 != s1.end())//end返回最后一个数据的下一个位置的迭代器
	{
		(*it1)--;//读+修改

		++it1;
	}
	cout << endl;

	it1 = s1.begin();
	while (it1 != s1.end())
	{
		cout << *it1 << " ";//读
		++it1;
	}
	cout << endl;
	return 0;
}

迭代器是所有容器通用的，只是在string这里面看起来麻烦点，但是下标+[]不是通用的。

while (it1 != s1.end())这里面推荐写!=，虽然这里用<也可以，但这是因为这里的空间是连续的，但是别的地方不一定可以。

链表的迭代器的定义：

int main() {
	list<int> lt;
	lt.push_back(1);
	lt.push_back(2);
	lt.push_back(3);
	lt.push_back(4);

	list<int>::iterator it = lt.begin();
	while (it != lt.end())
	{
		cout << *it << " ";//这里的*是运算符重载，不是解引用
		++it;//这里的++是运算符重载
	}
	cout << endl;

	return 0;
}

打印：

1 2 3 4

6. auto语法糖（C++11）

int main()
{
	int i = 0;
	int j = i;
	// auto会自动推导类型 C++11
	auto z = i;   // z是int
	auto x = 1.1; // x是double
	auto p = &i;  // p是int*
    
	int& r1 = i;
	auto r2 = r1;  // r2是int，因为本质上r1是i的引用，改变r1就改变i，r1就是int类型的
	auto& r3 = r1; // r3是int&
	//auto r4;       // 报错
    
	return 0;
}

auto看起来没什么用，那么auto真正的价值是什么呢？

list<int>::iterator it = lt.begin();
auto it = lt.begin();//简化代码，替代写起来长的类型

迭代器比较长的就可以用auto来简化代码。

7. 范围for 语法糖（C++11）

对于一个有范围的集合而言，由程序员来说明循环的范围是多余的，有时候还会容易犯错误。因此C++11中引入了基于范围的for循环。for循环后的括号由冒号“ ：”分为两部分：第一部分是范围内用于迭代的变量，第二部分则表示被迭代的范围，自动迭代，自动取数据，自动判断结束。

范围for可以作用到数组和容器对象上进行遍历

范围for的底层很简单，容器遍历实际就是替换为迭代器，这个从汇编层也可以看到。

适用于：容器遍历和数组遍历

自动取容器的数据赋值给左边的对象

自动++，自动判断结束

原理：范围for底层是迭代器

string s1("hello world");

for (auto ch : s1)
{
    cout << ch << " ";
}
cout << endl;

for (auto e : lt)
{
    cout << e << " ";
}
cout << endl;

// 修改
for (auto& ch : s1)
{
    ch++;
}

总结：对string来说有三种遍历方法：

下标+[]
迭代器
范围for

之前这么写：

int main(){
    int a[] = { 1,2,3,4,5,6 };
    for (size_t i = 0; i < sizeof(a)/sizeof(int); i++)
    {
        cout << a[i] << " ";
    }
    cout << endl;
    
	return 0;
}

现在这么写：

int main(){
    int a[] = { 1,2,3,4,5,6 };
    for (auto e : a)
    {
        cout << e << " ";
    }
    cout << endl;

    return 0;
}

简单多了。

迭代器可以倒着遍历吗？

可以使用反向迭代器，倒着走。

int main()
{
	string s1("hello world");
	string::reverse_iterator rit = s1.rbegin();
	while (rit != s1.rend())
	{
		cout << *rit << " ";
		++rit;
	}
	cout << endl;
    return 0;
}

ff007fe0b3e1c16860c99185fb37a673

const迭代器

int main(){
    const string s2(s1);
    //string::const_iterator it1 = s2.begin();
    auto it1 = s2.begin();//用auto替代
    while (it1 != s2.end())
    {
        //*it1 += 1;//const_iterator不能给常量赋值，*it1是常量
        cout << *it1 << " ";
        ++it1;
    }
    cout << endl;

    //string::const_reverse_iterator rit1 = s2.rbegin();
    auto rit1 = s2.rbegin();//用auto替代
    while (rit1 != s2.rend())
    {
        cout << *rit1 << " ";
        ++rit1;
    }
    cout << endl;

    return 0;
}

迭代器有四种：

iterator
reverse_iterator
const_iterator
const_reverse_iterator

8. reserve

2fa98b2f1fee61ab3146ebc9c64ec4e7

代码1：

int main()
{
	try
	{
		string s1("hello world11111");
		cout << s1.size() << endl;//16
		cout << s1.length() << endl;//16
		cout << s1.max_size() << endl;//9223372036854775807
		cout << s1.capacity() << endl;//31（扩容一次会多扩容一点），capacity没计算\0的空间
		cout << endl << endl;

		s1.clear();//把所有数据清除，但是不清除空间
		cout << s1.size() << endl;//0
		cout << s1.capacity() << endl;//31

		//s1[20];//越界了会断言报错

		s1.at(20);//at会捕获异常
	}
	catch (const exception& e)
	{
		cout << e.what() << endl;//invalid string position
	}

	return 0;
}

代码2：

int main()
{
	string s1("hello");
	s1.push_back(',');//尾插
	s1.push_back('w');
	cout << s1 << endl;//hello,w

	s1.append("orld");//尾插字符串
	cout << s1 << endl;//hello,world

	s1.append(10, '!');
	cout << s1 << endl;//hello,world!!!!!!!!!!

	string s2("hello bit hello world");

	s1.append(s2.begin()+6, s2.end());
	cout << s1 << endl;//hello,world!!!!!!!!!!bit hello world

	string s3("hello");
	s3 += ',';
	s3 += "world";
	cout << s3 << endl;//hello,world

	return 0;
}

代码3-1：

int main()
{
	string s1;

	size_t old = s1.capacity();//记录之前的旧的容量
	cout << "capacity:" << old << endl;
	for (size_t i = 0; i < 200; i++)
	{
		s1 += 'x';
		if (s1.capacity() != old)//如果容量发生变化，就说明扩容了
		{
			cout << "capacity:" << s1.capacity() << endl;//capacity:207
			old = s1.capacity();
		}
	}

	return 0;
}

打印：

capacity:15
capacity:31
capacity:47
capacity:70
capacity:105
capacity:157
capacity:235

差不多是1.5倍扩容

代码3-2：

//reserve  保留 预留
//reverse  反向 翻转
int main()
{
	string s1;
	//提前开空间，避免扩容
	s1.reserve(200);

	size_t old = s1.capacity();//记录之前的旧的容量
	cout << "capacity:" << old << endl;
	for (size_t i = 0; i < 200; i++)
	{
		s1 += 'x';
		if (s1.capacity() != old)//如果容量发生变化，就说明扩容了
		{
			cout << "capacity:" << s1.capacity() << endl;//capacity:207
			old = s1.capacity();
		}
	}

	string s3("11111111");
	string s4("111111112222222222222222222222222222222222222222222");
	cout << sizeof(s3) << endl;//40

	return 0;
}

9. resize

46dcc5aea9ca93a3e2d1096dce72cd74

6ec519458b9addbbafe6b5ddadd64896

int main()
{
	string s1("11111111111111111111");
	cout << s1 << endl;
	cout << s1.size() << endl;
	cout << s1.capacity() << endl;

	// 删除
	// n < size
	s1.resize(15);
	cout << s1 << endl;
	cout << s1.size() << endl;
	cout << s1.capacity() << endl;

	// 插入
	// size < n < capacity
	s1.resize(25, 'x');
	cout << s1 << endl;
	cout << s1.size() << endl;
	cout << s1.capacity() << endl;

	// n > capacity
	s1.resize(40, 'x');
	cout << s1 << endl;
	cout << s1.size() << endl;
	cout << s1.capacity() << endl;

	return 0;
}

打印：

11111111111111111111//20个1
20
31
111111111111111//15个1
15
31
111111111111111xxxxxxxxxx//15个1，10个x
25
31
111111111111111xxxxxxxxxxxxxxxxxxxxxxxxx//15个1，25个x
40
47

f48fb4d4aadf92bb28fa45f0d2b5c02f

10. Modifiers

5b0907f63f66425430082365c95b172d

insert，erase（偶尔使用）

int main()
{
	string s1("hello world");
	s1.insert(5, "xxxx");
	cout << s1 << endl;

	s1.erase(5, 5);
	cout << s1 << endl;

	s1.erase(0, 1);
	cout << s1 << endl;

	s1.erase(s1.begin());
	cout << s1 << endl;

	s1.erase(5, 10);
	cout << s1 << endl;

	return 0;
}

打印：

helloxxxx world
helloworld
elloworld
lloworld
llowo

replace（偶尔使用）

ba8761bce0e4632440ea2c2be5c9dd4c

int main()
{
	string s1("hello                          world hello bit");
	cout << s1 << endl;
	size_t i = s1.find(' ');
	while (i != string::npos)
	{
		s1.replace(i, 1, "%%");
		i = s1.find(' ', i+2);
	}
	cout << s1 << endl;

	string s2;
	for (auto ch : s1)
	{
		if (ch != ' ')
			s2 += ch;
		else
			s2 += "%%";
	}
	cout << s2 << endl;
	//s1.swap(s2);


	return 0;
}

打印：

hello                          world hello bit
hello%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%world%%hello%%bit
hello%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%world%%hello%%bit

11. find

int main()
{
	string s1("hello                          world hello bit");
	cout << s1 << endl;
    
	size_t i = s1.find(' ');
	while (i != string::npos)
	{
		s1.replace(i, 1, "%%");
		i = s1.find(' ', i+2);
	}
	cout << s1 << endl;
    
    return 0;
}

打印：

hello                          world hello bit
hello%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%world%%hello%%bit

12.c_str，rfind，find_first_not_of

可以更好地兼容C的接口

ab5850464539a2698461835170122c69

2200ed3a3c1b50ae5c1767158496e9f0

04ffcfe14182d31d766e7577f28a1fb8

int main()
{
	string s1("hello world");
	cout << s1 << endl;//hello world
	cout << s1.c_str() << endl;//hello world

	string s2("Test.cpp");
	FILE* fout = fopen(s2.c_str(), "r");
	char ch = fgetc(fout);
	/*while (ch != EOF)
	{
		cout << ch;
		ch = fgetc(fout);
	}*/

	const char* p1 = "xxxxx";
	int* p2 = nullptr;
	cout << (void*)p1 << endl;//00007FF64CEC1648
	cout << p2 << endl;//0000000000000000

    //rfind
	string s3("test.cpp.zip");
	size_t pos = s3.rfind('.');
	if (pos != string::npos)
	{
		string sub = s3.substr(pos);
		cout << sub << endl;//.zip
	}

	//find_first_not_of
	std::string str("Please, replace the vowels in this sentence by asterisks.");
	std::size_t found = str.find_first_not_of("aeiou");
	while (found != std::string::npos)
	{
		str[found] = '*';
		found = str.find_first_not_of("aeiou", found + 1);
	}

	std::cout << str << '\n';//**ea*e***e**a*e***e**o*e***i****i***e**e**e****a**e*i****

	return 0;
}

打印：

hello world
hello world
00007FF64CEC1648
0000000000000000
.zip
**ea*e***e**a*e***e**o*e***i****i***e**e**e****a**e*i****

13. 需要掌握的部分

6ea8cebf60431a8fdf983c3631927c59