python数字和字符串的各种操作（深度总结并实例演示）

程序员文章站 2023-11-21 16:47:52

数字1.数字和字符串2 字符串的方法使用 str.format() 进行字符串格式化。1.数字和字符串解释器就像一个简单的计算器一样：你可以在里面输入一个表达式然后它会写出答案。表达式的语法很直接：运算符 +、-、*、/ 的用法和其他大部分语言一样（比如 Pascal 或者 C 语言）；括号 (()) 用来分组。比如:>>> 2 + 24>>> 50 - 5*620>>> (50 - 5*6) / 45.0>>> 8...

数字

1.数字和字符串
2 字符串的方法
使用 str.format() 进行字符串格式化。

1.数字和字符串

解释器就像一个简单的计算器一样：你可以在里面输入一个表达式然后它会写出答案。表达式的语法很直接：运算符 +、-、*、/ 的用法和其他大部分语言一样（比如 Pascal 或者 C 语言）；括号 (()) 用来分组。比如:

>>> 2 + 2 4 >>> 50 - 5*6 20 >>> (50 - 5*6) / 4 5.0 >>> 8 / 5 # division always returns a floating point number 1.6

整数（比如 2、4、20 ）的类型是 int，有小数部分的（比如 5.0、1.6 ）的类型是 float。在这个手册的后半部分我们会看到更多的数字类型。

除法运算 (/) 永远返回浮点数类型。如果要做 floor division 得到一个整数结果（忽略小数部分）你可以使用 // 运算符；如果要计算余数，可以使用 %

>>> 17 / 3 # classic division returns a float 5.666666666666667 >>> >>> 17 // 3 # floor division discards the fractional part 5 >>> 17 % 3 # the % operator returns the remainder of the division 2 >>> 5 * 3 + 2 # result * divisor + remainder 17

在Python中，可以使用 ** 运算符来计算乘方

>>> 5 ** 2 # 5 squared 25 >>> 2 ** 7 # 2 to the power of 7 128

因为 ** 比 - 有更高的优先级, 所以 -3**2 会被解释成 -(3**2) ，因此结果是 -9. 为了避免这个并且得到结果 9, 你可以用这个式子 (-3)**2.

如果一个变量未定义（未赋值），试图使用它时会向你提示错误:

>>> n # try to access an undefined variable Traceback (most recent call last): File "<stdin>", line 1, in <module> NameError: name 'n' is not defined

除了数字，Python 也可以操作字符串。字符串有多种形式，可以使用单引号（'...'），双引号（"..."）都可以获得同样的结果。反斜杠 \ 可以用来转义:

>>> 'spam eggs' # single quotes 'spam eggs' >>> 'doesn\'t' # use \' to escape the single quote... "doesn't" >>> "doesn't" # ...or use double quotes instead "doesn't" >>> '"Yes," they said.' '"Yes," they said.' >>> "\"Yes,\" they said." '"Yes," they said.' >>> '"Isn\'t," they said.' '"Isn\'t," they said.'

在交互式解释器中，输出的字符串外面会加上引号，特殊字符会使用反斜杠来转义。虽然有时这看起来会与输入不一样（外面所加的引号可能会改变），但两个字符串是相同的。如果字符串中有单引号而没有双引号，该字符串外将加双引号来表示，否则就加单引号。 print() 函数会生成可读性更强的输出，即略去两边的引号，并且打印出经过转义的特殊字符:

>>> '"Isn\'t," they said.' '"Isn\'t," they said.' >>> print('"Isn\'t," they said.') "Isn't," they said. >>> s = 'First line.\nSecond line.' # \n means newline >>> s # without print(), \n is included in the output 'First line.\nSecond line.' >>> print(s) # with print(), \n produces a new line First line. Second line.

和其他语言不一样的是, 特殊字符比如说 \n 在单引号 ('...') 和双引号 ("...") 里有一样的意义. 这两种引号唯一的区别是，你不需要在单引号里转义双引号 " (但是你必须把单引号转义成 \') ，反之亦然.

如果你不希望前置了 \ 的字符转义成特殊字符，可以使用原始字符串方式，在引号前添加 r 即可:

>>> print('C:\some\name') # here \n means newline! C:\some
ame >>> print(r'C:\some\name') # note the r before the quote C:\some\name

字符串字面值可以跨行连续输入。一种方式是用三重引号：“”“......"""或 '''......'''。字符串中的回车换行会自动包含到字符串中，如果不想包含，在行尾添加一个 \ 即可。如下例:

print("""\
Usage: thingy [OPTIONS]
     -h                        Display this usage message
     -H hostname               Hostname to connect to
""")

将产生如下输出（注意最开始的换行没有包括进来）:

Usage: thingy [OPTIONS] -h                        Display this usage message -H hostname               Hostname to connect to

如果字符串末尾不想包含回车，在行尾添加一个 \ 即可。

print("""
Usage: thingy [OPTIONS]
     -h                        Display this usage message\
     -H hostname               Hostname to connect to
""") Usage: thingy [OPTIONS] -h                        Display this usage message -H hostname               Hostname to connect to

字符串可以用 + 进行连接（粘到一起），也可以用 * 进行重复:

>>> # 3 times 'un', followed by 'ium' >>> 3 * 'un' + 'ium' 'unununium'

相邻的两个或多个字符串字面值（引号引起来的字符）将会自动连接到一起.

>>> 'Py' 'thon' 'Python' text = ('Put' 'several' 'strings within parentheses ''to have them joined together.') text Out[34]: 'Putseveralstrings within parentheses to have them joined together.'

把很长的字符串拆开分别输入的时候尤其有用:

>>> text = ('Put several strings within parentheses ' ... 'to have them joined together.') >>> text 'Put several strings within parentheses to have them joined together.'

只能对两个字面值这样操作，变量或表达式不行:

>>> prefix = 'Py' >>> prefix 'thon' # can't concatenate a variable and a string literal File "<stdin>", line 1 prefix 'thon' ^ SyntaxError: invalid syntax >>> ('un' * 3) 'ium' File "<stdin>", line 1 ('un' * 3) 'ium' ^ SyntaxError: invalid syntax

如果你想连接变量，或者连接变量和字面值，可以用 + 号:

>>> prefix + 'thon' 'Python'

字符串是可以被 索引 （下标访问）的，第一个字符索引是 0。单个字符并没有特殊的类型，只是一个长度为一的字符串:

>>> word = 'Python' >>> word[0] # character in position 0 'P' >>> word[5] # character in position 5 'n'

索引也可以用负数，这种会从右边开始数:

>>> word[-1] # last character 'n' >>> word[-2] # second-last character 'o' >>> word[-6] 'P'

注意 -0 和 0 是一样的，所以负数索引从 -1 开始。

除了索引，字符串还支持 切片。索引可以得到单个字符，而 切片 可以获取子字符串:

>>> word[0:2] # characters from position 0 (included) to 2 (excluded) 'Py' >>> word[2:5] # characters from position 2 (included) to 5 (excluded) 'tho'

注意切片的开始总是被包括在结果中，而结束不被包括。这使得 s[:i] + s[i:] 总是等于 s

>>> word[:2] + word[2:] 'Python' >>> word[:4] + word[4:] 'Python'

切片的索引有默认值；省略开始索引时默认为0，省略结束索引时默认为到字符串的结束:

>>> word[:2] # character from the beginning to position 2 (excluded) 'Py' >>> word[4:] # characters from position 4 (included) to the end 'on' >>> word[-2:] # characters from the second-last (included) to the end 'on'

您也可以这么理解切片：将索引视作指向字符之间，第一个字符的左侧标为0，最后一个字符的右侧标为 n ，其中 n 是字符串长度。例如:

 +---+---+---+---+---+---+ | P | y | t | h | o | n | +---+---+---+---+---+---+ 0 1 2 3 4 5 6 -6 -5 -4 -3 -2 -1

试图使用过大的索引会产生一个错误:

>>> word[42] # the word only has 6 characters Traceback (most recent call last): File "<stdin>", line 1, in <module> IndexError: string index out of range

但是，切片中的越界索引会被自动处理:

>>> word[4:42] 'on' >>> word[42:] ''

Python 中的字符串不能被修改，它们是 immutable 的。因此，向字符串的某个索引位置赋值会产生一个错误:

>>> word[0] = 'J' Traceback (most recent call last): File "<stdin>", line 1, in <module> TypeError: 'str' object does not support item assignment >>> word[2:] = 'py' Traceback (most recent call last): File "<stdin>", line 1, in <module> TypeError: 'str' object does not support item assignment

如果需要一个不同的字符串，应当新建一个:

>>> 'J' + word[1:] 'Jython' >>> word[:2] + 'py' 'Pypy'

内建函数 len() 返回一个字符串的长度:

>>> s = 'supercalifragilisticexpialidocious' >>> len(s) 34

2 字符串的方法

字符串的方法: link.

count

str.count(sub[, start[, end]])

返回子字符串 sub 在 [start, end] 范围内非重叠出现的次数。可选参数 start 与 end 会被解读为切片表示法。

find

str.find(sub[, start[, end]])

返回子字符串 sub 在 s[start:end] 切片内被找到的最小索引。可选参数 start 与 end 会被解读为切片表示法。如果 sub 未被找到则返回 -1。

str1= 'dsdsdsfapapgkgfgkrpg' str1.count('ds',0,len(str1)-1) Out[41]: 3 str1.find('ds',0,len(str1)-1) Out[42]: 0

使用 str.format() 进行字符串格式化。

格式字符串语法: link.

本节包含 str.format() 语法的示例以及与旧式 % 格式化的比较。

该语法在大多数情况下与旧式的 % 格式化类似，只是增加了 {} 和 : 来取代 %。 例如，，'%03.2f' 可以被改写为 '{:03.2f}'。

新的格式语法还支持新增的不同选项，将在以下示例中说明。

按位置访问参数:

>>> '{0}, {1}, {2}'.format('a', 'b', 'c') 'a, b, c' >>> '{}, {}, {}'.format('a', 'b', 'c') # 3.1+ only 'a, b, c' >>> '{2}, {1}, {0}'.format('a', 'b', 'c') 'c, b, a' >>> '{2}, {1}, {0}'.format(*'abc') # unpacking argument sequence 'c, b, a' >>> '{0}, {1}, {2}'.format(*'abc') # unpacking argument sequence 'a, b, c' # 字符串前面加*，并且利用位置访问参数 >>> '{0}{1}{0}'.format('abra', 'cad') # arguments' indices can be repeated 'abracadabra'

按名称访问参数:

>>> 'Coordinates: {latitude}, {longitude}'.format(latitude='37.24N', longitude='-115.81W') 'Coordinates: 37.24N, -115.81W' >>> coord = {'latitude': '37.24N', 'longitude': '-115.81W'} >>> 'Coordinates: {latitude}, {longitude}'.format(**coord) #字符串前面加**，并且利用位名称访问参数 'Coordinates: 37.24N, -115.81W'

c = 3-5j ('The complex number {0} is formed from the real part {0.real} ' ... 'and the imaginary part {0.imag}.').format(c) Out[70]: 'The complex number (3-5j) is formed from the real part 3.0 and the imaginary part -5.0.' class Point1: def __init__(self,x,y): self.x, self.y=x, y def __str__(self): return 'Point({self.x}, {self.y})'.format(self=self) >>>print(str(Point1(11,22))) Point(11, 22)

访问参数的项:

>>> coord = (3, 5) >>> 'X: {0[0]};  Y: {0[1]}'.format(coord) 'X: 3;  Y: 5'

替代 %s 和 %r:

>>> "repr() shows quotes: {!r}; str() doesn't: {!s}".format('test1', 'test2') "repr() shows quotes: 'test1'; str() doesn't: test2"

对齐文本以及指定宽度:

>>> '{:<30}'.format('left aligned') 'left aligned                  ' >>> '{:>30}'.format('right aligned') '                 right aligned' >>> '{:^30}'.format('centered') '           centered           ' >>> '{:*^30}'.format('centered') # use '*' as a fill char '***********centered***********'

替代 %+f, %-f 和 % f 以及指定正负号:

>>> '{:+f}; {:+f}'.format(3.14, -3.14) # show it always '+3.140000; -3.140000' # +号仅对正数有效 >>> '{: f}; {: f}'.format(3.14, -3.14) # show a space for positive numbers ' 3.140000; -3.140000' >>> '{:-f}; {:-f}'.format(3.14, -3.14) # show only the minus -- same as '{:f}; {:f}' ，-号仅对负数有效 '3.140000; -3.140000'

替代 %x 和 %o 以及转换基于不同进位制的值:

>>> # format also supports binary numbers >>> "int: {0:d};  hex: {0:x};  oct: {0:o};  bin: {0:b}".format(42) 'int: 42;  hex: 2a;  oct: 52;  bin: 101010' >>> # with 0x, 0o, or 0b as prefix: >>> "int: {0:d};  hex: {0:#x};  oct: {0:#o};  bin: {0:#b}".format(42) 'int: 42;  hex: 0x2a;  oct: 0o52;  bin: 0b101010'

使用逗号作为千位分隔符:

>>> '{:,}'.format(1234567890) '1,234,567,890'

表示为百分数:

>>> points = 19 >>> total = 22 >>> 'Correct answers: {:.2%}'.format(points/total) 'Correct answers: 86.36%'

使用特定类型的专属格式化:

>>> import datetime >>> d = datetime.datetime(2010, 7, 4, 12, 15, 58) >>> '{:%Y-%m-%d %H:%M:%S}'.format(d) '2010-07-04 12:15:58'

本文地址：https://blog.csdn.net/yalipf/article/details/107059362