Python中fnmatch模块的使用详情

程序员文章站 2022-03-11 17:25:47

fnamtch就是filenamematch, 在python中利用符合linuxshell风格的匹配模块来进行文件名的匹配筛选工作。 fnmatch()函数匹配能力介于...

fnamtch就是filenamematch, 在python中利用符合linuxshell风格的匹配模块来进行文件名的匹配筛选工作。

fnmatch()函数匹配能力介于简单的字符串方法和强大的正则表达式之间，如果在数据处理操作中只需要简单的通配符就能完成的时候，这通常是一个比较合理的方案。此模块的主要作用是文件名称的匹配，并且匹配的模式使用的unix shell风格。源码很简单：

"""filename matching with shell patterns.

fnmatch(filename, pattern) matches according to the local convention.
fnmatchcase(filename, pattern) always takes case in account.

the functions operate by translating the pattern into a regular
expression. they cache the compiled regular expressions for speed.

the function translate(pattern) returns a regular expression
corresponding to pattern. (it does not compile it.)
"""
import os
import posixpath
import re
import functools

__all__ = ["filter", "fnmatch", "fnmatchcase", "translate"]

def fnmatch(name, pat):
  """test whether filename matches pattern.

  patterns are unix shell style:

  *    matches everything
  ?    matches any single character
  [seq]  matches any character in seq
  [!seq] matches any char not in seq

  an initial period in filename is not special.
  both filename and pattern are first case-normalized
  if the operating system requires it.
  if you don't want this, use fnmatchcase(filename, pattern).
  """
  name = os.path.normcase(name)
  pat = os.path.normcase(pat)
  return fnmatchcase(name, pat)

@functools.lru_cache(maxsize=256, typed=true)
def _compile_pattern(pat):
  if isinstance(pat, bytes):
    pat_str = str(pat, 'iso-8859-1')
    res_str = translate(pat_str)
    res = bytes(res_str, 'iso-8859-1')
  else:
    res = translate(pat)
  return re.compile(res).match

def filter(names, pat):
  """return the subset of the list names that match pat."""
  result = []
  pat = os.path.normcase(pat)
  match = _compile_pattern(pat)
  if os.path is posixpath:
    # normcase on posix is nop. optimize it away from the loop.
    for name in names:
      if match(name):
        result.append(name)
  else:
    for name in names:
      if match(os.path.normcase(name)):
        result.append(name)
  return result

def fnmatchcase(name, pat):
  """test whether filename matches pattern, including case.

  this is a version of fnmatch() which doesn't case-normalize
  its arguments.
  """
  match = _compile_pattern(pat)
  return match(name) is not none


def translate(pat):
  """translate a shell pattern to a regular expression.

  there is no way to quote meta-characters.
  """

  i, n = 0, len(pat)
  res = ''
  while i < n:
    c = pat[i]
    i = i+1
    if c == '*':
      res = res + '.*'
    elif c == '?':
      res = res + '.'
    elif c == '[':
      j = i
      if j < n and pat[j] == '!':
        j = j+1
      if j < n and pat[j] == ']':
        j = j+1
      while j < n and pat[j] != ']':
        j = j+1
      if j >= n:
        res = res + '\\['
      else:
        stuff = pat[i:j].replace('\\','\\\\')
        i = j+1
        if stuff[0] == '!':
          stuff = '^' + stuff[1:]
        elif stuff[0] == '^':
          stuff = '\\' + stuff
        res = '%s[%s]' % (res, stuff)
    else:
      res = res + re.escape(c)
  return r'(?s:%s)\z' % res

fnmatch的中的5个函数["filter", "fnmatch", "fnmatchcase", "translate"]

filter 返回列表形式的结果

def gen_find(filepat, top):
  """
  查找符合shell正则匹配的目录树下的所有文件名
  :param filepat: shell正则
  :param top: 目录路径
  :return: 文件绝对路径生成器
  """
  for path, _, filenames in os.walk(top):
    for file in fnmatch.filter(filenames, filepat):
      yield os.path.join(path, file)

fnmatch

# 列出元组中所有的python文件
pyfiles = [py for py in ('restart.py', 'index.php', 'file.txt') if fnmatch(py, '*.py')]
# 字符串的 startswith() 和 endswith() 方法对于过滤一个目录的内容也是很有用的

fnmatchcase 区分大小写的文件匹配

# 这两个函数通常会被忽略的一个特性是在处理非文件名的字符串时候它们也是很有用的。 比如，假设你有一个街道地址的列表数据
address = [
  '5412 n clark st',
  '1060 w addison st',
  '1039 w granville ave',
  '2122 n clark st',
  '4802 n broadway',
]
print([addr for addr in address if fnmatchcase(addr, '* st')])

translate 这个似乎很少有人用到，前面说了fnmatch是unix shell匹配风格，可以使用translate将其转换为正则表达式，举个栗子

shell_match = 'celery_?*.py'
print(translate(shell_match))
# 输出结果：(?s:celery_..*\.py)\z

celery_..*\.py就是正则表达式的写法。

以上就是本文的全部内容，希望对大家的学习有所帮助，也希望大家多多支持。

上一篇：软文营销平台的流量来源对比，哪个更适合未来的发展?

下一篇：营销软文写作技巧：这样可以提高转化率

Python中fnmatch模块的使用详情

Python的Django框架中的Context使用

Python的Django框架中if标签的相关使用

在Python的Django框架中创建和使用模版

Python中@property的理解和使用示例

使用Python实现将list中的每一项的首字母大写

Python中的模块导入和读取键盘输入的方法

在Python中定义和使用抽象类的方法

Python中subprocess的简单使用示例

Python中functools模块的常用函数解析

Python中的getopt函数使用详解