基于Python怎么实现文件分类器
通过自定义需要整理的文件目录,将该目录下面的全部文件按照文件格式完成分类操作。
实现逻辑使用的python技术栈就是os、glob、shutil三个标准库的综合运用,完成自动化的文件整理。
分别将这三个文件处理模块导入代码块中,进入后续的开发操作。
# It imports the os module. import os # Shutil is a module that provides a number of high-level operations on files and collections of files. import shutil # The glob module finds all the pathnames matching a specified pattern according to the rules used by the Unix shell, # although results are returned in arbitrary order. No tilde expansion is done, but *, ?, and character ranges expressed # with [] will be correctly matched. import glob import sys
将需要分类的文件目录uncatched_dir以及分类后文件存放目录target_dir设置为可以手动输入的方式。
# Asking the user to input the path of the directory that contains the files to be sorted. uncatched_dir = input('请输入待分类的文件路径:\n') # It checks if the uncatched_dir is empty. if uncatched_dir.strip() == '': print('待分类的文件夹路径不能为空!') sys.exit() # Asking the user to input the path of the directory that contains the files to be sorted. target_dir = input('请输入分类后文件存放的目标路径:\n') # It checks if the target_dir is empty. if target_dir.strip() == '': print('分类后的文件存放路径不能为空!') sys.exit()
检验输入的分类后文件存放目录路径是否存在,因为很可能是输入一个新的路径,不存在时则新建一个该路径。
# It checks if the target_dir exists. If it does not exist, it creates a new directory in the current working directory. if not os.path.exists(target_dir): # It creates a new directory in the current working directory. os.mkdir(target_dir)
定义一个文件移动数量的变量file_move_num,以及一个新建的文件夹数量的变量dir_new_num用于记录文件整理的结果记录。
# A variable that is used to count the number of files that have been moved. file_move_num = 0 # A variable that is used to count the number of new directories that have been created. dir_new_num = 0
遍历需要整理的文件夹目录uncatched_dir,对该目录下面的所有类型的文件进行自动整理操作。
# A for loop that iterates through all the files in the uncatched_dir directory. for file_ in glob.glob(f'{uncatched_dir}/**/*', recursive=True): # It checks if the file is a file. if os.path.isfile(file_): # It gets the file name of the file. file_name = os.path.basename(file_) # Checking if the file name contains a period. if '.' in file_name: # Getting the suffix of the file. suffix_name = file_name.split('.')[-1] else: # Used to classify files that do not have a suffix. suffix_name = 'others' # It checks if the directory exists. If it does not exist, it creates a new directory in the current working # directory. if not os.path.exists(f'{target_dir}/{suffix_name}'): # It creates a new directory in the current working directory. os.mkdir(f'{target_dir}/{suffix_name}') # Adding 1 to the variable dir_new_num. dir_new_num += 1 # It copies the file to the target directory. shutil.copy(file_, f'{target_dir}/{suffix_name}') # Adding 1 to the variable file_move_num. file_move_num += 1
注意:为了避免移动文件夹而造成的异常,尤其是系统盘,因此这里用的是复制,也就是shutil.copy函数使用。
最后,将文件分类数量、文件夹新建数量使用print函数进行打印即可。
print(f'整理完成,有{file_move_num}个文件分类到了{dir_new_num}个文件夹中!\n') input('输入任意键关闭窗口...')
为了避免程序执行完成后直接将命令窗口关闭,上面使用了input函数来保持窗口暂停的效果。
以上就是基于Python怎么实现文件分类器的详细内容,更多请关注www.sxiaw.com其它相关文章!