Crawling and searching directories
In this recipe, we'll learn how to scan a directory recursively to get all the files contained there. That will include all the files in subdirectories. The matched files can be of a particular kind, like text files, or every single one of them.
This is normally a starting operation when dealing with files, to detect all the existing ones.
Getting ready
Let's start by creating a test directory with some file information:
$ mkdir dir
$ touch dir/file1.txt
$ touch dir/file2.txt
$ mkdir dir/subdir
$ touch dir/subdir/file3.txt
$ touch dir/subdir/file4.txt
$ touch dir/subdir/file5.pdf
$ touch dir/file6.pdf
All the files will be empty; we will use them in this recipe only to discover them. Notice there are four files that have a .txt
extension, and two that have a .pdf
extension.
The files are also available in the GitHub repository here: https://github.com/PacktPublishing/Python-Automation-Cookbook-Second...