Step 1: Select a Source (Hard Disk). There will be many such sources (local/network)
Step 2: List all files, statistics, folder hierarchy and file types in each source with file properties in in xls/csv/json.
Step 3: Extract Metadata of all individual file and store it in xls/csv/json.
Step 4: Compare the metadata of files from all sources and list the metadata (filed-value pair) similarity between files and folders in all sources (like., same date, same time, same file name, same camera model and all metadata matches) WE HAVE A SIMPLE PSEUDOCODE TO CODE SIMILARITY COMPARISON LOGIC.
Step 5: Visualize and list them in social graphs (like neo4j)
Need to share us the full python code.