If an attacker were able to get into your email what PII could they gather? Your email and address are probably in many emails but how about your social security number?
Let’s find out.
Extract the Zip.
There should be a .mbox
file (or several) in there which is a standard file format of email text concatenated together. PII Crawler understands this format and will scan all emails and attachments separately within.
Create an exact-match.json
file to help PII Crawler find your social. It can do it without this but it helps in certain circumstances.
{
"me": ["whitcher", "078-05-1120"],
"me2": ["whitcher", "078051120"]
}
Note: This is no longer a real SSN but there is an interesting story behind it.
./piicrawler scanfile ~/Downloads/gmail-export/All\ mail\ Including\ Spam\ and\ Trash-002.mbox > results.txt
Results in results.txt
:
... (many results) ...
{
"path": "Loan documents from ******* MORTGAGE CO.::LoanDocs.pdf",
"mime_type": "application/pdf",
"csz_clusters": 12,
"unique_csz_clusters": 2,
"unique_common_first_names": 2,
"unique_common_last_names": 10,
"potential_tax_ids_or_ssns": 2,
"unique_addresses": 1,
"matches": {
"address": [
"**** W 2nd Street"
],
"csz": [
"Kalispell MT 59901",
"*** MT *****"
],
"email": null,
"ssn": [
"***-**-****",
"***-**-****"
]
},
"parent_path": "/home/me/Downloads/gmail-export/All mail Including Spam and Trash-002.mbox",
"exact_matches": 1
},
...
The path format for this file is <subject>::<file>
. From these results you can then go to Gmail, search for those subject lines and delete the emails. Be sure to also clear the trash folder.
💌 Get notified on new features and updates