Mbox
From Christoph's Personal Wiki
Mbox is a generic term for a family of related file formats used for holding collections of email messages. All messages in an mbox mailbox are concatenated and stored as plain text in a single file. Each message starts with the four characters "From" followed by a space (the so named "From_ line") and the sender's email address. RFC 4155 defines that a UTC timestamp follows after another separating space character.
Working with mbox files
- Create an example mbox file using Python:
import mailbox
import email.utils
from_addr = email.utils.formataddr(('Author', 'author@example.com'))
to_addr = email.utils.formataddr(('Recipient', 'recipient@example.com'))
mbox = mailbox.mbox('example.mbox')
mbox.lock()
try:
msg = mailbox.mboxMessage()
msg.set_unixfrom('author Thus Feb 16 17:10:02 2017')
msg['From'] = from_addr
msg['To'] = to_addr
msg['Subject'] = 'Sample message 1'
body = "This is the body.\n"
body += "From (should be escaped).\n"
body += "There are 3 lines.\n"
msg.set_payload(body)
mbox.add(msg)
mbox.flush()
msg = mailbox.mboxMessage()
msg.set_unixfrom('author')
msg['From'] = from_addr
msg['To'] = to_addr
msg['Subject'] = 'Sample message 2'
msg.set_payload('This is the second body.\n')
mbox.add(msg)
mbox.flush()
finally:
mbox.unlock()
print open('example.mbox', 'r').read()
Run the above Python script:
$ python mbox_create.py
From MAILER-DAEMON Thu Feb 16 17:15:22 2017 From: Author <author@example.com> To: Recipient <recipient@example.com> Subject: Sample message 1 This is the body. >From (should be escaped). There are 3 lines. From MAILER-DAEMON Thu Feb 16 17:15:22 2017 From: Author <author@example.com> To: Recipient <recipient@example.com> Subject: Sample message 2 This is the second body.
- Print out all of the Subject fields:
import mailbox
mbox = mailbox.mbox('example.mbox')
for message in mbox:
print message['subject']