Parser Cleaning
MailAtlas exposes a ParserConfig for high-noise email workflows.
Available controls
Section titled “Available controls”strip_forwarded_headersstrip_boilerplatestrip_link_only_linesstop_at_footerstrip_invisible_charsnormalize_whitespace
Example
Section titled “Example”from mailatlas import ParserConfig, parse_eml
document = parse_eml( "data/fixtures/atlas-founder-forward.eml", parser_config=ParserConfig( strip_forwarded_headers=True, strip_boilerplate=True, stop_at_footer=True, ),)Why it matters
Section titled “Why it matters”These controls make it easier to benchmark parser behavior on synthetic fixtures and adapt the output for different downstream use cases:
- preserve wrappers for forensic workflows
- remove boilerplate for cleaner exports and retrieval chunks
- stop at footers for cleaner RAG chunks