Click here to Skip to main content
Rate this: bad
good
Please Sign up or sign in to vote.
See more: Algorithms
I have an application that pulls messages from a POP3/IMAP server and processes the content into a ticket system. It works beautifully.
 
I would love to be able to identify and remove quoted text and e-mail signatures. I'd consider myself a regex expert but this is one case in which I don't want to reinvent the wheel if I don't have to.
 
If anyone has seen code to do this, please share.
 
Some e-mail clients do this with various degrees of success. If you've seen one that works well and is open-source, please let me know.
 
In the form of a question for Sergey: Is there an algorithm to identify the signature or quoted part of an e-mail? or; How can I identify the signtaure or quoted part of an e-mail?
Posted 26-Feb-13 11:26am
Edited 26-Feb-13 15:13pm
v4
Comments
Sergey Alexandrovich Kryukov at 26-Feb-13 22:02pm
   
Thank you for the form of question specially for me, I really appreciate it. But please understand: we are overwhelmed by non-questions these days. You are the one who is interested the most in making a distinct form of a question. At least your question won't be removed immediately by accident.
 
Thank you for understanding...
—SA
Sergey Alexandrovich Kryukov at 26-Feb-13 22:07pm
   
Now, what do you mean by both? You don't mean digital signatures, do you?
What is the "quoted pair", exactly? If I put some clause in quotation mark, will it be a "quoted pair".
 
I'm afraid, no matter what algorithm you are going to invent, there is no a reliable solution, by one simple reason: there is nearly nothing in the e-mail content standards. Only headers, sections, and remaining content, without any further structure. Whatever criteria you apply, there will be false negatives and false positives.
 
In other words, the problem is not the algorithms, but lack of information...
 
Do I miss anything?
—SA

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



Advertise | Privacy | Mobile
Web02 | 2.8.1411022.1 | Last Updated 26 Feb 2013
Copyright © CodeProject, 1999-2014
All Rights Reserved. Terms of Service
Layout: fixed | fluid

CodeProject, 503-250 Ferrand Drive Toronto Ontario, M3C 3G8 Canada +1 416-849-8900 x 100