[clug-talk] searching attachments
robinsonkd at shaw.ca
Wed Jan 4 20:39:58 PST 2006
the problem is separating the document from the e-mail. Once they are
separated I can use grep. grep works on a Word doc. I have a maildir folder
with one file per e-mail/attachment(s). Hundreds of e-mails in this folder. I
need to be able to search them all.
I think Nick hit the nail on the head that my problem isn't the binary Word
format but the MIME encoding.
On January 4, 2006 09:17 pm, Marcel Lecker wrote:
> Hi Kieth,
> Give 'antiword' a try.
> antiword -t document.doc
> ...will convert it to text on a console. From there pipe it, save it,
> search it, ...plenty of options.
> On Wed, 2006-01-04 at 13:17 -0700, Keith Robinson wrote:
> > is "tree" the path to my Mail folder? (btw, I understood why the tools
> > weren't working)
> > --
> > Keith
> > On January 4, 2006 12:19 pm, Andrew J. Kopciuch wrote:
> > > > My set up is kmail/ Maildir. AFAIK attachments are kept with the
> > > > mail in Kmail, instead of separately. But when I use my normal
> > > > search tools -- grep, Konqueror Tools, I can not find known strings.
> > >
> > > That's because word documents are binary data.
> > >
> > > You could try using strings on each file, and piping the ouput through
> > > grep.
> > >
> > > for f in `tree -if`; do echo $f; strings $f | grep whatIAmLookingFor;
> > > done;
> > >
> > > _______________________________________________
> > > clug-talk mailing list
> > > clug-talk at clug.ca
> > > http://clug.ca/mailman/listinfo/clug-talk_clug.ca
> > > Mailing List Guidelines (http://clug.ca/ml_guidelines.php)
> > > **Please remove these lines when replying
> > _______________________________________________
> > clug-talk mailing list
> > clug-talk at clug.ca
> > http://clug.ca/mailman/listinfo/clug-talk_clug.ca
> > Mailing List Guidelines (http://clug.ca/ml_guidelines.php)
> > **Please remove these lines when replying
> clug-talk mailing list
> clug-talk at clug.ca
> Mailing List Guidelines (http://clug.ca/ml_guidelines.php)
> **Please remove these lines when replying
More information about the clug-talk