Pages: 1
RSS
Extract URLS from body
 
Is there any way to extract all URLs (or better yet, domain names) from the body of an email and write them to a file?

I probably have to set up a "export message" filter, but I cant figure out how to set up the template for this task.
 
Do the messages, you want to extract from, have a set format or do you want to extract domain names from random formatted messages?
__________________________________
I'm just a user of The Bat! I don't work for Ritlabs.
 
Random format. I'm trying to create a list of domains frequently used in spam emails so I can filter out any email linking to these domains.
 
I'd say that that can't be done with TB. A proper spam filter would be the way to go.
__________________________________
I'm just a user of The Bat! I don't work for Ritlabs.
 
you can simply export all the spam messages. Either add all (append) to one file, or create a new file for each.
After you collect a good amount of crap, use some regexp tool (powergrep or alike) to get a list of domain names contained in the exported files. After you have the list, delete exported files and start a new cycle :)
Pages: 1