How to extract email from websites online?

I have a list of about a million websites. How can I extract the bulk email address from their about or contact page automatically without writing the web scraping agent for each website? Let us see how we can scrape online emails in bulk through email extractor software.

Here is basic approach to extract email from website:

To extract email addresses, use this REGEX option with this expression: ([\w.-]+@(?=[a-z\d][^.]*\.)[a-z\d.-]*[^.])

This will find and scrape all valid emails on any website URL you crawl, here is my test on Rubular site - Rubular: (^[\w.-]+@(?=[a-z\d][^.]*\.)[a-z\d.-]*[^.]$) with this test string which extracted all 6 valid emails.

email@domain.com
my.email@domain.com
my_email@domain.com
first_middle_last@domain.com

If you want to send an email, please email us at info@domain.com

Contact us for business inquiry on business@domain.com

Github link with example HTML - https://agenty.github.io/Agenty.TestData/forum/forum-33.html

Then create an agent to extract emails online (you may clone any sample agent from here) and follow the steps:

  1. Go to Configuration tab
  2. Add a field and change the Type : REGEX
  3. Enter your regex expression
  4. Add ‘1’ as the Group Name

  1. Save it and enter(or upload) the URL to extract emails from all the websites.

  1. After the email scraping is complete, now you can download the result in CSV file or use the API to retrieve the data programmatically.

Signup now to get 100 pages credit free

14 days free trial, no credit card required!