 |
1. To quickly start mining for web data create a new
project-> Menu -> File -> Manage project. |
 |
2. Click on Add a project to give your project a name. |
 |
2a. Manage projects lets you remove, add, delete or open a
project. |
 |
3. Click on the Automate Icon. |
 |
4. Type in a Search Phrase (just like typing search terms
into Google). |
 |
5. Set "watch Output" to No --> It will work much faster. |
 |
6. Set Extraction Type. |
 |
7. Click Retrieve. |
 |
Edge helps you mine for data with the professional's tool of
choice for extracting & managing all your data mining tasks. |
 |
Easy access and navigation of most popular features
available from the large square icons on the tool bar. |
 |
Save your resulting export lists either as CVS or txt files. |
 |
Save any file or web page to local machine. This provides
the option to snapshot an active page for a fast loading page
that does not overload your database. |
 |
Open icon lets you quickly open your lists or folders. |
 |
Open List->opens a text file that contains a list of items
(i.e., emails, fax #'s, URLs etc.) and populates the listbox in
the main screen. |
 |
Export to list of text or csv files using
Menu->File->Export
Many Extraction Modes:Email Addresses, Meta Keywords, Links
(URLs), Images, IP Addresses, Fax #'s, Phone #'s, & much more
using Custom Extraction & Expressions. |
 |
Select Mode ->Email Addresses.
This will browse to destination pages and extract emails (see
Extraction Mode). |
 |
Google Extractor steps through the page result listings of
Google.com and retrieves the highest ranked listings for a
particular keyword provided.
To set your Extraction Mode: Click on the Extract button.
There are two options:
1. Manually - the software extracts content to the listbox only
when the extract button is pushed. This mode fits a more
selective type of extraction.
2. Automatically - the software extracts content to the listbox
whenever a new page is loaded. No need to click the extract
button. |
 |
Save all web site images to your local machine. |
 |
Extraction Options:
You may restrict to one email per page. Check this option when
you use automatic modes or Google extraction. This option
enables you a way to avoid spamming by picking up the first
email encountered from each site. This option also increases
Google extraction speed performance. |
 |
Script Builder
Make your extraction a daily routine with Script Extraction.
Various tasks can be written to a file, which can be loaded and
ran systematically. Extraction output can be saved to files
automatically. |
 |
Extraction Types -select output mode (i.e.. emails,
fax #'s, URLs, images, etc.)
Set the number of listings - number of websites to be queried.
Suggested settings: Restrict extraction to one email per page. |
 |
Tip: Activate the Popup Blocker for smoothest
results. Tools -> Popup Blocker - run this utility while using
the various extraction tools to block unwanted popups that will
slow your extraction down. |
 |
Watch Output - Yes: the software browses the sites
while extracting, so you can see where it goes in real time. No:
the software does not display sites visited in the browser, but
works much faster. This option is recommended for speedier
extraction. |
 |
PageRank Network
Search for selected sites listed in our database by keywords,
users and PR qualifications. Add your own URLs to increase
exposure to your sites. |
 |
Filters
- Filter out pattern-specific items from the listbox.
Right-click the listbox and click Filter.
- Starts With - filter items that start with a particular
substring.
- Contains Substring - filter items that contain a particular
substring.
- Ends With - filter items that end with a particular substring.
- Use this option to filter out emails that start with
'webmaster' contain '.edu' or end with '.org' |
 |
Exclusion List
When listbox is filled with email addresses right-click the
listbox and select exclusion list. You can then choose a text
file from you local machine that contains an email exclusion
list. When you've chosen 'open' the emails in your exclusion
list will be removed from the listbox. |
 |
Sort
Sorts incoming items into the listbox either alphabetically or
in First In order. |
 |
Save Page/Images As
Extract images and/or URLs to the listbox. Then right-click the
listbox and select 'Save Page/Images As'. The software will save
the pages and/or images that are selected to the local machine.
A preview of each item will appear, and the user is prompted
with the save as dialog box. |