WWWGrab 1.33
WWWGrab is a web page data extraction and database generation tool, or "web scraper". It scans URL lists in a database, fetches the web pages and parses them with DTBuild transformations. It runs sequences of URL scans and SQL database operations.
Last update
9 Apr. 2009
Licence
Free to try |
$100.00
OS Support
Windows
Downloads
Total: 476 | Last week: 2
Ranking
#635 in
Internet Tools
Publisher
Dtutilities.com
Screenshots of WWWGrab
WWWGrab Publisher's Description
WWWGrab is a web page data extraction and database generation tool, or "web scraper". It scans URL lists in a database, fetches the listed web pages and parses them with the DTBuild data transformation engine. WWWGrab can run sequences of URL scans and SQL database operations, allowing for multiple passes over data generated "on the fly" (at run time).
WWWGrab parsers are created with the DTBuild data transformation workshop. At run time WWWGrab gets a web page and sends it to the DTBuild engine, which transforms the web page with the specified parser.
WWWGrab is controlled by a list of tasks specified in a database. There are two types of task:
1. scan a URL list,
2. execute an SQL list.
The user can combine any number of URL scans and SQL executions in a task list. For example, a task list could:
* scan an initial list of URLs,
* generate a new list of URLs,
* modify the generated URL list with SQL,
* scan the generated+modified URL list,
* generate another URL list,
* etcetera.
The combined flexibility of WWWGrab and DTBuild enables a wide variety of web data transformation tasks. Consult DTBuild help for more information.
WWWGrab / DTBuild features:
* Recursive capabilities (enabling parsing of nested HTML/XML tags, comments, etc.)
* Wide-string (Unicode) input / output capability
* ODBC interface that displays database layout info (table and field names) to the user
* ODBC interface allowing construction of SQL statements with a combination of user-defined data and recognized data
* Trace mode to show correspondence between input and nodes (for debugging)
* User-defined function interface allowing execution of custom DLL code ...
Configuration assistance is available.
WWWGrab parsers are created with the DTBuild data transformation workshop. At run time WWWGrab gets a web page and sends it to the DTBuild engine, which transforms the web page with the specified parser.
WWWGrab is controlled by a list of tasks specified in a database. There are two types of task:
1. scan a URL list,
2. execute an SQL list.
The user can combine any number of URL scans and SQL executions in a task list. For example, a task list could:
* scan an initial list of URLs,
* generate a new list of URLs,
* modify the generated URL list with SQL,
* scan the generated+modified URL list,
* generate another URL list,
* etcetera.
The combined flexibility of WWWGrab and DTBuild enables a wide variety of web data transformation tasks. Consult DTBuild help for more information.
WWWGrab / DTBuild features:
* Recursive capabilities (enabling parsing of nested HTML/XML tags, comments, etc.)
* Wide-string (Unicode) input / output capability
* ODBC interface that displays database layout info (table and field names) to the user
* ODBC interface allowing construction of SQL statements with a combination of user-defined data and recognized data
* Trace mode to show correspondence between input and nodes (for debugging)
* User-defined function interface allowing execution of custom DLL code ...
Configuration assistance is available.
What's New in Version 1.33 of WWWGrab
Control table indexing fix. Enhancement to Quotes sample.
Look for Similar Items by Category
Feedback
- If you need help or have a question, contact us
- Would you like to update this product info?
- Is there any feedback you would like to provide? Click here
Popular Downloads
- Kundli 4.5
- Macromedia Flash 8 8.0
- Cool Edit Pro 2.1.3097.0
- Cheat Engine 6.8.1
- Hill Climb Racing 1.0
- Grand Theft Auto: Vice City 1.0
- C-Free 5.0
- Windows XP Service Pack 3 Build...
- Iggle Pop 1.0
- Ulead Video Studio Plus 11
- Grand Auto Adventure 1.0
- Netcut 2.1.4
- Zuma Deluxe 1.0
- Horizon 2.9.0.0
- Vector on PC 1.0
- Tom VPN 2.2.8
- Auto-Tune Evo VST 6.0.9.2
- Vidnoz AI 1.0.0
- PhotoImpression 6.5
- FormatFactory 4.3