|
|
webbase |
|
|
5.1 |
|
|
Loic Dachary |
|
|
Free |
|
|
C and C++ / Searching |
|
|
Click to Visit |
|
|
Click to Download |
|
|
19 |
webbase is an internet web crawler written in C and later ported to C++. It uses a MySQL database to store information about crawled URLs. It is available as a command line program or as a library (shared or static). It has two main functions: crawl the WEB to get documents and build a full text database with these documents. The crawler part visits the documents and stores intersting information about them locally. It visits the document on a regular basis to make sure that it is still there and updates it if it changes. The full text database uses the local copies of the document to build a searchable index. The full text indexing functions are not included in webbase.
|
| Top C and C++ scripts |
1).
Larbin Larbin is a web crawler (also called (web) robot, spider, scooter, etc).
2).
DataparkSearch Engine DataparkSearch Engine is a browser based search engine software and is written in C that helps users to search keywords on their websites and organize them.
3).
Namazu Namazu is a full-text search engine intended for easy use.
4).
dtSearch Desktop with Spider dtSearch Desktop with Spider is a simple and an easy to use searching program that helps users to search any text on their systems instantly. This program supports all file formats.
5).
dtSearch Publish Publishes an instantly searchable database to CD/DVD, effectively adding dtSearch "powerful Web-based engines" (eWEEK) to a CD/DVD. Has a dozen indexed & fielded data search options. Highlights hits in HTML, XML & PDF, displaying links & images.
6).
Alkaline Search Engine All-in-one index and search server.
7).
SWISH-E SWISH-Enhanced is a fast, powerful, flexible, and easy to use system for indexing collections of Web pages or other text files.
|
|
| New C and C++ scripts |
1).
ht://Dig The ht://Dig system is a complete world wide web indexing and searching system for a small domain or intranet.
2).
dtSearch Desktop with Spider dtSearch Desktop with Spider is a simple and an easy to use searching program that helps users to search any text on their systems instantly. This program supports all file formats.
3).
Larbin Larbin is a web crawler (also called (web) robot, spider, scooter, etc).
4).
DataparkSearch Engine DataparkSearch Engine is a browser based search engine software and is written in C that helps users to search keywords on their websites and organize them.
5).
webbase webbase is an internet web crawler written in C and later ported to C++.
6).
SWISH-E SWISH-Enhanced is a fast, powerful, flexible, and easy to use system for indexing collections of Web pages or other text files.
7).
Glimpse Glimpse (which stands for GLobal IMPlicit SEarch) is a popular UNIX indexing and query system that allows you to search through a large set of files very quickly.
|
|