|
|
harvest |
|
|
1.7.28 |
|
|
Kang-Jin Lee |
|
|
Free |
|
|
Unix |
|
|
C and C++ / Searching |
|
|
Click to Visit |
|
|
Click to Download |
|
|
12 |
Harvest is a system to collect information and make them searchable using a web interface. Harvest can collect information on inter- and intranet using http, ftp, nntp as well as local files like data on harddisk, CDROM and files on file servers. Current list of supported formats in addition to HTML include dvi, ps, fulltext, mail, man pages, news, troff, WordPerfect, C sources and many more. Adding support for new format is easy due to Harvest's modular design.
|
| Top C and C++ scripts |
1).
Larbin Larbin is a web crawler (also called (web) robot, spider, scooter, etc).
2).
DataparkSearch Engine DataparkSearch Engine is a browser based search engine software and is written in C that helps users to search keywords on their websites and organize them.
3).
Namazu Namazu is a full-text search engine intended for easy use.
4).
dtSearch Desktop with Spider dtSearch Desktop with Spider is a simple and an easy to use searching program that helps users to search any text on their systems instantly. This program supports all file formats.
5).
dtSearch Publish Publishes an instantly searchable database to CD/DVD, effectively adding dtSearch "powerful Web-based engines" (eWEEK) to a CD/DVD. Has a dozen indexed & fielded data search options. Highlights hits in HTML, XML & PDF, displaying links & images.
6).
Alkaline Search Engine All-in-one index and search server.
7).
SWISH-E SWISH-Enhanced is a fast, powerful, flexible, and easy to use system for indexing collections of Web pages or other text files.
|
|
| New C and C++ scripts |
1).
ht://Dig The ht://Dig system is a complete world wide web indexing and searching system for a small domain or intranet.
2).
dtSearch Desktop with Spider dtSearch Desktop with Spider is a simple and an easy to use searching program that helps users to search any text on their systems instantly. This program supports all file formats.
3).
Larbin Larbin is a web crawler (also called (web) robot, spider, scooter, etc).
4).
DataparkSearch Engine DataparkSearch Engine is a browser based search engine software and is written in C that helps users to search keywords on their websites and organize them.
5).
webbase webbase is an internet web crawler written in C and later ported to C++.
6).
SWISH-E SWISH-Enhanced is a fast, powerful, flexible, and easy to use system for indexing collections of Web pages or other text files.
7).
Glimpse Glimpse (which stands for GLobal IMPlicit SEarch) is a popular UNIX indexing and query system that allows you to search through a large set of files very quickly.
|
|