cattaSearch backend consists of 3 parts:
In the first backend part you define the websites to be searched:
You can define as many websites you like. The search data are stored in a separate database table for each website.
A defined website in cattaSearch can also be just a part of your actual website. Then you also have to enter the web path. An example: If you have a photo part of your website, starting with the directory /photos from the root of your website, and you want to create a specific search for this part of your website, then enter '/photos' as your web path.
You can change already defined websites - and delete them when they are no longer needed.
In the second backend part you do the actual crawl for a defined website. What it does is:
You should crawl your website every time you make changes to it. In every crawl all the old search data are deleted and replaced with new search data.
The following document formats are currently supported for text search by cattaSearch:
Please note that the package needed for extracting text from PDF documents, pdftotext, is not included in the cattaSearch download file. It is specific to your operating system. Ie you have to download and install it yourself.
The third and last backend part of cattaSearch is the websites crawl log where you can see all your crawls with date/time, crawl time in seconds and the number of files found.