| Page Item | Value |
| Title | The Index Server Companion |
| Description | A look at the Index Server Companion - a product that allows Microsoft Index Server to index content from remote websites and ODBC databases. |
| Keywords | index server, site server, search, searching, unix, remote, database, odbc |
| Robots Meta Tag | |
| Page Content | ASP Kitchen Search: Go Home | ASP Articles | ASP.NET Articles | Tools | Table Of Contents | What's New ASP Kitchen : Classic ASP Articles : The Index Server Companion The Index Server Companion This article describes the Index Server Companion , a Windows application I have created that allows Microsoft Index Server to index content from remote websites and ODBC databases. The Problem Index Server is a great product! On the administrative side of things, it is easy to install, performance is good, and once installed maintenance tasks are minimal. The development of search applications using ASP is also made fairly straightforward through the use of the Query and Utility server components. The main limitation of Index Server is that it can really only be used to index content hosted on servers on the same machine or network as the machine hosting the Index Server service. Although it is possible to set up a share to a Unix/Linux Apache webserver using a file sharing solution such as SAMBA, this isn't always satisfactory because Index Server is not case sensitive with respect to filenames, so this can cause problems when displaying search results. Another issue is that it can be a chore to prevent Index Server from indexing certain content on a server. Unlike a web robot, it has no concept of the Robots Exclusion Standard specification (i.e. robots.txt files) and is unaffected by the 'robots' meta tag. The Solution Retrieving and indexing content from a web server by use of a web robot is the solution. The web robot is able to mimic a web browser, starting at one page in the site and traversing the links in the site until it has retrieved all of the pages of the site. The robot will potentially be able to retrieve content from any webserver, regardless of the platform it is hosted on. Two products that allow you to do this are Microsoft's Site Server 3.0 and the author's own Index Server Companion . Microsoft Site Server 3.0 Microsoft's Site Server 3.0 software suite has a Search application that enhances Index Server by allowing you to (amongst other things) retrieve and index content from remote websites using an integrated web robot. For an overview of Site Server 3.0 Search, take a look at an article I wrote for ariadne.ac.uk . Unfortunately Site Server 3.0 Search has a few shortcomings, including: Site Server 3.0 isn't the easiest of applications to install. The product wasn't really designed for Windows 2000 Server. It doesn't appear that the product is still in active development. It isn't very useful if your websites are hosted by a third party, and they don't have Site Server 3.0 installed. Site Server 3.0 costs a lot of money, which cannot always be justified if you only want to use the Search application of the software suite. Index Server Companion The Index Server Companion is the cost effective method of retrieving content from remote webservers for Index Server to index. Furthermore it also allows retrieval of content from ODBC databases which can be subsequently indexed by Index Server. Features The main features of the Index Server Companion are: Enables Index Server to allow searching of potentially any web server or ODBC compliant database. Integrated web robot extracts content from websites. Includes support for robots.txt files and robots meta tags. Robot can negotiate sites using HTML Frames. Optional mode allows QueryStrings to be treated as distinct URLs (e.g. treat http://www.aspalliance.com/brettb/WebJobMarket.asp?Skill=ASP as being a distinct URL from http://www.aspalliance.com/brettb/WebJobMarket.asp?Skill=JSP ). Ability to retrieve binary files from servers, including Adobe Acrobat PDF, Microsoft Office documents and even images. Support for full or incremental project updates of both web and database content, meaning that Index Server only has to re-index content that has changed. Configuration of the Index Server Companion is through the editing of a plain text configuration file. Index Server Companion can be run from the command line, and scheduled using the Windows Task Scheduler. Full reporting of activity to an external plain text log file. Flexible output options mean that administrative access to Index Server is not necessarily required. Facility for creating a basic table of contents page for the sites that are crawled [ sample from my own website , sample from myCDE.com , sample from Lachesis.biz ]. Facility for creating a site summary page that can be used to optimise sites for search engines, by showing what keywords are used on a site, the title tags used by each page etc. The site summary page shows: The top 40 most used keywords listed in the keywords meta tags in all of the pages crawled. The body of the summary page contains the following from the pages encountered in the web crawl: A list of page titles. A list of page headings (H1 to H3 inclusive). A list of bold (and strong) text. A list of external hyperlinks in the site. A list of italicised text. A selection of relevant paragraph text. Paragraphs are assumed to be relevant if they contain one or more of the 40 most frequently used keywords. A list of page description meta tags. A list of page keywords. A list of page description meta tags hyperlinked back to the URL from which each description was extracted. A list of page keywords hyperlinked back to the URL from which it was extracted. Fully documented VBScript examples show how to make use of the Index Server Companion in ASP pages. Detailed documentation in Microsoft's HTML Help format. Fully documented Perl source code. Access to product updates and technical support. head meta name= ISC_title_id content= MC2222 meta name= ISC_title content= Silicon Valley Gastronomic Treats meta name= ISC_type content= mod_cook meta name= ISC_price content= 19.99 meta name= ISC_pubdate content= 6/9/1991 12:00:00 AM meta name= ISC_notes content= Favorite recipes for quick, easy, and elegant meals. meta name= description content= Favorite recipes for quick, easy, and elegant meals. /head title Silicon Valley Gastronomic Treats /title body /body /html In this example, the title field is optionally used to give the page a title, and the notes field is used for the description meta tag. Each of the custom ISC_ prefixed meta tags can be queried using Index Server, although to retrieve their contents a minor configuration change to Index Server is required. It is straightforward to create a page which for example, will return the records where the value of the ISC_type meta tag is mod_cook . The Index Server Companion can also modify the HTML's title tag to include the table name and row ID, e.g.: title ISC_Table=titles ISC_KeyField=title_id ISC_RowNumber=MC2222 Silicon Valley Gastronomic Treats /title Summary The Index Server Companion allows Microsoft Index Server to index content from remote websites and ODBC databases, making it a cost effective way of significantly extending the functionality of Index Server. Comments/Suggestions? I've released the Index Server Companion in the hope that other users may find it useful! I'd love to hear what you think of it. Is it useful? What new features do you Downloads Index Server Companion Evaluation Version (1.1Mb zip file). Index Server Companion Documentation (121K zip file). Purchase the Index Server Companion ($39.99). [ ...more details ] Special offers on both Index Server Companion and the ASP Documentation Tool . Download this article in Adobe Acrobat PDF format . Download this article in Microsoft Word 97 format . Further information A more detailed version of this article was published in Ariadne.ac.uk . The Microsoft Site Server Search Facility . Searching Index Server With ASP . Introductory guide to using Index Server from ASP. More about Searching Index Server With ASP . More advice and source code. Need assistance with your Index Server application development? I'm available for hire at reasonable rates . Useful Development Tools ASP Documentation Tool Automatically creates developer documentation for ASP 2.0 and 3.0 web applications written in VBScript and JScript. Documentation for Microsoft Access, SQL Server 7/2000 databases and Visual Basic 6.0 components associated with the web application can also be incorporated into the reports. Documentation is created in HTML, HTML Help and plain text formats. View Sample Output (HTML Help format). View Sample Output (HTML Format). Download Trial Version (5.2Mb ZIP file). Index Server Companion The Index Server Companion is a Windows application that extends the functionality of Microsoft Index Server so that it is able to index content from remote websites and also from ODBC databases. As such it can be used as a low cost alternative to Site Server 3.0 Search. View Product Documentation (119K ZIP file). Try Sample Search Facility . Download Trial Version (1.7Mb ZIP file). ASP.NET Documentation Tool Automatically creates developer documentation for ASP.NET web applications written in C# or VB.NET. Documentation for SQL Server 7/2000 databases and C#/VB.NET components associated with the web application can also be incorporated into the reports. Documentation is created in HTML, HTML Help and plain text formats. View Sample Output (HTML Help format). View Sample Output (HTML Format). Download Trial Version (727K ZIP file). SQL Documentation Tool The SQL Documentation Tool creates technical documentation for Microsoft SQL Server 7.0 and 2000 databases. Technical documentation is created in HTML and HTML Help formats. The HTML Help format documentation is fully searchable and cross referenced. The SQL Documentation Tool documents SQL Server Tables, Views, Stored Procedures, Triggers and Table Relationships. View Sample Output (HTML Help format). View Sample Output (HTML Format). Download Trial Version (10.3Mb ZIP file). The Website Utility The Website Utility examines websites for errors and areas that need to be optimised for search engines by using a built in web crawling engine. Errors checked for include broken or moved hyperlinks, missing page titles and missing meta tags. It also generates HTML for use in creating website site maps (table of contents pages - like this one ), and is able to create both client-side JavaScript Search Engines and server-side ASP Search Engines for a website. View Sample Output (HTML Format). Download Trial Version (3Mb ZIP file). Text Workbench Text Workbench is a file search and replacement utility for text files and Microsoft Office documents. Make rapid file replacements on multiple files and folders full of files. Advanced replacement options include regular expressions support. It even works on remote file systems via FTP. A Regular Expression Laboratory allows advanced pattern matching and replacement expressions to be built and tested. This great utility will make your everyday development tasks much easier! Download Trial Version (3Mb ZIP file; you have the option to either install directly from this link or save the file for later installation). Author details Brett Burridge spent two years working in the University of Essex Computing Service, before moving to The Internet Applications Group in the Autumn of 1999, where he developed e-Business applications for a range of corporate clients and dot-com start ups. Brett is presently employed as an Internet developer and technical writer through his own company, Winnersh Triangle Web Solutions Limited . The company produces a number of innovative products, including the popular ASP Documentation Tool , the Index Server Companion , the ASP.NET Documentation Tool , the SQL Server Documentation Tool and The Website Utility . The company is also available for web application design and development at reasonable rates, primarily using Microsoft technologies (ASP, ASP.NET, Visual Basic, SQL Server) but also using open source technologies such as PHP, MySQL and Perl. Specialist services include development of search solutions using Microsoft's Index Server and Site Server 3.0 Search. As well as the ASPAlliance, Brett has written articles for Ariadne.ac.uk and ASPToday , and has contributed recipes to the ASP.NET Developer's Cookbook . links Outside web development, Brett is interested in digital photography (here's my photo gallery ), tropical fishkeeping and collecting contemporary works of art by artists such as Doug Hyde . Article history The Index Server Companion published on ASPAlliance.com on 30 August 2002. Last revised 13 May 2003. ASP Kitchen : Classic ASP Articles : The Index Server Companion
page content copyright Brett Burridge 1998 - 2004. |
| Image Alt Tags | ASPAlliance The Index Server Companion contains fully searchable documentation in Microsoft's HTML Help format Tell a friend about the Index Server Companion, and win a CD! View Sample Output (HTML Help format) View Sample Output (HTML Format) Download Trial Version View Product Documentation Try Sample Search Facility Download Trial Version View Sample Output (HTML Help format) View Sample Output (HTML Format) Download Trial Version View Sample Output (HTML Help format) View Sample Output (HTML Format) Download Trial Version View Sample Output (HTML Format) Download Trial Version Download Trial Version of Text Workbench Index Server Companion - allows Index Server to index content from remote websites and ODBC databases!!! Download a Free ASP Documentation Tool Now! Search Engine Builder - Build a search engine for your website! |
| Internal Links | http://authors.aspalliance.com/brettb/IndexServerCompanion.asp (5 links in this page) [ Robot View of this URL ] http://authors.aspalliance.com/brettb/Default.asp (3 links in this page) [ Robot View of this URL ] http://authors.aspalliance.com/brettb/ClassicASPArticles.asp (3 links in this page) [ Robot View of this URL ] http://authors.aspalliance.com/brettb/TableOfContents.asp (2 links in this page) [ Robot View of this URL ] http://authors.aspalliance.com/brettb/ASP.NetArticles.aspx [ Robot View of this URL ] http://authors.aspalliance.com/brettb/Tools.asp [ Robot View of this URL ] http://authors.aspalliance.com/brettb/Links.asp [ Robot View of this URL ] http://authors.aspalliance.com/brettb/What'sNew.aspx [ Robot View of this URL ] http://authors.aspalliance.com/brettb/SearchingIndexServerWithASP.asp [ Robot View of this URL ] http://authors.aspalliance.com/ [ Robot View of this URL ] http://authors.aspalliance.com/brettb/MoreIndexServerWithASP.asp [ Robot View of this URL ] |
Report generated by The Website Utility 2.8