When you think ASP, think...
Recent Articles
All Articles
ASP.NET Articles [1.x] [2.0]
ASPFAQs.com
Message Board
Related Web Technologies
User Tips!
Coding Tips
Search

Sections:
Book Reviews
Sample Chapters
Commonly Asked Message Board Questions
Headlines from ASPWire.com
JavaScript Tutorials
MSDN Communities Hub
Official Docs
Security
Stump the SQL Guru!
Web Hosts
XML Info
Information:
Advertise
Feedback
Author an Article
Technology Jobs

















internet.com
IT
Developer
Internet News
Small Business
Personal Technology
International

Search internet.com
Advertise
Corporate Info
Newsletters
Tech Jobs
E-mail Offers
ASP ASP.NET ASP FAQs Message Board Feedback ASP Jobs
Print this page.

Windows Systems Administrator
Jupitermedia
US-CT-Darien

Justtechjobs.com Post A Job | Post A Resume

Published: Tuesday, November 30, 1999

SyteKit - Part One
By Marc Draco


Sytekit is a suite of ASP routines that everyone can use no matter what their level of skill to improve their pages. Although Sytekit is being marketed as ASPExpressware (more of which, later) I'm presenting a large part of it on 4Guys for its educational value. Putting away my soapbox, it's off to first utility. This recursive engine forms the driving force behind Floyd, the site search engine part of Sytekit. You can see Floyd in action at www.nima99.co.uk.

- continued -

Floyd, presented in part 2 of this series (coming soon!), spiders an entire website - either locally or on the server - and maps it ready for high-speed searching just like a real-live search engine. At Floyd's heart is a version of this nifty little routine which locates its position on the server and scans all the folders and subfolders for objects of interest.

If you're new to programming you might not have come across recursion before. It's a well-understood and widely-used technique for examining trees and other binary structures. Confused? Don't worry. Most modern operating systems have something like Windows Explorer which shows the folder (directory) list as a tree. Each branch sub-divides into zero or more branches which eventually lead to directories. It's just like a family tree with great, great... grandaddy at the top and his descendants below. In fact, you can read a previously written article on 4Guys dealing with recursion: Recursion: Why It's Cool.

Figure 1 shows this in more detail. Folders (directories) are shown in green and files are shown as grey boxes; this also proves I cannot draw for toffee. Each folder can have zero or more descendants; it may even be completely empty like the one on the extreme right. It looks very complex and, in reality, it's usually a lot worse than this!

A graphical represenation of a directory hierarchy

Searching this lot iteratively (that's in a repeating loop) would be a nightmare to program, but using a divide-and-conquer approach it's a breeze. Here's what we do:

    1) Look in a directory and get the list of its contents.
    2) Repeat
    3) Display the name of the file/folder we find.
    4) If we find a directory, make a note of where we are and return to Step 1.
    5) Until we've run out of things to process.
    6) Return to where we were called from.

It's this last bit that confuses people when they first see it: because this is the essence of recursion. At step 6, the program "unwinds" back until it reaches the point just after it branched; Step 5. Eventually, the program runs out of things to process and "falls off" Step 6.

If all that left you feeling that you wished you hadn't started, read it again and you'll get it. When you get back here you understand recursion. I know that because if you follow that instruction implicitly, you're recursing through this article!

Dazed and Coded
OK, let's take a closer look at how this all works (note that the complete code is available for downloading). This:

Path = server.mappath(".")

Tells us where we are executing on the server. Don't be fooled into thinking that the directory you're executing in is the same one you FTP to, it probably isn't! (If you're using this on a desktop, you could set an absolute path like this: Path = "c:" and get a listing of your entire hard disk - at least, until the script times out. More of that later.)

Set Fso = CreateObject("Scripting.FileSystemObject")

How many times have you seen that one? Straight out of Microsoft's documentation it creates an instance of a FilingSystemObject which we'll need to access files and folders. If you're new to the FileSystemObject, there's a great FileSystemObject FAQ on 4Guys. We'll be using the FileSystemObject in ScanFolders like this:

Set FolderInfo = Fso.GetFolder(PathSpec)
Set FileList = FolderInfo.Files
For Each File in FileList

Ask the FSO we created earlier for details on the current folder... note that this is handed in to the function as a parameter -that's important. The list of files within the folder is determined from the Files method of the FolderInfo object. Next, we iterate through each file displaying its name (coloured according to extension for a bit of pazazz). For the sake of this example, I've kept it very simple. A more complex scan will be found in Floyd; Part Two of this series.

Finally the following piece of code performs the recursion

Set f = Fso.GetFolder(PathSpec) 
Set fc = f.SubFolders 
For Each Folder in fc 
   %><BLOCKQUOTE><%
   Call ScanFolders(PathSpec & "\" & Folder.Name)	
   %></BLOCKQUOTE><%
Next 

Use of blockquotes indents the text further each level. You may want to examine the HTML generated by this code to see how it all fits together. Recursion isn't so much tricky as one of those things that suddenly clicks into place, so if you don't grab it straight away, don't worry, you will in time.

Time for Bed
IIS only allows scripts to execute for specified amount of time before it considers they have got lost and stops them with a message like this:

The maximum amount of time for a script to execute was exceeded. You can change this limit by specifying a new value for the property Server.ScriptTimeOut or by changing the value in the IIS administration tools

While you can increase the timeout I don't recommend it; only very large sites (thousands of pages) will take that long to scan. This only really becomes an issue for Floyd, as we'll see in the next part. Until then, happy programming!


Attachments

  • Download the code in text format


    So What's ASPExpressware?
    I use David Wier's ASP Express to develop all my ASP code but it's Shareware and I'm running out of evaluation time. Being a programmer of more years than I care to recall, I believe that it's good karma to pay for tools we use. In order to do this on my limited income, I'm asking a small fee for Sytekit to cover the expense of registering ASP Express. Registration details to follow.


    Windows Internet Technology | ASP.NET [1.x] [2.0] | ASPMessageboard.com | ASPFAQs.com | Advertise | Feedback | Author an Article



  • JupiterOnlineMedia

    internet.comearthweb.comDevx.commediabistro.comGraphics.com

    Search:

    Jupitermedia Corporation has two divisions: Jupiterimages and JupiterOnlineMedia

    Jupitermedia Corporate Info


    Legal Notices, Licensing, Reprints, & Permissions, Privacy Policy.

    Advertise | Newsletters | Tech Jobs | Shopping | E-mail Offers

    Solutions
    Whitepapers and eBooks
    Microsoft Article: HyperV-The Killer Feature in WinServer ‘08
    Avaya Article: How to Feed Data into the Avaya Event Processor
    Microsoft Article: Install What You Need with Win Server ‘08
    HP eBook: Putting the Green into IT
    Whitepaper: HP Integrated Citrix XenServer for HP ProLiant Servers
    Intel Go Parallel Portal: Interview with C++ Guru Herb Sutter, Part 1
    Intel Go Parallel Portal: Interview with C++ Guru Herb Sutter, Part 2--The Future of Concurrency
    Avaya Article: Setting Up a SIP A/S Development Environment
    IBM Article: How Cool Is Your Data Center?
    Microsoft Article: Managing Virtual Machines with Microsoft System Center
    HP eBook: Storage Networking , Part 1
    Microsoft Article: Solving Data Center Complexity with Microsoft System Center Configuration Manager 2007
    MORE WHITEPAPERS, EBOOKS, AND ARTICLES
    Webcasts
    Intel Video: Are Multi-core Processors Here to Stay?
    On-Demand Webcast: Five Virtualization Trends to Watch
    HP Video: Page Cost Calculator
    Intel Video: APIs for Parallel Programming
    HP Webcast: Storage Is Changing Fast - Be Ready or Be Left Behind
    Microsoft Silverlight Video: Creating Fading Controls with Expression Design and Expression Blend 2
    MORE WEBCASTS, PODCASTS, AND VIDEOS
    Downloads and eKits
    Sun Download: Solaris 8 Migration Assistant
    Sybase Download: SQL Anywhere Developer Edition
    Red Gate Download: SQL Backup Pro and free DBA Best Practices eBook
    Red Gate Download: SQL Compare Pro 6
    Iron Speed Designer Application Generator
    MORE DOWNLOADS, EKITS, AND FREE TRIALS
    Tutorials and Demos
    How-to-Article: Preparing for Hyper-Threading Technology and Dual Core Technology
    eTouch PDF: Conquering the Tyranny of E-Mail and Word Processors
    IBM Article: Collaborating in the High-Performance Workplace
    HP Demo: StorageWorks EVA4400
    Intel Featured Algorhythm: Intel Threading Building Blocks--The Pipeline Class
    Microsoft How-to Article: Get Going with Silverlight and Windows Live
    MORE TUTORIALS, DEMOS AND STEP-BY-STEP GUIDES