Having search engine friendly sites these days is no longer a good idea...it's a MUST!

More and more SE's are ignoring the META tags we have all come to know and love and are concentrating more and more on the actual content of your page. But what happens if the page is called dynamically? Answer - The bot falls on its bot! Most SE bots will not follow a URL once it hits the ? or & which indicates the dynamic content and will grind to a halt. This tutorial will show you a method to make sure the SE bots really dig into your site and index every possible page!

The bones of the script below are very simple and lightning fast. In a nutshell the script interrogates the URL you pass it to determine where the URL ends and the dynamic content begins. Take this example:

http://www.yourdomain.com/page.cfm?Var1=2&Var2=2&Var3=3

If you rewrote the above to get rid of the ? and & and replace them all with / to make it look like folders you would get:

http://www.yourdomain.com/page.cfm/Var1/1/Var2/2/Var3/3

Marvellous!! We now have a URL that the SE bot will follow as it looks like a bunch of folders!

But how do we use this in our query? That?s where the script comes in. On the page where you are passing this new URL to i.e. page.cfm simply add the script BEFORE the queries and write your query as normal.

The script will basically scan through the URL looking for the extension, the .cfm in this case. Once it finds it, it then starts looking for another /. Now it knows that if it finds another one it?s a dynamic URL and it needs to start collecting the variables.

Obviously each variable name will have a value so it knows that after the very next / will be the value for the variable. It will then continue to decipher the rest of the URL until it runs out of /?s.

Your query will then accept those variables in the normal way and process.

Here?s the script:

<cfscript>
   debug = 0;
   valid_extensions = "html,htm,cfm,asp,jsp";
   url_suffix = ".html";
   path_to_parse = replacenocase(cgi.path_info, cgi.script_name, "");
   if (listlen(path_to_parse, "/") gte 2) {
      var_name = "";
      for (x = 1; x lte listlen(path_to_parse, "/"); x = x + 1) {
      if (var_name eq "") {
         var_name = trim(listgetat(path_to_parse, x, "/"));
         if (not refind("^[A-Za-z][A-Za-z0-9_]*$", var_name)) {
            var_name = "";
            x = x + 1;
         }
      }
      else {
           value_to_set = listgetat(path_to_parse, x, "/");
           if (trim(valid_extensions) neq "" and x eq listlen(path_to_parse, "/")) {
               for (ext = 1; ext lte listlen(valid_extensions); ext = ext + 1) {
                     extension = "." & listgetat(valid_extensions, ext);
                     if (right(value_to_set, len(extension)) eq extension) {
                          value_to_set = left(value_to_set, len(value_to_set) - len(extension));
                          url_suffix = extension;
                          break;
                     }
            }
      }
  setvariable(var_name, value_to_set);
  if (isdefined("debug") and debug) {
     writeoutput("<!-- " & var_name & " = " & value_to_set & " -->" & chr(10));
  }
  var_name = "";
  }
 }
}

</cfscript>

So where to use it? Well the easiest method is to use it as an include in each page that has your SE friendly URL passed to it. You could just add it to the Application.cfm file as well.

The down side is that SE bots can go a bit mental as happened to me last night! My normal page views are around 2-3k per day. Yesterday it shot up to 11.5K!! Why? Because the Inkotomi bot indexed every resource on the site (around 900) and all the associated pages!!

So there you go! SE bots will now trawl your site until they are full to the brim! For a working example, check out www.webmasteredge.com and just watch the URL?s as they are passed from page to page in the resources section.

Amendment: Some users have reported a bug with the above tutorial. If you copy and paste the script direct into your page you may find that the script will not work correctly and comes up with a compilation error.
If this happens, simply strip out the whitespace before each line of code in the script and this should fix it.

About This Tutorial
Author: Phil Williams
Skill Level: Intermediate 
 
 
 
Platforms Tested: CF4,CF5,CFMX
Total Views: 113,685
Submission Date: February 12, 2003
Last Update Date: June 05, 2009
All Tutorials By This Autor: 8
Discuss This Tutorial
  • This thing is AWESOME! It actually worked! You didn't even have to edit the code or take out the lines in front. But be sure to have the base href tag otherwise it might not work.

  • No worries! Glad you liked it...

  • Wow, this is great. I had some problems because I implemented the base tag incorrectly and then once i got that right everything worked perfect. I have it in my Application.cfm and have not had any bugs once I got the implementation correct. Thank you so much,

  • I see no reason why this shouldn't work but I've never tried! Best thing to do is give it a whirl...

  • Hi, Would you please tell me if I can use the same script in my jsp pages to use the URLs like the above to be search engine friendly. Thank you, Kaju

  • You shouldn't need to make any changes to the Apache config. The one thing I noticed in your example is that you still have the = symbol in the url. Replace these with / like so: /productsearch.cfm/ispeak/8/iwanttolearn/10 That should fix you up! ;-)

  • Doesn't work for me (running CFMX 6.1, on Apache 2 on RedHat Enterprise), I get a lovely 404: 404 /productsearch.cfm/ispeak=8/iwanttolearn=10 Which is what I thought would happen... Do I need to do something to the Apache config?

  • Mike, Send me your code and I'll look it over for you. Cheers, Phil

  • I am not getting any errors, and including it before my queries, as well as using . Is there any changes I should be making to the script to get it to work, as in the site webmasteredge.com. I see it works well on this site, so this is something at my end for sure.

  • Rather than use the scrip in the application file as I have found this gets a little buggy, just include it before your queries on the page you want to use dynamically. You also need to make sure that you are using the tag in your head. Phil

Advertisement

Sponsored By...
Mobile App Development (IOS, Android, Cordova, Phonegap, Objective-C, Java) - Austin, Texas Mobile Apps - Touch512, LLC.