Having search engine friendly sites these
days is no longer a good idea...it's a MUST!
More and more SE's are ignoring the META tags we have all come to know and love
and are concentrating more and more on the actual content of your page. But what
happens if the page is called dynamically? Answer - The bot falls on its bot!
Most SE bots will not follow a URL once it hits the ? or & which indicates the
dynamic content and will grind to a halt. This tutorial will show you a method
to make sure the SE bots really dig into your site and index every possible
page!
The bones of the script below are very simple and lightning fast. In a nutshell
the script interrogates the URL you pass it to determine where the URL ends and
the dynamic content begins. Take this example:
http://www.yourdomain.com/page.cfm?Var1=2&Var2=2&Var3=3
If you rewrote the above to get rid of the ? and & and replace them all with /
to make it look like folders you would get:
http://www.yourdomain.com/page.cfm/Var1/1/Var2/2/Var3/3
Marvellous!! We now have a URL that the SE bot will follow as it looks like a
bunch of folders!
But how do we use this in our query? That?s where the script comes in. On the
page where you are passing this new URL to i.e. page.cfm simply add the script
BEFORE the queries and write your query as normal.
The script will basically scan through the URL looking for the extension, the
.cfm in this case. Once it finds it, it then starts looking for another /. Now
it knows that if it finds another one it?s a dynamic URL and it needs to start
collecting the variables.
Obviously each variable name will have a value so it knows that after the very
next / will be the value for the variable. It will then continue to decipher the
rest of the URL until it runs out of /?s.
Your query will then accept those variables in the normal way and process.
Here?s the script:
<cfscript>
debug = 0;
valid_extensions = "html,htm,cfm,asp,jsp";
url_suffix = ".html";
path_to_parse = replacenocase(cgi.path_info, cgi.script_name, "");
if (listlen(path_to_parse, "/") gte 2) {
var_name = "";
for (x = 1; x lte listlen(path_to_parse, "/"); x
= x + 1) {
if (var_name eq "") {
var_name =
trim(listgetat(path_to_parse, x, "/"));
if (not refind("^[A-Za-z][A-Za-z0-9_]*$",
var_name)) {
var_name =
"";
x = x + 1;
}
}
else {
value_to_set =
listgetat(path_to_parse, x, "/");
if (trim(valid_extensions)
neq "" and x eq listlen(path_to_parse, "/")) {
for (ext = 1; ext lte listlen(valid_extensions); ext = ext + 1) {
extension = "." & listgetat(valid_extensions, ext);
if (right(value_to_set, len(extension)) eq extension) {
value_to_set = left(value_to_set, len(value_to_set) - len(extension));
url_suffix = extension;
break;
}
}
}
setvariable(var_name, value_to_set);
if (isdefined("debug") and debug) {
writeoutput("<!-- " & var_name & " = " & value_to_set &
" -->" & chr(10));
}
var_name = "";
}
}
}
</cfscript>
So where to use it? Well the easiest method is to use it as an include in each
page that has your SE friendly URL passed to it. You could just add it to the
Application.cfm file as well.
The down side is that SE bots can go a bit mental as happened to me last night!
My normal page views are around 2-3k per day. Yesterday it shot up to 11.5K!!
Why? Because the Inkotomi bot indexed every resource on the site (around 900)
and all the associated pages!!
So there you go! SE bots will now trawl your site until they are full to the
brim! For a working example, check out
www.webmasteredge.com and just watch the URL?s as they are passed from page
to page in the resources section.
Amendment: Some users have reported a bug with the above tutorial. If you copy and paste the script direct into your page you may find that the script will not work correctly and comes up with a compilation error.
If this happens, simply strip out the whitespace before each line of code in the script and this should fix it.