Each
of the following components are critical pieces to a site's ability
to be crawled, indexed and ranked by search engine spiders. When
properly used in the construction of a website, these features give a
site/page the best chance of ranking well for targeted keywords.
Accessibility
An
accessible site is one that ensures delivery of its content
successfully as often as possible. The functionality of pages,
validity of HTML elements, uptime of the site's server and working
status of site coding and components all figure into site
accessibility. If these features are ignored or faulty, both search
engines and users will select other sites to visit.
The
biggest problems in accessibility that most sites encounter fit into
the following categories. Addressing these issues satisfactorily will
avoid problems getting search engines and visitors to and through
your site.
-
Broken
Links
- If an HTML link is broken, the contents of the linked-to page may
never be found. In addition, some surmise that search engines
negatively degrade rankings on sites & pages with many broken
links.
-
Valid
HTML & CSS -
Although arguments exist about the necessity for full validation of
HTML and CSS in accordance with W3C
guidelines,
it is generally agreed that code must meet minimum requirements of
functionality and successful display in order to be spidered and
cached properly by the search engines.
-
Functionality
of Forms and Applications
- If form submissions, select boxes, javascript or other
input-required elements block content from being reached via direct
hyperlinks, search engines may never find them. Keep data that you
want accessible to search engines on pages that can be directly
accessed via a link. In a similar vein, the successful functionality
and implementation of any of these pieces is critical to a site's
accessibility for visitors. A non-functioning page, form or code
element is unlikely to receive much attention from visitors.
-
File
Size
- With the exception of a select few documents that search engine
consider to be of exceptional importance, web pages greater than
150K in size are typically not fully cached. This is done to reduce
index size, bandwidth and load on the servers, and is important to
anyone building pages with exceptionally large amounts of content.
If it's important that every word and phrase be spidered and
indexed, keeping file size under 150K is highly recommended. As with
any online endeavor, smaller file size also means faster download
speed for users - a worthy metric in its own right.
URLs,
Title Tags & Meta Data
URLs,
title tags and meta tag components are all information that describe
your site and page to visitors and search engines. Keeping them
relevant, compelling and accurate are key to ranking well. You can
also use these areas as launching points for your keywords, and
indeed, successful rankings require their use.
The
URL of a document should ideally be as descriptive and brief as
possible. If, for example, your site's structure has several levels
of files and navigation, the URL should reflect this with folders and
subfolders. Individual page's URLs should also be descriptive without
being overly lengthy, so that a visitor who sees only the URL could
have a good idea of what to expect on the page. Several examples
follow:
Comparison
of URLs for a Canon Powershot SD400 Camera
Amazon.com
-
http://www.amazon.com/gp/product/B0007TJ5OG/102-8372974-
4064145?v=glance&n=502394&m=ATVPDKIKX0DER&n=3031001&s=photo&v=glance
Canon.com
-
http://consumer.usa.canon.com/ir/controller?
act=ModelDetailAct&fcategoryid=145&modelid=11158
DPReview.com
- http://www.dpreview.com/reviews/canonsd400/
With
both Canon and Amazon, a user has virtually no idea what the URL
might point to. With DPReview's logical URL, however, it is easy to
surmise that a review of a Canon SD400 is the likely topic of the
page.
In
addition to the issues of brevity and clarity, it's also important to
keep URLs limited to as few dynamic parameters as possible. A dynamic
parameter is a part of the URL that provides data to a database so
the proper records can be retrieved, i.e. n=3031001, v=glance,
categoryid=145, etc.
Note
that in both Amazon and Canon's URLs, the dynamic parameters number 3
or more. In an ideal site, there should never be more than two.
Search engineer representatives have confirmed on numerous occasions
that URLs with more than 2 dynamic parameters may not be spidered
unless they are perceived as significantly important (i.e. have many,
many links pointing to them).
Well
written URLs have the additional benefit of serving as their own
anchor text when copied and pasted as links in forums, blogs, or
other online venues. In the DPReview example, a search engine might
see the URL: http://www.dpreview.com/reviews/canonsd400/ and give
ranking credit to the page for terms in the URL like dpreview,
reviews, canon, sd, 400. The parsing and breaking of terms is subject
to the search engine's analysis, but the chance of earning this
additional credit makes writing friendly, usable URLs even more
worthwhile.
Title
tags, in addition to their invaluable use in targeting keyword terms
for rankings, also help drive click-through-rates (CTRs) from the
results pages. Most of the search engines will use a page's title tag
as the blue link text and headline for a result (see image below) and
thus, it is important to make them informative and compelling without
being overly "salesy". The best title tags will make the
targeted keywords prominent, help brand the site, and be as clear and
concise as possible.