SiteMap : Java Glossary

The JDisplay Java Applet displays the large program listings on this web page. JDisplay requires an up-to-date browser and Java version 1.8+, preferably 1.8.0_131. If you can’t see the listings, or if you just want to learn more about JDisplay, click

Use Firefox for best results.

menu
How Do Search Engines find your Sitemap?	Links
SiteMap Utility

How Do Search Engines find your Sitemap?

Links

SiteMap Utility

How Do Search Engines find your Sitemap?

You must register with Google to tell them you have prepared such a file, what you called it and where you put it, usually sitemap.gz in the root directory of your website, e.g. http://mindprod.com/sitemap.gz. You can check it is registered properly in your Google Webmaster Tools ⇒ Dashboard. Not having a standard name makes it harder for hackers to find the sitemap and use it to scope out your site, while leaving you free to tell the name to whichever searchengines you wish. It is not available generally, unless you call it something obvious like sitemap.gz.

You can register your sitemap with other search engines too.

Also help search engines find your sitemap with a line like this is your robots.txt file:

# parts of the mindprod.com website not indexed
user-agent: *
disallow: /include/
disallow: /jgloss/include/
disallow: /image/restricted/
Sitemap: http://mindprod.com/sitemap.gz

Note how Sitemap takes a full URL (Uniform Resource Locator), unlike the others.

It also might help to make put a link to your gz file somewhere on one of your webpages, so that the search engine will stumble on your sitemap in the process of spidering.

SiteMap Utility

I have written a Java program called SiteMap you can

to generate a Google sitemap file for your own website, offline. It can prepare a sitemap for a website of 10,000 files in under 2 seconds. This is fast enough that you can use it before every upload, ensuring your Google sitemap is 100% up to date for when the Google spider of opportunity knocks to spider your site.

You control the utility by composing three files with a text editor. The first siteconfig.properties looks like this:

You also compose a list of entire directories to specially catalog: directories.csv, containing the directory name, frequency of update and relative importance when it comes to keeping it up to date, expressed

You then compose a list of exceptions, the files to specially catalog a different way from the default for their directory : files.csv: It contains directory name, filename, frequency updated and

# individual file exceptions to the directory rules # directory, file, frequency, spidering importance. , whatsnew.html, daily, .9 jgloss, deadpadsites.html, never, 0 project, projects.html, weekly, .7

My utility then scans your disk and prepares a Google sitemap entry for every individual file that meets those criteria and compresses it. You then upload it to your website. The first time, you must also register that file’s name with Google.

You want to regenerate your site map just prior to every upload, otherwise if the Google spider comes, it will miss some of your recently updated files.

You can validate a sitemap file with an sitemap XML schema.

Human Readable Sitemaps

There is another kind of site map, that tries to give your a birds eye view of the entire website, so you can jump directly to the section you need. You can prepare these manually using the usual HTML (Hypertext Markup Language) editors, combined with directory listings and search/replace or you can use a utility to build one for you such as Coffee Cup SiteMapper. I created a primitive one manually for my own website.

standard footer
	This page is posted on the web at:	http://mindprod.com/jgloss/sitemap.html
	Optional Replicator mirror of mindprod.com on local hard disk J:	J:\mindprod\jgloss\sitemap.html
	Please read the feedback from other visitors, or send your own feedback about the site. Contact Roedy. Please feel free to link to this page without explicit permission.
	Canadian Mind Products IP:[65.110.21.43] Your face IP:[216.73.217.43]
Feedback	You are visitor number

This page is posted
on the web at:

http://mindprod.com/jgloss/sitemap.html

Optional Replicator mirror
of mindprod.com
on local hard disk J:

J:\mindprod\jgloss\sitemap.html

Please read the feedback from other visitors, or send your own feedback about the site.
Contact Roedy. Please feel free to link to this page without explicit permission.

Canadian Mind Products
IP:[65.110.21.43]
Your face IP:[216.73.217.43]

Feedback

You are visitor number