London street directory & create a sitemap

Being my normal self, I thought I would have a bit of a change today. So, the london19.com site became important again, it should, what a great name.

I have added the street directories for 1818,1832, 1842, 1843, 1940 and I think that’s all to this site. I have kept all pub stuff on the pub history site, and the 1921 street directory too. I am not sure why I made this decision, but anything can be changed.

The good thing is that the pub history site now has just under 50,000 html pages, which all fit on one sitemap for google. Good old google.

If you are ever interested, i can tell you how to create a site map of even this magnitude in a matter of a minute. Lets do it.

You first need to login via ssh to your linux web site hosting, I use putty which is a free ssh utility. You need to know the correct login details, too.

You then change directory to the top folder for your website, something like :

/var/www/vhosts/p**/httpdocs , e.g. cd /var/www/vhosts/p**/httpdocs

You then run a simple script which finds all files in the folder/s:

find `pwd` > foundthislot.txt

You then need to pick the specifics from this file, i.e. all of the .html pages

cat foundthislot.txt | grep shtml > findjusthtml.txt

Its quite simple.

Download this last file via ftp as you would normally, then do a grand edit of this file so that each line is relevant to the web page you are visiting.

 

Sitemaps are very important, and although this is easy; it took me a couple of years to get this right!

Don’t laugh at me for posting such a ridiculously simple way of creating google sitemaps. It really is that simple, it just took me a long time to realise the potential of linux power against the weakness of running in a Windows environment, which is good for desktop applications only.

You also need to use google webmaster tools to upload the sitemap at some stage, and see how your site is doing in desktop and mobile visibility, but this is another story.

 

 

 

 

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s