Sitemap Protocol
Sitemap protocol allows you to inform search engines about URLs on your website that are available for crawling. In its simplest form is a Sitemap using the Sitemap Protocol is an XML file that lists URLs for a site. The Protocol can be used for sites of any size. The Protocol also allows administrators to include on any Web site URL with additional information (when it was last updated, how often they change, how is important in relation to other URLs in the site) so that search engines can page browse smarter. This Protocol is used as one of the tools SEO optimization of Web pages. However, to use this Protocol does not guaranteethat your site will be included in search indexes.
Although at first glance it looks that the Protocol provides the same positive things Sitemap and is good to use it, we have a duty to notify also the risks that brings.
If you generate a file in XML format, it's as if you're offered a key to Your Web pages. So you are passing through this file Google information about what files are on the server (including the configuration files on a MySQL database, webconfigů, files, Microsoft Office, etc.
On our servers, it is prohibited to browse the directory. However, if the user knows the direct URL address to these files and you do not have these folders or individual files directly is prohibited (e.g., by using .htaccess), simply files will appear. Inappropriate generate XML file may cause the display of files, which are not intended for the public!
Google Sitemaps Generator works on the principle of internal indexing the directory structure of files on a site's practices. It is therefore not suitable for indexation of dynamic Web applications, applications that use the ModRewrite to rewrite the URL, or in general applications where the structure of the site from the browser view does not match the directory structure of files on the server.