Current Release
This documentation covers the latest release of Islandora 7.x. For the very latest in Islandora, we recommend Islandora 8.

Overview

The Drupal XML Sitemap module automatically creates and updates an XML sitemap which will show up at http://pathtoyoursite.com/sitemap.xml and can help to guide search engines indexing the website. By default, the sitemap does not include links to Islandora repository objects and you have to configure it. By using the Drupal XML Sitemap Custom module together with the Islandora XML Sitemap module, URLs for Islandora objects will be automatically added to the XML sitemap module database as custom links. Including Islandora objects in the Drupal sitemap makes them more likely to be correctly indexed by search engines.

Dependencies

This module requires the following modules/libraries:

Downloads

Release Notes and Downloads

Latest code on GitHub

Installation

Install as usual, see this for further information.

Configuration

Set 'Last Modified Solr Field' and 'Maximum number of Islandora links to process at once' in Administration » Islandora » XML Sitemap Integration (admin/islandora/xmlsitemap).

Configuration

Notes

Administrators can configure the number of objects to add to the sitemap at once, as well as the Solr field to use to track when objects were last added to the sitemap.

If you change either of the two settings in this menu, be sure to click "Save configuration" before you click either "Generate remaining entries" or "Regenerate all Islandora entries" otherwise those settings will not be saved.

Click "Reset configuration" to return the settings to their original values.

In the administration tools for the Drupal XML sitemap module, the Custom Links tab (admin/config/search/xmlsitemap/custom) provides a list of Islandora links in your sitemap. You can edit or delete Islandora links from your sitemap in this tab.

The following hooks will create an update to the list of Islandora sitemap custom links:

  • hook_islandora_object_purged()
  • hook_islandora_object_ingested()
  • hook_islandora_object_modified()
  • hook_islandora_datastream_purged()
  • hook_islandora_datastream_ingested()
  • hook_islandora_datastream_modified()

Objects must be publicly (anonymously) available to be included in the XML Sitemap. Note that the listing of Custom Links in the Drupal XML Sitemap module does not necessarily equate to inclusion in the sitemap; an Islandora object may be listed in the Custom Links table (admin/config/search/xmlsitemap/custom) and still not be in the sitemap.xml file if the object is not publicly available. If Islandora objects are not appearing in the XML Sitemap, check the following:

  1. Ensure that the Islandora XML Sitemap and the Drupal XML Sitemap Custom Links modules are enabled.
  2. Ensure that the anonymous Drupal user has the "View repository objects" permission (admin/people/permissions).
  3. XACML permissions override the Drupal "View repository objects" permission. Make sure that there are no extra XACML permissions on the object.

Please also note that objects marked as "inactive", whether manually or by using the Simple Workflow module, will still be indexed by default. You can prevent this by enabling the option "Lock down inactive and deleted objects" under admin/islandora/configure.

Larger sites with greater than 100,000 objects may encounter issues during the sitemap building process with the default configuration, such as the process hanging around a specific number indefinitely or exiting the process entirely before completion. These users may want to try unchecking the "Prefetch URL aliases during sitemap generation" option found on the xmlsitemap admin configuration page (/admin/config/search/xmlsitemap/settings) and trying the process again.

Viewing your Sitemap

Once you've configured the sitemap, your site map will show up at http://pathtoyoursite.com/sitemap.xml

You can edit that default sitemap and can create and manage additional sitemaps by visiting admin/config/search/xmlsitemap .



  • No labels

4 Comments

  1. Does the user need to enable the "XML sitemap engines" module in order to submit the xmlsitemap to Bing and Google? or does that happen automatically without actually enabling that module?

    If it is not automatic, then we might want to add to these instructions that the "XML sitemap engines" module should be enabled, where the user can select to submit it to "Bing" and "Google".

  2. What does clicking the "Reset last modified date" button actually do? It doesn't let you then edit that date, so I'm assuming that all it does is change the date to today's date. Perhaps we could add a short note about what that button actually does.


  3. I changed the "Maximum number of Islandora links to process at one" to "10", but can't see how that number makes any difference.

    Am I supposed to then click "Save Setting" so that number is set for future processes?

    I ask because, that limit of 10 certainly did not have any effect when I clicked on either of these buttons "General remaining entries" or "Regenerate all Islandora entries". When I clicked those buttons, the process did all objects in the repository.

    I suggest adding some note on what setting the "Maximum number" will affect.


    1. Peter, I think this is something like a sleep between indexing issues.  We have it set at 1,000 at a time for the Florida Islandora sites, but I don't remember why we did this.  Maybe ask on the Islandora listserv.