<?xml version="1.0" encoding="utf-8"?>
<html>
By Robert Tansley, Google engineer and architect of DSpace 1.0
...
- Upgrade to the latest possible DSpace.
- Ensure that your DSpace is visible to search engines.
- Use the simple HTML sitemap feature – this does not require e.g. registering with Google Webmaster tools.
- Ensure your robots.txt allows access to item "splash" pages and full text.
- Ensure item metadata appears in HTML headers correctly.
- Don't worry about OAI-PMH; it is not particularly useful for indexing. Really.
Panel | ||||||
---|---|---|---|---|---|---|
Contents
|
Upgrade to the latest possible DSpace.
...
First ensure your DSpace instance is visible, e.g. with: https://www.google.com/webmasters/tools/sitestatus
If your site is not indexed at all, all search engines have a way to add your URL, e.g.:
- Google: http://www.google.com/addurl
- Yahoo: http://siteexplorer.search.yahoo.com/submit
- Bing: http://www.bing.com/docs/submit.aspx
Add HTML Sitemap support.
...
Ensure that your robots.txt file is at the top level of your site: i.e. at http://repo.foo.edu/robots.txt, and NOT e.g. http://repo.foo.edu/dspace/robots.txt
. If your DSpace instance is served from e.g. http://repo.foo.edu/dspace/
, you'll need to add /dspace to all the paths in the examples below (e.g. /dspace/browse-subject).
...