TestDataResults

From PreservWiki

(Difference between revisions)
Jump to: navigation, search
m
Line 7: Line 7:
* The DROID does not like being fed documents which contain possible bad URLs in them, I will need to check to see if this is a problem with DROID (I suspect not) or my code.
* The DROID does not like being fed documents which contain possible bad URLs in them, I will need to check to see if this is a problem with DROID (I suspect not) or my code.
** File in question: http://dspace.mit.edu/47:14Z%20(GMT).%20No.%20of%20bitstreams:%20263191853.pdf:%207781517%20bytes,%20checksum:%20f3b11c5d00fa6a9a02d4c906c1825757%20(MD5)63191853MIT.pdf:%207786389%20bytes,%20checksum:%2032bfd24738365b6456b4d2a10a961c6b%20(MD5)&.
** File in question: http://dspace.mit.edu/47:14Z%20(GMT).%20No.%20of%20bitstreams:%20263191853.pdf:%207781517%20bytes,%20checksum:%20f3b11c5d00fa6a9a02d4c906c1825757%20(MD5)63191853MIT.pdf:%207786389%20bytes,%20checksum:%2032bfd24738365b6456b4d2a10a961c6b%20(MD5)&.
 +
** FIXED: DROID does not handle shell escaping (in this case bash) when file names are handed to it, thus it cannot read the file. This was worked around by handing droid a "droid list XML file" which contains a XML encoded link to the file.

Revision as of 13:30, 17 February 2009

For now this is a bullet pointed list which needs bulking out:

  • When harvesting repositories about 1/3 of downloads fail for various reasons, Further Investigation?
  • ROAR identifies a lot of stuff at HTML as it may get redirected to html in the process of trying to get a resource. This is wrong as this is not then the resource, the bug we believe is in the fact that the repository sends a HTTP 200 and not an HTTP 401 header. This is particularly the case on the ANU repository where we have only 74 items as fmt/94 (html) is the redirected 401 page. Further Investigation?
Personal tools