Dealing with large archives

To: THREDDS <thredds@xxxxxxxxxxxxxxxx>
Subject: Dealing with large archives
From: Tennessee Leeuwenburg <t.leeuwenburg@xxxxxxxxxx>
Date: Tue, 01 Feb 2005 12:01:51 +1100

Hi guys,

Firstly :

I have "solved" the problem with the bad characters. The problem is thatthe NetCDF reader that thredds uses makes use itself of the "urlPath"specification when coming back with the DDS and DAS. As such, if use the"=" character (among others) in the urlPath (even if it's in the pathrather than the simple filename), it gets inserted into the DDS/DAS bythe NetCDF reader, which causes errors down the track in the parser.

I have worked around the problem by having a separate internalServicefor each dataset. The "base" section can contain the illegal characterswithout polluting the DDS/DAS of files read by the NetCDF reader. Forthe moment this is fine, but is less than ideal. I may return to itafter dealing with more pressing issues. In future I will look atencoding the illegal characters as escaped strings or encoded in someway, but it's tricky to be sure that you've covered all of the caseswhen thinking about those techniques.

Maybe once everything goes XML the problem will simply disappear, and Ican just wait it out :)


Secondly :

I am trying to work out how to structure my data by date. I will have anumber of data sets (NWP Models) which will get updated daily, or evenmultiple times per day. Quite quickly I will reach the point where Iwill have hundreds of data sets published. Even a week's worth of dataat 2 per day across 3 sources is 42 data sets.

I have two tasks - one would be to automate the updating of theconfiguration files so that new data sets get incorporated as theybecome available, and the other would be structuring the data pages in asensible way for users to access.

I was wondering what practises people might have adopted or foundsuccessful in the past with regards to handling large amounts of data?Have people typically arranged archive data as aggregations, or linkedto archive catalogs from the top-level catalog? What have people found best?


Cheers,
-Tennessee

Follow-Ups:
- Re: Dealing with large archives
  - From: Ethan Davis
- Re: Dealing with large archives
  - From: Benno Blumenthal
- Re: Dealing with large archives
  - From: James Gallagher

2005 messages navigation, sorted by:
1. Thread
2. Subject
3. Author
4. Date
5. ↑ Table Of Contents
Search the thredds archives: