Re: [thredds] TDS initialization

To: thredds@xxxxxxxxxxxxxxxx
Subject: Re: [thredds] TDS initialization
From: John Caron <caron@xxxxxxxxxxxxxxxx>
Date: Tue, 04 Jan 2011 09:49:35 -0700

Just to add a few notes. Having very large and many catalogs is one ofthe things we want to handle in a refactor. While we do need to find all"data roots", we dont need to cache the catalogs, thats a performanceoptimisation that should be configurable.

The main problem is the memory used by caching bloated catalog objects.We have the start of a catalog refactor (thredds.catalog2 if anyonewants to have a look) in which the catalog objects are much lighterweight and generally more better. Probably we would use ehcache forcaching. This is "scheduled" for the 4.3 release.

From another POV, we have always tried to obviate large/many catalogswith things like datasetScan, and now featureCollection elements. Butthere are obviously good reasons for users to generate them.


Anyway, I would welcome experience reports and advice.

On 1/4/2011 9:35 AM, Roland Schweitzer wrote:

Thanks John. Among the groups we collaborate with there are somefolks that are quite concerned about the scaling issue. Personally,my direct experience at this point that indicates that the performanceis just fine (at least so far) even with our largest catalogs.
What's the experience of the list? Are folks seeing unacceptable TDSinitialization because of time spend reading catalogs? The threadfrom John Maurer about aggregation access issues notwithstanding.
Roland

On 01/03/2011 07:34 PM, John Caron wrote:
On 1/3/2011 10:53 AM, Roland Schweitzer wrote:
Hi,
We're starting to put together some "big" server-side configurationcatalogs (both with "lots" of dataset elements and "lots" ofcatalogRef elements). We are wondering about the process TDS goesthrough to read the catalog when is starts. What gets cached? Doesit have a way to know a referenced catalog is unchanged? When doreferenced catalogs get scanned? And so on.
Is there some documentation or a flow chart on how TDS initializesitself?
Thanks,
Roland

_______________________________________________
thredds mailing list
thredds@xxxxxxxxxxxxxxxx
For list information or to unsubscribe, visit:http://www.unidata.ucar.edu/mailing_lists/
Hi Roland:
The sad answer is theres not much documentation. Weve been on theverge of redoing the initialization sequence for a few years now, soweve been waiting so we can document the clean, cool refactor insteadof the crufty, lame current one.
Anyway, the TDS reads in all the config catalogs at startup. Itcaches all of them, and uses the "expires" attribute on the catalogto decide if/when it needs to reread a catalog. It needs to read allcatalogs, including catalogRef, because it has to know what thepossible dataset URLs are, and there is no contract that a client hasto read a catalog first.
Obviously this doesnt scale forever. Ethan can probably fill in somedetails.
see:
http://www.unidata.ucar.edu/projects/THREDDS/tech/catalog/v1.0.2/InvCatalogSpec.html#catalog
John

_______________________________________________
thredds mailing list
thredds@xxxxxxxxxxxxxxxx
For list information or to unsubscribe, visit:http://www.unidata.ucar.edu/mailing_lists/
_______________________________________________
thredds mailing list
thredds@xxxxxxxxxxxxxxxx
For list information or to unsubscribe, visit:http://www.unidata.ucar.edu/mailing_lists/

References:
- [thredds] TDS initialization
  - From: Roland Schweitzer
- Re: [thredds] TDS initialization
  - From: John Caron
- Re: [thredds] TDS initialization
  - From: Roland Schweitzer

2011 messages navigation, sorted by:
1. Thread
2. Subject
3. Author
4. Date
5. ↑ Table Of Contents
Search the thredds archives: