Re: [thredds] Static aggregations

To: David Robertson <robertson@xxxxxxxxxxxxxxxxxx>
Subject: Re: [thredds] Static aggregations
From: John Caron <caron@xxxxxxxxxxxxxxxx>
Date: Fri, 23 Oct 2009 16:20:48 -0600

Hi David:

Im guessing that you have caching disabled, or is ineffective for somereason. Can you send me your threddsConfig.xml file to verify that ? Ifthats true, is that deliberate?

Under those circumstances, any access to an aggregation has to rebuildthe aggregation, no matter what the recheckEvery setting is. TDS 4.1 nowhas a file system cache using ehcache, which will only do an OS filescan when the directory changes.

So bottom line is, re-enable the NetcdfFile object cache and thingsshould work as expected. If thats not the case then we have moreinvestigating to do.


David Robertson wrote:

Hi,

Richard Signell wrote:
I'm pretty sure the caching behavior has changed a lot with different
versions of the THREDDS Data Server -- and I'm pretty sure the latest
4.1 server does not rescan the entire aggregation.
What version are you using?
TDS 4.1 built on October 10th. I'm not positive that it's rescanningthe entire directory but it definitely takes longer (10-60 secondsversus ~1 second) just after I touch a file in the directory. Thiscurrent test was using recheckEvery="-1" but the results are the samewithout recheckEvery and with it set to a normal value like "15 min".
As long as nothing in the directory has a new date, I can access thedataset in ~1 second even hours later.
Dave
-Rich

On Fri, Oct 23, 2009 at 12:27 PM, David Robertson
<robertson@xxxxxxxxxxxxxxxxxx> wrote:
Hi all,
It seems that THREDDS forces a rescan if the time stamp on thedirectory haschanged. Even if I set recheckEvery to -1 or 90 days it stillappears torescan when the modified date on the folder changes. I tested thisusing a
simple "touch junk" command in the directory I'm aggregating.
This makes sense so that files added to the directory can be addedto theaggregation. However, is there a way to tell TDS to skip this stepfor agiven dataset or will I need to put my non-changing datasets insubfolders?If I put them in subfolders will I be able to aggregate the entiredataset
together anymore? Perhaps something like:

<netcdf xmlns="http://www.unidata.ucar.edu/namespaces/netcdf/ncml-2.2";>
  <aggregation dimName="time" type="joinExisting">
     <scan location="/home/om/dods-data/thredds/cool/avhrr/nc4/2006/"
           regExp="^2006.*\.nc" />
  </aggregation>
</netcdf>

for each year and

<netcdf xmlns="http://www.unidata.ucar.edu/namespaces/netcdf/ncml-2.2";>
  <aggregation dimName="time" type="joinExisting">
     <scan location="/home/om/dods-data/thredds/cool/avhrr/nc4/"
           regExp=".*/*\.nc" />
  </aggregation>
</netcdf>

to get the aggregation to go through the subfolders and put the years
together?
The solution of getting TDS to skip rescan on specific datasetswould be
preferable to simplify scripts and avoid having to change them each new
year.

Thanks,
Dave

Roy Mendelssohn wrote:
I believe if you don't set rescan it uses the default value - butwouldhave to check on that. I know there is a way to tell it to notrescan.
-Roy

On Oct 22, 2009, at 9:42 AM, David Robertson wrote:
Hi,

Roy Mendelssohn wrote:
What is your rescan set to for that dataset? That is probablywhat is
causing it.
I am not using rescan or recheckEvery so that's probably theproblem the
dataset element I'm using is pasted below:

<dataset name="2006"
       ID="cool-avhrr-bigbight-2006"
       urlPath="cool/avhrr/bigbight/2006" >

 <metadata inherited="true">
    <timeCoverage>
       <start>2006-01-01 03:10:00 UTC</start>
       <end>2006-12-31 22:53:00 UTC</end>
    </timeCoverage>
    <geospatialCoverage>
       <northsouth>
          <start>34.9950981140137</start>
          <size> 11.0090637207031</size>
          <units>degrees_north</units>
       </northsouth>
       <eastwest>
          <start>-77.0059967041016</start>
          <size>  14.0119972229004</size>
          <units>degrees_east</units>
       </eastwest>
    </geospatialCoverage>
 </metadata>
<netcdfxmlns="http://www.unidata.ucar.edu/namespaces/netcdf/ncml-2.2";>
    <aggregation dimName="time" type="joinExisting">
       <scan location="/home/om/dods-data/thredds/cool/avhrr/nc4/"
             regExp="^2006.*\.nc" />
    </aggregation>
 </netcdf>
</dataset>
On Oct 22, 2009, at 8:47 AM, David Robertson wrote:
Hi,
Is there a way to tell the TDS NOT to look for new files to addto anaggregated dataset? I have several aggregations set up that donot change(no added or removed or modified files). Yesterday aftergenerating theaggregation cache, access the dataset was quite quick; ~1 secondto load theData Access Form. However, when I try to access those samedatasets today ittakes just as long as it did to generate the aggregation cachein the first
place (5 minutes).
It should be noted that these aggregations are subsets of filesin adirectory that IS being updated. What I have done used a regExpto separatea very large dataset into years. The 2008 and prior aggregationswill nothave files added so I'm looking for a way to stop the TDS fromsearching for
new files to add to the aggregation cache.

Thanks,
Dave

_______________________________________________
thredds mailing list
thredds@xxxxxxxxxxxxxxxx
For list information or to unsubscribe,  visit:
http://www.unidata.ucar.edu/mailing_lists/
**********************
"The contents of this message do not reflect any position of theU.S.
Government or NOAA."
**********************
Roy Mendelssohn
Supervisory Operations Research Analyst
NOAA/NMFS
Environmental Research Division
Southwest Fisheries Science Center
1352 Lighthouse Avenue
Pacific Grove, CA 93950-2097
e-mail: Roy.Mendelssohn@xxxxxxxx (Note new e-mail address)
voice: (831)-648-9029
fax: (831)-648-8440
www: http://www.pfeg.noaa.gov/
"Old age and treachery will overcome youth and skill."
"From those who have been given much, much will be expected"
**********************
"The contents of this message do not reflect any position of the U.S.
Government or NOAA."
**********************
Roy Mendelssohn
Supervisory Operations Research Analyst
NOAA/NMFS
Environmental Research Division
Southwest Fisheries Science Center
1352 Lighthouse Avenue
Pacific Grove, CA 93950-2097

e-mail: Roy.Mendelssohn@xxxxxxxx (Note new e-mail address)
voice: (831)-648-9029
fax: (831)-648-8440
www: http://www.pfeg.noaa.gov/

"Old age and treachery will overcome youth and skill."
"From those who have been given much, much will be expected"
_______________________________________________
thredds mailing list
thredds@xxxxxxxxxxxxxxxx
For list information or to unsubscribe,  visit:
http://www.unidata.ucar.edu/mailing_lists/
_______________________________________________
thredds mailing list
thredds@xxxxxxxxxxxxxxxx
For list information or to unsubscribe, visit:http://www.unidata.ucar.edu/mailing_lists/

Follow-Ups:
- Re: [thredds] Static aggregations
  - From: David Robertson

References:
- [thredds] Static aggregations
  - From: David Robertson
- Re: [thredds] Static aggregations
  - From: Roy Mendelssohn
- Re: [thredds] Static aggregations
  - From: David Robertson
- Re: [thredds] Static aggregations
  - From: Roy Mendelssohn
- Re: [thredds] Static aggregations
  - From: David Robertson
- Re: [thredds] Static aggregations
  - From: Richard Signell
- Re: [thredds] Static aggregations
  - From: David Robertson

2009 messages navigation, sorted by:
1. Thread
2. Subject
3. Author
4. Date
5. ↑ Table Of Contents
Search the thredds archives: