THREDDS slow with lots of files aggregated?

To: THREDDS <thredds@xxxxxxxxxxxxxxxx>
Subject: THREDDS slow with lots of files aggregated?
From: "Bob Simons" <Bob.Simons@xxxxxxxx>
Date: Fri, 14 Jul 2006 11:52:43 -0700

We have a THREDDS server running on Linux here at ERD serving a lot ofdata sets (http://oceanwatch.pfeg.noaa.gov:8081/thredds/catalog.html).The data sets are aggregates, created by putting lots of individualfiles in each data set's directory.

The problem is: I have noticed that the time to get data from 1 datafile in a dataset is roughly proportional to the number of files in thedirectory. And access to data in directories with lots of files is veryslow.

Here are the results from a test in the order that the subtests weredone (in one run of the test program). The GAssta hday subtest wasadded to test the theory that the number of files in the directory wascorrelated with the thredds opendap response time.


AG ssta 3day: 190 files, 719 ms
CM usfc hday: 2138 files, 13359 ms
GA ssta hday: 1018 files, 7063 ms
MB chla 1day: 185 files, 625 ms
QN curl 8day: 537 files, 3141 ms

That looks like a great correlation to me.

Any idea why THREDDS is so slow at opening one file? Linux is slow withso many files, but not close to this slow. Is there anything we can doto Linux or THREDDS to improve this?


Thank you.

Sincerely,

Bob Simons
Satellite Data Product Manager
Environmental Research Division
NOAA Southwest Fisheries Science Center
1352 Lighthouse Ave
Pacific Grove, CA 93950-2079
(831)658-3205
bob.simons@xxxxxxxx
<>< <>< <>< <>< <>< <>< <>< <>< <><

==============================================================================
To unsubscribe thredds, visit:
http://www.unidata.ucar.edu/mailing-list-delete-form.html
==============================================================================

Follow-Ups:
- Re: THREDDS slow with lots of files aggregated?
  - From: John Caron

2006 messages navigation, sorted by:
1. Thread
2. Subject
3. Author
4. Date
5. ↑ Table Of Contents
Search the thredds archives: