Re: [thredds] THREDDS/NCSS/Open Files

To: thredds@xxxxxxxxxxxxxxxx
Subject: Re: [thredds] THREDDS/NCSS/Open Files
From: Don Murray <don.murray@xxxxxxxx>
Date: Wed, 6 May 2020 13:54:49 -0600

Hi Kevin-

Since I wrote the GEMPAK IOSP over 10 years ago, my guess is that itmight be missing some dataset close method. The netCDF-java IOSP APIchanged in that time, so it might just be a matter of adding some closemethods to the IOSP classes. Sean would know more. Glad to know thatyou still find that useful.


Don

On 5/6/20 1:10 PM, Tyle, Kevin R wrote:

Hi,
Our THREDDS server (http://thredds.atmos.albany.edu:8080/thredds , stillrunning 4.6.13 at this time) serves both a current-week and longer termarchive of GEMPAK-formatted METAR files as Feature Collections. Verynicely, THREDDS invokes netcdf-java to handle the conversion of GEMPAKto NetCDF. The archive is accessed especially frequently at this time ofthe year, when my co-instructor and I have the students do a case studyof their choice and use MetPy and Siphon to access, subset, and displaysurface maps and meteograms for their event of interest.
Typically, I soon run into issues where the THREDDS server fails with500 server errors when an arbitrary GEMPAK surface file gets accessedvia NCSS. I have traced this to our NCSS and Random Access caches havingmax values set too low.
I see messages in the content/thredds/logs/cache.log file that look likethis:
[2020-05-06T00:25:01.089+0000] FileCache NetcdfFileCache cleanupcouldnt remove enough to keep under the maximum= 150 due to lockedfiles; currently at = 905
[2020-05-06T00:25:44.105+0000] FileCache RandomAccessFile cleanupcouldnt remove enough to keep under the maximum= 500 due to lockedfiles; currently at = 905
No prob, I have upped these limits now. But those “locked files”references made me do some poking around on the machine which is runningTHREDDS. I notice that when I run the *lsof* command and grep for one ofthe GEMPAK files that has been accessed, I see a really large # of matches.
For example, just now I picked one particular file, ran my Jupyternotebook on it that queries and returns the subsetted data via Siphon,and then ran *lsof *and grepped specifically for that one file.
Not surprisingly, it was listed in the *lsof* output. But surprisingly,*lsof *had it listed 89 times! Why might that be the case?
Multiply this by a dozen or so students and co-instructors, and 1-4individual GEMPAK files per case, and now I’m seeing why I consistentlyrun into issues, particularly with these types of datasets. Once thenotebook instance is closed, the open files disappear from *lsof*, butoften times students (and even I) forget to close and halt their Jupyternotebooks.
Curiously, when I look into my content/thredds/cache/ncss directory, Idon’t see anything.
So my two questions are:

 1. Why does *lsof* return such a large number of duplicate references
    for a single file that’s being accessed via NCSS?
 2. Why do I not see files appear in the *cache* directory, even when
    there are clearly instances when the cache scouring script detects them?

Thanks,

Kevin

_____________________________________________

Kevin Tyle, M.S.; Manager of Departmental Computing

NSF XSEDE Campus Champion

Dept. of Atmospheric & Environmental Sciences

University at Albany

Earth Science 228, 1400 Washington Avenue

Albany, NY 12222

Email: ktyle@xxxxxxxxxx <mailto:ktyle@xxxxxxxxxx>

Phone: 518-442-4578

_____________________________________________


_______________________________________________
NOTE: All exchanges posted to Unidata maintained email lists are
recorded in the Unidata inquiry tracking system and made publicly
available through the web.  Users who post to any of the lists we
maintain are reminded to remove any personal information that they
do not want to be made public.


thredds mailing list
thredds@xxxxxxxxxxxxxxxx
For list information or to unsubscribe,  visit: 
https://www.unidata.ucar.edu/mailing_lists/


--
Don Murray
NOAA/PSL and CU-CIRES
303-497-3596
https://www.psl.noaa.gov/people/don.murray/

Follow-Ups:
- Re: [thredds] THREDDS/NCSS/Open Files
  - From: Tyle, Kevin R

References:
- [thredds] THREDDS/NCSS/Open Files
  - From: Tyle, Kevin R

2020 messages navigation, sorted by:
1. Thread
2. Subject
3. Author
4. Date
5. ↑ Table Of Contents
Search the thredds archives: