[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: 20001107: LDM problem at University of Puerto Rico



Karli,

The problem appears to be a LDM queue management type problem that permits
the queue to grow on SGI machines. These problems have been fixed in the
latest release of the LDM 5.1.2.  There are binaries available for your
SGI platform.  The installation of the new release require the
commenting out the "exec pqexpire" in the ldmd.conf file and it
needs to have the queue rebuilt because it has a new struture. Hopefully
this will solve your problem, if it doesn't let us know.

Robb... 

On Tue, 7 Nov 2000, Unidata Support wrote:

> 
> ------- Forwarded Message
> 
> >From: McIDAS <address@hidden>
> >Organization: McIDAS proyect
> >Keywords: 200011070833.eA78XG409393 LDM mmap IRIX64
> 
> This is a multi-part message in MIME format.
> 
> --------------2781446B794B
> Content-Type: text/plain; charset=us-ascii
> Content-Transfer-Encoding: 7bit
> 
> Sorry guys, it's been so long since I've posted I forgot I sent it to
> the wrong forum. This is really and LDM problem and should've been sent
> to LDM's support. So here it is.
> 
> Karli
> -- 
> 
> ====================================================================
> Amos Winter                                  address@hidden
> Director
> Puerto Rico Climatology Center 
> P.O. Box 9013                           
> Department of Marine Sciences                  phone: (787) 265-5416    
> University of Puerto Rico - Mayaguez             fax: (787) 265-2195
> Mayaguez, PR 00681-9013
> 
> --------------2781446B794B
> Content-Type: message/rfc822
> Content-Transfer-Encoding: 7bit
> Content-Disposition: inline
> 
> Message-ID: <address@hidden>
> Date: Mon, 06 Nov 2000 22:57:11 -0400
> From: McIDAS <address@hidden>
> Organization: McIDAS proyect
> X-Mailer: Mozilla 3.01SGoldC-SGI (X11; I; IRIX64 6.4 IP30)
> MIME-Version: 1.0
> To: Unidata Support <address@hidden>
> Subject: hi again
> References: <address@hidden>
> Content-Type: text/plain; charset=us-ascii
> Content-Transfer-Encoding: 7bit
> 
> Hi Again!
> 
> I know it's been a while but I've been really busy lately, anyway given
> that now we have another massive panic with our systems, I figured I'd
> ask you guys what's going on?
> Right now this is the problem: as usual we stopped getting any feeds,
> but surprisingly enough LDM was still running so when I insptected the
> ldmd.log file I found all over it the same type of message as the one
> below:
> 
> --------------------------
> Nov 07 00:00:17 5Q:breeze sysu1[16313]: Connection reset by peer
> Nov 07 00:00:17 5Q:breeze sysu1[16313]: Exiting
> Nov 07 00:00:23 5Q:breeze sysu1[16296]: Connection from
> sysu1.uni.wsicorp.com
> Nov 07 00:00:24 3Q:breeze sysu1[16296]: mmap: 64000000 0 1069875200: Not
> enough space
> Nov 07 00:00:24 3Q:breeze sysu1[16296]: Remap failed. Abandon all hope.
> Nov 07 00:00:24 3Q:breeze sysu1[16296]: pq_last: seq:Not enough space
> (errno = 12)
> Nov 07 00:00:24 5Q:breeze sysu1[16296]: hiya: 20001106230024.053 TS_ENDT
> {{WSI,  ".*"}}
> Nov 07 00:00:34 5Q:breeze sysu1[16296]: Growing data by 33980416
> Nov 07 00:00:34 3Q:breeze sysu1[16296]: mmap: 40000000 0 1103855616: Not
> enough space
> Nov 07 00:00:34 3Q:breeze sysu1[16296]: comings: pqe_new: Not enough
> space
> Nov 07 00:00:34 3Q:breeze sysu1[16296]:        :
> 7d2c6ca2f497a3b4aa283edaa7c69a45    16086 20001107000034.115     WSI
> 001  NEX/JUA/SRMV1/200011062355
> Nov 07 00:00:44 5Q:breeze sysu1[16296]: Growing data by 33980416
> Nov 07 00:00:44 3Q:breeze sysu1[16296]: mmap: 40000000 0 1103855616: Not
> enough space
> Nov 07 00:00:44 3Q:breeze sysu1[16296]: comings: pqe_new: Not enough
> space
> Nov 07 00:00:44 3Q:breeze sysu1[16296]:        :
> b09fa27bef8865ab10a60eb3a73473bd    14428 20001107000044.632     WSI
> 002  NEX/JUA/VEL1/200011062355
> 
> --------------------------
> and it just keeps going. Surprisingly the ldm.pq file has risen to a
> whopping 1.069 GB!!! I have absolutely no Idea how this could've
> happenned (my limit is about 250MB). I have stopped LDM, and I'm about
> to erase the pq file and recreate it and start the processes again.
> still I assume this problem will come back and I would like to know what
> it could be and what can I do about it. It looks as if it wasn't
> scouring for files, but this doesn't explain why the pq file overstepped
> its bounds!  I also have still 3.0 GB fre in the partition ldm.pq is
> located at.
> 
> Thank you guys for your help.
> 
> Karli
> 
> 
> --
> 
> ====================================================================
> Amos Winter                                  address@hidden
> Director
> Puerto Rico Climatology Center 
> P.O. Box 9013                           
> Department of Marine Sciences                  phone: (787) 265-5416    
> University of Puerto Rico - Mayaguez             fax: (787) 265-2195
> Mayaguez, PR 00681-9013
> 
> --------------2781446B794B--
> 
> 
> ------- End of Forwarded Message
> 
> 

===============================================================================
Robb Kambic                                Unidata Program Center
Software Engineer III                      Univ. Corp for Atmospheric Research
address@hidden             WWW: http://www.unidata.ucar.edu/
===============================================================================