Jessica,
I use the ldmd.conf and "REQUEST" from a local server.
This is all IDD, but I can now confirm your problem.
a) I updated one of my systems to Red Hat Linux 7.0
b) Updated the LDM and recompiled
c) Configured for a basic WMO ingestion, no decoders
d) Removed the old queue and made another (256meg)
e) Started ldm
f) It killed the data stream on the upstream machine
This was working before the upgrade to RH7.0. The server
and client were communicating OK. The data stream was OK, at least
for the small WMO stuff. Both machines are Intel.
The server is still using ldm5.1.2 on RH6.2.
The client is RH7.0 (now)
You appear to have trouble with:
FreeBSD 4.2 server
Solaris 8
I am assuming these are both Intel(?).
The errors I get are similar to yours. I am attaching the full logs,
but be aware that the client log had MY mistake in it. I had
forgotten to build a new queue before starting the ldm (dumb).
Anne,
There is a pretty clear relationship between starting the
client, and crashing a server process. I did this a couple of times
to be sure. Any how, some surface thoughts follow. You are probably
already looking, but:
Have any of the file locking semantics changed?
Or the socket interface? (AF_INET, AF_UNIX, AF_LOCAL)
Have any new functions been added or changed in the queuing?
(search, indexing, qinsert(), qdelete())
Anyway, I uninstalled ldm-5.1.3, reinstalled ldm-5.1.2 and it is
working again. The server configuration was unchanged, except to
stop and restart. I have ldm-5.1.3 running on a RH6.2 box. It is
not crashing things.
jdm
At 02:01 AM 2/14/01 +0000, you wrote:
>Good Evening,
>
>We are requesting a NOAAPort feed using pqing -P <port number> on a ldm
>server that serves this data to downstream ldm hosts.
>
>Today, when the downstream ldm server started requesting a feed from the
>upstream ldm server, one of the pqing processes (importing the NWSTG
>channel) mysteriously died. I attached a copy of the two ldmd log files.
>
>Both machines are running ldm-5.1.3. The upstream server is a FreeBSD
>(version 4.2) box. The downstream server is a Solaris/x86 box (version 8).
>
>There appears to be a problem with the product queue on the upstream ldm
>server.
>
>Has anyone experienced any problems similar to this problem?
>
>Thank you in advance for any assistance,
>
>Jessica
>
>--
>Jessica M. Thomale
>Oklahoma Climatological Survey
>E-mail: jthomale@xxxxxx
>Mail: 100 E. Boyd, Suite 1210 Norman, OK 73019-1012
>Phone: (405) 325-7809
>Fax: (405) 325-2550
>
>
>Attachment Converted: "c:\program files\bear
access\winba\eudora\attach\ldmd.log.downstream"
>
>Attachment Converted: "c:\program files\bear
access\winba\eudora\attach\ldmd.log.upstream"
>
Feb 14 18:02:44 vorticity rpc.ldmd[31063]: Starting Up (built: Aug 31 2000
11:48:33)
Feb 14 18:02:44 vorticity pqexpire[31064]: Starting Up
Feb 14 18:02:44 vorticity pqact[31067]: Starting Up
Feb 14 18:02:44 vorticity pqbinstats[31066]: Starting Up (31063)
Feb 14 18:02:44 vorticity snow[31069]: run_requester: Starting Up:
snow.cit.cornell.edu
Feb 14 18:02:44 vorticity snow[31069]: run_requester: 20010214180217.181
TS_ENDT {{MCIDAS, "^pnga2area Q[01]"},{WMO, ".*"},{UNIDATA, ".*"},{FSL,
".*"},{NLDN, ".*"},{ANY, ".*"}}
Feb 14 18:02:46 vorticity localhost[31116]: Connection from
localhost.localdomain
Feb 14 18:02:46 vorticity localhost[31116]: Connection reset by peer
Feb 14 18:02:46 vorticity localhost[31116]: Exiting
Feb 14 18:02:46 vorticity snowstorm[31117]: Connection from
snowstorm.cit.cornell.edu
Feb 14 18:02:46 vorticity snowstorm(feed)[31117]: Starting Up:
20010214180217.178 TS_ENDT {{WMO, ".*"},{UNIDATA, ".*"},{WSI, ".*"},{FSL2,
".*"}}
Feb 14 18:02:46 vorticity snowstorm(feed)[31117]: topo:
snowstorm.cit.cornell.edu WSI|FSL2|UNIDATA
Feb 14 18:02:46 vorticity visibility[31118]: Connection from
visibility.cit.cornell.edu
Feb 14 18:02:46 vorticity visibility(feed)[31118]: Starting Up:
20010214180217.178 TS_ENDT {{MCIDAS, "^pnga2area Q[01]"},{WMO,
".*"},{UNIDATA, ".*"},{FSL, ".*"},{WSI, ".*"}}
Feb 14 18:02:46 vorticity visibility(feed)[31118]: topo:
visibility.cit.cornell.edu WSI|FSL|UNIDATA
Feb 14 18:02:46 vorticity cloudcover[31119]: Connection from
cloudcover.cit.cornell.edu
Feb 14 18:02:46 vorticity cloudcover(feed)[31119]: Starting Up:
20010214180217.178 TS_ENDT {{UNIDATA, ".*"},{FSL2, ".*"},{MCIDAS,
".*"},{WSI, ".*"}}
Feb 14 18:02:46 vorticity cloudcover(feed)[31119]: topo:
cloudcover.cit.cornell.edu WSI|FSL2|UNIDATA
Feb 14 18:02:46 vorticity pqexpire[31064]: > Recycled 55285.072 kb/hr (
13857.765 prods per hour)
Feb 14 18:02:49 vorticity snow[31069]: FEEDME(snow.cit.cornell.edu): reclass:
20010214180217.181 TS_ENDT {{MCIDAS, "^pnga2area Q[01]"},{WMO,
".*"},{UNIDATA, ".*"},{FSL2, ".*"},{FSL2|UNIDATA, ".*"}}
Feb 14 18:02:49 vorticity snow[31069]: FEEDME(snow.cit.cornell.edu): OK
Feb 14 18:04:16 vorticity sealevel[31991]: Connection from
sealevel.cit.cornell.edu
Feb 14 18:04:16 vorticity sealevel(feed)[31991]: Starting Up:
20010214180217.178 TS_ENDT {{WMO, ".*"},{UNIDATA, ".*"},{FSL2,
".*"},{MCIDAS, ".*"},{WSI, ".*"}}
Feb 14 18:04:16 vorticity sealevel(feed)[31991]: topo:
sealevel.cit.cornell.edu WSI|FSL2|UNIDATA
Feb 14 18:04:45 vorticity proftomd[32005]: Starting up
Feb 14 18:04:45 vorticity proftomd[32005]: unsetting MCPATH environment
variable
Feb 14 18:04:45 vorticity proftomd[32005]: Decoding 2001045.1754 data into
data/mcidas/MDXX0095
Feb 14 18:04:46 vorticity proftomd[32005]: Exiting
Feb 14 18:05:03 vorticity sysu1[32123]: Connection from sysu1.uni.wsicorp.com
Feb 14 18:05:04 vorticity sysu1[32123]: hiya: 20010214175958.794 TS_ENDT {{WSI,
".*"}}
Feb 14 18:05:25 vorticity nids2area[32580]: NIDS2AREA -- BEGIN
Feb 14 18:05:25 vorticity nids2area[32580]: PRODUCT CODE=RE 1045
180000
Feb 14 18:05:25 vorticity nids2area[32584]: NIDS2AREA -- BEGIN
Feb 14 18:05:25 vorticity nids2area[32584]: PRODUCT CODE=RJ 1045
180000
Feb 14 18:05:25 vorticity nids2area[32580]: NIDS2AREA -- DONE AREA 826
Feb 14 18:05:25 vorticity nids2area[32584]: NIDS2AREA -- DONE AREA 1039
Feb 14 18:05:38 vorticity temperature[32616]: Connection from
temperature.cit.cornell.edu
Feb 14 18:05:38 vorticity temperature(feed)[32616]: Starting Up:
20010214170540.561 TS_ENDT {{WMO, ".*"}}
Feb 14 18:05:38 vorticity temperature(feed)[32616]: topo:
temperature.cit.cornell.edu WMO
Feb 14 18:05:56 vorticity nids2area[32646]: NIDS2AREA -- BEGIN
Feb 14 18:05:56 vorticity nids2area[32646]: PRODUCT CODE=R2 1045
180000
Feb 14 18:05:56 vorticity nids2area[32646]: NIDS2AREA -- DONE AREA 340
Feb 14 18:06:34 vorticity pnga2area[471]: Starting Up
Feb 14 18:06:34 vorticity pnga2area[471]: unPNG:: 7382 225088 30.4915
Feb 14 18:06:34 vorticity pnga2area[471]: Exiting
Feb 14 18:06:46 vorticity nids2area[495]: NIDS2AREA -- BEGIN
Feb 14 18:06:46 vorticity nids2area[495]: PRODUCT CODE=RF 1045
180000
Feb 14 18:06:46 vorticity nids2area[495]: NIDS2AREA -- DONE AREA 861
Feb 14 18:06:46 vorticity nids2area[496]: NIDS2AREA -- BEGIN
Feb 14 18:06:46 vorticity nids2area[496]: PRODUCT CODE=RK 1045
180000
Feb 14 18:06:46 vorticity nids2area[496]: NIDS2AREA -- DONE AREA 1073
Feb 14 18:07:07 vorticity nids2area[586]: NIDS2AREA -- BEGIN
Feb 14 18:07:07 vorticity nids2area[586]: PRODUCT CODE=R3 1045
180000
Feb 14 18:07:07 vorticity nids2area[586]: NIDS2AREA -- DONE AREA 413
Feb 14 18:07:17 vorticity nids2area[763]: NIDS2AREA -- BEGIN
Feb 14 18:07:17 vorticity nids2area[763]: PRODUCT CODE=RG 1045
180000
Feb 14 18:07:17 vorticity nids2area[763]: NIDS2AREA -- DONE AREA 929
Feb 14 18:07:21 vorticity temperature(feed)[32616]: SRUS81 KCTP 141807
/pRRACTP: RPC: Unable to receive
Feb 14 18:07:21 vorticity temperature(feed)[32616]: pq_sequence failed:
Input/output error (errno = 5)
Feb 14 18:07:21 vorticity temperature(feed)[32616]: Exiting
Feb 14 18:07:27 vorticity rpc.ldmd[31063]: child 32616 exited with status 1
Feb 14 18:07:37 vorticity nids2area[1022]: NIDS2AREA -- BEGIN
Feb 14 18:07:37 vorticity nids2area[1022]: PRODUCT CODE=R4 1045
180000
Feb 14 18:07:37 vorticity nids2area[1022]: NIDS2AREA -- DONE AREA 452
Feb 14 18:07:37 vorticity nids2area[1029]: NIDS2AREA -- BEGIN
Feb 14 18:07:37 vorticity nids2area[1029]: PRODUCT CODE=RH 1045
180000
Feb 14 18:07:38 vorticity nids2area[1029]: NIDS2AREA -- DONE AREA 978
Feb 14 18:07:51 vorticity rpc.ldmd[31063]: Exiting
Feb 14 18:07:51 vorticity rpc.ldmd[31063]: Terminating process group
Feb 14 18:07:51 vorticity pqbinstats[31066]: Exiting
Feb 14 18:07:51 vorticity sysu1[32123]: Exiting
Feb 14 18:07:51 vorticity sealevel(feed)[31991]: Exiting
Feb 14 18:07:51 vorticity cloudcover(feed)[31119]: Exiting
Feb 14 18:07:51 vorticity visibility(feed)[31118]: Exiting
Feb 14 18:07:51 vorticity snowstorm(feed)[31117]: Exiting
Feb 14 18:07:51 vorticity snow[31069]: Exiting
Feb 14 18:07:51 vorticity pqact[31067]: Exiting
Feb 14 18:07:51 vorticity pqexpire[31064]: Exiting
Feb 14 18:07:51 vorticity pqexpire[31064]: > Up since: 20010214180244.161
Feb 14 18:07:51 vorticity pqexpire[31064]: > Queue usage (bytes):256000000
Feb 14 18:07:51 vorticity pqexpire[31064]: > (nregions): 32991
Feb 14 18:07:51 vorticity pqexpire[31064]: > nbytes recycle: 5377144 (
44366.803 kb/hr)
Feb 14 18:07:51 vorticity pqexpire[31064]: > nprods deleted: 1605 (
13560.680 per hour)
Feb 14 18:07:51 vorticity pqexpire[31064]: > First deleted: 20010214165538.162
Feb 14 18:07:51 vorticity pqexpire[31064]: > Last deleted: 20010214170244.246
Feb 14 18:07:51 vorticity rpc.ldmd[31063]: child 31065 terminated by signal 6
Feb 14 18:07:51 vorticity rpc.ldmd[31063]: Killing (SIGINT) process group
Feb 14 18:07:51 vorticity rpc.ldmd[31063]: Interrupt
Feb 14 18:05:40 temperature rpc.ldmd[1950]: Starting Up (built: Feb 14 2001
11:34:27)
Feb 14 18:05:40 temperature pqbinstats[1952]: Starting Up (1950)
Feb 14 18:05:40 temperature pqact[1953]: Starting Up
Feb 14 18:05:40 temperature pqact[1953]: Error in pattern file
"/usr/local/ldm/etc/pqact.conf"
Feb 14 18:05:40 temperature pqact[1953]: Exiting
Feb 14 18:05:40 temperature vorticity[1954]: run_requester: Starting Up:
vorticity.cit.cornell.edu
Feb 14 18:05:40 temperature vorticity[1954]: run_requester: 20010214170540.561
TS_ENDT {{WMO, ".*"}}
Feb 14 18:05:40 temperature vorticity[1954]: FEEDME(vorticity.cit.cornell.edu):
OK
Feb 14 18:05:40 temperature pqexpire[1951]: Starting Up
Feb 14 18:05:42 temperature rpc.ldmd[1950]: child 1953 exited with status 1
Feb 14 18:05:42 temperature localhost[1961]: Connection from
localhost.localdomain
Feb 14 18:05:42 temperature localhost[1961]: Connection reset by peer
Feb 14 18:05:42 temperature localhost[1961]: Exiting
Feb 14 18:07:22 temperature rpc.ldmd[1950]: Exiting
Feb 14 18:07:22 temperature rpc.ldmd[1950]: Terminating process group
Feb 14 18:07:22 temperature pqexpire[1951]: Exiting
Feb 14 18:07:22 temperature pqexpire[1951]: > Up since: 20010214180540.643
Feb 14 18:07:22 temperature pqexpire[1951]: > Queue usage (bytes):31153432
Feb 14 18:07:22 temperature pqexpire[1951]: > (nregions): 11164
Feb 14 18:07:22 temperature pqexpire[1951]: > nprods deleted 0
Feb 14 18:07:22 temperature pqbinstats[1952]: Exiting
Feb 14 18:07:22 temperature vorticity[1954]: Exiting