All,
Built, and am running, ldm-6.0.10 on two linux machines (f5.aos.wisc,edu,
and profhorn.aos.wisc.edu, mapmaker will be next), all good on that
front.
However, when I built on our SGI running irix 6.5.15m, using
gcc compilers (freeware version 3.0.1), I get assertion failures
and core dumps.
I'm going to have to fail back to 6.0.2 for now (which, after
core-dumping before I rebuilt the queue file the first time,
had been running ok.)
Unidata support: here are some excerpts from the log files when 6.0.10
crashes under irix.
This time, it just died, but did not dump core:
Apr 08 21:33:43 5Q:sunset zeus(feed)[1782]: topo: zeus.lsc.vsc.edu DIFAX
Apr 08 21:36:54 5Q:sunset kelvin[1991]: ldmprog_4: ldmping from
kelvin.ca.uky.edu
Apr 08 21:43:02 5Q:sunset rpc.ldmd[1697]: child 1710 terminated by signal 9
Apr 08 21:43:15 3Q:sunset DCNLDN[1692]: nldninput(): no data within timeout
period: returning EOF
Apr 08 21:43:15 3Q:sunset DCNLDN[1692]: nldninput(): NLDN read error
Apr 08 21:43:19 5Q:sunset pqact[1699]: child 1692 exited with status 110
Apr 08 21:46:54 5Q:sunset kelvin[2688]: ldmprog_4: ldmping from
kelvin.ca.uky.edu
Apr 08 21:52:53 5Q:sunset rpc.ldmd[1697]: child 1685 terminated by signal 11
Apr 08 21:52:53 5Q:sunset rpc.ldmd[1697]: Killing (SIGINT) process group
Apr 08 21:52:53 5Q:sunset rpc.ldmd[1697]: SIGINT
Apr 08 21:52:53 5Q:sunset mapmaker[1706]: SIGINT
Apr 08 21:52:54 5Q:sunset mapmaker[1713]: SIGINT
Apr 08 21:52:55 3Q:sunset mapmaker[1706]: pmap_unset(LDMPROG 300029, LDMVERS 5)
failed
Apr 08 21:52:55 3Q:sunset mapmaker[1713]: pmap_unset(LDMPROG 300029, LDMVERS 5)
failed
Apr 08 21:52:55 3Q:sunset mapmaker[1706]: pmap_unset(LDMPROG 300029, LDMVERS 6)
failed
Apr 08 21:52:55 3Q:sunset mapmaker[1713]: pmap_unset(LDMPROG 300029, LDMVERS 6)
failed
Apr 08 21:53:15 5Q:sunset rpc.ldmd[1697]: Terminating process group
Apr 08 21:53:15 5Q:sunset mapmaker[1706]: SIGTERM
Apr 08 21:53:15 5Q:sunset mapmaker[1713]: SIGTERM
Apr 08 21:53:15 5Q:sunset pqbinstats[1701]: Interrupt
Apr 08 21:53:15 5Q:sunset io(feed)[1757]: SIGTERM
Apr 08 21:53:15 5Q:sunset pqact[1699]: Interrupt
Apr 08 21:53:15 5Q:sunset f5(feed)[1750]: SIGTERM
Apr 08 21:53:16 5Q:sunset io(feed)[1757]: SIGINT
Apr 08 21:53:15 5Q:sunset zeus(feed)[1782]: SIGTERM
Apr 08 21:53:16 5Q:sunset f5(feed)[1750]: SIGINT
Apr 08 21:53:16 5Q:sunset pqbinstats[1701]: Exiting
Apr 08 21:53:15 5Q:sunset shadow(feed)[1739]: SIGTERM
Apr 08 21:53:15 5Q:sunset storm2(feed)[1743]: SIGTERM
Apr 08 21:53:16 5Q:sunset kelvin(feed)[1763]: SIGTERM
Apr 08 21:53:15 5Q:sunset accas(feed)[1746]: SIGTERM
Apr 08 21:53:16 5Q:sunset zeus(feed)[1769]: SIGTERM
Apr 08 21:53:16 5Q:sunset shadow(feed)[1739]: SIGINT
Apr 08 21:53:16 5Q:sunset storm2(feed)[1743]: SIGINT
Apr 08 21:53:16 5Q:sunset kelvin(feed)[1763]: SIGINT
Apr 08 21:53:16 5Q:sunset accas(feed)[1746]: SIGINT
Apr 08 21:53:16 5Q:sunset zeus(feed)[1769]: SIGINT
Apr 08 21:53:16 5Q:sunset zeus(feed)[1782]: SIGINT
Apr 08 21:53:16 5Q:sunset pqact[1699]: Exiting
Apr 08 21:53:16 3Q:sunset pqact[1699]: mm0_mtof: Couldn't riul_r_find 0
Apr 08 21:53:16 5Q:sunset io(feed)[1759]: SIGTERM
Apr 08 21:53:16 5Q:sunset rtstats[1703]: Interrupt
Apr 08 21:53:16 5Q:sunset io(feed)[1759]: SIGINT
Apr 08 21:53:16 5Q:sunset rtstats[1703]: Exiting
Apr 08 21:53:17 5Q:sunset f5[1711]: SIGTERM
Apr 08 21:53:17 5Q:sunset f5[1711]: SIGINT
Apr 08 21:53:18 5Q:sunset thelma[1707]: SIGTERM
Apr 08 21:53:18 5Q:sunset thelma[1707]: SIGINT
Apr 08 21:53:18 3Q:sunset thelma[1707]: pmap_unset(LDMPROG 300029, LDMVERS 5)
failed
Apr 08 21:53:18 3Q:sunset thelma[1707]: pmap_unset(LDMPROG 300029, LDMVERS 6)
failed
Apr 08 21:53:18 3Q:sunset f5[1711]: pmap_unset(LDMPROG 300029, LDMVERS 5) failed
Apr 08 21:53:18 3Q:sunset f5[1711]: pmap_unset(LDMPROG 300029, LDMVERS 6) failed
This time, it dumped a 400 Mb core file (ok, well I powered down the
machine after about 20 minutes and 200 Mb were written)
Apr 08 20:50:08 5Q:sunset mapmaker[1371139]: Connecting to upstream LDM using
protocol version 6...
Apr 08 20:50:08 5Q:sunset mapmaker[1371139]: Upstream LDM is willing to feed
Apr 08 20:56:57 5Q:sunset kelvin[1369930]: ldmprog_4: ldmping from
kelvin.ca.uky.edu
Apr 08 20:59:36 3Q:sunset DCNLDN[1327079]: nldninput(): no data within timeout
period: returning EOF
Apr 08 20:59:37 3Q:sunset DCNLDN[1327079]: nldninput(): NLDN read error
Apr 08 21:01:33 5Q:sunset pqact[1341124]: child 1327079 exited with status 110
Apr 08 21:01:33 5Q:sunset rpc.ldmd[1370280]: child 1369745 terminated by signal
11
Apr 08 21:01:33 5Q:sunset rpc.ldmd[1370280]: Killing (SIGINT) process group
Apr 08 21:01:33 5Q:sunset rpc.ldmd[1370280]: SIGINT
Apr 08 21:01:34 5Q:sunset zeus[1327918]: SIGINT
Apr 08 21:01:34 5Q:sunset pqact[1341124]: Interrupt
Apr 08 21:01:34 5Q:sunset pqbinstats[1359722]: Interrupt
Apr 08 21:01:34 5Q:sunset io(feed)[1367586]: SIGINT
Apr 08 21:01:34 5Q:sunset io(feed)[1369155]: SIGINT
Apr 08 21:01:34 5Q:sunset accas(feed)[1369325]: SIGINT
Apr 08 21:01:34 5Q:sunset shadow(feed)[1368393]: SIGINT
Apr 08 21:01:34 5Q:sunset kelvin(feed)[1371945]: SIGINT
Apr 08 21:01:34 5Q:sunset storm2(feed)[1367439]: SIGINT
Apr 08 21:01:34 3Q:sunset thelma[1360636]: assertion "sx->nfree + sx->nelems ==
sx->nalloc" failed: file "pq.c", line 2364
Apr 08 21:01:34 5Q:sunset zeus(feed)[1370336]: SIGINT
Apr 08 21:01:34 5Q:sunset f5(feed)[1370687]: SIGINT
Apr 08 21:01:34 5Q:sunset mapmaker[1371139]: SIGINT
Apr 08 21:01:34 5Q:sunset pqbinstats[1359722]: Exiting
Apr 08 21:01:34 5Q:sunset pqact[1341124]: Exiting
Apr 08 21:01:34 5Q:sunset rtstats[1369891]: Interrupt
Apr 08 21:01:34 5Q:sunset mapmaker[1367178]: SIGINT
Apr 08 21:01:34 5Q:sunset zeus(feed)[1367324]: SIGINT
Apr 08 21:01:34 5Q:sunset rtstats[1369891]: Exiting
Apr 08 21:01:35 3Q:sunset mapmaker[1371139]: pmap_unset(LDMPROG 300029, LDMVERS
5) failed
Apr 08 21:01:35 3Q:sunset mapmaker[1371139]: pmap_unset(LDMPROG 300029, LDMVERS
6) failed
Apr 08 21:01:35 5Q:sunset rpc.ldmd[1370280]: Terminating process group
Apr 08 21:01:35 5Q:sunset mapmaker[1367178]: SIGTERM
Apr 08 21:01:35 3Q:sunset mapmaker[1367178]: pmap_unset(LDMPROG 300029, LDMVERS
5) failed
Apr 08 21:01:35 3Q:sunset mapmaker[1367178]: pmap_unset(LDMPROG 300029, LDMVERS
6) failed
Apr 08 21:01:35 5Q:sunset mapmaker[1371139]: SIGTERM
Apr 08 21:10:39 3Q:sunset f5[1370668]: assertion "sx->nfree + sx->nelems ==
sx->nalloc" failed: file "pq.c", line 2364
Apr 08 21:10:41 5Q:sunset rpc.ldmd[1370280]: child 1360636 terminated by signal
6
Apr 08 21:10:41 5Q:sunset rpc.ldmd[1370280]: Killing (SIGINT) process group
Apr 08 21:19:06 3Q:sunset f5[1370033]: assertion "sx->nfree + sx->nelems ==
sx->nalloc" failed: file "pq.c", line 2364
Apr 08 21:19:07 5Q:sunset rpc.ldmd[1370280]: child 1370668 terminated by signal
6
Apr 08 21:19:07 5Q:sunset rpc.ldmd[1370280]: Killing (SIGINT) process group
Pete
--
+>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>+<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<+
^ Pete Pokrandt V 1447 AOSS Bldg 1225 W Dayton St^
^ Systems Programmer V Madison, WI 53706 ^
^ V poker@xxxxxxxxxxxxxxx ^
^ Dept of Atmos & Oceanic Sciences V (608) 262-3086 (Phone/voicemail) ^
^ University of Wisconsin-Madison V 262-0166 (Fax) ^
+<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<+>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>+