Axp-List Archive
Re: 2.2.16 Alpha: "Fix SMP rescheduling with lock held"

Subject: Re: 2.2.16 Alpha: "Fix SMP rescheduling with lock held"
From: Peter Rival (frival@zk3.dec.com)
Date: Wed Jun 14 04:24:11 2000


Greg Lindahl wrote:

> > I take it these
> > messages indicate the hang is around 2 seconds, which makes sense
> > given I send 16MB over 100Mb Enet,
>
> That depends on the detail. Are you running PVM in daemon mode or using
> direct routing? Even if you were sending 16 MB in a single blocking send, it
> should not hold the kernel lock while you're doing it. If so, it would be a
> generic kernel bug...
>
> strace is a very good utility (if it's functioning on alphas these days) to
> figure out what kinds of system calls you're making.
>

Other fun thing to do is what I've done here. Take a look at the addresses in
the "spinlock stuck" message, and either objdump --disassemble-all or take a
look through the System.map file and try to match up the addresses. The only
kicker is when you find a situation like this one - multiple people calling
schedule() and hanging up on the kernel_lock. That's when the strace
_really_ becomes handy, particularly if you can manage to strace both (all?)
the ways it's coming in.

 - Pete

-- 
To unsubscribe: send e-mail to axp-list-request@redhat.com with
'unsubscribe' as the subject.  Do not send it to axp-list@redhat.com



This archive was generated by hypermail version 2a22 on Sat Jul 1 05:31:30 2000 PDT
Send any problems or questions about this archive to webmaster@alphalinux.org.