* Problem with 2.4.24 e1000 and keepalived @ 2004-01-07 19:05 Stephan von Krawczynski 2004-01-07 21:02 ` Willy Tarreau 0 siblings, 1 reply; 8+ messages in thread From: Stephan von Krawczynski @ 2004-01-07 19:05 UTC (permalink / raw) To: linux-kernel; +Cc: netdev, linux-net Hello all, I am looking for confirmation for the following problem. Setup is a simple pair of routers with 2 nics each, all e1000. If you start a vrrp setup with keepalived and interface state is down during keepalived startup, then the failover does not work. If the nics are UP during startup everything works well. Now the kernel part of the story: the exact same setup works with tulip cards. Is there a difference regarding UP/DOWN state handling/events in e1000 and tulip. e100 and eepro100 show the same problem btw. Any hints are welcome Regards, Stephan ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: Problem with 2.4.24 e1000 and keepalived 2004-01-07 19:05 Problem with 2.4.24 e1000 and keepalived Stephan von Krawczynski @ 2004-01-07 21:02 ` Willy Tarreau 2004-01-08 2:45 ` Ben Greear 0 siblings, 1 reply; 8+ messages in thread From: Willy Tarreau @ 2004-01-07 21:02 UTC (permalink / raw) To: Stephan von Krawczynski; +Cc: linux-kernel, netdev, linux-net Hi Stephan, On Wed, Jan 07, 2004 at 08:05:56PM +0100, Stephan von Krawczynski wrote: > Setup is a simple pair of routers with 2 nics each, all e1000. If you start a > vrrp setup with keepalived and interface state is down during keepalived > startup, then the failover does not work. If the nics are UP during startup > everything works well. Now the kernel part of the story: the exact same setup > works with tulip cards. > Is there a difference regarding UP/DOWN state handling/events in e1000 and > tulip. e100 and eepro100 show the same problem btw. I noticed the exact same problem about 1 year ago with the early 2.4 bonding code and eepro100. At this time, I attributed this to a yet undiscovered but in the bonding state machine, and could not investigate much since it was on a remote production machine. Someone went there and rebooted it and everything went OK. Before the reboot, the switch alredy detected an UP link, while the bonding code saw it down (using MII at this time, not ethtool). I recently read one report (here or on keepalived list) about someone who got the same problem with another eepro100. I wonder whether there would not be a bug either in the driver or in the chip itself. What I noticed is that if you load the driver while the cable is unplugged, and then plug it, the MII status says the link is still down. Unfortunately, the only e100 I have access to are in prod at a customer's and I really cannot make tests there. Cheers, Willy ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: Problem with 2.4.24 e1000 and keepalived 2004-01-07 21:02 ` Willy Tarreau @ 2004-01-08 2:45 ` Ben Greear 2004-01-08 5:20 ` Willy Tarreau 2004-01-08 8:14 ` Stephan von Krawczynski 0 siblings, 2 replies; 8+ messages in thread From: Ben Greear @ 2004-01-08 2:45 UTC (permalink / raw) To: Willy Tarreau; +Cc: Stephan von Krawczynski, linux-kernel, netdev, linux-net Willy Tarreau wrote: > Hi Stephan, > > On Wed, Jan 07, 2004 at 08:05:56PM +0100, Stephan von Krawczynski wrote: > >>Setup is a simple pair of routers with 2 nics each, all e1000. If you start a >>vrrp setup with keepalived and interface state is down during keepalived >>startup, then the failover does not work. If the nics are UP during startup >>everything works well. Now the kernel part of the story: the exact same setup >>works with tulip cards. >>Is there a difference regarding UP/DOWN state handling/events in e1000 and >>tulip. e100 and eepro100 show the same problem btw. > > > I noticed the exact same problem about 1 year ago with the early 2.4 > bonding code and eepro100. At this time, I attributed this to a yet > undiscovered but in the bonding state machine, and could not investigate > much since it was on a remote production machine. Someone went there and > rebooted it and everything went OK. Before the reboot, the switch alredy > detected an UP link, while the bonding code saw it down (using MII at this > time, not ethtool). I recently read one report (here or on keepalived list) > about someone who got the same problem with another eepro100. I wonder > whether there would not be a bug either in the driver or in the chip itself. > > What I noticed is that if you load the driver while the cable is unplugged, > and then plug it, the MII status says the link is still down. Unfortunately, > the only e100 I have access to are in prod at a customer's and I really > cannot make tests there. You have to bring the interface 'UP' before it will detect link, with something like: ifconfig eth2 up Could that be the problem? Ben > > Cheers, > Willy > > - > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > the body of a message to majordomo@vger•kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > Please read the FAQ at http://www.tux.org/lkml/ > -- Ben Greear <greearb@candelatech•com> Candela Technologies Inc http://www.candelatech.com ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: Problem with 2.4.24 e1000 and keepalived 2004-01-08 2:45 ` Ben Greear @ 2004-01-08 5:20 ` Willy Tarreau 2004-01-08 8:07 ` Ben Greear 2004-01-08 8:14 ` Stephan von Krawczynski 1 sibling, 1 reply; 8+ messages in thread From: Willy Tarreau @ 2004-01-08 5:20 UTC (permalink / raw) To: Ben Greear Cc: Willy Tarreau, Stephan von Krawczynski, linux-kernel, netdev, linux-net Hi Ben, On Wed, Jan 07, 2004 at 06:45:04PM -0800, Ben Greear wrote: > You have to bring the interface 'UP' before it will detect link, > with something like: ifconfig eth2 up Don't you mean "after" instead of "before" here ? Because the case where it doesn't work is when everything is set up while the cable is unplugged, but conversely, if the system goes up with the cable plugged, setting the interface UP detects the link as UP and works. I believe that the problem is related to setting the interface UP with nothing plugged into it. Cheers, Willy ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: Problem with 2.4.24 e1000 and keepalived 2004-01-08 5:20 ` Willy Tarreau @ 2004-01-08 8:07 ` Ben Greear 2004-01-08 8:46 ` Willy Tarreau 0 siblings, 1 reply; 8+ messages in thread From: Ben Greear @ 2004-01-08 8:07 UTC (permalink / raw) To: Willy Tarreau; +Cc: Stephan von Krawczynski, linux-kernel, netdev, linux-net Willy Tarreau wrote: > Hi Ben, > > On Wed, Jan 07, 2004 at 06:45:04PM -0800, Ben Greear wrote: > > >>You have to bring the interface 'UP' before it will detect link, >>with something like: ifconfig eth2 up > > > Don't you mean "after" instead of "before" here ? Because the case where > it doesn't work is when everything is set up while the cable is unplugged, > but conversely, if the system goes up with the cable plugged, setting the > interface UP detects the link as UP and works. I believe that the problem > is related to setting the interface UP with nothing plugged into it. No, I meant what I said: You have to tell many drivers to bring the interface up before they will attempt (or at least report) link negotiation. You do NOT have to give it an IP address or add any routes to it. But, I don't know about your particular program, I just suspect it is related to detecting link state. I think tg3 detects link when the interface is not UP, if you have some tg3 nics maybe you could try with them? Ben > > Cheers, > Willy > -- Ben Greear <greearb@candelatech•com> Candela Technologies Inc http://www.candelatech.com ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: Problem with 2.4.24 e1000 and keepalived 2004-01-08 8:07 ` Ben Greear @ 2004-01-08 8:46 ` Willy Tarreau 0 siblings, 0 replies; 8+ messages in thread From: Willy Tarreau @ 2004-01-08 8:46 UTC (permalink / raw) To: Ben Greear Cc: Willy Tarreau, Stephan von Krawczynski, linux-kernel, netdev, linux-net On Thu, Jan 08, 2004 at 12:07:10AM -0800, Ben Greear wrote: > No, I meant what I said: You have to tell many drivers to bring the > interface > up before they will attempt (or at least report) link negotiation. > You do NOT have to give it an IP address or add any routes to it. ah, OK. No, anyway, it is just a matter of wrongly detecting link state after the link has been plugged while the interface was already UP, no matter if an IP was set or not. > But, I don't know about your particular program, I just suspect it > is related to detecting link state. I think tg3 detects link when > the interface is not UP, if you have some tg3 nics maybe you could > try with them? As far as I have tested, tg3 are fine WRT this. Willy ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: Problem with 2.4.24 e1000 and keepalived 2004-01-08 2:45 ` Ben Greear 2004-01-08 5:20 ` Willy Tarreau @ 2004-01-08 8:14 ` Stephan von Krawczynski 2004-01-08 8:47 ` Willy Tarreau 1 sibling, 1 reply; 8+ messages in thread From: Stephan von Krawczynski @ 2004-01-08 8:14 UTC (permalink / raw) To: Ben Greear; +Cc: willy, linux-kernel, netdev, linux-net On Wed, 07 Jan 2004 18:45:04 -0800 Ben Greear <greearb@candelatech•com> wrote: > Willy Tarreau wrote: > > Hi Stephan, > > [...] > > What I noticed is that if you load the driver while the cable is unplugged, > > and then plug it, the MII status says the link is still down. > > Unfortunately, the only e100 I have access to are in prod at a customer's > > and I really cannot make tests there. > > You have to bring the interface 'UP' before it will detect link, > with something like: ifconfig eth2 up > > Could that be the problem? > > Ben Hi Ben, the situation is like this (exactly this works flawlessly with tulip): - unplug all interfaces from the switches - reboot box - plug in _one_ interface - log into the box (yes, network works flawlessly) - start keepalived - now plug in rest of the interfaces - watch keepalived do _nothing_ (seems no UP event shows up) in comparison to: - let all interfaces plugged in - reboot box - log in - start keepalived - watch it work as expected Regards, Stephan ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: Problem with 2.4.24 e1000 and keepalived 2004-01-08 8:14 ` Stephan von Krawczynski @ 2004-01-08 8:47 ` Willy Tarreau 0 siblings, 0 replies; 8+ messages in thread From: Willy Tarreau @ 2004-01-08 8:47 UTC (permalink / raw) To: Stephan von Krawczynski; +Cc: Ben Greear, linux-kernel, netdev, linux-net On Thu, Jan 08, 2004 at 09:14:41AM +0100, Stephan von Krawczynski wrote: > the situation is like this (exactly this works flawlessly with tulip): > > - unplug all interfaces from the switches > - reboot box > - plug in _one_ interface > - log into the box (yes, network works flawlessly) > - start keepalived > - now plug in rest of the interfaces > - watch keepalived do _nothing_ (seems no UP event shows up) I agree with this description, and would add : - mii-diag ethX or ethtool ethX report link down Willy ^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2004-01-08 8:47 UTC | newest] Thread overview: 8+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2004-01-07 19:05 Problem with 2.4.24 e1000 and keepalived Stephan von Krawczynski 2004-01-07 21:02 ` Willy Tarreau 2004-01-08 2:45 ` Ben Greear 2004-01-08 5:20 ` Willy Tarreau 2004-01-08 8:07 ` Ben Greear 2004-01-08 8:46 ` Willy Tarreau 2004-01-08 8:14 ` Stephan von Krawczynski 2004-01-08 8:47 ` Willy Tarreau
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox