public inbox for netdev@vger.kernel.org 
 help / color / mirror / Atom feed
From: Adam Baker <linux@baker-net•org.uk>
To: "Ezequiel Garcia" <ezequiel@vanguardiasur•com.ar>,
	"Andrew Lunn" <andrew@lunn•ch>,
	"Thomas Schlöter" <thomas@schloeter•net>
Cc: Thomas Petazzoni <thomas.petazzoni@free-electrons•com>,
	Karl Beldan <karl.beldan@rivierawaves•com>,
	netdev@vger•kernel.org, philipp@familie-kirchhofer•de,
	Martin Michlmayr <tbm@cyrius•com>,
	Linux ARM Kernel <linux-arm-kernel@lists•infradead.org>
Subject: Re: net: mv643xx: interface does not transmit after some time
Date: Sat, 27 Feb 2016 20:06:58 +0000	[thread overview]
Message-ID: <56D201E2.4060504@baker-net.org.uk> (raw)
In-Reply-To: <CAAEAJfBFNEPRd+AaKpRAh6sKFJ=trO+mAD_DAAj3dzhAbbreqg@mail.gmail.com>

On 11/02/16 14:38, Ezequiel Garcia wrote:
> (let's expand the Cc a bit)
>
> On 10 February 2016 at 19:57, Andrew Lunn <andrew@lunn•ch> wrote:
>> On Wed, Feb 10, 2016 at 07:40:54PM +0100, Thomas Schlöter wrote:
>>>
>>>> Am 08.02.2016 um 19:49 schrieb Thomas Schlöter <thomas@schloeter•net>:
>>>>
>>>>
>>>>> Am 07.02.2016 um 22:07 schrieb Thomas Schlöter <thomas@schloeter•net>:
>>>>>
>>>>> Am 07.02.2016 um 21:35 schrieb Andrew Lunn <andrew@lunn•ch>:
>>>>>>
>>>>>>>> FWIW, we had a similar bug report in Debian recently:
>>>>>>>> https://lists.debian.org/debian-arm/2016/01/msg00098.html
>>>>>>
>>>>>> Hi Thomas
>>>>>>
>>>>>> I this thread, Ian Campbell mentions a patch. Please could you try
>>>>>> that patch and see if it fixes your problem.
>>>>>>
>>>>>> Thanks
>>>>>>    Andrew
>>>>>
>>>>> Hi Andrew,
>>>>>
>>>>> I just applied the patch and the NAS is now running it. I???ll try to crash it tonight and keep you informed whether it worked.
>>>>>
>>>>> Thanks
>>>>>     Thomas
>>>>
>>>> Hi Andrew,
>>>>
>>>> the patch did not fix the problem. After 1.2 GiB RX and 950 MiB TX, the interface crashed again.
>>>>
>>>> Now I switched off RX/TX offload just to make sure we are talking about the same problem. If we are, the interface should be stable without offload, right?
>>>>
>>>>      Thomas
>>>
>>> Okay, so I have installed ethtool and switched off all offload features available. Now the NAS is running rock solid for two days. I backed up my Mac using Time Machine / netatalk (450 GiB transferred) and some Linux machines via NFS (100 GiB total) without a problem.
>>>
>>> How much code is used for mv643xx offload functionality?
>>> Is it possible to debug things in the driver and figure out what happens during the crash?
>>> Is the hardware offload interface proprietary or reverse engineered or is it a well known API that can be analyzed?
>>
>> Hi Thomas
>>
>> Ezequiel Garcia probably knows this part of the driver and hardware
>> the best...
>>
>
> The TCP segmentation offload (TSO) implemented in this driver is
> mostly a software thing.
>
> I'm CCing Karl and Philipp, who have fixed subtle issues in the TSO
> path, and may be able to help figure this one out.
>

Hi,

Had this issue occur again today. In my case it seems to be triggered by 
large NFSv4 transfers.

I'm running 4.4 plus Nicolas Schichan's patch at
https://patchwork.ozlabs.org/patch/573334/

There is a thread a http://forum.doozan.com/read.php?2,17404 suggesting 
that this has been broken since at least 3.16.

I first spotted the issue when upgrading from 3.11 to 4.4.

Looking at 
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/log/drivers/net/ethernet/marvell/mv643xx_eth.c 
I see 2014-05-22 as the date TSO support was first added which is 
shortly before the merge window opened for 3.16. I'm therefore guessing 
that TSO has been problematic since it's introduction.

Regards

Adam



_______________________________________________
linux-arm-kernel mailing list
linux-arm-kernel@lists•infradead.org
http://lists.infradead.org/mailman/listinfo/linux-arm-kernel

      reply	other threads:[~2016-02-27 20:06 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <E4F9E0D0-9DA8-4E33-9F89-B49E47F11CE0@schloeter.net>
     [not found] ` <20160206183414.GD17218@lunn.ch>
     [not found]   ` <20160206231935.GA30734@jirafa.cyrius.com>
     [not found]     ` <2ACB3A0B-DD51-43C1-A56E-E7C175645554@schloeter.net>
     [not found]       ` <20160207203545.GB29107@lunn.ch>
     [not found]         ` <A21B89D4-5826-4F97-A1C9-E62E5B724082@schloeter.net>
     [not found]           ` <312E318A-CAE1-45B1-AB13-EA147B48E315@schloeter.net>
     [not found]             ` <A6128DBC-59DB-4231-A08D-B84C1D8C26CC@schloeter.net>
     [not found]               ` <20160210225717.GD14610@lunn.ch>
2016-02-11 14:38                 ` net: mv643xx: interface does not transmit after some time Ezequiel Garcia
2016-02-27 20:06                   ` Adam Baker [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=56D201E2.4060504@baker-net.org.uk \
    --to=linux@baker-net$(echo .)org.uk \
    --cc=andrew@lunn$(echo .)ch \
    --cc=ezequiel@vanguardiasur$(echo .)com.ar \
    --cc=karl.beldan@rivierawaves$(echo .)com \
    --cc=linux-arm-kernel@lists$(echo .)infradead.org \
    --cc=netdev@vger$(echo .)kernel.org \
    --cc=philipp@familie-kirchhofer$(echo .)de \
    --cc=tbm@cyrius$(echo .)com \
    --cc=thomas.petazzoni@free-electrons$(echo .)com \
    --cc=thomas@schloeter$(echo .)net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox