From: Dave Martin <Dave.Martin@arm•com>
To: Mark Brown <broonie@kernel•org>
Cc: Julien Grall <julien@xen•org>,
Catalin Marinas <catalin.marinas@arm•com>,
zhang.lei@jp•fujitsu.com, Julien Grall <julien.grall@arm•com>,
Will Deacon <will@kernel•org>,
linux-arm-kernel@lists•infradead.org,
Daniel Kiss <Daniel.Kiss@arm•com>
Subject: Re: [PATCH v3 8/8] arm64/sve: Rework SVE trap access to use TIF_SVE_NEEDS_FLUSH
Date: Wed, 15 Jul 2020 17:52:54 +0100 [thread overview]
Message-ID: <20200715165254.GG30452@arm.com> (raw)
In-Reply-To: <20200629133556.39825-9-broonie@kernel.org>
On Mon, Jun 29, 2020 at 02:35:56PM +0100, Mark Brown wrote:
> From: Julien Grall <julien.grall@arm•com>
>
> SVE state will be flushed on the first SVE access trap. At the moment,
> the SVE state will be generated from the FPSIMD state in software and
> then loaded in memory.
>
> It is possible to use the newly introduce flag TIF_SVE_NEEDS_FLUSH to
> avoid a lot of memory access.
>
> If the FPSIMD state is in memory, the SVE state will be loaded on return
> to userspace from the FPSIMD state.
>
> If the FPSIMD state is loaded, then we need to set the vector-length
> before relying on return to userspace to flush the SVE registers. This
> is because the vector length is only set when loading from memory. We
> also need to rebind the task to the CPU so the newly allocated SVE state
> is used when the task is saved.
Reasonable overall, I think.
A few minor queries below.
> Signed-off-by: Julien Grall <julien.grall@arm•com>
> Signed-off-by: Mark Brown <broonie@kernel•org>
> ---
> arch/arm64/include/asm/fpsimd.h | 2 ++
> arch/arm64/kernel/entry-fpsimd.S | 5 +++++
> arch/arm64/kernel/fpsimd.c | 35 ++++++++++++++++++++++----------
> 3 files changed, 31 insertions(+), 11 deletions(-)
>
> diff --git a/arch/arm64/include/asm/fpsimd.h b/arch/arm64/include/asm/fpsimd.h
> index bec5f14b622a..e60aa4ebb351 100644
> --- a/arch/arm64/include/asm/fpsimd.h
> +++ b/arch/arm64/include/asm/fpsimd.h
> @@ -74,6 +74,8 @@ extern void sve_load_from_fpsimd_state(struct user_fpsimd_state const *state,
> unsigned long vq_minus_1);
> extern unsigned int sve_get_vl(void);
>
> +extern void sve_set_vq(unsigned long vq_minus_1);
> +
> struct arm64_cpu_capabilities;
> extern void sve_kernel_enable(const struct arm64_cpu_capabilities *__unused);
>
> diff --git a/arch/arm64/kernel/entry-fpsimd.S b/arch/arm64/kernel/entry-fpsimd.S
> index 5b1a9adfb00b..476c8837a7e5 100644
> --- a/arch/arm64/kernel/entry-fpsimd.S
> +++ b/arch/arm64/kernel/entry-fpsimd.S
> @@ -48,6 +48,11 @@ SYM_FUNC_START(sve_get_vl)
> ret
> SYM_FUNC_END(sve_get_vl)
>
Might be worth a comment here to remind us that x0 is the vq minus 1.
> +SYM_FUNC_START(sve_set_vq)
> + sve_load_vq x0, x1, x2
> + ret
> +SYM_FUNC_END(sve_set_vq)
> +
> /*
> * Load SVE state from FPSIMD state.
> *
> diff --git a/arch/arm64/kernel/fpsimd.c b/arch/arm64/kernel/fpsimd.c
> index ccbc38b71069..dfe2e19ce591 100644
> --- a/arch/arm64/kernel/fpsimd.c
> +++ b/arch/arm64/kernel/fpsimd.c
> @@ -944,10 +944,10 @@ void fpsimd_release_task(struct task_struct *dead_task)
> /*
> * Trapped SVE access
> *
> - * Storage is allocated for the full SVE state, the current FPSIMD
> - * register contents are migrated across, and TIF_SVE is set so that
> - * the SVE access trap will be disabled the next time this task
> - * reaches ret_to_user.
> + * Storage is allocated for the full SVE state so that the code
> + * running subsequently has somewhere to save the SVE registers to. We
> + * then rely on ret_to_user to actually convert the FPSIMD registers
> + * to SVE state by flushing as required.
> *
> * TIF_SVE should be clear on entry: otherwise, fpsimd_restore_current_state()
> * would have disabled the SVE access trap for userspace during
> @@ -965,14 +965,24 @@ void do_sve_acc(unsigned int esr, struct pt_regs *regs)
>
> get_cpu_fpsimd_context();
>
> - fpsimd_save();
> -
> - /* Force ret_to_user to reload the registers: */
> - fpsimd_flush_task_state(current);
> + set_thread_flag(TIF_SVE_NEEDS_FLUSH);
> + /*
> + * We should not be here with SVE enabled. TIF_SVE will be set
> + * before returning to userspace by fpsimd_restore_current_state().
> + */
> + WARN_ON(test_thread_flag(TIF_SVE));
>
> - fpsimd_to_sve(current);
> - if (test_and_set_thread_flag(TIF_SVE))
> - WARN_ON(1); /* SVE access shouldn't have trapped */
> + /*
> + * When the FPSIMD state is loaded:
> + * - The return path (see fpsimd_restore_current_state) requires
> + * the vector length t be loaded beforehand.
Nit: to
> + * - We need to rebind the task to the CPU so the newly allocated
> + * SVE state is used when the task is saved.
> + */
> + if (!test_thread_flag(TIF_FOREIGN_FPSTATE)) {
> + sve_set_vq(sve_vq_from_vl(current->thread.sve_vl) - 1);
> + fpsimd_bind_task_to_cpu();
Hmm, does this actually to the sve_user_enable(), duplicating the
sve_user_enable() in fpsimd_restore_current_state()?
> + }
>
> put_cpu_fpsimd_context();
> }
> @@ -1189,6 +1199,9 @@ void fpsimd_restore_current_state(void)
> /*
> * The userspace had SVE enabled on entry to the kernel
> * and requires the state to be flushed.
> + *
> + * We rely on the Vector-Length to be set correctly before-hand
Trivial nit: I think we normally just write "vector length".
Could be worth saying where it gets done (i.e., do_sve_acc()).
> + * when converting a loaded FPSIMD state to SVE state.
> */
> sve_flush_live();
> sve_user_enable();
Possibly redundant? See do_sve_acc().
Cheers
---Dave
_______________________________________________
linux-arm-kernel mailing list
linux-arm-kernel@lists•infradead.org
http://lists.infradead.org/mailman/listinfo/linux-arm-kernel
next prev parent reply other threads:[~2020-07-15 16:54 UTC|newest]
Thread overview: 19+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-06-29 13:35 [PATCH v3 0/8] arm64/sve: First steps towards optimizing syscalls Mark Brown
2020-06-29 13:35 ` [PATCH v3 1/8] arm64/fpsimd: Update documentation of do_sve_acc Mark Brown
2020-06-29 13:35 ` [PATCH v3 2/8] arm64/signal: Update the comment in preserve_sve_context Mark Brown
2020-06-29 13:35 ` [PATCH v3 3/8] arm64/fpsimdmacros: Allow the macro "for" to be used in more cases Mark Brown
2020-06-29 13:35 ` [PATCH v3 4/8] arm64/fpsimdmacros: Introduce a macro to update ZCR_EL1.LEN Mark Brown
2020-06-29 13:35 ` [PATCH v3 5/8] arm64/sve: Implement a helper to flush SVE registers Mark Brown
2020-07-15 16:52 ` Dave Martin
2020-06-29 13:35 ` [PATCH v3 6/8] arm64/sve: Implement a helper to load SVE registers from FPSIMD state Mark Brown
2020-07-15 16:52 ` Dave Martin
2020-06-29 13:35 ` [PATCH v3 7/8] arm64/sve: Don't disable SVE on syscalls return Mark Brown
2020-07-15 16:52 ` Dave Martin
2020-08-21 21:54 ` Mark Brown
2020-06-29 13:35 ` [PATCH v3 8/8] arm64/sve: Rework SVE trap access to use TIF_SVE_NEEDS_FLUSH Mark Brown
2020-07-15 16:52 ` Dave Martin [this message]
2020-07-15 16:49 ` [PATCH v3 0/8] arm64/sve: First steps towards optimizing syscalls Dave Martin
2020-07-15 17:11 ` Mark Brown
2020-07-20 10:44 ` Dave Martin
2020-07-21 2:43 ` zhang.lei
2020-07-21 22:34 ` Mark Brown
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20200715165254.GG30452@arm.com \
--to=dave.martin@arm$(echo .)com \
--cc=Daniel.Kiss@arm$(echo .)com \
--cc=broonie@kernel$(echo .)org \
--cc=catalin.marinas@arm$(echo .)com \
--cc=julien.grall@arm$(echo .)com \
--cc=julien@xen$(echo .)org \
--cc=linux-arm-kernel@lists$(echo .)infradead.org \
--cc=will@kernel$(echo .)org \
--cc=zhang.lei@jp$(echo .)fujitsu.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox