linux.git/arch/arm64/kernel/io.c, branch v4.18

arm64: Avoid aligning normal memory pointers in __memcpy_{to,from}io

2017-10-24T15:23:07+00:00

__memcpy_{to,from}io fall back to byte-at-a-time copying if both the
source and destination pointers are not 8-byte aligned. Since one of the
pointers always points at normal memory, this is unnecessary and
detrimental to performance, so only do byte copying until we hit an 8-byte
boundary for the device pointer.

This change was motivated by performance issues in the pstore driver.
On a test platform, measuring probe time for pstore, console buffer
size of 1/4MB and pmsg of 1/2MB, was in the 90-107ms region. Change
managed to reduce it to 10-25ms, an improvement in boot time.

Cc: Kees Cook 
Cc: Anton Vorontsov 
Cc: Tony Luck 
Cc: Catalin Marinas 
Cc: Will Deacon 
Cc: Anton Vorontsov 
Cc: Robin Murphy 
Signed-off-by: Mark Salyzyn 
Signed-off-by: Will Deacon

arm64: optimize memcpy_{from,to}io() and memset_io()

2014-11-06T17:25:27+00:00

Optimize memcpy_{from,to}io() and memset_io() by transferring in 64 bit
as much as possible with minimized barrier usage.  This simplest
optimization brings faster throughput compare to current byte-by-byte read
and write with barrier in the loop.  Code's skeleton is taken from the
powerpc.

Link: http://lkml.kernel.org/p/20141020133304.GH23751@e104818-lin.cambridge.arm.com
Reviewed-by: Catalin Marinas 
Reviewed-by: Trilok Soni 
Signed-off-by: Joonwoo Park 
Signed-off-by: Will Deacon

arm64: Device specific operations

2012-09-17T12:42:04+00:00

This patch adds several definitions for device communication, including
I/O accessors and ioremap(). The __raw_* accessors are implemented as
inline asm to avoid compiler generation of post-indexed accesses (less
efficient to emulate in a virtualised environment).

Signed-off-by: Will Deacon 
Signed-off-by: Catalin Marinas 
Acked-by: Arnd Bergmann 
Acked-by: Tony Lindgren 
Acked-by: Nicolas Pitre 
Acked-by: Olof Johansson 
Acked-by: Santosh Shilimkar