diff mbox

[RFC,2/2] ARM: add soc memory barrier extension

Message ID E1Z07tA-0001TV-As@rmk-PC.arm.linux.org.uk (mailing list archive)
State New, archived
Headers show

Commit Message

Russell King June 3, 2015, 12:35 p.m. UTC
Add an extension to the heavy barrier code to allow a SoC specific
memory barrier function to be provided.  This is needed for platforms
where the interconnect has weak ordering, and thus needs assistance
to ensure that memory writes are properly visible in the correct order
to other parts of the system.

Signed-off-by: Russell King <rmk+kernel@arm.linux.org.uk>
---
 arch/arm/include/asm/barrier.h | 1 +
 arch/arm/mm/flush.c            | 4 ++++
 2 files changed, 5 insertions(+)

Comments

Catalin Marinas June 3, 2015, 1:15 p.m. UTC | #1
On Wed, Jun 03, 2015 at 01:35:20PM +0100, Russell King wrote:
> Add an extension to the heavy barrier code to allow a SoC specific
> memory barrier function to be provided.  This is needed for platforms
> where the interconnect has weak ordering, and thus needs assistance
> to ensure that memory writes are properly visible in the correct order
> to other parts of the system.

Do you have an example of where this is needed? Were they previously
handled by hijacking outer_cache.sync?
Russell King - ARM Linux June 3, 2015, 1:43 p.m. UTC | #2
On Wed, Jun 03, 2015 at 02:15:25PM +0100, Catalin Marinas wrote:
> On Wed, Jun 03, 2015 at 01:35:20PM +0100, Russell King wrote:
> > Add an extension to the heavy barrier code to allow a SoC specific
> > memory barrier function to be provided.  This is needed for platforms
> > where the interconnect has weak ordering, and thus needs assistance
> > to ensure that memory writes are properly visible in the correct order
> > to other parts of the system.
> 
> Do you have an example of where this is needed? Were they previously
> handled by hijacking outer_cache.sync?

Look for omap_bus_sync() - you need to go back through the kernel history
as it has been half-heartedly removed (OMAP's mach/barriers.h still exists
but it isn't used by anything, and omap_bus_sync() is now just an assembly
stub.)

Basically, OMAP used to set CONFIG_ARCH_HAS_BARRIERS, which caused OMAPs
mach/barriers.h to be picked up.  This contained:

#define rmb()		dsb()
#define wmb()		do { dsb(); outer_sync(); omap_bus_sync(); } while (0)
#define mb()		wmb()

and omap_bus_sync() was implemented arch/arm/mach-omap2/omap4-common.c
as:

void __iomem *dram_sync, *sram_sync;

void omap_bus_sync(void)
{
	if (dram_sync && sram_sync) {
		writel_relaxed(readl_relaxed(dram_sync), dram_sync);
		writel_relaxed(readl_relaxed(sram_sync), sram_sync);
		isb();
	}
}
EXPORT_SYMBOL(omap_bus_sync);

with the initialisation:

	dram_io_desc[0].virtual = OMAP4_DRAM_BARRIER_VA;
	dram_io_desc[0].pfn = __phys_to_pfn(paddr);
	dram_io_desc[0].length = size;
	dram_io_desc[0].type = MT_MEMORY_RW_SO;
	iotable_init(dram_io_desc, ARRAY_SIZE(dram_io_desc));
	dram_sync = (void __iomem *) dram_io_desc[0].virtual;

and:

		sram_sync = (void *)gen_pool_alloc(sram_pool, PAGE_SIZE);


which has the effect of issuing a strongly ordered access out to memory,
which cause the OMAP interconnects to be flushed.

The message I'm hearing from TI is that the removal of this from mainline
is a mistake (it sounds to me like it's caused a regression), and it needs
to be restored, otherwise data accesses via various paths to DRAM can end
up being seen out of order.
Tony Lindgren June 3, 2015, 2:54 p.m. UTC | #3
* Russell King - ARM Linux <linux@arm.linux.org.uk> [150603 06:44]:
> On Wed, Jun 03, 2015 at 02:15:25PM +0100, Catalin Marinas wrote:
> > On Wed, Jun 03, 2015 at 01:35:20PM +0100, Russell King wrote:
> > > Add an extension to the heavy barrier code to allow a SoC specific
> > > memory barrier function to be provided.  This is needed for platforms
> > > where the interconnect has weak ordering, and thus needs assistance
> > > to ensure that memory writes are properly visible in the correct order
> > > to other parts of the system.
> > 
> > Do you have an example of where this is needed? Were they previously
> > handled by hijacking outer_cache.sync?
> 
> Look for omap_bus_sync() - you need to go back through the kernel history
> as it has been half-heartedly removed (OMAP's mach/barriers.h still exists
> but it isn't used by anything, and omap_bus_sync() is now just an assembly
> stub.)

With multiarch support, the code for errata i688 became unselectable.
That's mostly commit 137d105d50f6 ("ARM: OMAP4: Fix errata i688 with MPU
interconnect barriers.").
 
> Basically, OMAP used to set CONFIG_ARCH_HAS_BARRIERS, which caused OMAPs
> mach/barriers.h to be picked up.  This contained:
> 
> #define rmb()		dsb()
> #define wmb()		do { dsb(); outer_sync(); omap_bus_sync(); } while (0)
> #define mb()		wmb()
> 
> and omap_bus_sync() was implemented arch/arm/mach-omap2/omap4-common.c
> as:
> 
> void __iomem *dram_sync, *sram_sync;
> 
> void omap_bus_sync(void)
> {
> 	if (dram_sync && sram_sync) {
> 		writel_relaxed(readl_relaxed(dram_sync), dram_sync);
> 		writel_relaxed(readl_relaxed(sram_sync), sram_sync);
> 		isb();
> 	}
> }
> EXPORT_SYMBOL(omap_bus_sync);
> 
> with the initialisation:
> 
> 	dram_io_desc[0].virtual = OMAP4_DRAM_BARRIER_VA;
> 	dram_io_desc[0].pfn = __phys_to_pfn(paddr);
> 	dram_io_desc[0].length = size;
> 	dram_io_desc[0].type = MT_MEMORY_RW_SO;
> 	iotable_init(dram_io_desc, ARRAY_SIZE(dram_io_desc));
> 	dram_sync = (void __iomem *) dram_io_desc[0].virtual;
> 
> and:
> 
> 		sram_sync = (void *)gen_pool_alloc(sram_pool, PAGE_SIZE);
> 
> 
> which has the effect of issuing a strongly ordered access out to memory,
> which cause the OMAP interconnects to be flushed.
> 
> The message I'm hearing from TI is that the removal of this from mainline
> is a mistake (it sounds to me like it's caused a regression), and it needs
> to be restored, otherwise data accesses via various paths to DRAM can end
> up being seen out of order.

Yes we need a multiarch safe way to do the interconnect barriers, it's not
limited to just erratum i688 on omaps.

Regards,

Tony
diff mbox

Patch

diff --git a/arch/arm/include/asm/barrier.h b/arch/arm/include/asm/barrier.h
index 646539a903a2..1676007d9549 100644
--- a/arch/arm/include/asm/barrier.h
+++ b/arch/arm/include/asm/barrier.h
@@ -37,6 +37,7 @@ 
 #endif
 
 #ifdef CONFIG_ARM_HEAVY_MB
+extern void (*soc_mb)(void);
 extern void arm_heavy_mb(void);
 #define __arm_heavy_mb(x...) do { dsb(x); arm_heavy_mb(); } while (0)
 #else
diff --git a/arch/arm/mm/flush.c b/arch/arm/mm/flush.c
index ce6c2960d5ac..1ec8e7590fc6 100644
--- a/arch/arm/mm/flush.c
+++ b/arch/arm/mm/flush.c
@@ -22,12 +22,16 @@ 
 #include "mm.h"
 
 #ifdef CONFIG_ARM_HEAVY_MB
+void (*soc_mb)(void);
+
 void arm_heavy_mb(void)
 {
 #ifdef CONFIG_OUTER_CACHE_SYNC
 	if (outer_cache.sync)
 		outer_cache.sync();
 #endif
+	if (soc_mb)
+		soc_mb();
 }
 EXPORT_SYMBOL(arm_heavy_mb);
 #endif