blob: ab80933be65ff5309de45a94a12903018ac8dcca [file] [log] [blame]
Thomas Gleixnerec8f24b2019-05-19 13:07:45 +01001# SPDX-License-Identifier: GPL-2.0-only
Christoph Hellwig59e0b522018-07-31 13:39:35 +02002
3menu "Memory Management options"
4
Dave Hansene1785e82005-06-23 00:07:49 -07005config SELECT_MEMORY_MODEL
6 def_bool y
Kees Cooka8826ee2013-01-16 18:54:17 -08007 depends on ARCH_SELECT_MEMORY_MODEL
Dave Hansene1785e82005-06-23 00:07:49 -07008
Dave Hansen3a9da762005-06-23 00:07:42 -07009choice
10 prompt "Memory model"
Dave Hansene1785e82005-06-23 00:07:49 -070011 depends on SELECT_MEMORY_MODEL
12 default DISCONTIGMEM_MANUAL if ARCH_DISCONTIGMEM_DEFAULT
Andy Whitcroftd41dee32005-06-23 00:07:54 -070013 default SPARSEMEM_MANUAL if ARCH_SPARSEMEM_DEFAULT
Dave Hansene1785e82005-06-23 00:07:49 -070014 default FLATMEM_MANUAL
Mike Rapoportd66d1092019-05-13 17:23:05 -070015 help
16 This option allows you to change some of the ways that
17 Linux manages its memory internally. Most users will
18 only have one option here selected by the architecture
19 configuration. This is normal.
Dave Hansen3a9da762005-06-23 00:07:42 -070020
Dave Hansene1785e82005-06-23 00:07:49 -070021config FLATMEM_MANUAL
Dave Hansen3a9da762005-06-23 00:07:42 -070022 bool "Flat Memory"
Anton Blanchardc898ec12006-01-06 00:12:07 -080023 depends on !(ARCH_DISCONTIGMEM_ENABLE || ARCH_SPARSEMEM_ENABLE) || ARCH_FLATMEM_ENABLE
Dave Hansen3a9da762005-06-23 00:07:42 -070024 help
Mike Rapoportd66d1092019-05-13 17:23:05 -070025 This option is best suited for non-NUMA systems with
26 flat address space. The FLATMEM is the most efficient
27 system in terms of performance and resource consumption
28 and it is the best option for smaller systems.
Dave Hansen3a9da762005-06-23 00:07:42 -070029
Mike Rapoportd66d1092019-05-13 17:23:05 -070030 For systems that have holes in their physical address
31 spaces and for features like NUMA and memory hotplug,
Randy Dunlapdd33d292019-11-30 17:58:26 -080032 choose "Sparse Memory".
Andy Whitcroftd41dee32005-06-23 00:07:54 -070033
34 If unsure, choose this option (Flat Memory) over any other.
Dave Hansen3a9da762005-06-23 00:07:42 -070035
Dave Hansene1785e82005-06-23 00:07:49 -070036config DISCONTIGMEM_MANUAL
Dave Hansenf3519f92005-09-16 19:27:54 -070037 bool "Discontiguous Memory"
Dave Hansen3a9da762005-06-23 00:07:42 -070038 depends on ARCH_DISCONTIGMEM_ENABLE
39 help
Dave Hansen785dcd42005-06-23 00:07:50 -070040 This option provides enhanced support for discontiguous
41 memory systems, over FLATMEM. These systems have holes
42 in their physical address spaces, and this option provides
Mike Rapoportd66d1092019-05-13 17:23:05 -070043 more efficient handling of these holes.
Dave Hansen785dcd42005-06-23 00:07:50 -070044
Mike Rapoportd66d1092019-05-13 17:23:05 -070045 Although "Discontiguous Memory" is still used by several
46 architectures, it is considered deprecated in favor of
47 "Sparse Memory".
Dave Hansen785dcd42005-06-23 00:07:50 -070048
Mike Rapoportd66d1092019-05-13 17:23:05 -070049 If unsure, choose "Sparse Memory" over this option.
Dave Hansen3a9da762005-06-23 00:07:42 -070050
Andy Whitcroftd41dee32005-06-23 00:07:54 -070051config SPARSEMEM_MANUAL
52 bool "Sparse Memory"
53 depends on ARCH_SPARSEMEM_ENABLE
54 help
55 This will be the only option for some systems, including
Mike Rapoportd66d1092019-05-13 17:23:05 -070056 memory hot-plug systems. This is normal.
Andy Whitcroftd41dee32005-06-23 00:07:54 -070057
Mike Rapoportd66d1092019-05-13 17:23:05 -070058 This option provides efficient support for systems with
59 holes is their physical address space and allows memory
60 hot-plug and hot-remove.
Andy Whitcroftd41dee32005-06-23 00:07:54 -070061
Mike Rapoportd66d1092019-05-13 17:23:05 -070062 If unsure, choose "Flat Memory" over this option.
Andy Whitcroftd41dee32005-06-23 00:07:54 -070063
Dave Hansen3a9da762005-06-23 00:07:42 -070064endchoice
65
Dave Hansene1785e82005-06-23 00:07:49 -070066config DISCONTIGMEM
67 def_bool y
68 depends on (!SELECT_MEMORY_MODEL && ARCH_DISCONTIGMEM_ENABLE) || DISCONTIGMEM_MANUAL
69
Andy Whitcroftd41dee32005-06-23 00:07:54 -070070config SPARSEMEM
71 def_bool y
Russell King1a83e172009-10-26 16:50:12 -070072 depends on (!SELECT_MEMORY_MODEL && ARCH_SPARSEMEM_ENABLE) || SPARSEMEM_MANUAL
Andy Whitcroftd41dee32005-06-23 00:07:54 -070073
Dave Hansene1785e82005-06-23 00:07:49 -070074config FLATMEM
75 def_bool y
Andy Whitcroftd41dee32005-06-23 00:07:54 -070076 depends on (!DISCONTIGMEM && !SPARSEMEM) || FLATMEM_MANUAL
77
78config FLAT_NODE_MEM_MAP
79 def_bool y
80 depends on !SPARSEMEM
Dave Hansene1785e82005-06-23 00:07:49 -070081
Dave Hansen93b75042005-06-23 00:07:47 -070082#
83# Both the NUMA code and DISCONTIGMEM use arrays of pg_data_t's
84# to represent different areas of memory. This variable allows
85# those dependencies to exist individually.
86#
87config NEED_MULTIPLE_NODES
88 def_bool y
89 depends on DISCONTIGMEM || NUMA
Andy Whitcroftaf705362005-06-23 00:07:53 -070090
91config HAVE_MEMORY_PRESENT
92 def_bool y
Andy Whitcroftd41dee32005-06-23 00:07:54 -070093 depends on ARCH_HAVE_MEMORY_PRESENT || SPARSEMEM
Bob Picco802f1922005-09-03 15:54:26 -070094
95#
Bob Picco3e347262005-09-03 15:54:28 -070096# SPARSEMEM_EXTREME (which is the default) does some bootmem
Matt LaPlante84eb8d02006-10-03 22:53:09 +020097# allocations when memory_present() is called. If this cannot
Bob Picco3e347262005-09-03 15:54:28 -070098# be done on your architecture, select this option. However,
99# statically allocating the mem_section[] array can potentially
100# consume vast quantities of .bss, so be careful.
101#
102# This option will also potentially produce smaller runtime code
103# with gcc 3.4 and later.
104#
105config SPARSEMEM_STATIC
Jan Beulich9ba16082008-10-15 22:01:38 -0700106 bool
Bob Picco3e347262005-09-03 15:54:28 -0700107
108#
Matt LaPlante44c09202006-10-03 22:34:14 +0200109# Architecture platforms which require a two level mem_section in SPARSEMEM
Bob Picco802f1922005-09-03 15:54:26 -0700110# must select this option. This is usually for architecture platforms with
111# an extremely sparse physical address space.
112#
Bob Picco3e347262005-09-03 15:54:28 -0700113config SPARSEMEM_EXTREME
114 def_bool y
115 depends on SPARSEMEM && !SPARSEMEM_STATIC
Hugh Dickins4c21e2f2005-10-29 18:16:40 -0700116
Andy Whitcroft29c71112007-10-16 01:24:14 -0700117config SPARSEMEM_VMEMMAP_ENABLE
Jan Beulich9ba16082008-10-15 22:01:38 -0700118 bool
Andy Whitcroft29c71112007-10-16 01:24:14 -0700119
120config SPARSEMEM_VMEMMAP
Geoff Levanda5ee6da2007-12-17 16:19:53 -0800121 bool "Sparse Memory virtual memmap"
122 depends on SPARSEMEM && SPARSEMEM_VMEMMAP_ENABLE
123 default y
124 help
Krzysztof Kozlowski19fa40a2019-11-30 17:58:23 -0800125 SPARSEMEM_VMEMMAP uses a virtually mapped memmap to optimise
126 pfn_to_page and page_to_pfn operations. This is the most
127 efficient option when sufficient kernel resources are available.
Andy Whitcroft29c71112007-10-16 01:24:14 -0700128
Tejun Heo7c0caeb2011-07-14 11:43:42 +0200129config HAVE_MEMBLOCK_NODE_MAP
Christoph Jaeger6341e622014-12-20 15:41:11 -0500130 bool
Tejun Heo7c0caeb2011-07-14 11:43:42 +0200131
Philipp Hachtmann70210ed2014-01-29 18:16:01 +0100132config HAVE_MEMBLOCK_PHYS_MAP
Christoph Jaeger6341e622014-12-20 15:41:11 -0500133 bool
Philipp Hachtmann70210ed2014-01-29 18:16:01 +0100134
Christoph Hellwig67a929e2019-07-11 20:57:14 -0700135config HAVE_FAST_GUP
Christoph Hellwig050a9ad2019-07-11 20:57:21 -0700136 depends on MMU
Christoph Jaeger6341e622014-12-20 15:41:11 -0500137 bool
Steve Capper2667f502014-10-09 15:29:14 -0700138
Mike Rapoport350e88b2019-05-13 17:22:59 -0700139config ARCH_KEEP_MEMBLOCK
Christoph Jaeger6341e622014-12-20 15:41:11 -0500140 bool
Tejun Heoc378ddd2011-07-14 11:46:03 +0200141
Minchan Kimee6f5092012-07-31 16:43:50 -0700142config MEMORY_ISOLATION
Christoph Jaeger6341e622014-12-20 15:41:11 -0500143 bool
Minchan Kimee6f5092012-07-31 16:43:50 -0700144
Yasuaki Ishimatsu46723bf2013-02-22 16:33:00 -0800145#
146# Only be set on architectures that have completely implemented memory hotplug
147# feature. If you are not sure, don't touch it.
148#
149config HAVE_BOOTMEM_INFO_NODE
150 def_bool n
151
Dave Hansen3947be12005-10-29 18:16:54 -0700152# eventually, we can have this option just 'select SPARSEMEM'
153config MEMORY_HOTPLUG
154 bool "Allow for memory hot-add"
Keith Manntheyec69acb2006-09-30 23:27:05 -0700155 depends on SPARSEMEM || X86_64_ACPI_NUMA
Stephen Rothwell40b31362013-05-21 13:49:35 +1000156 depends on ARCH_ENABLE_MEMORY_HOTPLUG
Dave Hansen3947be12005-10-29 18:16:54 -0700157
Keith Manntheyec69acb2006-09-30 23:27:05 -0700158config MEMORY_HOTPLUG_SPARSE
159 def_bool y
160 depends on SPARSEMEM && MEMORY_HOTPLUG
161
Vitaly Kuznetsov8604d9e2016-05-19 17:13:03 -0700162config MEMORY_HOTPLUG_DEFAULT_ONLINE
Krzysztof Kozlowski19fa40a2019-11-30 17:58:23 -0800163 bool "Online the newly added memory blocks by default"
164 depends on MEMORY_HOTPLUG
165 help
Vitaly Kuznetsov8604d9e2016-05-19 17:13:03 -0700166 This option sets the default policy setting for memory hotplug
167 onlining policy (/sys/devices/system/memory/auto_online_blocks) which
168 determines what happens to newly added memory regions. Policy setting
169 can always be changed at runtime.
Mauro Carvalho Chehabcb1aaeb2019-06-07 15:54:32 -0300170 See Documentation/admin-guide/mm/memory-hotplug.rst for more information.
Vitaly Kuznetsov8604d9e2016-05-19 17:13:03 -0700171
172 Say Y here if you want all hot-plugged memory blocks to appear in
173 'online' state by default.
174 Say N here if you want the default policy to keep all hot-plugged
175 memory blocks in 'offline' state.
176
KAMEZAWA Hiroyuki0c0e6192007-10-16 01:26:12 -0700177config MEMORY_HOTREMOVE
178 bool "Allow for memory hot remove"
Yasuaki Ishimatsu46723bf2013-02-22 16:33:00 -0800179 select MEMORY_ISOLATION
Nathan Fontenotf7e33342013-09-27 10:18:09 -0500180 select HAVE_BOOTMEM_INFO_NODE if (X86_64 || PPC64)
KAMEZAWA Hiroyuki0c0e6192007-10-16 01:26:12 -0700181 depends on MEMORY_HOTPLUG && ARCH_ENABLE_MEMORY_HOTREMOVE
182 depends on MIGRATION
183
Hugh Dickins4c21e2f2005-10-29 18:16:40 -0700184# Heavily threaded applications may benefit from splitting the mm-wide
185# page_table_lock, so that faults on different parts of the user address
186# space can be handled with less contention: split it at this NR_CPUS.
187# Default to 4 for wider testing, though 8 might be more appropriate.
188# ARM's adjust_pte (unused if VIPT) depends on mm-wide page_table_lock.
Hugh Dickins7b6ac9d2005-11-23 13:37:37 -0800189# PA-RISC 7xxx's spinlock_t would enlarge struct page from 32 to 44 bytes.
Hugh Dickinsa70caa82009-12-14 17:59:02 -0800190# DEBUG_SPINLOCK and DEBUG_LOCK_ALLOC spinlock_t also enlarge struct page.
Hugh Dickins4c21e2f2005-10-29 18:16:40 -0700191#
192config SPLIT_PTLOCK_CPUS
193 int
Kirill A. Shutemov91645502014-04-07 15:37:14 -0700194 default "999999" if !MMU
Hugh Dickinsa70caa82009-12-14 17:59:02 -0800195 default "999999" if ARM && !CPU_CACHE_VIPT
196 default "999999" if PARISC && !PA20
Hugh Dickins4c21e2f2005-10-29 18:16:40 -0700197 default "4"
Christoph Lameter7cbe34c2006-01-08 01:00:49 -0800198
Kirill A. Shutemove009bb32013-11-14 14:31:07 -0800199config ARCH_ENABLE_SPLIT_PMD_PTLOCK
Christoph Jaeger6341e622014-12-20 15:41:11 -0500200 bool
Kirill A. Shutemove009bb32013-11-14 14:31:07 -0800201
Christoph Lameter7cbe34c2006-01-08 01:00:49 -0800202#
Konstantin Khlebnikov09316c02014-10-09 15:29:32 -0700203# support for memory balloon
204config MEMORY_BALLOON
Christoph Jaeger6341e622014-12-20 15:41:11 -0500205 bool
Konstantin Khlebnikov09316c02014-10-09 15:29:32 -0700206
207#
Rafael Aquini18468d92012-12-11 16:02:38 -0800208# support for memory balloon compaction
209config BALLOON_COMPACTION
210 bool "Allow for balloon memory compaction/migration"
211 def_bool y
Konstantin Khlebnikov09316c02014-10-09 15:29:32 -0700212 depends on COMPACTION && MEMORY_BALLOON
Rafael Aquini18468d92012-12-11 16:02:38 -0800213 help
214 Memory fragmentation introduced by ballooning might reduce
215 significantly the number of 2MB contiguous memory blocks that can be
216 used within a guest, thus imposing performance penalties associated
217 with the reduced number of transparent huge pages that could be used
218 by the guest workload. Allowing the compaction & migration for memory
219 pages enlisted as being part of memory balloon devices avoids the
220 scenario aforementioned and helps improving memory defragmentation.
221
222#
Mel Gormane9e96b32010-05-24 14:32:21 -0700223# support for memory compaction
224config COMPACTION
225 bool "Allow for memory compaction"
Rik van Riel05106e6a2012-10-08 16:33:03 -0700226 def_bool y
Mel Gormane9e96b32010-05-24 14:32:21 -0700227 select MIGRATION
Andrea Arcangeli33a93872011-01-25 15:07:25 -0800228 depends on MMU
Mel Gormane9e96b32010-05-24 14:32:21 -0700229 help
Krzysztof Kozlowski19fa40a2019-11-30 17:58:23 -0800230 Compaction is the only memory management component to form
231 high order (larger physically contiguous) memory blocks
232 reliably. The page allocator relies on compaction heavily and
233 the lack of the feature can lead to unexpected OOM killer
234 invocations for high order memory requests. You shouldn't
235 disable this option unless there really is a strong reason for
236 it and then we would be really interested to hear about that at
237 linux-mm@kvack.org.
Mel Gormane9e96b32010-05-24 14:32:21 -0700238
239#
Christoph Lameter7cbe34c2006-01-08 01:00:49 -0800240# support for page migration
241#
242config MIGRATION
Christoph Lameterb20a3502006-03-22 00:09:12 -0800243 bool "Page migration"
Christoph Lameter6c5240a2006-06-23 02:03:37 -0700244 def_bool y
Chen Gangde32a812013-09-12 15:14:08 -0700245 depends on (NUMA || ARCH_ENABLE_MEMORY_HOTREMOVE || COMPACTION || CMA) && MMU
Christoph Lameterb20a3502006-03-22 00:09:12 -0800246 help
247 Allows the migration of the physical location of pages of processes
Mel Gormane9e96b32010-05-24 14:32:21 -0700248 while the virtual addresses are not changed. This is useful in
249 two situations. The first is on NUMA systems to put pages nearer
250 to the processors accessing. The second is when allocating huge
251 pages as migration can relocate pages to satisfy a huge page
252 allocation instead of reclaiming.
Greg Kroah-Hartman6550e072006-06-12 17:11:31 -0700253
Naoya Horiguchic177c812014-06-04 16:05:35 -0700254config ARCH_ENABLE_HUGEPAGE_MIGRATION
Christoph Jaeger6341e622014-12-20 15:41:11 -0500255 bool
Naoya Horiguchic177c812014-06-04 16:05:35 -0700256
Naoya Horiguchi9c670ea2017-09-08 16:10:53 -0700257config ARCH_ENABLE_THP_MIGRATION
258 bool
259
Alexandre Ghiti8df995f2019-05-13 17:19:00 -0700260config CONTIG_ALLOC
Krzysztof Kozlowski19fa40a2019-11-30 17:58:23 -0800261 def_bool (MEMORY_ISOLATION && COMPACTION) || CMA
Alexandre Ghiti8df995f2019-05-13 17:19:00 -0700262
Jeremy Fitzhardinge600715d2008-09-11 01:31:45 -0700263config PHYS_ADDR_T_64BIT
Christoph Hellwigd4a451d2018-04-03 16:24:20 +0200264 def_bool 64BIT
Jeremy Fitzhardinge600715d2008-09-11 01:31:45 -0700265
Christoph Lameter2a7326b2007-07-17 04:03:37 -0700266config BOUNCE
Vinayak Menon9ca24e22013-04-29 15:08:55 -0700267 bool "Enable bounce buffers"
268 default y
Christoph Lameter2a7326b2007-07-17 04:03:37 -0700269 depends on BLOCK && MMU && (ZONE_DMA || HIGHMEM)
Vinayak Menon9ca24e22013-04-29 15:08:55 -0700270 help
271 Enable bounce buffers for devices that cannot access
272 the full range of memory available to the CPU. Enabled
273 by default when ZONE_DMA or HIGHMEM is selected, but you
274 may say n to override this.
Christoph Lameter2a7326b2007-07-17 04:03:37 -0700275
Stephen Rothwellf057eac2007-07-15 23:40:05 -0700276config VIRT_TO_BUS
Stephen Rothwell4febd952013-03-07 15:48:16 +1100277 bool
278 help
279 An architecture should select this if it implements the
280 deprecated interface virt_to_bus(). All new architectures
281 should probably not select this.
282
Andrea Arcangelicddb8a52008-07-28 15:46:29 -0700283
284config MMU_NOTIFIER
285 bool
Pranith Kumar83fe27e2014-12-05 11:24:45 -0500286 select SRCU
Jason Gunthorpe99cb2522019-11-12 16:22:19 -0400287 select INTERVAL_TREE
David Howellsfc4d5c22009-05-06 16:03:05 -0700288
Hugh Dickinsf8af4da2009-09-21 17:01:57 -0700289config KSM
290 bool "Enable KSM for page merging"
291 depends on MMU
Timofey Titovets59e1a2f42018-12-28 00:34:05 -0800292 select XXHASH
Hugh Dickinsf8af4da2009-09-21 17:01:57 -0700293 help
294 Enable Kernel Samepage Merging: KSM periodically scans those areas
295 of an application's address space that an app has advised may be
296 mergeable. When it finds pages of identical content, it replaces
Hugh Dickinsd0f209f2009-12-14 17:59:34 -0800297 the many instances by a single page with that content, so
Hugh Dickinsf8af4da2009-09-21 17:01:57 -0700298 saving memory until one or another app needs to modify the content.
299 Recommended for use with KVM, or with other duplicative applications.
Mike Rapoportad56b732018-03-21 21:22:47 +0200300 See Documentation/vm/ksm.rst for more information: KSM is inactive
Hugh Dickinsc73602a2009-10-07 16:32:22 -0700301 until a program has madvised that an area is MADV_MERGEABLE, and
302 root has set /sys/kernel/mm/ksm/run to 1 (if CONFIG_SYSFS is set).
Hugh Dickinsf8af4da2009-09-21 17:01:57 -0700303
Christoph Lametere0a94c22009-06-03 16:04:31 -0400304config DEFAULT_MMAP_MIN_ADDR
Krzysztof Kozlowski19fa40a2019-11-30 17:58:23 -0800305 int "Low address space to protect from user allocation"
David Howells6e141542009-12-15 19:27:45 +0000306 depends on MMU
Krzysztof Kozlowski19fa40a2019-11-30 17:58:23 -0800307 default 4096
308 help
Christoph Lametere0a94c22009-06-03 16:04:31 -0400309 This is the portion of low virtual memory which should be protected
310 from userspace allocation. Keeping a user from writing to low pages
311 can help reduce the impact of kernel NULL pointer bugs.
312
313 For most ia64, ppc64 and x86 users with lots of address space
314 a value of 65536 is reasonable and should cause no problems.
315 On arm and other archs it should not be higher than 32768.
Eric Paris788084a2009-07-31 12:54:11 -0400316 Programs which use vm86 functionality or have some need to map
317 this low address space will need CAP_SYS_RAWIO or disable this
318 protection by setting the value to 0.
Christoph Lametere0a94c22009-06-03 16:04:31 -0400319
320 This value can be changed after boot using the
321 /proc/sys/vm/mmap_min_addr tunable.
322
Linus Torvaldsd949f362009-09-26 09:35:07 -0700323config ARCH_SUPPORTS_MEMORY_FAILURE
324 bool
Christoph Lametere0a94c22009-06-03 16:04:31 -0400325
Andi Kleen6a460792009-09-16 11:50:15 +0200326config MEMORY_FAILURE
327 depends on MMU
Linus Torvaldsd949f362009-09-26 09:35:07 -0700328 depends on ARCH_SUPPORTS_MEMORY_FAILURE
Andi Kleen6a460792009-09-16 11:50:15 +0200329 bool "Enable recovery from hardware memory errors"
Minchan Kimee6f5092012-07-31 16:43:50 -0700330 select MEMORY_ISOLATION
Xie XiuQi97f0b132015-06-24 16:57:36 -0700331 select RAS
Andi Kleen6a460792009-09-16 11:50:15 +0200332 help
333 Enables code to recover from some memory failures on systems
334 with MCA recovery. This allows a system to continue running
335 even when some of its memory has uncorrected errors. This requires
336 special hardware support and typically ECC memory.
337
Andi Kleencae681f2009-09-16 11:50:17 +0200338config HWPOISON_INJECT
Andi Kleen413f9ef2009-12-16 12:20:00 +0100339 tristate "HWPoison pages injector"
Andi Kleen27df5062009-12-21 19:56:42 +0100340 depends on MEMORY_FAILURE && DEBUG_KERNEL && PROC_FS
Wu Fengguang478c5ff2009-12-16 12:19:59 +0100341 select PROC_PAGE_MONITOR
Andi Kleencae681f2009-09-16 11:50:17 +0200342
David Howellsfc4d5c22009-05-06 16:03:05 -0700343config NOMMU_INITIAL_TRIM_EXCESS
344 int "Turn on mmap() excess space trimming before booting"
345 depends on !MMU
346 default 1
347 help
348 The NOMMU mmap() frequently needs to allocate large contiguous chunks
349 of memory on which to store mappings, but it can only ask the system
350 allocator for chunks in 2^N*PAGE_SIZE amounts - which is frequently
351 more than it requires. To deal with this, mmap() is able to trim off
352 the excess and return it to the allocator.
353
354 If trimming is enabled, the excess is trimmed off and returned to the
355 system allocator, which can cause extra fragmentation, particularly
356 if there are a lot of transient processes.
357
358 If trimming is disabled, the excess is kept, but not used, which for
359 long-term mappings means that the space is wasted.
360
361 Trimming can be dynamically controlled through a sysctl option
362 (/proc/sys/vm/nr_trim_pages) which specifies the minimum number of
363 excess pages there must be before trimming should occur, or zero if
364 no trimming is to occur.
365
366 This option specifies the initial value of this option. The default
367 of 1 says that all excess pages should be trimmed.
368
369 See Documentation/nommu-mmap.txt for more information.
Tejun Heobbddff02010-09-03 18:22:48 +0200370
Andrea Arcangeli4c76d9d2011-01-13 15:46:39 -0800371config TRANSPARENT_HUGEPAGE
Andrea Arcangeli13ece882011-01-13 15:47:07 -0800372 bool "Transparent Hugepage Support"
Gerald Schaefer15626062012-10-08 16:30:04 -0700373 depends on HAVE_ARCH_TRANSPARENT_HUGEPAGE
Andrea Arcangeli5d689242011-01-13 15:47:07 -0800374 select COMPACTION
Matthew Wilcox3a08cd52018-09-22 16:14:30 -0400375 select XARRAY_MULTI
Andrea Arcangeli4c76d9d2011-01-13 15:46:39 -0800376 help
377 Transparent Hugepages allows the kernel to use huge pages and
378 huge tlb transparently to the applications whenever possible.
379 This feature can improve computing performance to certain
380 applications by speeding up page faults during memory
381 allocation, by reducing the number of tlb misses and by speeding
382 up the pagetable walking.
383
384 If memory constrained on embedded, you may want to say N.
385
Andrea Arcangeli13ece882011-01-13 15:47:07 -0800386choice
387 prompt "Transparent Hugepage Support sysfs defaults"
388 depends on TRANSPARENT_HUGEPAGE
389 default TRANSPARENT_HUGEPAGE_ALWAYS
390 help
391 Selects the sysfs defaults for Transparent Hugepage Support.
392
393 config TRANSPARENT_HUGEPAGE_ALWAYS
394 bool "always"
395 help
396 Enabling Transparent Hugepage always, can increase the
397 memory footprint of applications without a guaranteed
398 benefit but it will work automatically for all applications.
399
400 config TRANSPARENT_HUGEPAGE_MADVISE
401 bool "madvise"
402 help
403 Enabling Transparent Hugepage madvise, will only provide a
404 performance improvement benefit to the applications using
405 madvise(MADV_HUGEPAGE) but it won't risk to increase the
406 memory footprint of applications without a guaranteed
407 benefit.
408endchoice
409
Huang Ying38d8b4e2017-07-06 15:37:18 -0700410config ARCH_WANTS_THP_SWAP
Krzysztof Kozlowski19fa40a2019-11-30 17:58:23 -0800411 def_bool n
Huang Ying38d8b4e2017-07-06 15:37:18 -0700412
413config THP_SWAP
414 def_bool y
Huang Ying14fef282018-08-17 15:49:41 -0700415 depends on TRANSPARENT_HUGEPAGE && ARCH_WANTS_THP_SWAP && SWAP
Huang Ying38d8b4e2017-07-06 15:37:18 -0700416 help
417 Swap transparent huge pages in one piece, without splitting.
Huang Ying14fef282018-08-17 15:49:41 -0700418 XXX: For now, swap cluster backing transparent huge page
419 will be split after swapout.
Huang Ying38d8b4e2017-07-06 15:37:18 -0700420
421 For selection by architectures with reasonable THP sizes.
422
Kirill A. Shutemove496cf32016-07-26 15:26:35 -0700423config TRANSPARENT_HUGE_PAGECACHE
424 def_bool y
Aneesh Kumar K.V953c66c2016-12-12 16:44:32 -0800425 depends on TRANSPARENT_HUGEPAGE
Kirill A. Shutemove496cf32016-07-26 15:26:35 -0700426
427#
Tejun Heobbddff02010-09-03 18:22:48 +0200428# UP and nommu archs use km based percpu allocator
429#
430config NEED_PER_CPU_KM
431 depends on !SMP
432 bool
433 default y
Dan Magenheimer077b1f82011-05-26 10:01:36 -0600434
435config CLEANCACHE
436 bool "Enable cleancache driver to cache clean pages if tmem is present"
Dan Magenheimer077b1f82011-05-26 10:01:36 -0600437 help
438 Cleancache can be thought of as a page-granularity victim cache
439 for clean pages that the kernel's pageframe replacement algorithm
440 (PFRA) would like to keep around, but can't since there isn't enough
441 memory. So when the PFRA "evicts" a page, it first attempts to use
Michael Witten140a1ef2011-06-10 03:57:26 +0000442 cleancache code to put the data contained in that page into
Dan Magenheimer077b1f82011-05-26 10:01:36 -0600443 "transcendent memory", memory that is not directly accessible or
444 addressable by the kernel and is of unknown and possibly
445 time-varying size. And when a cleancache-enabled
446 filesystem wishes to access a page in a file on disk, it first
447 checks cleancache to see if it already contains it; if it does,
448 the page is copied into the kernel and a disk access is avoided.
449 When a transcendent memory driver is available (such as zcache or
450 Xen transcendent memory), a significant I/O reduction
451 may be achieved. When none is available, all cleancache calls
452 are reduced to a single pointer-compare-against-NULL resulting
453 in a negligible performance hit.
454
455 If unsure, say Y to enable cleancache
Dan Magenheimer27c6aec22012-04-09 17:10:34 -0600456
457config FRONTSWAP
458 bool "Enable frontswap to cache swap pages if tmem is present"
459 depends on SWAP
Dan Magenheimer27c6aec22012-04-09 17:10:34 -0600460 help
461 Frontswap is so named because it can be thought of as the opposite
462 of a "backing" store for a swap device. The data is stored into
463 "transcendent memory", memory that is not directly accessible or
464 addressable by the kernel and is of unknown and possibly
465 time-varying size. When space in transcendent memory is available,
466 a significant swap I/O reduction may be achieved. When none is
467 available, all frontswap calls are reduced to a single pointer-
468 compare-against-NULL resulting in a negligible performance hit
469 and swap data is stored as normal on the matching swap device.
470
471 If unsure, say Y to enable frontswap.
Aneesh Kumar K.Vf825c732013-07-02 11:15:15 +0530472
473config CMA
474 bool "Contiguous Memory Allocator"
Mike Rapoportaca52c32018-10-30 15:07:44 -0700475 depends on MMU
Aneesh Kumar K.Vf825c732013-07-02 11:15:15 +0530476 select MIGRATION
477 select MEMORY_ISOLATION
478 help
479 This enables the Contiguous Memory Allocator which allows other
480 subsystems to allocate big physically-contiguous blocks of memory.
481 CMA reserves a region of memory and allows only movable pages to
482 be allocated from it. This way, the kernel can use the memory for
483 pagecache and when a subsystem requests for contiguous area, the
484 allocated pages are migrated away to serve the contiguous request.
485
486 If unsure, say "n".
487
488config CMA_DEBUG
489 bool "CMA debug messages (DEVELOPMENT)"
490 depends on DEBUG_KERNEL && CMA
491 help
492 Turns on debug messages in CMA. This produces KERN_DEBUG
493 messages for every CMA call as well as various messages while
494 processing calls such as dma_alloc_from_contiguous().
495 This option does not affect warning and error messages.
Alexander Grafbf550fc2013-08-29 00:41:59 +0200496
Sasha Levin28b24c12015-04-14 15:44:57 -0700497config CMA_DEBUGFS
498 bool "CMA debugfs interface"
499 depends on CMA && DEBUG_FS
500 help
501 Turns on the DebugFS interface for CMA.
502
Joonsoo Kima2541292014-08-06 16:05:25 -0700503config CMA_AREAS
504 int "Maximum count of the CMA areas"
505 depends on CMA
506 default 7
507 help
508 CMA allows to create CMA areas for particular purpose, mainly,
509 used as device private area. This parameter sets the maximum
510 number of CMA area in the system.
511
512 If unsure, leave the default value "7".
513
Dan Streetmanaf8d4172014-08-06 16:08:36 -0700514config MEM_SOFT_DIRTY
515 bool "Track memory changes"
516 depends on CHECKPOINT_RESTORE && HAVE_ARCH_SOFT_DIRTY && PROC_FS
517 select PROC_PAGE_MONITOR
Seth Jennings4e2e2772013-07-10 16:04:55 -0700518 help
Dan Streetmanaf8d4172014-08-06 16:08:36 -0700519 This option enables memory changes tracking by introducing a
520 soft-dirty bit on pte-s. This bit it set when someone writes
521 into a page just as regular dirty bit, but unlike the latter
522 it can be cleared by hands.
523
Mike Rapoport1ad13352018-04-18 11:07:49 +0300524 See Documentation/admin-guide/mm/soft-dirty.rst for more details.
Seth Jennings4e2e2772013-07-10 16:04:55 -0700525
Seth Jennings2b281112013-07-10 16:05:03 -0700526config ZSWAP
527 bool "Compressed cache for swap pages (EXPERIMENTAL)"
528 depends on FRONTSWAP && CRYPTO=y
529 select CRYPTO_LZO
Dan Streetman12d79d62014-08-06 16:08:40 -0700530 select ZPOOL
Seth Jennings2b281112013-07-10 16:05:03 -0700531 help
532 A lightweight compressed cache for swap pages. It takes
533 pages that are in the process of being swapped out and attempts to
534 compress them into a dynamically allocated RAM-based memory pool.
535 This can result in a significant I/O reduction on swap device and,
536 in the case where decompressing from RAM is faster that swap device
537 reads, can also improve workload performance.
538
539 This is marked experimental because it is a new feature (as of
540 v3.11) that interacts heavily with memory reclaim. While these
541 interactions don't cause any known issues on simple memory setups,
542 they have not be fully explored on the large set of potential
543 configurations and workloads that exist.
544
Dan Streetmanaf8d4172014-08-06 16:08:36 -0700545config ZPOOL
546 tristate "Common API for compressed memory storage"
Pavel Emelyanov0f8975e2013-07-03 15:01:20 -0700547 help
Dan Streetmanaf8d4172014-08-06 16:08:36 -0700548 Compressed memory storage API. This allows using either zbud or
549 zsmalloc.
Pavel Emelyanov0f8975e2013-07-03 15:01:20 -0700550
Dan Streetmanaf8d4172014-08-06 16:08:36 -0700551config ZBUD
Vitaly Wool9a001fc2016-05-20 16:58:30 -0700552 tristate "Low (Up to 2x) density storage for compressed pages"
Dan Streetmanaf8d4172014-08-06 16:08:36 -0700553 help
554 A special purpose allocator for storing compressed pages.
555 It is designed to store up to two compressed pages per physical
556 page. While this design limits storage density, it has simple and
557 deterministic reclaim properties that make it preferable to a higher
558 density approach when reclaim will be used.
Minchan Kimbcf16472014-01-30 15:45:50 -0800559
Vitaly Wool9a001fc2016-05-20 16:58:30 -0700560config Z3FOLD
561 tristate "Up to 3x density storage for compressed pages"
562 depends on ZPOOL
Vitaly Wool9a001fc2016-05-20 16:58:30 -0700563 help
564 A special purpose allocator for storing compressed pages.
565 It is designed to store up to three compressed pages per physical
566 page. It is a ZBUD derivative so the simplicity and determinism are
567 still there.
568
Minchan Kimbcf16472014-01-30 15:45:50 -0800569config ZSMALLOC
Minchan Kimd867f202014-06-04 16:11:10 -0700570 tristate "Memory allocator for compressed pages"
Minchan Kimbcf16472014-01-30 15:45:50 -0800571 depends on MMU
Minchan Kimbcf16472014-01-30 15:45:50 -0800572 help
573 zsmalloc is a slab-based memory allocator designed to store
574 compressed RAM pages. zsmalloc uses virtual memory mapping
575 in order to reduce fragmentation. However, this results in a
576 non-standard allocator interface where a handle, not a pointer, is
577 returned by an alloc(). This handle must be mapped in order to
578 access the allocated space.
579
580config PGTABLE_MAPPING
581 bool "Use page table mapping to access object in zsmalloc"
582 depends on ZSMALLOC
583 help
584 By default, zsmalloc uses a copy-based object mapping method to
585 access allocations that span two pages. However, if a particular
586 architecture (ex, ARM) performs VM mapping faster than copying,
587 then you should select this. This causes zsmalloc to use page table
588 mapping rather than copying for object mapping.
589
Ben Hutchings2216ee82014-03-10 15:49:46 -0700590 You can check speed with zsmalloc benchmark:
591 https://github.com/spartacus06/zsmapbench
Mark Salter9e5c33d2014-04-07 15:39:48 -0700592
Ganesh Mahendran0f050d92015-02-12 15:00:54 -0800593config ZSMALLOC_STAT
594 bool "Export zsmalloc statistics"
595 depends on ZSMALLOC
596 select DEBUG_FS
597 help
598 This option enables code in the zsmalloc to collect various
599 statistics about whats happening in zsmalloc and exports that
600 information to userspace via debugfs.
601 If unsure, say N.
602
Mark Salter9e5c33d2014-04-07 15:39:48 -0700603config GENERIC_EARLY_IOREMAP
604 bool
Helge Deller042d27a2014-04-30 23:26:02 +0200605
606config MAX_STACK_SIZE_MB
607 int "Maximum user stack size for 32-bit processes (MB)"
608 default 80
Helge Deller042d27a2014-04-30 23:26:02 +0200609 range 8 2048
610 depends on STACK_GROWSUP && (!64BIT || COMPAT)
611 help
612 This is the maximum stack size in Megabytes in the VM layout of 32-bit
613 user processes when the stack grows upwards (currently only on parisc
James Hogan5f171572017-10-24 16:52:32 +0100614 arch). The stack will be located at the highest memory address minus
615 the given value, unless the RLIMIT_STACK hard limit is changed to a
616 smaller value in which case that is used.
Helge Deller042d27a2014-04-30 23:26:02 +0200617
618 A sane initial value is 80 MB.
Mel Gorman3a80a7f2015-06-30 14:57:02 -0700619
Mel Gorman3a80a7f2015-06-30 14:57:02 -0700620config DEFERRED_STRUCT_PAGE_INIT
Vlastimil Babka1ce22102016-02-05 15:36:21 -0800621 bool "Defer initialisation of struct pages to kthreads"
Mike Rapoportd39f8fb2018-08-17 15:47:07 -0700622 depends on SPARSEMEM
Pavel Tatashinab1e8d82018-05-18 16:09:13 -0700623 depends on !NEED_PER_CPU_KM
Pasha Tatashin889c6952018-09-20 12:22:30 -0700624 depends on 64BIT
Mel Gorman3a80a7f2015-06-30 14:57:02 -0700625 help
626 Ordinarily all struct pages are initialised during early boot in a
627 single thread. On very large machines this can take a considerable
628 amount of time. If this option is set, large machines will bring up
629 a subset of memmap at boot and then initialise the rest in parallel
Vlastimil Babka1ce22102016-02-05 15:36:21 -0800630 by starting one-off "pgdatinitX" kernel thread for each node X. This
631 has a potential performance impact on processes running early in the
632 lifetime of the system until these kthreads finish the
633 initialisation.
Dan Williams033fbae2015-08-09 15:29:06 -0400634
Vladimir Davydov33c3fc72015-09-09 15:35:45 -0700635config IDLE_PAGE_TRACKING
636 bool "Enable idle page tracking"
637 depends on SYSFS && MMU
638 select PAGE_EXTENSION if !64BIT
639 help
640 This feature allows to estimate the amount of user pages that have
641 not been touched during a given period of time. This information can
642 be useful to tune memory cgroup limits and/or for job placement
643 within a compute cluster.
644
Mike Rapoport1ad13352018-04-18 11:07:49 +0300645 See Documentation/admin-guide/mm/idle_page_tracking.rst for
646 more details.
Vladimir Davydov33c3fc72015-09-09 15:35:45 -0700647
Robin Murphy17596732019-07-16 16:30:47 -0700648config ARCH_HAS_PTE_DEVMAP
Oliver O'Halloran65f7d042017-06-28 11:32:31 +1000649 bool
650
Dan Williams033fbae2015-08-09 15:29:06 -0400651config ZONE_DEVICE
Jérôme Glisse5042db42017-09-08 16:11:43 -0700652 bool "Device memory (pmem, HMM, etc...) hotplug support"
Dan Williams033fbae2015-08-09 15:29:06 -0400653 depends on MEMORY_HOTPLUG
654 depends on MEMORY_HOTREMOVE
Dan Williams99490f12016-03-17 14:19:58 -0700655 depends on SPARSEMEM_VMEMMAP
Robin Murphy17596732019-07-16 16:30:47 -0700656 depends on ARCH_HAS_PTE_DEVMAP
Matthew Wilcox3a08cd52018-09-22 16:14:30 -0400657 select XARRAY_MULTI
Dan Williams033fbae2015-08-09 15:29:06 -0400658
659 help
660 Device memory hotplug support allows for establishing pmem,
661 or other device driver discovered memory regions, in the
662 memmap. This allows pfn_to_page() lookups of otherwise
663 "device-physical" addresses which is needed for using a DAX
664 mapping in an O_DIRECT operation, among other things.
665
666 If FS_DAX is enabled, then say Y.
Linus Torvalds06a660a2015-09-11 16:42:39 -0700667
Dan Williamse76384882018-05-16 11:46:08 -0700668config DEV_PAGEMAP_OPS
669 bool
670
Christoph Hellwig9c240a72019-08-06 19:05:52 +0300671#
672# Helpers to mirror range of the CPU page tables of a process into device page
673# tables.
674#
Jérôme Glissec0b12402017-09-08 16:11:27 -0700675config HMM_MIRROR
Christoph Hellwig9c240a72019-08-06 19:05:52 +0300676 bool
Christoph Hellwigf442c282019-08-06 19:05:51 +0300677 depends on MMU
Jérôme Glissec0b12402017-09-08 16:11:27 -0700678
Jérôme Glisse5042db42017-09-08 16:11:43 -0700679config DEVICE_PRIVATE
680 bool "Unaddressable device memory (GPU memory, ...)"
Christoph Hellwig7328d9c2019-06-26 14:27:22 +0200681 depends on ZONE_DEVICE
Dan Williamse76384882018-05-16 11:46:08 -0700682 select DEV_PAGEMAP_OPS
Jérôme Glisse5042db42017-09-08 16:11:43 -0700683
684 help
685 Allows creation of struct pages to represent unaddressable device
686 memory; i.e., memory that is only accessible from the device (or
687 group of devices). You likely also want to select HMM_MIRROR.
688
Jan Kara8025e5d2015-07-13 11:55:44 -0300689config FRAME_VECTOR
690 bool
Dave Hansen63c17fb2016-02-12 13:02:08 -0800691
692config ARCH_USES_HIGH_VMA_FLAGS
693 bool
Dave Hansen66d37572016-02-12 13:02:32 -0800694config ARCH_HAS_PKEYS
695 bool
Dennis Zhou30a5b532017-06-19 19:28:31 -0400696
697config PERCPU_STATS
698 bool "Collect percpu memory statistics"
Dennis Zhou30a5b532017-06-19 19:28:31 -0400699 help
700 This feature collects and exposes statistics via debugfs. The
701 information includes global and per chunk statistics, which can
702 be used to help understand percpu memory usage.
Kirill A. Shutemov64c349f42017-11-17 15:31:22 -0800703
704config GUP_BENCHMARK
705 bool "Enable infrastructure for get_user_pages_fast() benchmarking"
Kirill A. Shutemov64c349f42017-11-17 15:31:22 -0800706 help
707 Provides /sys/kernel/debug/gup_benchmark that helps with testing
708 performance of get_user_pages_fast().
709
710 See tools/testing/selftests/vm/gup_benchmark.c
Laurent Dufour3010a5e2018-06-07 17:06:08 -0700711
Christoph Hellwig39656e82019-07-11 20:56:49 -0700712config GUP_GET_PTE_LOW_HIGH
713 bool
714
Song Liu99cb0db2019-09-23 15:38:00 -0700715config READ_ONLY_THP_FOR_FS
716 bool "Read-only THP for filesystems (EXPERIMENTAL)"
717 depends on TRANSPARENT_HUGE_PAGECACHE && SHMEM
718
719 help
720 Allow khugepaged to put read-only file-backed pages in THP.
721
722 This is marked experimental because it is a new feature. Write
723 support of file THPs will be developed in the next few release
724 cycles.
725
Laurent Dufour3010a5e2018-06-07 17:06:08 -0700726config ARCH_HAS_PTE_SPECIAL
727 bool
Christoph Hellwig59e0b522018-07-31 13:39:35 +0200728
Christoph Hellwigcbd34da2019-07-11 20:57:28 -0700729#
730# Some architectures require a special hugepage directory format that is
731# required to support multiple hugepage sizes. For example a4fe3ce76
732# "powerpc/mm: Allow more flexible layouts for hugepage pagetables"
733# introduced it on powerpc. This allows for a more flexible hugepage
734# pagetable layouts.
735#
736config ARCH_HAS_HUGEPD
737 bool
738
Thomas Hellstromc5acad82019-03-19 13:12:30 +0100739config MAPPING_DIRTY_HELPERS
740 bool
741
Christoph Hellwig59e0b522018-07-31 13:39:35 +0200742endmenu