page tables and you « Perpendiculous

2011.12.21

page tables and you

Filed under: Uncategorized — cwright @ 11:44 pm

Any comp-sci grad worth his or her student loan debt can tell you about virtual memory. And many can tell you the intricacies of dealing with unix-style VM. But for the developers working with Mac OS X (and iOS), there’s a deeper layer hidden underneath that is seldom expressed (and often for good reason) – Mach. Mach, not Unix, is the true undercarriage of OS X and its ilk. And since it’s basically the only wide-spread Mach-based OS on the planet, there’s a relatively large void when it comes to dealing with this layer. There aren’t many reasons to drop down to this layer, but sometimes, for fun or for profit, the need may arise.

For this first post, let’s explore allocating, deallocating, and protecting memory pages.

The functions we’re interested in here are vm_allocate, vm_deallocate, and vm_protect. You can find them in /usr/include/mach/vm_map.h.

vm_allocate, unsurprisingly, allocates memory pages into a task (process). It takes a target task (usually mach_task_self()), an address pointer, a size, and some flags.

The address pointer is often best left unspecified. This allows the kernel to map in pages wherever there’s room. on 64 bit systems this isn’t particularly interesting, but for 32 bit processes address space can be fairly constrained.

Size is often a multiple of a page size (4k, or 4096 bytes). It defines how large of an area you want to map.

Flags is where things get interesting. There are several to choose from, tersely documented in vm_statistics.h. A couple notes on them:

VM_FLAGS_FIXED – This controls whether or not it should try to map the pages at the address you specified. It’s actually implied by the absense of VM_FLAGS_ANYWHERE.
VM_FLAGS_ANYWHERE – This indicates that it’s ok to ignore the input address, and simply allocate the pages wherever they’d fit (and return the base address in address).
VM_FLAGS_PURGABLE – This indicates that it’s ok for the pages to get purged (that is, discarded without first being written to disk). This can be useful for caches and other data that can be recomputed. I’ll discuss purging pages in a future update.
VM_FLAGS_NO_CACHE – This controls how the pager deals with prioritizing pages in low-memory situations.

vm_deallocate is used to undo an allocation. Or in other words, free the memory vm_allocate makes use of. It takes a task (again, often mach_task_self()), a base pointer, and a size. It behaves largely how you might expect.

vm_protect is interesting (and potentially useful). With it you can mark pages are readable (VM_PROT_READ), writable (VM_PROT_WRITE), and executable (VM_PROT_EXECUTE). These are defined in /usr/include/mach/vm_prot.h. Newly created pages (via vm_allocate) are readable and writable, but not executable. There’s also a copy protection used for copy-on-write behavior, but that’s also for a future post.

Now for some sample code!

#include <mach/mach_interface.h>
#include <stdlib.h>
#include <stdio.h>
#include <stdbool.h>
 
int main()
{
	unsigned char *base;
	kern_return_t ret;
 
	ret = vm_allocate(mach_task_self(), (vm_address_t*)&amp;base, 1024*1024,
					  VM_FLAGS_ANYWHERE);
	if (ret == KERN_SUCCESS) {
		printf("got allocation: %p\n", base);
		base[0] = 0x42;
	} else
		return -1;

this will allocate 1MB (256 pages) anywhere they’ll fit in the process. They’ll be read/write by default.

	ret = vm_protect(mach_task_self(), (vm_address_t)base, 4096, true, VM_PROT_NONE);
	if (ret == KERN_SUCCESS) {
		printf("access is now None!\n");
	} else {
		printf("setting permission to none failed!\n");
	}

This will set the pages to no access – no reads, no writes, no execution. This is pretty worthless, but it demonstrates something coming up

ret = vm_protect(mach_task_self(), (vm_address_t)base, 4096, true, VM_PROT_READ);
	if (ret == KERN_SUCCESS) {
		printf("access is now Read Only!\n");
	} else {
		printf("resetting protections failed\n");
	}

This would restore read permission to the pages, but the above call set the maximum protections to none. That will make this call fail (the pages will remain unreadable).

and finally, cleaning up

	ret = vm_deallocate(mach_task_self(), (vm_address_t)base, 1024*1024);
	if (ret == KERN_SUCCESS)
		printf("deallocated!\n");
	else
		printf("error deallocating!\n");
 
	return 0;
}

which will deallocate the pages.

And there you have it. Basic kernel-level memory allocation for your process. Now you can get page-aligned allocations of large slabs of memory (sometimes useful for storing GL textures with GL_APPLE_client_storage, among other things). Note that vm_allocate is _not_ faster than malloc/free. It is not wise to use these functions as a replacement for general allocation functions. Standard debugging tools will also not deal with these allocations (guard malloc won’t be able to help you, leaks won’t be able to help you, Instruments probably won’t be able to help you – vmmap will show them though, and you can tag them so that they stand out, using VM_MAKE_TAG as one of the vm_allocate flags).

It can also be useful for initializing some large set of data, and then marking it as read-only to ensure you don’t accidentally mutate it.

Comments (0)

No Comments »

No comments yet.

RSS feed for comments on this post. TrackBack URL

Perpendiculous Programming, Personal Finance, and Personal musings

2011.12.21

page tables and you

No Comments »

Leave a comment