Memory Allocators 101 - Write a simple memory allocator

news2024/10/7 12:24:52

Memory Allocators 101 - Write a simple memory allocator - Arjun Sreedharan

  • Blog
  • About
  • Contact
  • Posts

Google+LinkedInGithubFacebookTwitterUMass Amherst

1:11 AM 9th 八月 20160 notes

  1. Memory Allocators 101 - Write a simple memory allocator

     

    Code related to this article: github.com/arjun024/memalloc

    This article is about writing a simple memory allocator in C.
    We will implement malloc(), calloc(), realloc() and free().

    This is a beginner level article, so I will not spell out every detail.
    This memory allocator will not be fast and efficient, we will not adjust allocated memory to align to a page boundary, but we will build a memory allocator that works. That’s it.

    If you want to take a look at the code in full, take a look at my github repo memalloc.

    Before we get into building the memory allocator, you need to be familiar with the memory layout of a program. A process runs within its own virtual address space that’s distinct from the virtual address spaces of other processes. This virtual address space typically comprises of 5 sections:

    Text section: The part that contains the binary instructions to be executed by the processor.
    Data section: Contains non-zero initialized static data.
    BSS (Block Started by Symbol) : Contains zero-initialized static data. Static data uninitialized in program is initialized 0 and goes here.
    Heap: Contains the dynamically allocated data.
    Stack: Contains your automatic variables, function arguments, copy of base pointer etc. Memory layout

    As you can see in the image, the stack and the heap grow in the opposite directions.
    Sometimes the data, bss and heap sections are collectively referred to as the “data segment”,
    the end of which is demarcated by a pointer named program break or brk.
    That is, brk points to the end of the heap.

    Now if we want to allocate more memory in the heap, we need to request the system to increment brk. Similarly, to release memory we need to request the system to decrement brk.

    Assuming we run Linux (or a Unix-like system), we can make use of sbrk() system call that lets us manipulate the program break.

    Calling sbrk(0) gives the current address of program break.
    Calling sbrk(x) with a positive value increments brk by x bytes, as a result allocating memory.
    Calling sbrk(-x) with a negative value decrements brk by x bytes, as a result releasing memory.
    On failure, sbrk() returns (void*) -1.

    To be honest, sbrk() is not our best buddy in 2015. There are better alternatives like mmap() available today. sbrk() is not really thread safe. It can can only grow or shrink in LIFO order.
    If you do a man 2 sbrk on your macbook, it will tell you:

    The brk and sbrk functions are historical curiosities left over from earlier days before the advent of virtual memory management.

    However, the glibc implementation of malloc still uses sbrk() for allocating memory that’s not too big in size.[1]
    So, we will go ahead with sbrk() for our simple memory allocator.

    malloc()

    The malloc(size) function allocates size bytes of memory and returns a pointer to the allocated memory.
    Our simple malloc will look like:

    void *malloc(size_t size)
    {
    	void *block;
    	block = sbrk(size);
    	if (block == (void*) -1)
    		return NULL;
    	return block;
    }
    

    In the above code, we call sbrk() with the given size.
    On success, size bytes are allocated on the heap.
    That was easy. Wasn’t it?

    The tricky part is freeing this memory.
    The free(ptr) function frees the memory block pointed to by ptr, which must have been returned by a previous call to malloc(), calloc() or realloc().
    But to free a block of memory, the first order of business is to know the size of the memory block to be freed. In the current scheme of things, this is not possible as the size information is not stored anywhere. So, we will have to find a way to store the size of an allocated block somewhere.

    Moreover, we need to understand that the heap memory the operating system has provided is contiguous. So we can only release memory which is at the end of the heap. We can’t release a block of memory in the middle to the OS. Imagine your heap to be something like a long loaf of bread that you can stretch and shrink at one end, but you have to keep it in one piece.
    To address this issue of not being able to release memory that’s not at the end of the heap, we will make a distinction between freeing memory and releasing memory.
    From now on, freeing a block of memory does not necessarily mean we release memory back to OS. It just means that we keep the block marked as free. This block marked as free may be reused on a later malloc() call. Since memory not at the end of the heap can’t be released, this is the only way ahead for us.

    So now, we have two things to store for every block of allocated memory:
        1. size
        2. Whether a block is free or not-free?

    To store this information, we will add a header to every newly allocated memory block.
    The header will look something like this:

    struct header_t {
    	size_t size;
    	unsigned is_free;
    };
    

    The idea is simple. When a program requests for size bytes of memory, we calculate total_size = header_size + size, and call sbrk(total_size). We use this memory space returned by sbrk() to fit in both the header and the actual memory block. The header is internally managed, and is kept completely hidden from the calling program.

    Now, each one of our memory blocks will look like:
    memory block with header

    We can’t be completely sure the blocks of memory allocated by our malloc is contiguous. Imagine the calling program has a foreign sbrk(), or there’s a section of memory mmap()ed in between our memory blocks. We also need a way to traverse through our blocks for memory (why traverse? we will get to know when we look at the implementation of free()). So to keep track of the memory allocated by our malloc, we will put them in a linked list. Our header will now look like:

    struct header_t {
    	size_t size;
    	unsigned is_free;
    	struct header_t *next;
    };
    

    and the linked list of memory blocks like this:

    linked list of memory blocks

    Now, let’s wrap the entire header struct in a union along with a stub variable of size 16 bytes. This makes the header end up on a memory address aligned to 16 bytes. Recall that the size of a union is the larger size of its members. So the union guarantees that the end of the header is memory aligned. The end of the header is where the actual memory block begins and therefore the memory provided to the caller by the allocator will be aligned to 16 bytes.

    typedef char ALIGN[16];
    
    union header {
    	struct {
    		size_t size;
    		unsigned is_free;
    		union header *next;
    	} s;
    	ALIGN stub;
    };
    typedef union header header_t;
    

    We will have a head and tail pointer to keep track of the list.

    header_t *head, *tail;
    

    To prevent two or more threads from concurrently accessing memory, we will put a basic locking mechanism in place.

    We’ll have a global lock, and before every action on memory you have to acquire the lock, and once you are done you have to release the lock.

    pthread_mutex_t global_malloc_lock;
    

    Our malloc is now modified to:

    void *malloc(size_t size)
    {
    	size_t total_size;
    	void *block;
    	header_t *header;
    	if (!size)
    		return NULL;
    	pthread_mutex_lock(&global_malloc_lock);
    	header = get_free_block(size);
    	if (header) {
    		header->s.is_free = 0;
    		pthread_mutex_unlock(&global_malloc_lock);
    		return (void*)(header + 1);
    	}
    	total_size = sizeof(header_t) + size;
    	block = sbrk(total_size);
    	if (block == (void*) -1) {
    		pthread_mutex_unlock(&global_malloc_lock);
    		return NULL;
    	}
    	header = block;
    	header->s.size = size;
    	header->s.is_free = 0;
    	header->s.next = NULL;
    	if (!head)
    		head = header;
    	if (tail)
    		tail->s.next = header;
    	tail = header;
    	pthread_mutex_unlock(&global_malloc_lock);
    	return (void*)(header + 1);
    }
    
    header_t *get_free_block(size_t size)
    {
    	header_t *curr = head;
    	while(curr) {
    		if (curr->s.is_free && curr->s.size >= size)
    			return curr;
    		curr = curr->s.next;
    	}
    	return NULL;
    }
    

    Let me explain the code:

    We check if the requested size is zero. If it is, then we return NULL.
    For a valid size, we first acquire the lock. The we call get_free_block() - it traverses the linked list and see if there already exist a block of memory that is marked as free and can accomodate the given size. Here, we take a first-fit approach in searching the linked list.

    If a sufficiently large free block is found, we will simply mark that block as not-free, release the global lock, and then return a pointer to that block. In such a case, the header pointer will refer to the header part of the block of memory we just found by traversing the list. Remember, we have to hide the very existence of the header to an outside party. When we do (header + 1), it points to the byte right after the end of the header. This is incidentally also the first byte of the actual memory block, the one the caller is interested in. This is cast to (void*) and returned.

    If we have not found a sufficiently large free block, then we have to extend the heap by calling sbrk(). The heap has to be extended by a size that fits the requested size as well a header. For that, we first compute the total size: total_size = sizeof(header_t) + size;. Now, we request the OS to increment the program break: sbrk(total_size).

    In the memory thus obtained from the OS, we first make space for the header. In C, there is no need to cast a void* to any other pointer type, it is always safely promoted. That’s why we don’t explicitly do: header = (header_t *)block;
    We fill this header with the requested size (not the total size) and mark it as not-free. We update the next pointer, head and tail so to reflect the new state of the linked list. As explained earlier, we hide the header from the caller and hence return (void*)(header + 1). We make sure we release the global lock as well.

    free()

    Now, we will look at what free() should do. free() has to first deterimine if the block-to-be-freed is at the end of the heap. If it is, we can release it to the OS. Otherwise, all we do is mark it ‘free’, hoping to reuse it later.

    void free(void *block)
    {
    	header_t *header, *tmp;
    	void *programbreak;
    
    	if (!block)
    		return;
    	pthread_mutex_lock(&global_malloc_lock);
    	header = (header_t*)block - 1;
    
    	programbreak = sbrk(0);
    	if ((char*)block + header->s.size == programbreak) {
    		if (head == tail) {
    			head = tail = NULL;
    		} else {
    			tmp = head;
    			while (tmp) {
    				if(tmp->s.next == tail) {
    					tmp->s.next = NULL;
    					tail = tmp;
    				}
    				tmp = tmp->s.next;
    			}
    		}
    		sbrk(0 - sizeof(header_t) - header->s.size);
    		pthread_mutex_unlock(&global_malloc_lock);
    		return;
    	}
    	header->s.is_free = 1;
    	pthread_mutex_unlock(&global_malloc_lock);
    }
    

    Here, first we get the header of the block we want to free. All we need to do is get a pointer that is behind the block by a distance equalling the size of the header. So, we cast block to a header pointer type and move it behind by 1 unit.
    header = (header_t*)block - 1;

    sbrk(0) gives the current value of program break. To check if the block to be freed is at the end of the heap, we first find the end of the current block. The end can be computed as (char*)block + header->s.size. This is then compared with the program break.

    If it is in fact at the end, then we could shrink the size of the heap and release memory to OS. We first reset our head and tail pointers to reflect the loss of the last block. Then the amount of memory to be released is calculated. This the sum of sizes of the header and the acutal block: sizeof(header_t) + header->s.size. To release this much amount of memory, we call sbrk() with the negative of this value.

    In the case the block is not the last one in the linked list, we simply set the is_free field of its header. This is the field checked by get_free_block() before actually calling sbrk() on a malloc().

    calloc()

    The calloc(num, nsize) function allocates memory for an array of num elements of nsize bytes each and returns a pointer to the allocated memory. Additionally, the memory is all set to zeroes.
    void *calloc(size_t num, size_t nsize)
    {
    	size_t size;
    	void *block;
    	if (!num || !nsize)
    		return NULL;
    	size = num * nsize;
    	/* check mul overflow */
    	if (nsize != size / num)
    		return NULL;
    	block = malloc(size);
    	if (!block)
    		return NULL;
    	memset(block, 0, size);
    	return block;
    }
    

    Here, we do a quick check for multiplicative overflow, then call our malloc(),
    and clears the allocated memory to all zeroes using memset().

    realloc()

    realloc() changes the size of the given memory block to the size given.

    void *realloc(void *block, size_t size)
    {
    	header_t *header;
    	void *ret;
    	if (!block || !size)
    		return malloc(size);
    	header = (header_t*)block - 1;
    	if (header->s.size >= size)
    		return block;
    	ret = malloc(size);
    	if (ret) {
    		
    		memcpy(ret, block, header->s.size);
    		free(block);
    	}
    	return ret;
    }
    

    Here, we first get the block’s header and see if the block already has the size to accomodate the requested size. If it does, there’s nothing to be done.

    If the current block does not have the requested size, then we call malloc() to get a block of the request size, and relocate contents to the new bigger block using memcpy(). The old memory block is then freed.

    Compiling and using our memory allocator.

    You can get the code from my github repository - memalloc.
    We’ll compile our memory allocator and then run a utility like ls using our memory allocator.

    To do that, we will first compile it as a library file.

    $ gcc -o memalloc.so -fPIC -shared memalloc.c
    

    The -fPIC and -shared options makes sure the compiled output has position-independent code and tells the linker to produce a shared object suitable for dynamic linking.

    On Linux, if you set the enivornment variable LD_PRELOAD to the path of a shared object, that file will be loaded before any other library. We could use this trick to load our compiled library file first, so that the later commands run in the shell will use our malloc(), free(), calloc() and realloc().

    $ export LD_PRELOAD=$PWD/memalloc.so
    

    Now,

    $ ls
    memalloc.c		memalloc.so
    

    Voila! That’s our memory allocator serving ls.
    Print some debug message in malloc() and see it for yourself if you don’t believe me.

    Thank you for reading. All comments are welcome. Please report bugs if you find any.

    Footnotes, References

    See a list of memory allocators:
    liballoc
    Doug Lea’s Memory Allocator.
    TCMalloc
    ptmalloc

    [1] The GNU C Library: Malloc Tunable Parameters
    OSDev - Memory allocation
    Memory Allocators 101 - James Golick

  2. cmemoryMemory Allocationoperating system






     

    Disclaimer: The views expressed here are solely those of the author in his private capacity and do not in any way represent the views of the author's employer or any organization associated with the author.

    points






















 



















































 

Check this out:

Wikicoding, the wikipedia of code

Recent Posts:

Simplicity is the ultimate sophistication.

©

Arjun Sreedharan 2013 - 2023

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.coloradmin.cn/o/882389.html

如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈,一经查实,立即删除!

相关文章

数据结构——B-树、B+树、B*树

一、B-树 1. B-树概念 B树是一种适合外查找的、平衡的多叉树。一棵m阶(m>2)的B树,是一棵平衡的M路平衡搜索树,它可以是空树或满足以下性质: (1)根节点至少有两个孩子。 (2&#…

Integer中缓存池讲解

文章目录 一、简介二、实现原理三、修改缓存范围 一、简介 Integer缓存池是一种优化技术,用于提高整数对象的重用和性能。在Java中,对于整数值在 -128 到 127 之间的整数对象,会被放入缓存池中,以便重复使用。这是因为在这个范围…

UDP 的报文结构以及注意事项

UDP协议 1.UDP协议端格式 1.图中的16位UDP长度,表示整个数据报(UDP首部UDP数据)的最大长度 2.若校验和出错,会直接丢弃 2.UDP的报文结构 UDP报文主体分为两个部分:UDP报头(占8个字节)UDP载荷/UDP数据 1.源端口号 16位,2个字节 2.目的端口号 16位,2个字节 3.包长度 指示了…

laravel框架中批量更新数据

在php框架中 tp中就有批量更新封装好的 SaveAll 在laravel中有批量插入没有批量更新操作;因此我们可以自己去封装一个 然后批量进行更新操作 封装参考代码: /*** 批量更新** param $tableName 表名称* param string $pk 更新的字段* param array $multipleData 要更新的数据*…

免费SAFe敏捷工具,SAFe框架执行

Leangoo领歌覆盖了敏捷项目研发全流程,包括小型团队敏捷开发,Scrum of Scrums大规模敏捷。 Leangoo领歌是ScrumCN(scrum.cn)旗下的一款永久免费的敏捷研发管理工具。 Leangoo领歌覆盖了敏捷研发全流程,包括小型团队敏…

七、Linux操作系统下,whichfind如何使用?

1、which命令 (1)语法:which 参数 (2)参数:要查找的命令 (3)示例: 2、find命令 (1)find 起始路径 -name “被查找的文件名” 注意&#xff1…

多环境_部署项目

多环境: 指同一套项目代码在不同的阶段需要根据实际情况来调整配置并且部署到不同的机器上。 为什么需要? 1. 每个环境互不影响 2. 区分不同的阶段:开发 / 测试 / 生产 3. 对项目进行优化: 1. 本地日志级别 2. 精简依赖&a…

虫情测报灯——监测预警分析

KH-CQPest虫情测报灯是专为田间虫害统计、农林虫情测报而研制的设备,利用光、电、数控等技术实现自动诱虫、杀虫、虫体分散、拍照、运输、收集、排水等系统作业等功能,当有害虫出现时,会受到诱集光源的影响,自动飞扑撞向撞击屏&am…

网络基础——网络协议是什么?

作者:Insist-- 个人主页:insist--个人主页 作者会持续更新网络知识和python基础知识,期待你的关注 目录 一、理解网络协议 1、网络协议是什么? 2、网络协议的三要素 二、常见的网络协议 1、TCP/IP协议 2、HTTP协议 3、FTP协…

AD8302 - 信号幅度相位检测

AD8302 - 信号幅度相位检测 AD8302模块AD8302简介芯片特点模块引脚AD8302工作原理内部框图工作原理 实验接线鉴相鉴幅 实验总结 AD8302模块 AD8302简介 AD8302是ADI公司推出的一个款用于测量RF/IF信号幅度和相位的单片集成电路,它能同时测量从低频到 2.7GHz 频率范…

uniapp的uview-plus组件库的导入

uniapp的vue3中使用uview-plus组件库。在插件市场中找到该组件并点击如下所示绿色按钮,弹出弹窗选择要导入的项目后,就会在uni_modules文件中生成如下文件内容 关于插件的下载区别,可参考:https://uniapp.dcloud.net.cn/compone…

互联网发展历程:跨越远方,路由器的启示

互联网的蓬勃发展,一直在追求更广阔的连接,更遥远的距离。然而,在早期的网络中,人们面临着连接距离有限的问题。一项重要的技术应运而生,那就是“路由器”。 连接受限的问题:距离有限 早期的网络受限于直接…

计网第三章(数据链路层)(一)

一.数据链路层概述 数据链路层还没有牵扯到多个网络互连的问题,第三章主要研究的是在同一个局域网中,分组怎样从一个主机传送到另一个主机,中间并没有路由器的转发。 1.信道分类 点对点信道: 即一对一的通信方式。 广播信道&…

西瓜书南瓜书第一、二章

(存在疑问的地方使用红色字体进行了标注) 第一章 什么是机器学习 人工智能:研究如何让机器变得像人一样拥有智能的学科 机器学习:让计算机像人一样能从数据中学习出规律的一类算法 深度学习:神经网络类的机器学习算…

科大讯飞星火模型申请与chatgpt 3.5模型以及new bing的对比

科大讯飞星火模型 申请科大讯飞星火认知大模型账号科大讯飞星火认知大模型使用1.界面介绍2. 在编程能力上与chatgpt 3.5对比科大讯飞星火模型chatgpt 3.5模型 3. 在图片生成能力上与new bing对比 总结 申请科大讯飞星火认知大模型账号 注册网址: 科大讯飞星火认知大…

Python基础知识:列表推导式详解

前言 嗨喽,大家好呀~这里是爱看美女的茜茜呐 我们经常需要这样处理一个列表: 把一个列表里面的每个元素, 经过相同的处理 ,生成另一个列表。 👇 👇 👇 更多精彩机密、教程,尽在下方…

如何给 Keycloak 用户加上“部门”、“电话”等自定义属性

Keycloak 是一款开源的用户认证和授权软件。在默认安装情况下,它只给新创建的用户提供了 email 属性,但是在许多应用场景中,客户都会要求给新创建的用户增加诸如“部门”、“电话”等自定义属性。 本文会介绍如何给 keycloak 中新创建的用户…

Android上架商城 隐私政策需要网页 没有怎么办

Android开发的项目上架商城的时候会需要你填写url,但其实并不需要真的去发布一个网站 使用腾讯文档新建文档 填写隐私政策 点击生成网页 再将网址填写即可 下面我找到的一个隐私政策文档供大家参考 将XXXX应用一键替换为自己的应用 将XXXXXX公司一键替换为公司 …

【Linux】多线程1——线程概念与线程控制

文章目录 1. 线程概念什么是线程Linux中的线程线程的优点线程的缺点线程的独立资源和共享资源 2. 线程控制Linux的pthread库用户级线程 📝 个人主页 :超人不会飞)📑 本文收录专栏:《Linux》💭 如果本文对您有帮助&…

记一次触发器拦截更新操作

1、背景 业务上有一张表记录仓库和经纬度的,正常情况不怎么做变更;业务反馈经常出现经纬度被更新的情况,操作人都是接口或者admin,人工运维后又会被接口/admin覆盖更新掉 2、过程 遇到这种情况,我的第一反应是定位代…