Resources and Memory

[TOC]

内容

Creating a buffer
Allocating and binding a memory object for a buffer
Setting a buffer memory barrier
Creating a buffer view
Creating an image
Allocating and binding a memory object to an image
Setting an image memory barrier
Creating an image view
Creating a 2D image and view
Creating a layered 2D image with a CUBEMAP view
Mapping, updating, and unmapping host-visible memory
Copying data between buffers
Copying data from a buffer to an image
Copying data from an image to a buffer
Using a staging buffer to update a buffer with a device-local memory bound
Using a staging buffer to update an image with a device-local memory bound
Destroying an image view
Destroying an image
Destroying a buffer view
Freeing a memory object
Destroying a buffer

Vulkan里非常重要的存储数据的资源时buffers和images.buffers存储linear数组数据.Images和OpenGL的textures类似,有1D,2D,3D.Buffers和Images可以用于很多目的:shaders里可以read或者sample数据,或者存储数据.Images可以用于color或者depth/stencil绑定(RT),也就是说可以渲染到其上.Buffer还可以存储顶点数据、indices,parameters(indiret drawing).

重要的是提及的所有usages需要在创建资源时指明(可以一次提供很多).

Vulkan里buffers和images没有自己的storage,需要创建和绑定memory objects.

本节介绍如何使用这些资源、如何申请缓存和绑定、如何上传数据到GPU、如何在资源见进行拷贝.

buffer

创建buffer

buffers能用于很多目的.它们能通过descriptor sets在pipelines中统一uniform buffers、storage buffers、texel buffer等的后备缓冲.它们能作为vertex indices或者attributes的数据源,或者暂存从CPU到GPU移动数据的staging resources.为了这些目的,我们需要创建buffer和指定它的usage.

buffers只能用于创建时指定的usages.

buffers支持的使用方式列表

flag	description
VK_BUFFER_USAGE_TRANSFER_SRC_BIT	specifies that the buffer can be a source of data for copy operations
VK_BUFFER_USAGE_TRANSFER_DST_BIT	specifies that we can copy data to the buffer
VK_BUFFER_USAGE_UNIFORM_TEXEL_BUFFER_BIT	indicates that the buffer can be used in shaders as a uniform texel buffer
VK_BUFFER_USAGE_STORAGE_TEXEL_BUFFER_BIT	specifies that the buffer can be used in shaders as a storage texel buffer
VK_BUFFER_USAGE_UNIFORM_BUFFER_BIT	indicates that the buffer can be used in shaders as a source of values for uniform variables
VK_BUFFER_USAGE_STORAGE_BUFFER_BIT	indicates that we can store data in the buffer from within shaders
VK_BUFFER_USAGE_INDEX_BUFFER_BIT	specifies that the buffer can be used as a source of vertex indices during drawing
VK_BUFFER_USAGE_VERTEX_BUFFER_BIT	indicates that the buffer can be a source of data for vertex attributes specified during drawing
VK_BUFFER_USAGE_INDIRECT_BUFFER_BIT	indicates that the buffer can contain data that will be used during indirect drawing

VkBufferCreateInfo buffer_create_info = {
    VK_STRUCTURE_TYPE_BUFFER_CREATE_INFO,
    nullptr,
    0,
    size,
    usage,
    VK_SHARING_MODE_EXCLUSIVE,
    0,
    nullptr
};

之前的VK_SHARING_MODE_EXCLUSIVE(sharingMode)是一个非常重要的参数.通过它我们能指定多families里的queues能否同时访问buffer.Exclusive shaing mode(独占)告诉驱动程序缓冲区只能由一个系列中的队列一次引用.如果想从另一个family提交的commands使用buffer,必须在所有者改变时明确告诉驱动(从一个family改为另一个).这样性能更好但也更麻烦.

我们也可以指明VK_SHARING_MODE_CONCURRENT,这样多families多queues能够同时访问一个buffer，并且不用进行所有者转换,但并发性能可能很低.

创建buffer

VkResult result = vkCreateBuffer( logical_device, &buffer_create_info,
nullptr, &buffer );
if( VK_SUCCESS != result ) {
    std::cout << "Could not create a buffer." << std::endl;
    return false;
}
return true;

allocating and binding a memory object for a buffer

Vulkan里buffers和images没有自己的memroy,需要allocate memory object并绑定.

关于内存管理

https://github.com/GPUOpen-LibrariesAndSDKs/VulkanMemoryAllocator

https://www.youtube.com/watch?v=rXSdDE7NWmA

过程

1.take VkPhysicalDevice physical_device

2.create VkPhysicalDeviceMemoryProperties physical_device_memory_properties

3.clall vkGetPhysicalDeviceMemoryProperties( physical_device,&physical_device_memory_properties )会存储内存相关参数(heaps的数量,大小,types)

4.take VkDevice logical_device

5.take VkBuffer buffer

6.create VkMemoryRequirements memory_requirements.

7.call vkGetBufferMemoryRequirements(logical_device, buffer,&memory_requirements);

8.create VkDeviceMemory memory_object = VK_NULL_HANDLE;

9.create VkMemoryPropertyFlagBits memory_properties

10.遍历physical device的physical_device_memory_properties的内存types,每次循环进行如下操作:

1.确保memory_requirements.memoryTypeBits 设置了

2.确保memory_properties变量的位设置与memory type的propertyflags成员相同,该成员位于物理_device_memory_properties变量中memorytypes数组的索引类型.

3.如果1、2为false,continue

4.创建VkMemoryAllocateInfo buffer_memory_allocate_info

.allocationSize = memory_requirements.size,

.memoryTypeIndex = type

5.vkAllocateMemory( logical_device,&buffer_memory_allocate_info, nullptr, &memory_object)

6.确认结果为VK_SUCCESS

11.确保allocate的memory object成功

12.绑定,call vkBindBufferMemory(logical_device, buffer, memory_object, 0 )

13.确保call成功

为了给buffer allocate a memory,需要知道可用的memory types(physical device)以及有多少.

1
2
3

VkPhysicalDeviceMemoryProperties physical_device_memory_properties;
vkGetPhysicalDeviceMemoryProperties( physical_device,
&physical_device_memory_properties );

接下来,我们需要知道给定缓冲区需要多少存储(缓冲区的内存可能需要大于缓冲区的大小),以及与之兼容的内存类型.这些信息存储在VkMemoryRequirements:

1 2	VkMemoryRequirements memory_requirements; vkGetBufferMemoryRequirements(logical_device, buffer, &memory_requirements);

接下来,我们需要检查哪个内存类型对应于缓冲区的内存需求:

memory_object = VK_NULL_HANDLE;
for( uint32_t type = 0; type < physical_device_memory_properties.memoryTypeCount; ++type ) 
{
    if( (memory_requirements.memoryTypeBits & (1 << type)) &&
    ((physical_device_memory_properties.memoryTypes[type].propertyFlags &
    memory_properties) == memory_properties) ) 
    {
        VkMemoryAllocateInfo buffer_memory_allocate_info = 
        {
            VK_STRUCTURE_TYPE_MEMORY_ALLOCATE_INFO,
            nullptr,
            memory_requirements.size,
            type
        };
        VkResult result = vkAllocateMemory( logical_device,
        &buffer_memory_allocate_info, nullptr, &memory_object );
        if( VK_SUCCESS == result ) 
        {
      	  break;
        }
    }
}

这里遍历所有可用memory types以检查是否能够用于我们的buffer,我们也能检查一些额外的属性,比如如果想直接从CPU上传数据到GPU，memory mapping必须支持.这种情况下,我们需要使用的memory type为host-visible.

当我们找到合适的memory type后,我们能用它allocate memory object并break,然后可以绑定了:

if( VK_NULL_HANDLE == memory_object ) {
    std::cout << "Could not allocate memory for a buffer." << std::endl;
    return false;
}
VkResult result = vkBindBufferMemory( logical_device, buffer,
memory_object, 0 );
if( VK_SUCCESS != result ) {
    std::cout << "Could not bind memory object to a buffer." << std::endl;
    return false;
}
return true;

绑定时,指定了offset等其他参数.这个参数在内存管理里很有用.

通常来说,不会为每个buffer使用一个分开的memory object.而是allocate很大的memory objects,多个buffers各使用其一部分.在这种方式中,我们通过call vkGetPhysicalDeviceMemoryProperties来获取物理设备的可用内存类型.但通常来说,为了提高app的性能,不会每次需要allocate memory object的时候都调用它.我们只需要调用一次,在我们选择一个physical device后可以使用存储的参数.

set a buffer memory barrier

我们必须通知一个驱动程序每一个这样的使用,不仅在缓冲区创建期间,而且在预期的使用之前.当我们出于一个目的使用缓冲区,并且从现在开始我们想以不同的方式使用它时,我们必须告诉驱动程序缓冲区的使用发生了变化.这是通过缓冲存储器屏障(barrier)实现的.在cb record时,它们作为pipeline barriers的一部分.

自定义一个结构体

struct BufferTransition {
    VkBuffer Buffer;
    //VkAccessFlags buffer如何使用
    VkAccessFlags CurrentAccess;
    VkAccessFlags NewAccess;
    //用于想将buffer用于不同families的queue时(exclusive sharing mode时用到)
    uint32_t CurrentQueueFamily;
    uint32_t NewQueueFamily;
};

在Vulkan里,提交给queues的opoerations是按顺序执行的,但也是独立的.有时有些操作需要等待其他操作结束后才能执行,这时候memory barriers就有用了.

memory barriers用于定义命令缓冲区执行中的时刻,在这些时刻中,后面的命令应该等待前面的命令完成它们的工作.它们还使这些操作的结果对其他操作可见。

image

使用目的

type	description
VK_IMAGE_USAGE_TRANSFER_SRC_BIT	specifies that the image can be used as a source of data for copy operations
VK_IMAGE_USAGE_TRANSFER_DST_BIT	specifies that we can copy data to the image
VK_IMAGE_USAGE_SAMPLED_BIT	indicates that we can sample data from the image inside shaders
VK_IMAGE_USAGE_STORAGE_BIT	specifies that the image can be used as a storage image inside shaders
VK_IMAGE_USAGE_COLOR_ATTACHMENT_BIT	specifies that we can render into an image (use it as a color render target/attachment in a framebuffer)
VK_IMAGE_USAGE_DEPTH_STENCIL_ATTACHMENT_BIT	indicates that the image can be used as a depth and/or stencil buffer (as a depth render target/attachment

in a framebuffer) |
| VK_IMAGE_USAGE_TRANSIENT_ATTACHMENT_BIT | indicates that the memory bound to the image will be allocated lazily (on demand) |
| VK_IMAGE_USAGE_INPUT_ATTACHMENT_BIT | specifies that the image can be used as an input attachment inside shaders |

不同的usage情景要求使用不同的image layout,通过使用image memory barriers进行改变,我们需需要指明VK_IMAGE_LAYOUT_UNDEFINED,如果不在意初始化内容,或者VK_IMAGE_LAYOUT_PREINITIALIZED如果想通过mapping host-visible memory上传数据.在使用前总需要transition to another layout.

VkImageCreateInfo image_create_info = {
    VK_STRUCTURE_TYPE_IMAGE_CREATE_INFO,
    nullptr,
    cubemap ? VK_IMAGE_CREATE_CUBE_COMPATIBLE_BIT : 0u,
    type,
    format,
    size,
    num_mipmaps,
    cubemap ? 6 * num_layers : num_layers,
    samples,
    VK_IMAGE_TILING_OPTIMAL,
    usage_scenarios,
    VK_SHARING_MODE_EXCLUSIVE,
    0,
    nullptr,
    VK_IMAGE_LAYOUT_UNDEFINED
};

创建Image时需要指明tiling

linear tiling:在memory中线性存储,这允许我们映射图像的内存并直接从应用程序读取或初始化它,因为我们知道内存是如何组织的.但对使用目的有严格显示,比如不能作为depth texture或cubemap.会降低性能

optimal tiling:能用于所有目的,性能更好,作为代价不知道image得memory如何组织.

不同硬件存储image数据方式不同.所以不能在app中直接mapping、初始化、读image的内存.此时,可以使用staging resources.

VkResult result = vkCreateImage( logical_device, &image_create_info,
nullptr, &image );
if( VK_SUCCESS != result ) {
    std::cout << "Could not create an image." << std::endl;
    return false;
}
return true;

allocating and binding a memory object to an image

vkGetImageMemoryRequirements

vkBindImageMemory

先检查可用的memory types.

1
2
3

VkPhysicalDeviceMemoryProperties physical_device_memory_properties;
vkGetPhysicalDeviceMemoryProperties( physical_device,
&physical_device_memory_properties );

然后给image请求指定的memory requirememts.每个image可能不同,与format,size,mipmaps和layers的数量和其他属性有关.

1 2	VkMemoryRequirements memory_requirements; vkGetImageMemoryRequirements( logical_device, image, &memory_requirements);

下一步是找到与image的memory requirements适配的memory type

memory_object = VK_NULL_HANDLE;
for( uint32_t type = 0; type <physical_device_memory_properties.memoryTypeCount; ++type ) {
    if( (memory_requirements.memoryTypeBits & (1 << type)) &&
    ((physical_device_memory_properties.memoryTypes[type].propertyFlags &
    memory_properties) == memory_properties) ) {
        VkMemoryAllocateInfo image_memory_allocate_info = {
        VK_STRUCTURE_TYPE_MEMORY_ALLOCATE_INFO,
        nullptr,
        memory_requirements.size,
        type
    };
    VkResult result = vkAllocateMemory( logical_device,
    &image_memory_allocate_info, nullptr, &memory_object );
    if( VK_SUCCESS == result ) {
    break;
		}
	}
}

if( VK_NULL_HANDLE == memory_object ) {
	std::cout << "Could not allocate memory for an image." << std::endl;
	return false;
}
VkResult result = vkBindImageMemory( logical_device, image, memory_object,
0 );
if( VK_SUCCESS != result ) {
        std::cout << "Could not bind memory object to an image." << std::endl;
        return false;
}
return true;

申请大内存共享,可以提高性能,可以减少memory浪费.

setting an image memory barrier

image 用于texture,RT(通过descriptor sets),swapchain的images.拷贝(目标或源)

定义一个结构体

struct ImageTransition {
    VkImage Image;
    //
    VkAccessFlags CurrentAccess;
    VkAccessFlags NewAccess;
    //不用image usages有不用的layout,所以改变目的时需要确保
    VkImageLayout CurrentLayout;
    VkImageLayout NewLayout;
    uint32_t CurrentQueueFamily;
    uint32_t NewQueueFamily;
    VkImageAspectFlags Aspect;//usage.color,depth,stencil
};

在我们不想改变ownerships时我们能使用VK_QUEUE_FAMILY_IGNORED.

内存屏障用于定义命令缓冲区执行中的时刻,在这些时刻中,后面的命令应该等待前面的命令完成其任务.它们还使这些操作的结果对其他操作可见.

Barriers用于让后续commands 的memory操作可见

为了性能,最好给特定的usages用image memory layout,尽管需要注意频繁转换用处的layout.

VkImageMemoryBarrier

std::vector<VkImageMemoryBarrier> image_memory_barriers;
for( auto & image_transition : image_transitions ) {
    image_memory_barriers.push_back( {
        VK_STRUCTURE_TYPE_IMAGE_MEMORY_BARRIER,
        nullptr,
        image_transition.CurrentAccess,
        image_transition.NewAccess,
        image_transition.CurrentLayout,
        image_transition.NewLayout,
        image_transition.CurrentQueueFamily,
        image_transition.NewQueueFamily,
        image_transition.Image,
        {
        image_transition.Aspect,
        0,
        VK_REMAINING_MIP_LEVELS,
        0,
        VK_REMAINING_ARRAY_LAYERS
        }
    } );
}

需要定义pipeline stages

左侧vertex等待fragment结束

右侧fragment等待vertex结束,减少barrier数量很重要,如果需要以正确设置绘图命令并为屏障选择适当的pipeline stages

if( image_memory_barriers.size() > 0 ) {
    vkCmdPipelineBarrier( command_buffer, generating_stages,
    consuming_stages, 0, 0, nullptr, 0, nullptr,
    static_cast<uint32_t>(image_memory_barriers.size()),
    &image_memory_barriers[0] );
}

如果多次使用于同样的目的,不需要重复设置barrier.这个设置是在改变时设置,而非usage.

create an image view

Images在Vulkan Commands中被直接使用,Framebuffer和shaders(通过descriptor sets)通过image views访问images.Images view定义了image的内存的选定部分和指明了读取image数据的额外的信息.

VkImageViewCreateInfo
VkImageView

image view定义了用于访问image的额外的数据,通过它我们能指明commands能够访问的image的部分.比如,如果渲染到image,可以指明就一个mipmap level需要更新.

Image view也定义了image内存如何解释.一个好的例子是multiple layers.对于它,我们可以定义一个image view来解释image,作为一个layered image,或者我们可以使用image view从中创建一个cubemap映射。

VkImageViewCreateInfo image_view_create_info = {
    VK_STRUCTURE_TYPE_IMAGE_VIEW_CREATE_INFO,
    nullptr,
    0,
    image,
    view_type,
    format,
    {
        VK_COMPONENT_SWIZZLE_IDENTITY,
        VK_COMPONENT_SWIZZLE_IDENTITY,
        VK_COMPONENT_SWIZZLE_IDENTITY,
        VK_COMPONENT_SWIZZLE_IDENTITY
        },
        {
        aspect,
        0,
        VK_REMAINING_MIP_LEVELS,
        0,
        VK_REMAINING_ARRAY_LAYERS
    }
};

vkCreateImageView

VkResult result = vkCreateImageView( logical_device,
&image_view_create_info, nullptr, &image_view );
if( VK_SUCCESS != result ) {
    std::cout << "Could not create an image view." << std::endl;
    return false;
}
return true;

create a 2D image and view

RGBA 32bits 2D texture最常用.

分三步

创建一个image

创建一个memory object(或者使用以存在的)绑定到image

创建一个image view

需要创建type为 VK_IMAGE_TYPE_2D,format为VK_FORMAT_R8G8B8A8_UNORM的image.其他image的属性依赖于image的size(换句话说,我们从一个已经存在的image文件创建一个textur,需要匹配它的dimensions)、filtering类型(如果想用mipmapping的话)、samples的数量(如果需要multisampled的话)、期望的usage.

将image创建过程封装在CreateImage里

if( !CreateImage( logical_device, VK_IMAGE_TYPE_2D, format, { size.width,
size.height, 1 }, num_mipmaps, num_layers, samples, usage, false, image ) )
{
	return false;
}

将allocate和bind一个memory object封装在AllocateAndBindMemoryObjectToImage

if( !AllocateAndBindMemoryObjectToImage( physical_device, logical_device,
image, VK_MEMORY_PROPERTY_DEVICE_LOCAL_BIT, memory_object ) ) {
	return false;
}

也能使用一个已经创建的memory

然后创建image view.

if( !CreateImageView( logical_device, image, VK_IMAGE_VIEW_TYPE_2D, format,
aspect, image_view ) ) {
	return false;
}

create a layered 2D image with a CUEMAP view

app常用于模拟物体反射环境的CUBEMAPs,不需要创建一个CUBEMAP image,只需要创建一个layered image和通过image view告诉硬件将之视为6个CUBEMAP表面.

与创建普通image一样,不同之处是CUBEMAP需要6个layres,同时不能每个texel不能使用超过一个sample.

if( !CreateImage( logical_device, VK_IMAGE_TYPE_2D,
    VK_FORMAT_R8G8B8A8_UNORM, { size, size, 1 }, num_mipmaps, 6,
    VK_SAMPLE_COUNT_1_BIT, usage, true, image ) ) {
    	return false;
}

allocate和bind a memory object

if( !AllocateAndBindMemoryObjectToImage( physical_device, logical_device,
image, VK_MEMORY_PROPERTY_DEVICE_LOCAL_BIT, memory_object ) ) {
	return false;
}

创建image view,指明CUBEMAP view type

if( !CreateImageView( logical_device, image, VK_IMAGE_VIEW_TYPE_CUBE,
VK_FORMAT_R8G8B8A8_UNORM, aspect, image_view ) ) {
	return false;
}

faces order +X,-X,+Y,-Y,+Z,-Z

data

mapping,updating and unmapping host-visible memory

images和buffers绑定的memory位于显卡硬件(device-local memory),高性能,但不能直接使用,我们需要使用中间的(staging)资源作为GPU(device)-CPU(host)中转.

staging resources需要host-visible,为了上传数据或者读取数据,需要map it.

mapping memory是最简单的用于upload data的方式.需要指明需要map的memory的内容(offset,size).

VkResult result;
void * local_pointer;
result = vkMapMemory( logical_device, memory_object, offset, data_size, 0,
&local_pointer );
if( VK_SUCCESS != result ) {
    std::cout << "Could not map memory object." << std::endl;
    return false;
}

mapping给我们一个指针,与标准c++一样使用,不限制是读还是写.

1	std::memcpy( local_pointer, data, data_size );

当更新了mapped memory range.需要告诉驱动,内存的内容被修改了否则更新的数据不会立刻被其他提交给queues的操作访问.这个过程称为flush.

std::vector<VkMappedMemoryRange> memory_ranges = {
    {
    VK_STRUCTURE_TYPE_MAPPED_MEMORY_RANGE,
    nullptr,
    memory_object,
    offset,
    data_size
    }
};
vkFlushMappedMemoryRanges( logical_device,
static_cast<uint32_t>(memory_ranges.size()), &memory_ranges[0] );
if( VK_SUCCESS != result ) {
    std::cout << "Could not flush mapped memory." << std::endl;
    return false;
}

在使用完映射的memory后,可以unmap.memory mapping不会影响app运行效率.但关闭程序前必须unmap.

if( unmap ) {
	vkUnmapMemory( logical_device, memory_object );
} else if( nullptr != pointer ) {
	*pointer = local_pointer;
}
return true;

copy data between buffers

除了mapping,vulkan还支持memory间(包括不同types)进行内存拷贝.

这类操作需要在command buffer中record.

if( regions.size() > 0 ) {
    vkCmdCopyBuffer( command_buffer, source_buffer, destination_buffer,
    static_cast<uint32_t>(regions.size()), &regions[0] );
}

为了最好的性能,渲染阶段用到的资源,需要绑定device-local memory.但我们不能map这类memory.使用vkCmdCopyBuffer,能拷贝数据到另一个host-visible内存.这种memory能直接被app给mapped和updated.

能被拷贝的memory创建时要有VK_BUFFER_USAGE_TRANSFER_SRC_BIT usage.
能拷贝数据的memory创建时需要有VK_BUFFER_USAGE_TRANSFER_DST_BIT usage.

当想拷贝buffer到另一个buffer,我们需要设置一个memory barrier,告诉硬件从现在开始该buffer的操作需要按照VK_ACCESS_TRANSFER_WRITE_BIT来.当拷贝完后,我们想将之用于特定目的,需要设置另一个memory barrier.

copy data from a buffer to an image

对于images,能帮到不同memory types的memory objects.但只有host-visible memory能被app直接mapped或updated.当想更新一个device-local memory的image需要从一个buffer进行拷贝.

拷贝buffer的数据到image通过comman buffer完成.

1
2
3

if( regions.size() > 0 ) {
    vkCmdCopyBufferToImage( command_buffer, source_buffer, destination_image,image_layout, static_cast<uint32_t>(regions.size()), &regions[0] );
}

需要知道image data是如何组织在buffer里的.包括memory offset,length of the data row,height of data in a buffer.可以给row length和height设置为0,表明是紧密的数据,并与image的尺寸一致.

image from:VK_BUFFER_USAGE_TRANSFER_SRC_BIT,在transfer之前,image layout需要为VK_IMAGE_LAYOUT_TRANSFER_SRC_OPTIMAL

buffer to:VK_BUFFER_USAGE_TRANSFER_DST_BIT

在从image拷贝数据前,需要设置一个memory barrier,改变image得layout VK_IMAGE_LAYOUT_TRANSFER_SRC_OPTIMAL.内存方位类型变为VK_ACCESS_TRANSFER_READ_BIT.如果拷贝完后要用于其他目的,需要设置另一个barrier..

staging buffer

使用staging buffer更新device-local memory

staging resources用于更新not host-visible的memory的内容.这种memory不能mapped.需要一个中间buffer(可以mapped和更新),通过它传输数据.

需要一个能mapped的buffer,可以用池.

VkBuffer staging_buffer;
if( !CreateBuffer( logical_device, data_size,
VK_BUFFER_USAGE_TRANSFER_SRC_BIT, staging_buffer ) ) {
	return false;
}
VkDeviceMemory memory_object;
if( !AllocateAndBindMemoryObjectToBuffer( physical_device, logical_device,
staging_buffer, VK_MEMORY_PROPERTY_HOST_VISIBLE_BIT, memory_object ) ) {
	return false;
}

然后map it和更新内容

if( !MapUpdateAndUnmapHostVisibleMemory( logical_device, memory_object, 0,
data_size, data, true, nullptr ) ) {
	return false;
}

然后开始record command buffer.先给目标buffer设置一个memory barrier,改变它的usage为copy操作的target.staging buffer不需要barrier.当我们map和update buffer的memory,它的内容对其他commands也是可见的.因为导尿管我们开始command buffer recording一个隐性的barrier为host写操作而设置了.

if( !BeginCommandBufferRecordingOperation( command_buffer,
VK_COMMAND_BUFFER_USAGE_ONE_TIME_SUBMIT_BIT, nullptr ) ) {
	return false;
}
SetBufferMemoryBarrier( command_buffer,
destination_buffer_generating_stages, VK_PIPELINE_STAGE_TRANSFER_BIT, { {
destination_buffer, destination_buffer_current_access,
VK_ACCESS_TRANSFER_WRITE_BIT, VK_QUEUE_FAMILY_IGNORED,
VK_QUEUE_FAMILY_IGNORED } } );

然后我们能record拷贝操作

1 2	CopyDataBetweenBuffers( command_buffer, staging_buffer, destination_buffer, { { 0, destination_offset, data_size } } );

之后,需要为target buffer设置另一个barrier.将他的usage改为使用时期望的

SetBufferMemoryBarrier( command_buffer, VK_PIPELINE_STAGE_TRANSFER_BIT,
destination_buffer_consuming_stages, { { destination_buffer,
VK_ACCESS_TRANSFER_WRITE_BIT, destination_buffer_new_access,
VK_QUEUE_FAMILY_IGNORED, VK_QUEUE_FAMILY_IGNORED } } );
if( !EndCommandBufferRecordingOperation( command_buffer ) ) {
	return false;
}

然后创建一个fence,并在submit cb到queue时使用

VkFence fence;
if( !CreateFence( logical_device, false, fence ) ) {
	return false;
}
if( !SubmitCommandBuffersToQueue( queue, {}, { command_buffer },
signal_semaphores, fence ) ) {
	return false;
}

如果不再想使用staging buffer,销毁它.但必须在使用完成之后才能进行.(fence).

if( !WaitForFences( logical_device, { fence }, VK_FALSE, 500000000 ) ) {
	return false;
}
DestroyBuffer( logical_device, staging_buffer );
FreeMemoryObject( logical_device, memory_object );
return true;

真实应用中,通常使用一个池,复用它,而不是动态创建.这样能避免wait fence的时间,也能提高效率

使用staging buffer更新device-local memory image

与上面的类似.

VkBuffer staging_buffer;
if( !CreateBuffer( logical_device, data_size,
VK_BUFFER_USAGE_TRANSFER_SRC_BIT, staging_buffer ) ) {
	return false;
}
VkDeviceMemory memory_object;
if( !AllocateAndBindMemoryObjectToBuffer( physical_device, logical_device,
staging_buffer, VK_MEMORY_PROPERTY_HOST_VISIBLE_BIT, memory_object ) ) {
	return false;
}
if( !MapUpdateAndUnmapHostVisibleMemory( logical_device, memory_object, 0,
data_size, data, true, nullptr ) ) {
	return false;
}

设置barrier

if( !BeginCommandBufferRecordingOperation( command_buffer,
VK_COMMAND_BUFFER_USAGE_ONE_TIME_SUBMIT_BIT, nullptr ) ) {
	return false;
}
SetImageMemoryBarrier( command_buffer, destination_image_generating_stages,
VK_PIPELINE_STAGE_TRANSFER_BIT,
{
	{
    destination_image,
    destination_image_current_access,
    VK_ACCESS_TRANSFER_WRITE_BIT,
    destination_image_current_layout,
    VK_IMAGE_LAYOUT_TRANSFER_DST_OPTIMAL,
    VK_QUEUE_FAMILY_IGNORED,
    VK_QUEUE_FAMILY_IGNORED,
    destination_image_aspect
	} } );
CopyDataFromBufferToImage( command_buffer, staging_buffer,
destination_image, VK_IMAGE_LAYOUT_TRANSFER_DST_OPTIMAL,
{
    {
        0,
        0,
        0,
        destination_image_subresource,
        destination_image_offset,
        destination_image_size,
    } } );

再次修改image的usag

SetImageMemoryBarrier( command_buffer, VK_PIPELINE_STAGE_TRANSFER_BIT,
destination_image_consuming_stages,
{
	{
    destination_image,
    VK_ACCESS_TRANSFER_WRITE_BIT,
    destination_image_new_access,
    VK_IMAGE_LAYOUT_TRANSFER_DST_OPTIMAL,
    destination_image_new_layout,
    VK_QUEUE_FAMILY_IGNORED,
    VK_QUEUE_FAMILY_IGNORED,
    destination_image_aspect
    } } );
    if( !EndCommandBufferRecordingOperation( command_buffer ) ) {
    	return false;
	}

fence

VkFence fence;
if( !CreateFence( logical_device, false, fence ) ) {
	return false;
}
if( !SubmitCommandBuffersToQueue( queue, {}, { command_buffer },
signal_semaphores, fence ) ) {
	return false;
}
if( !WaitForFences( logical_device, { fence }, VK_FALSE, 500000000 ) ) {
	return false;
}
DestroyBuffer( logical_device, staging_buffer );
FreeMemoryObject( logical_device, memory_object );
return true;

如果是用池的话就不需要fence.

destroy

销毁iamge view

if( VK_NULL_HANDLE != buffer_view ) {
vkDestroyBufferView( logical_device, buffer_view, nullptr );
	buffer_view = VK_NULL_HANDLE;
}

memory object

if( VK_NULL_HANDLE != memory_object ) {
vkFreeMemory( logical_device, memory_object, nullptr );
	memory_object = VK_NULL_HANDLE;
}

buffer

if( VK_NULL_HANDLE != buffer ) {
vkDestroyBuffer( logical_device, buffer, nullptr );
	buffer = VK_NULL_HANDLE;
}