CUDA Memory Operators¶

Tensor new_managed_tensor(const Tensor &self, const std::vector<std::int64_t> &sizes)¶

Allocate an at::Tensor with unified managed memory (UVM). Then set its preferred storage location to CPU (host memory) and establish mappings on the CUDA device to the host memory.

Parameters:

self – The input tensor
sizes – The target tensor dimensions

Returns:

A new tensor backed by UVM

Tensor new_managed_tensor_meta(const Tensor &self, const std::vector<std::int64_t> &sizes)¶

Placeholder operator for the Meta dispatch key.

Parameters:

self – The input tensor
sizes – The target tensor dimensions

Returns:

A new empty tensor

Tensor new_host_mapped_tensor(const Tensor &self, const std::vector<std::int64_t> &sizes)¶

Allocate the at::Tensor with host-mapped memory.

Parameters:

self – The input tensor
sizes – The target tensor dimensions

Returns:

A new tensor backed by host-mapped memory

Tensor new_unified_tensor(const Tensor &self, const std::vector<std::int64_t> &sizes, bool is_host_mapped)¶

Allocate the at::Tensor with either unified managed memory (UVM) or host-mapped memory.

Parameters:

self – The input tensor
sizes – The target tensor dimensions
is_host_mapped – Whether to allocate UVM or host-mapped memory

Returns:

A new tensor backed by UVM or host-mapped memory, depending on the value of is_host_mapped

Tensor new_vanilla_managed_tensor(const Tensor &self, const std::vector<std::int64_t> &sizes)¶

Allocate an at::Tensor with unified managed memory (UVM), but allow for its preferred storage location to be automatically managed.

Parameters:

self – The input tensor
sizes – The target tensor dimensions

Returns:

A new tensor backed by UVM

bool uvm_storage(const Tensor &self)¶

Check if a tensor is allocated with UVM (either CPU or GPU tensor).

Parameters:: self – The input tensor
Returns:: true if the tensor is allocated with UVM, otherwise false

bool is_uvm_tensor(const Tensor &self)¶

Check if a tensor is allocated with UVM, BUT is not a CPU tensor.

Parameters:: self – The input tensor
Returns:: true if the tensor is a non-CPU tensor allocated with UVM, otherwise false

Tensor uvm_to_cpu(const Tensor &self)¶

Convert a UVM tensor to a CPU tensor.

Parameters:: self – The input tensor
Returns:: A new tensor that is effectively the input moved from UVM to CPU

Tensor uvm_to_device(const Tensor &self, const Tensor &prototype)¶

Create a new UVM tensor that shares the same device and UVM storage with prototype.

Parameters:

self – The input tensor
prototype – The target tensor whose device and and UVM storage will be shared with the new tensor

Returns:

A new tensor that shares the same device and UVM storage with prototype.

void uvm_cuda_mem_advise(const Tensor &self, int64_t cuda_memory_advise)¶

Call cudaMemAdvise() on a UVM tensor’s storage. The cudaMemoryAdvise enum is available on the Python side in the fbgemm_gpu.uvm namespace; see the documentation over there for valid values.

CUDA Memory Operators¶

Docs

Tutorials

Resources