vickieGPT - Page 14 - vickieGPT’s blog

August 5, 2022September 14, 2022

Resource-Centric Serverless Computing

FAAS对resource(CPU/Memory) disaggregation的map-reduce化?这显然是一个完全不考虑NUMA/NIC latency的考虑，就是在说dynamic memory allocation/placement大过latency的tradeoff,重点落在了怎么藏RDMA/NUMA的时延.但是为什么要套一个FAAS呢?然后就和MemTrade没啥大区别,只是多家了一套用户配置.

Reference

Resource-Centric Serverless Computing

August 3, 2022September 17, 2022

TMO: Transparent Memory Offloading in datacenters

Both of the papers are from Dimitrios

Memory offloading

Because the memory occupation on a single node is huge, we are required to offload them into far memory.

They have to model what the memory footprint is like. And what's shown in the previous work zswap, it only has a single slow memory tier with compressed memory and they only have offline application profiling, which the metric is merely page-promotion rate.

Transparent memory offloading

Memory Tax comes can be triggered by infrastructure-level functions like packaging, logging, and profiling and microservices like routing and proxy. The primary target of offloading is memory tax SLA.

TMO basically sees through the resulting performance info like pressure stall info to predict how much memory to offload.

Then they use the PSI tracking to limit the memory/IO/CPU using cgroup, which they called Senpai.

IOCost reclaims not frequently used pages to SSD.

Reference

Jing Liu's blog
Software-Defined Far Memory in Warehouse-Scale Computers
Cerebros: Evading the RPC Tax in Datacenters
Beyond malloc efficiency to fleet efficiency: a hugepage-aware memory allocator

July 30, 2022August 10, 2022

TPP: Transparent Page Multi-tier CXL-Memory

CXL在这里的作用可以理解为External cache coherent agent，一个带CC的iMC，带宽肯定比NUMA高，但是这个工作只提升了18%，他们的策略是用PMU collect data，然后冷热load 和flush，给个temperature的grading，个人觉得肯定有page coloring那种优秀的算法搞定这件事，比这个数高。

July 30, 2022July 7, 2023

MQSim

最近看的一个高精度multi-queue SSD模拟。

https://blog.csdn.net/Alieon/article/details/109920819

July 29, 2022August 9, 2022

Application-Informed Kernel Synchronization Primitives

一个runtime dynamic profiling/loading guided lock primitive，可以同时兼顾到high/no lock contention的情况。

Continue reading "Application-Informed Kernel Synchronization Primitives"

July 18, 2022September 14, 2022

IOring Windows at first sight and migration to `monoio`

最近在和LemonHX一起写个跨平台下载器，想要的是个延时确定的协程调度器，然后我们就看上了字节开源的monoio，准备贡献一波Windows部分。

主要需要跨平台抽象的部分已经写好了, GAT 刚进主线, 其实感觉贡献这个更经济一点. 字节内好像也没有开始用这个, 只是做了点测试.

July 18, 2022August 5, 2022

Twizzler: a Data-Centric OS for Non-Volatile Memory

A Distributed NVM modeled OS runtime for the arrangement of NVM data structure that

Does not require explicit (un)loading
Less serialization (Context available persistent ptr to reduce memcpy).
Has basic support for security/ share/ crash atomicity

Continue reading "Twizzler: a Data-Centric OS for Non-Volatile Memory"

July 15, 2022July 21, 2022

The A-Z of my Ph.D. Trajectory

On Arrival

Familiarizing Yourself with Your New Workplace and City. - Move in and buy stuff
1. Search for gym/canteen
2. good food for celebration/lowering weight food

Continue reading "The A-Z of my Ph.D. Trajectory"

July 15, 2022July 23, 2022