image adaptive 3dlut based on deep learning

news2025/7/4 13:50:25

文章目录

- image adaptive 3dlut based on deep learning
- - 1. Learning Image-adaptive 3D Lookup Tables for High Performance Photo Enhancement in Real-time
  - 2. CLUT-Net: Learning Adaptively Compressed Representations of 3DLUTs for Lightweight Image Enhancement
  - - 2.1 3dlut分析
    - 2.2 具体方法
    - 2.3 主要原理
    - 2.4 实验结果
  - 3. 4D LUT: Learnable Context-Aware 4D Lookup Table for Image Enhancement
  - 4. RSFNet A White-Box Image Retouching Approach using Region-Specific Color Filters
  - - 4.1 选择10个图像处理方法（或者叫做filter）
    - 4.2 预测每个filter的参数
  - 5. Flexible Piecewise Curves Estimation for Photo Enhancement
  - - 5.1 什么是PNG curve
    - 5.2 网络结构
    - 5.3 Spatial-Adaptive Confidence Map Fusion
  - 6. Neural Color Operators for Sequential Image Retouching
  - - 6.1 NOP (neural color operators)
    - 6.2 strength predictor就是一个小网络预测三个 NOP 的强度。
  - 7. AdaInt: Learning Adaptive Intervals for 3D Lookup Tables on Real-time Image Enhancement
  - 8. SepLUT: Separable Image-adaptive Lookup Tables for Real-time Image Enhancement

image adaptive 3dlut based on deep learning

1. Learning Image-adaptive 3D Lookup Tables for High Performance Photo Enhancement in Real-time

在这里插入图片描述

图像输入一个卷积网络输出3个weight,
初始化3个3dlut

weight 和 3dlut 合成为一个，然后三线性插值得到 pred, 与target建立损失。

2. CLUT-Net: Learning Adaptively Compressed Representations of 3DLUTs for Lightweight Image Enhancement

2.1 3dlut分析

Given a specific color channel 𝑐 where 𝑐 ∈ {𝑟, 𝑔, 𝑏} and the other two channels denoted by 𝑥
and 𝑦, we find that the output value 3Dlut(𝑐) is strongly correlated to the input value of channel 𝑐 while weakly correlated to the input values 𝑥𝑖𝑛, 𝑦𝑖𝑛 of channel 𝑥 and 𝑦, respectively.

意思是R 通道的3Dlut 与R相关性更大，与GB通道相关性小
G 通道的3Dlut 与R相关性更大，与RB通道相关性小
B 通道的3Dlut 与R相关性更大，与RG通道相关性小

因此，对于R通道的3Dlut, 原本是 17 * 17 * 17 个节点，作者替换为 S * W
请添加图片描述

2.2 具体方法

主要是矩阵分解的思想，然后再重建
重建：
在这里插入图片描述

由两个矩阵 Ms, Mw, 压缩后的Clut 重建为原始 3dlut

2.3 主要原理

在这里插入图片描述

主要是对3dlut进行压缩处理，降低参数量，提高效率。

首先同样是学习得到 weight 和 basis Cluts。
然后还有两个矩阵需要学习得到。一共这三个模块

其中bisis Cluts和两个矩阵 Ms, Mw在推理阶段是不变化的。

2.4 实验结果

FiveK: PSNR, SSIM， deltaE 三种评价标准
在这里插入图片描述

3. 4D LUT: Learnable Context-Aware 4D Lookup Table for Image Enhancement

在这里插入图片描述

4D lut：输入r,g,b,context 输出r,g,b
增加一个图像内容context map : achieve content-dependent image enhancement
在这里插入图片描述

原理和3dlut类似, 框架如下很容易明白：

学习 weight, bisis 4dluts, context map

本来生成的3dlut就是image-adaptive，因为weight是每个图像都不同的。这篇论文又多一个维度说是content map, 这样效果就有提升？

作者实验确实有提升，而且context map越大的地方相比3dlut提升越好：
在这里插入图片描述

4. RSFNet A White-Box Image Retouching Approach using Region-Specific Color Filters

4.1 选择10个图像处理方法（或者叫做filter）

We select 10 commonly used retouching filters from traditional tools(e.g., Davinci Resolve)
to represent adjustment manipulations, including contrast,
saturation, hue, temperature, shadows, midtones, highlights
and shift.