论文导读｜八月下旬Operations Research文章精选：定价问题专题

论文导读｜八月下旬Operations Research文章精选：定价问题专题

news2026/2/15 4:20:39

在这里插入图片描述

编者按：

在“ Operations Research论文精选”中，我们有主题、有针对性地选择了Operations Research中一些有趣的文章，不仅对文章的内容进行了概括与点评，而且也对文章的结构进行了梳理，旨在激发广大读者的阅读兴趣与探索热情。在本期“论文精选”中，我们以“定价问题”为主题，涉及动态定价问题、马尔可夫均衡，未知非参数模型、机器学习、迭代贪心等诸多知识。

推荐文章1

● 题目：Strategic Pricing in Volatile Markets
波动市场中的策略定价
● 期刊：Operations Research
● 原文链接：https://doi.org/10.1287/opre.2021.0550
● 作者：Sebastian Gryglewicz, Aaron Kolb
● 关键词：limit pricing（限制定价） • market entry（市场进入）• signaling （信号）• optimal stopping （最优停止）• stochastic games（随机游戏）
● 摘要：
We study dynamic entry deterrence through limit pricing in markets subject to persistent demand shocks. An incumbent is privately informed about its costs, high or low, and can deter a Bayesian potential entrant by setting its prices strategically. The entrant can irreversibly enter the market at any time for a fixed cost, earning a payoff that depends on the market conditions and the incumbent’s unobserved type. Market demand evolves as a geometric Brownian motion. When market demand is low, entry becomes a distant threat, so there is little benefit to further deterrence, and, in equilibrium, a weak incumbent becomes tempted to reveal itself by raising its prices. We characterize a unique equilibrium in which the entrant enters when market demand is sufficiently high (relative to the incumbent’s current reputation), and the weak incumbent mixes over revealing itself when market demand is sufficiently low. In this equilibrium, pricing and entry decisions exhibit path dependence, depending not only on the market’s current size, but also its historical minimum.
我们通过持续需求冲击下的市场限制定价来研究动态进入者威慑（dynamic entry deterrence）。在位者私下知道自己的成本是高是低，并可以通过战略性地设定价格来阻止贝叶斯理论的潜在进入者。进入者可以在任何时间以固定成本不可逆转地进入市场，获得的回报取决于市场条件和在位者的未观察类型。市场需求演变为一个几何布朗运动。当市场需求较低时，潜在进入者基本不构成威胁，因此进一步的威慑几乎没有什么好处，而且，在均衡状态下，弱势的在位者会试图通过提高价格来暴露自己。我们描述了一种独特的均衡，在这种均衡中，当市场需求足够高时(相对于在位者目前的声誉)，进入者进入，而当市场需求足够低时，弱势在位者混合在一起，过度暴露自己。在这个均衡中，定价和进入决策表现出路径依赖，不仅取决于市场当前的规模，还取决于其历史最小值。
● 文章结构：
在这里插入图片描述

● 点评：
本文对于市场在位者和潜在进入者的动态均衡问题进行了研究。提出了一个基于马尔可夫均衡的（U,L)模型，也对该模型其他功能的拓展进行了简要概述。本文研究结果表明，经济衰退后进入延迟，导致经济复苏缓慢，在位者在经济衰退中幸存下来的可能性更大。为市场进入威慑提出了新的研究思路。

推荐文章2

● 题目：Dynamic Pricing with Unknown Nonparametric Demand and Limited Price Changes
未知非参数需求和有限价格变动的动态定价
● 期刊：Operations Research
● 原文链接：https://doi.org/10.1287/opre.2020.0445
● 作者：Georgia Perakis, Divya Singhvi
● 关键词：learning（学习）• dynamic pricing（动态定价) • nonparametric models(非参数模型) • limited price changes(有限价格变化)
● 摘要：
We consider the dynamic pricing problem of a retailer who does not have any information on the underlying demand for a product. The retailer aims to maximize cumulative revenue collected over a finite time horizon by balancing two objectives: learning demand and maximizing revenue. The retailer also seeks to reduce the amount of price experimentation because of the potential costs associated with price changes. Existing literature solves this problem in cases where the unknown demand is parametric. We consider the pricing problem when demand is nonparametric. We construct a pricing algorithm that uses second order approximations of the unknown demand function and establish when the proposed policy achieves near-optimal rate of regret, $\widetilde{O}(\sqrt T)$ while making $O(log\ logT)$ price changes. Hence, we show considerable reduction in price changes from the previously known $o(log\ T)$ rate of price change guarantee in the literature. We also perform extensive numerical experiments to show that the algorithm substantially improves over existing methods in terms of the total price changes, with comparable performance on the cumulative regret metric.
我们研究了不知道产品任何潜在需求信息的零售商动态定价问题。零售商的目标是通过平衡两个目标:了解需求和最大化收入，在有限的时间范围内最大化累积收入。零售商还力求减少价格试验的数量，因为价格变化具有潜在成本。现有文献在未知需求为参数的情况下解决了这一问题。考虑需求是非参数时的定价问题。我们构建了一个定价算法，该算法使用未知需求函数的二阶近似，并确定所提出的策略何时达到接近最优的后悔率 $\widetilde{O}(\sqrt T)$ 同时进行 $O(log\ logT)$ 价格变化。因此，与之前文献显示的 $o(log\ T)$ 价格变化了相比，我们在价格试验次数上展现了相当大的减少，我们还进行了大量的数值实验，以表明该算法在总价格变化方面大大改进了现有方法，在累积遗憾度量上具有相当好的性能。
● 文章结构：
在这里插入图片描述

● 点评：
本文研究了当潜在需求未知且非参数，以实现价格设置恰当的同时减少定价试验次数为目的，零售商销售单一产品的动态定价问题。文章构建了一个动态定价策略，该策略使用非参数需求的二阶近似来生成未来价格。提出的政策在分析和数值上都表现良好。为零售商在新产品定价问题上提出了切实有效的办法，达到减少成本、收入最大化的目的。

推荐文章3

● 题目：Dynamic Pricing and Learning with Discounting
动态定价与折扣学习
● 期刊：Operations Research
● 原文链接：https://doi.org/10.1287/opre.2023.2477
● 作者：Zhichao Feng, Milind Dawande, Ganesh Janakiraman, Anyan Qi
● 关键词：dynamic pricing（动态定价） • learning（学习） • discounting （折扣）• regret minimization（遗憾最小化）
● 摘要：
In many practical settings, learning algorithms can take a substantial amount of time to converge, thereby raising the need to understand the role of discounting in learning. We illustrate the impact of discounting on the performance of learning algorithms by examining two classic and representative dynamic-pricing and learning problems studied in Broder and Rusmevichientong (BR) [Broder J, Rusmevichientong P (2012) Dynamic pricing under a general parametric choice model. Oper. Res. 60(4):965–980] and Keskin and Zeevi (KZ) [Keskin NB, Zeevi A (2014) Dynamic pricing with an unknown demand model: Asymptotically optimal semi-myopic policies. Oper. Res. 62(5):1142–1167]. In both settings, a seller sells a product with unlimited inventory over T periods. The seller initially does not know the parameters of the general choice model in BR (respectively, the linear demand curve in KZ). Given a discount factor ρ, the retailer’s objective is to determine a pricing policy to maximize the expected discounted revenue over T periods. In both settings, we establish lower bounds on the regret under any policy and show limiting bounds of $Ω（\sqrt {1/(1 - \rho )} ）$ and $Ω(\sqrt T)$ and when T →∞ and $\rho $→1, respectively. In the model of BR with discounting, we propose an asymptotically tight learning policy and show that the regret under our policy as well that under the MLE-CYCLE policy in BR is $O(\sqrt {1/(1 - \rho )} )$ (respectively, $O(\sqrt T)$ ）when T →∞ (respectively, $\rho $→1).In the model of KZ with discounting, we present sufficient conditions for a learning policy to guarantee asymptotic optimality and show that the regret under any policy satisfying these conditions is $O(log(1/1-\rho)\sqrt {1/(1 - \rho )})$ (respectively, $O(logT\sqrt T)$ when T →∞(respectively, $\rho $→1).We show that three different policies—namely, the two variants of the greedy iterated least squares policy in KZ and a different policy that we propose—achieve this upper bound on the regret. We numerically examine the behavior of the regret under our policies as well as those in BR and KZ in the presence of discounting. We also analyze a setting in which the discount factor per period is a function of the number of decision periods in the planning horizon.

在许多实际设置中，学习算法可能需要大量的时间来收敛，因此需要了解贴现在学习中的作用。我们通过考察Broder和Rusmevichientong (BR) 研究的两个经典且具有代表性的动态定价和学习问题来说明折扣对学习算法性能的影响[Broder J, Rusmevichientong P (2012) Dynamic pricing under a general parametric choice model. Oper. Res. 60(4):965–980] ，还有 Keskin and Zeevi (KZ) [Keskin NB, Zeevi A (2014) Dynamic pricing with an unknown demand model: Asymptotically optimal semi-myopic policies. Oper. Res. 62(5):1142–1167]. 在这两种情况下，卖家在T个周期内销售具有无限库存的产品。卖方最初不知道BR中一般选择模型的参数(或KZ中的线性需求曲线)。给定折扣系数ρ，零售商的目标是确定一种定价策略，使T期间的预期折扣收入最大化。在这两种情况下，我们建立了任何策略下的后悔下界，并分别给出了当T→∞和ρ→1时, $Ω（\sqrt {1/(1 - \rho )} ）$ 和 $Ω(\sqrt T)$ 的极限界。在有折扣的BR模型中，我们提出了一种渐近紧密学习策略，并证明了在我们的策略下以及在BR的MLE-CYCLE策略下的遗憾是 $O(\sqrt {1/(1 - \rho )} )$ （或 $O(\sqrt T)$ ）当T→∞(或ρ→1)时。我们给出了一个学习策略保证渐近最优性的充分条件，并证明了在满足这些条件的任何策略下的后悔是 $O(log(1/1-\rho)\sqrt {1/(1 - \rho )})$ （或 $O(logT\sqrt T)$ ），当T→∞(或ρ→1)时。我们证明了三种不同的策略-即，KZ中贪婪迭代最小二乘策略的两个变体和我们提出的另一个策略实现了遗憾的上界。我们在数字上检验了我们的政策下的后悔行为，以及BR和KZ中存在折扣的行为。我们还分析了一种设定，其中每个周期的贴现因子是规划范围内决策周期数量的函数。

● 文章结构：

● 点评：
本文通过研究BR和KZ研究的两个经典和代表性的动态定价和学习问题，研究了折扣对学习算法性能的影响。并研究了如何将折扣纳入CILS等算法中，同时进行探索和利用。本文中的分析在理解非平稳需求如何影响动态定价和存在折扣的需求学习做出了很大的贡献。

本文来自互联网用户投稿，该文观点仅代表作者本人，不代表本站立场。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如若转载，请注明出处：http://www.coloradmin.cn/o/1089318.html

如若内容造成侵权/违法违规/事实不符，请联系多彩编程网进行投诉反馈，一经查实，立即删除！

相关文章

C++编程基础|多级指针

C++编程基础|多级指针

C编程基础|多级指针一级指针二级指针三级指针多级指针的意义一维数组与数组指针二维数组与数组指针在看代码时发现下面的内容 GridNodePtr *** GridNodeMap;struct GridNode; typedef GridNode* GridNodePtr;显而GridNodePtr是结构体GridNode首地址指针那么GridNodeMap是什…

阅读更多...

centos离线安装telnet、traceroute工具

centos离线安装telnet、traceroute工具

安装包下载地址安装包下载地址在这里直接输入包名，筛选系统，根据自己系统版本确定该下哪个包 centos离线安装telnet 准备三个安装包 xinetd-2.3.15-14.el7.x86_64.rpmtelnet-server-0.17-65.el7_8.x86_64.rpmtelnet-0.17-65.el7_8.x86_64.rpm 三个…

阅读更多...

线性代数中涉及到的matlab命令-第二章：矩阵及其运算

线性代数中涉及到的matlab命令-第二章：矩阵及其运算

目录 1，矩阵定义 2，矩阵的运算 3，方阵的行列式和伴随矩阵 4，矩阵的逆 5，克莱默法则 6，矩阵分块 1，矩阵定义矩阵与行列式的区别： （1）形式上行列式…

阅读更多...

贴片电容材质的区别与电容的主要作用

贴片电容材质的区别与电容的主要作用

一、贴片电容材质NPO、COG、X7R、X5R、Y5V、Z5U区别主要是介质材料不同，不同介质种类由于它的主要极化类型不一样，其对电场变化的响应速度和极化率也不一样。在相同的体积下的容量就不同，随之带来的电容器介质的损耗、容量的稳定性也就不同…

阅读更多...

Vue2知识总结

Vue2知识总结

vue2复习回顾 vue基础想让vue工作，就必须创建一个vue实例，并且传入一个配置对象【el data 】 <div id"demo"><h1>Hello，{{name.toUpperCase()}}，{{address}}</h1>…

阅读更多...

天猫商品品牌数据采集接口，天猫商品详情数据接口，天猫API接口

天猫商品品牌数据采集接口，天猫商品详情数据接口，天猫API接口

天猫商品品牌数据采集方法如下： 打开天猫，进入任意一个品牌的商品页面。点击浏览器右上角的“选项”按钮，选择“检查元素”或使用快捷键CtrlShiftI（Windows）或CmdOptionI（Mac）打开开发者工具。…

阅读更多...

Spring是什么？为什么要使用Spring？

Spring是什么？为什么要使用Spring？

目录前言一、Spring是什么？ 1.1 轻量级 1.2 JavaEE的解决方案二、为什么要使用Spring 2.1 传统方式完成业务逻辑 2.2 使用Spring模式完成业务逻辑三、为什么使用Spring？ 前言本文主要介绍Spring是什么，并且解释为何要去使用Spring&…

阅读更多...

【Python 零基础入门】 Numpy

【Python 零基础入门】 Numpy

【Python 零基础入门】第六课 Numpy 概述什么是 Numpy?Numpy 与 Python 数组的区别并发 vs 并行单线程 vs 多线程GILNumpy 在数据科学中的重要性 Numpy 安装Anaconda导包 ndarraynp.array 创建数组属性np.zeros 创建np.ones 创建数组的切片和索引基本索引切片操作数组运算常…

阅读更多...

TypeScript React（上）

TypeScript React（上）

目录扩展学习资料 TypeScript设计原则 TypeScript基础语法基础变量声明 JavaScript声明变量 TypeScript声明变量示例接口 (标准类型-Interface) 类型别名-Type 接口 VS 类型别名类型断言:欺骗TS，肯定数据符合结构泛型、<大写字母> 扩展学习…

阅读更多...

Vulnhub系列靶机---Raven2

Vulnhub系列靶机---Raven2

文章目录 Raven2 渗透测试信息收集提权UDF脚本MySQL提权SUID提权 Raven2 渗透测试信息收集查看存活主机 arp-scan -l 找到目标主机。扫描目标主机上的端口、状态、服务类型、版本信息 nmap -A 192.168.160.47目标开放了 22、80、111 端口访问一下80端口，并…

阅读更多...

VSCode 快速移动光标至行尾

VSCode 快速移动光标至行尾

最近在用vscode进行C编程，经常需要把光标跳到行尾去添加符号。手动到行尾太麻烦了。一种快捷方式是：用键盘上的“END”快捷键。但是用这个键也不是很方便，因为“end”键离主键盘区太远。另一种便捷的方式是：给vscode设置自定义…

阅读更多...

分权分域有啥内容？

分权分域有啥内容？

目前的系统有什么问题？ 现在我们的系统越来越庞大，可是每一个人进来的查看到的内容完全一样，没有办法灵活的根据不同用户展示不同的数据例如我们有一个系统，期望不同权限的用户可以看到不同类型的页面，同一个页面不…

阅读更多...

计算机毕业设计选什么题目好？springboot 高校就业管理系统

计算机毕业设计选什么题目好？springboot 高校就业管理系统

✍✍计算机编程指导师 ⭐⭐个人介绍：自己非常喜欢研究技术问题！专业做Java、Python、微信小程序、安卓、大数据、爬虫、Golang、大屏等实战项目。 ⛽⛽实战项目：有源码或者技术上的问题欢迎在评论区一起讨论交流！ ⚡⚡ Java实战 |…

阅读更多...

供应链 | 零售商-供应商柔性承诺契约:一种鲁棒优化方法 (一）

供应链 | 零售商-供应商柔性承诺契约:一种鲁棒优化方法 (一）

论文解读：毕鑫宇作者：Aharon Ben-Tal, Boaz Golany, Arkadi Nemirovski, Jean-Philippe Vial 引用：Ben-Tal, A., Golany, B. , Nemirovski, A., & Vial, J. P… (2005). Retailer-supplier flexible commitments contracts: a robust op…

阅读更多...

内存空间的分配与回收之连续分配管理方式

内存空间的分配与回收之连续分配管理方式

1.连续分配管理方式连续分配:指为用户进程分配的必须是一个连续的内存空间。 1.单一连续分配在单一连续分配方式中，内存被分为系统区和用户区。系统区通常位于内存的低地址部分，用于存放操作系统相关数据;用户区用于存放用户进程相关数据。内存中只…

阅读更多...

十六、代码校验（3）

十六、代码校验（3）

本章概要测试驱动开发测试驱动 vs 测试优先日志日志信息日志等级测试驱动开发之所以可以有测试驱动开发（TDD）这种开发方式，是因为如果你在设计和编写代码时考虑到了测试，那么你不仅可以写出可测试性更好的代码&#xff…

阅读更多...

计算机导论实验——Linux基础入门

计算机导论实验——Linux基础入门

使用Xshell登录 Linux 主机 linux命令： cd：去哪里 pwd：在哪里 ls：查看当前有什么文件 mkdir：创建新目录 cp：复制 cat：连接或显示文件 rm：删除 mv：用于移动或重命名文件…

阅读更多...

B站视频“多模态大模型，科大讯飞前NLP专家串讲”记录

B站视频“多模态大模型，科大讯飞前NLP专家串讲”记录

文章目录多模态：对齐 -- align迁移学习和zero-shotClipBlip 多模态： 图片、文字、视频、语音等不同的表征。表示信息的方式有多种，但是不同的表示方式携带的信息不完全相同。对齐 – align 如第一个图中，文字内容的描述和图…

阅读更多...

关于一篇什么是JWT的原理与实际应用

关于一篇什么是JWT的原理与实际应用

目录一.介绍 1.1.什么是JWT 二.结构三.Jwt的工具类的使用 3.1. 依赖 3.2.工具类 3.3.过滤器 3.4.控制器 3.5.配置 3.6. 测试类用于生成JWT 解析Jwt 复制jwt，并延时30分钟测试JWT的有效时间测试过期JWT的解析四.应用今天就到这了，希…

阅读更多...

基于SpringBoot的网上订餐系统

基于SpringBoot的网上订餐系统

基于SpringBoot的网上订餐系统的设计与实现开发语言：Java数据库：MySQL技术：SpringBootMyBatisVue工具：IDEA/Ecilpse、Navicat、Maven 【主要功能】角色：用户、管理员管理员：登录、个人中心、会员管理、…

阅读更多...

推荐文章

最新文章