Quadratic optimal transportation problem with a positive semi definite structure on the cost function
1 introduction, 1.1 introduction.
Optimal Transportation problem (OT) has been an active research area in last decades, and many questions in the field are studied a lot. In addition, as OT gives a way to metrize a probability space, OT has a wide range of applications. While OT was used in wide application area, there were some application situations where a few aspects of OT needed to be modified to fit in the situation. As such, there has been a lot of versions of OT such as multi-marginal OT ( [ 23 ] , [ 16 ] , [ 22 ] , [ 11 ] ), optimal partial transportation problem( [ 9 ] , [ 17 ] , [ 8 ] ), congested transportation problem( [ 6 ] ), martingale transportation problem( [ 3 ] , [ 13 ] , [ 2 ] ), co-OT( [ 24 ] ), etc. In this paper, we introduce another version of OT.
Problem (0-1) .
Problem LABEL:Prbm_0-1 can be regarded as a quadratic version of OT, written in Mongeβs formulation. The problem that corresponds Katorovichβs relaxed formation of OT is the following.
Problem (0-2) .
Some readers may familiar with this problem already due to the studies about Gromov-Wasserstein distance.
This type of quantity can also be found in [ 25 ] , where the author defines L p subscript πΏ π L_{p} italic_L start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT distortion distance between two metric measure spaces in the form of ( 1 ) and construct the space of metric measure spaces. We would like to comment that there is another result [ 4 ] by R. Cabrera about where the author studies Problem LABEL:Prbm_0-1 with dynamic setting.
The motivation of Problem LABEL:Prbm_0-1 in this paper, however, arises from a data dimension reduction algorithms such as t-SNE [ 26 ] and UMAP [ 21 ] . In data science, it is an important problem to design an algorithm which let us save the data points from very high dimension in low dimension such as β 2 superscript β 2 \mathbb{R}^{2} blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT or β 3 superscript β 3 \mathbb{R}^{3} blackboard_R start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT preserving some structures of the point clouds. For instance, in t-SNE algorithm which is designed in [ 26 ] , the authors define affinities p i β’ j subscript π π π p_{ij} italic_p start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT and q i β’ j subscript π π π q_{ij} italic_q start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT which quantifies similarity of two points x i subscript π₯ π x_{i} italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and x j subscript π₯ π x_{j} italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT in high dimension and y i subscript π¦ π y_{i} italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and y j subscript π¦ π y_{j} italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT in low dimension respectively, and use KL-divergence β p β’ log β‘ p q π π π \sum p\log\frac{p}{q} β italic_p roman_log divide start_ARG italic_p end_ARG start_ARG italic_q end_ARG to measure the difference between p i β’ j subscript π π π p_{ij} italic_p start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT and q i β’ j subscript π π π q_{ij} italic_q start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT . Then t-SNE algorithm seeks a set of points in the low dimension which minimizes the KL-divergence. Existence of such set of points { y i } subscript π¦ π \{y_{i}\} { italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } in the low dimension was studied in [ 1 ] , and [ 14 ] . Then, letting T π T italic_T be a function such that T β’ ( x i ) = y i π subscript π₯ π subscript π¦ π T(x_{i})=y_{i} italic_T ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , we see that T π T italic_T solves Problem LABEL:Prbm_0-1 with c = p β’ log β‘ p q π π π π c=p\log\frac{p}{q} italic_c = italic_p roman_log divide start_ARG italic_p end_ARG start_ARG italic_q end_ARG . Hence, to study the structure of the projection T π T italic_T , it is necessary to study Problem LABEL:Prbm_0-1 .
The goal of this paper is to introduce the quadratic optimal transportation problem . We explore some basic properties of the quadratic optimal transportation problem analogous to OT, and discuss extra structures of the quadratic optimal transportation problem. The main theorem of this paper is the duality for the quadratic transportation problem with a structural assumption on the cost function.
Theorem (Duality, rough statement) .
For squared cost function c π c italic_c , we have the following identity
where the infimum and supreme are taken over some admissible sets.
The squared cost function is defined in Section 3 and the admissible sets are described in Section 2 and 4 .
This paper consists as follows. In section 2 , we introduce the quadratic optimal transportation problem and compare the problem with other versions of OT. In section 3 , we explore some basic properties of the quadratic optimal transportation problem. In 4 , we provide a duality formula for a certain type of cost function. In the appendices, we derive a related partial differential equation formally and provide some background theory for the classical optimal transportation problem which are used in this paper.
1.2 Notations
2 quadratic transportation problem, 2.1 quadratic transportation problem.
We restate the problem that we consider in this paper in detail. Let c : Z Γ Z β β : π β π π β c:Z\times Z\to\mathbb{R} italic_c : italic_Z Γ italic_Z β blackboard_R be a lower semi-continuous function. Then the problem states as follows.
Problem 1 .
Find a measurable function T : X β Y : π β π π T:X\to Y italic_T : italic_X β italic_Y which achieves the following infimum
among the measurable functions that satisfies T β― β’ ΞΌ = Ξ½ subscript π β― π π T_{\sharp}\mu=\nu italic_T start_POSTSUBSCRIPT β― end_POSTSUBSCRIPT italic_ΞΌ = italic_Ξ½ , where T β― β’ ΞΌ subscript π β― π T_{\sharp}\mu italic_T start_POSTSUBSCRIPT β― end_POSTSUBSCRIPT italic_ΞΌ is the push-forward measure.
We call the above problem Monge formulation of the quadratic optimal transportation problem, or simply Monge QOT. We call a solution to Monge QOT a Monge solution. Monge QOT is highly non-linear with respect to T π T italic_T . However, Using Kantorovichβs relaxation as in the classical optimal transportation theory, we can reduce the non-linearity to the quadratic dependency. We first define
Then the Kantorovich formulation of the quadratic optimal transportation problem (Kantorovich QOT) is the following.
Problem 2 .
Find Ο 0 β Ξ β’ ( ΞΌ , Ξ½ ) subscript π 0 Ξ π π \pi_{0}\in\Pi(\mu,\nu) italic_Ο start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT β roman_Ξ ( italic_ΞΌ , italic_Ξ½ ) that achieves the following infimum.
We call a solution to the Kantorovich QOT a Kantorovich solution. We will often use quadratic optimal transportation problem or QOT to refer one of Monge QOT or Kantorovich QOT, and we will often say solution when there is no need for distinguishing the Monge solutions and the Kantorovich solutions.
The Kantorovich formulation of the classical optimal transportation problem allow us to view the problem in the perspective of linear porgramming. In the quadratic optimal transportation case, the relaxed problem 2 can be considered as a quadratic programming.
2.2 Comparison with other variations of OT
In this subsection, we claim that the solutions of the quadratic optimal transportation are different from the solutions to other versions of the optimal transportation problem with naive modifications.
2.2.1 Classical optimal transportation problems
We start with the classical optimal transportation problems. In this problem, we consider a cost function c : X Γ Y β β : π β π π β c:X\times Y\to\mathbb{R} italic_c : italic_X Γ italic_Y β blackboard_R and measures Ο π \pi italic_Ο which have given marginals ΞΌ β π« β’ ( X ) π π« π \mu\in\mathcal{P}(X) italic_ΞΌ β caligraphic_P ( italic_X ) and Ξ½ β π« β’ ( Y ) π π« π \nu\in\mathcal{P}(Y) italic_Ξ½ β caligraphic_P ( italic_Y ) . We define total cost π : π« β’ ( X Γ Y ) β β βͺ { Β± β } : π β π« π π β plus-or-minus \mathcal{C}:\mathcal{P}(X\times Y)\to\mathbb{R}\cup\{\pm\infty\} caligraphic_C : caligraphic_P ( italic_X Γ italic_Y ) β blackboard_R βͺ { Β± β } by π β’ ( Ο ) = β« c β’ π Ο π π π differential-d π \mathcal{C}(\pi)=\int cd\pi caligraphic_C ( italic_Ο ) = β« italic_c italic_d italic_Ο . Then we seek a probability measure in Ξ β’ ( ΞΌ , Ξ½ ) Ξ π π \Pi(\mu,\nu) roman_Ξ ( italic_ΞΌ , italic_Ξ½ ) which minimize the total cost.
Problem 3 .
In the quadratic optimal transportation problem, we consider probability measures with the same marginal conditions in Ξ β’ ( ΞΌ , Ξ½ ) Ξ π π \Pi(\mu,\nu) roman_Ξ ( italic_ΞΌ , italic_Ξ½ ) , but we integrate the cost function against the measure twice. Hence, one may consider Problem 3 with Ο β Ο tensor-product π π \pi\otimes\pi italic_Ο β italic_Ο as a measure in π« β’ ( X 2 Γ Y 2 ) π« superscript π 2 superscript π 2 \mathcal{P}(X^{2}\times Y^{2}) caligraphic_P ( italic_X start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT Γ italic_Y start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) which is in Ξ β’ ( ΞΌ β ΞΌ , Ξ½ β Ξ½ ) Ξ tensor-product π π tensor-product π π \Pi(\mu\otimes\mu,\nu\otimes\nu) roman_Ξ ( italic_ΞΌ β italic_ΞΌ , italic_Ξ½ β italic_Ξ½ ) .
We claim that the solution to this problem is different from the solutions to the QOT in general with an example. Let us describe the idea before we give an example. The idea is to see the structure of the solutions to the two problems. If QOT has a Monge solution T π T italic_T , then Ο = ( I β’ d Γ T ) β― β’ ΞΌ π subscript πΌ π π β― π \pi=(Id\times T)_{\sharp}\mu italic_Ο = ( italic_I italic_d Γ italic_T ) start_POSTSUBSCRIPT β― end_POSTSUBSCRIPT italic_ΞΌ and Ο β Ο = ( I β’ d Γ T Γ I β’ d Γ T ) β― β’ ΞΌ tensor-product π π subscript πΌ π π πΌ π π β― π \pi\otimes\pi=(Id\times T\times Id\times T)_{\sharp}\mu italic_Ο β italic_Ο = ( italic_I italic_d Γ italic_T Γ italic_I italic_d Γ italic_T ) start_POSTSUBSCRIPT β― end_POSTSUBSCRIPT italic_ΞΌ . In particular, if Ο β Ο tensor-product π π \pi\otimes\pi italic_Ο β italic_Ο is a Kantorovich solution of Problem LABEL:prbm:_OT2 , then ( T , T ) π π (T,T) ( italic_T , italic_T ) will be the Monge solution to the Problem LABEL:prbm:_OT2 . Then the horizontal and vertical slices of X Γ X π π X\times X italic_X Γ italic_X will be mapped to the horizontal and vertical slices of Y Γ Y π π Y\times Y italic_Y Γ italic_Y respectively. This is not true for the solutions to the classical optimal transportation problem in general.
Example 1 .
π¦ Β― π¦ c(x,y,\overline{x},\overline{y})=-(x+\overline{x})(y+\overline{y}) italic_c ( italic_x , italic_y , overΒ― start_ARG italic_x end_ARG , overΒ― start_ARG italic_y end_ARG ) = - ( italic_x + overΒ― start_ARG italic_x end_ARG ) ( italic_y + overΒ― start_ARG italic_y end_ARG ) . We let ΞΌ π \mu italic_ΞΌ and Ξ½ π \nu italic_Ξ½ be the probability measures on [ 0 , 1 ] 0 1 [0,1] [ 0 , 1 ] defined by
Lemma 2.1 .
Define D Β― k subscript Β― π· π \overline{D}_{k} overΒ― start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and D Β― k subscript Β― π· π \underline{D}_{k} underΒ― start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT by
subscript π¦ 0 subscript Β― π¦ 0 l=y_{0}+\overline{y}_{0} italic_l = italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + overΒ― start_ARG italic_y end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT . Then
Corollary 2.2 .
we first note that
as Ο β Ο β Ξ β’ ( ΞΌ β ΞΌ , Ξ½ β Ξ½ ) tensor-product π π Ξ tensor-product π π tensor-product π π \pi\otimes\pi\in\Pi(\mu\otimes\mu,\nu\otimes\nu) italic_Ο β italic_Ο β roman_Ξ ( italic_ΞΌ β italic_ΞΌ , italic_Ξ½ β italic_Ξ½ ) for any Ο β Ξ β’ ( ΞΌ , Ξ½ ) π Ξ π π \pi\in\Pi(\mu,\nu) italic_Ο β roman_Ξ ( italic_ΞΌ , italic_Ξ½ ) . Therefore, we only need to show that the equality is not attained. Suppose we have the equality
Note that by existence of the solution of QOT (Proposition 3.2 ), β Ο β β Ξ β’ ( ΞΌ , Ξ½ ) superscript π Ξ π π \exists\pi^{*}\in\Pi(\mu,\nu) β italic_Ο start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT β roman_Ξ ( italic_ΞΌ , italic_Ξ½ ) which is attains the infimum. Then Ο β β Ο β tensor-product superscript π superscript π \pi^{*}\otimes\pi^{*} italic_Ο start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT β italic_Ο start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT is also a solution to Problem LABEL:prbm:_OT2 . Let ( x , y ) β spt β’ ( Ο β ) π₯ π¦ spt superscript π (x,y)\in\mathrm{spt}(\pi^{*}) ( italic_x , italic_y ) β roman_spt ( italic_Ο start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT ) , then ( x , y , x , y ) β spt β’ ( Ο β β Ο β ) π₯ π¦ π₯ π¦ spt tensor-product superscript π superscript π (x,y,x,y)\in\mathrm{spt}(\pi^{*}\otimes\pi^{*}) ( italic_x , italic_y , italic_x , italic_y ) β roman_spt ( italic_Ο start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT β italic_Ο start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT ) and by Lemma 2.1 , we observe
Explicit computation with the definitions of ΞΌ π \mu italic_ΞΌ and Ξ½ π \nu italic_Ξ½ , one can observe that y π¦ y italic_y is given by a function of x π₯ x italic_x . In particular, if y = T β’ ( x ) π¦ π π₯ y=T(x) italic_y = italic_T ( italic_x ) , one can compute
On the other hand, we also have Ξ½ = T β― β’ ΞΌ π subscript π β― π \nu=T_{\sharp}\mu italic_Ξ½ = italic_T start_POSTSUBSCRIPT β― end_POSTSUBSCRIPT italic_ΞΌ as spt β’ ( Ο β ) spt superscript π \mathrm{spt}(\pi^{*}) roman_spt ( italic_Ο start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT ) is on the graph of T π T italic_T . Then we can also compute
which is a contradiction. Therefore the equality ( 3 ) cannot be attained. β
Remark 2.3 .
By solving the formula of the cost function in Example 1 , one can find that the quadratic optimal transportation problem in Example 1 is in fact equivalent to a classical optimal transportation problem with c β’ ( x , y ) = β x β’ y π π₯ π¦ π₯ π¦ c(x,y)=-xy italic_c ( italic_x , italic_y ) = - italic_x italic_y .
This shows that for some special cases, the quadratic optimal transportation problem can still be regarded as a classical optimal transportation problem, but with appropriate adjustments.
2.2.2 Multi-marginal problems
In multi-marginal optimal transport problems, we consider probability measures ΞΌ i β π« β’ ( X i ) subscript π π π« subscript π π \mu_{i}\in\mathcal{P}(X_{i}) italic_ΞΌ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT β caligraphic_P ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) and a cost function c : β i = 1 n X i β β : π β superscript subscript product π 1 π subscript π π β c:\prod_{i=1}^{n}X_{i}\to\mathbb{R} italic_c : β start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT β blackboard_R , and we seek a probability measure Ξ³ πΎ \gamma italic_Ξ³ that minimizes the total cost β« c β’ π Ξ³ π differential-d πΎ \int cd\gamma β« italic_c italic_d italic_Ξ³ among all the probability measures Ξ³ πΎ \gamma italic_Ξ³ that satisfy the marginal conditions Proj X i β― β’ Ξ³ = ΞΌ i subscript subscript Proj subscript π π β― πΎ subscript π π {\mathrm{Proj}_{X_{i}}}_{\sharp}\gamma=\mu_{i} roman_Proj start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUBSCRIPT β― end_POSTSUBSCRIPT italic_Ξ³ = italic_ΞΌ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT .
Problem 4 .
Then one can consider the multi-marginal problem with n = 4 π 4 n=4 italic_n = 4 , X 1 = X 3 = X subscript π 1 subscript π 3 π X_{1}=X_{3}=X italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_X start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = italic_X , X 2 = X 4 = Y subscript π 2 subscript π 4 π X_{2}=X_{4}=Y italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_X start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT = italic_Y , ΞΌ 1 = ΞΌ 3 = ΞΌ subscript π 1 subscript π 3 π \mu_{1}=\mu_{3}=\mu italic_ΞΌ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_ΞΌ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = italic_ΞΌ and ΞΌ 2 = ΞΌ 4 = Ξ½ subscript π 2 subscript π 4 π \mu_{2}=\mu_{4}=\nu italic_ΞΌ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_ΞΌ start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT = italic_Ξ½ . We claim that this modification of multi-marginal problem does not give a solution to QOT in general. To see this, we consider the dimension of the supports of the solutions. In [ 23 ] , [ 16 ] , it is shown that under suitable condition, the solution of Problem 4 is given by a function, i.e. there exists F i : X 1 β X i : subscript πΉ π β subscript π 1 subscript π π F_{i}:X_{1}\to X_{i} italic_F start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT β italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , 2 β€ i β€ n 2 π π 2\leq i\leq n 2 β€ italic_i β€ italic_n such that
is a solution to Problem 4 . Then if we consider X i = X subscript π π π X_{i}=X italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_X for all i π i italic_i where dim X = d dimension π π \dim X=d roman_dim italic_X = italic_d , we can observe that the dimension of the support of the solution will be d π d italic_d as it lies on the graph of a function whose domain has dimension d π d italic_d . On the other hand, if a solution Ο π \pi italic_Ο of QOT is given by a Monge solution T π T italic_T , then the support of the measure Ο β Ο tensor-product π π \pi\otimes\pi italic_Ο β italic_Ο lies in the graph of the function ( T , T ) π π (T,T) ( italic_T , italic_T ) which has 2 β’ d 2 π 2d 2 italic_d -dimensional domain.
Example 2 .
Let X i = [ 0 , 1 ] subscript π π 0 1 X_{i}=[0,1] italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = [ 0 , 1 ] for 1 β€ i β€ 4 1 π 4 1\leq i\leq 4 1 β€ italic_i β€ 4 and we consider
Let ΞΌ i = d x β [ 0 , 1 ] \mu_{i}=dx\lfloor_{[0,1]} italic_ΞΌ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_d italic_x β start_POSTSUBSCRIPT [ 0 , 1 ] end_POSTSUBSCRIPT for 1 β€ i β€ 4 1 π 4 1\leq i\leq 4 1 β€ italic_i β€ 4 . Then it is obvious that the infimum value of Problem 4 is 0 0 (Consider Ξ³ = ( I d Γ I d Γ I d Γ I d ) β― d x β [ 0 , 1 ] \gamma=(Id\times Id\times Id\times Id)_{\sharp}dx\lfloor_{[0,1]} italic_Ξ³ = ( italic_I italic_d Γ italic_I italic_d Γ italic_I italic_d Γ italic_I italic_d ) start_POSTSUBSCRIPT β― end_POSTSUBSCRIPT italic_d italic_x β start_POSTSUBSCRIPT [ 0 , 1 ] end_POSTSUBSCRIPT ). On the other hand, QOT with X = Y = [ 0 , 1 ] π π 0 1 X=Y=[0,1] italic_X = italic_Y = [ 0 , 1 ] , ΞΌ = Ξ½ = d x β [ 0 , 1 ] \mu=\nu=dx\lfloor_{[0,1]} italic_ΞΌ = italic_Ξ½ = italic_d italic_x β start_POSTSUBSCRIPT [ 0 , 1 ] end_POSTSUBSCRIPT and c β’ ( x , y , x Β― , y Β― ) π π₯ π¦ Β― π₯ Β― π¦ c(x,y,\overline{x},\overline{y}) italic_c ( italic_x , italic_y , overΒ― start_ARG italic_x end_ARG , overΒ― start_ARG italic_y end_ARG ) where c π c italic_c is defined above does not have the infimum value 0.
Therefore, solutions of multi-marginal problem does not directly implies solutions of QOT as
2.2.3 Co-optimal transportation problems
Co-optimal transportation problem was introduced in [ 24 ] .
Problem 5 .
This problem have the most similar look with Problem 2 as we get Problem 2 by setting Ο = Ο π π \pi=\rho italic_Ο = italic_Ο in Problem 5 . Still, if we consider the Problem 5 with ΞΌ 1 = ΞΌ 2 = ΞΌ subscript π 1 subscript π 2 π \mu_{1}=\mu_{2}=\mu italic_ΞΌ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_ΞΌ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_ΞΌ and Ξ½ 1 = Ξ½ 2 = Ξ½ subscript π 1 subscript π 2 π \nu_{1}=\nu_{2}=\nu italic_Ξ½ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_Ξ½ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_Ξ½ , Solutions to Problem 5 does not directly give solutions of QOT. Here, the idea of the example is look at the difference of the supports of Ο β Ο tensor-product π π \pi\otimes\rho italic_Ο β italic_Ο and Ο β Ο tensor-product π π \pi\otimes\pi italic_Ο β italic_Ο . Support of Ο β Ο tensor-product π π \pi\otimes\pi italic_Ο β italic_Ο always contains some part of the diagonal of ( X Γ Y ) 2 superscript π π 2 (X\times Y)^{2} ( italic_X Γ italic_Y ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , while Ο β Ο tensor-product π π \pi\otimes\rho italic_Ο β italic_Ο does not have to contain any part of the diagonal of ( X Γ Y ) 2 superscript π π 2 (X\times Y)^{2} ( italic_X Γ italic_Y ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . Hence, we can build an example by letting the cost function to have very high value on the diagonal of ( X Γ Y ) 2 superscript π π 2 (X\times Y)^{2} ( italic_X Γ italic_Y ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT .
Example 3 .
We give a discrete example. We let X = Y = { 0 , 1 } π π 0 1 X=Y=\{0,1\} italic_X = italic_Y = { 0 , 1 } and define the cost function c π c italic_c by
1 2 subscript πΏ 0 1 2 subscript πΏ 1 \mu=\nu=\frac{1}{2}\delta_{0}+\frac{1}{2}\delta_{1} italic_ΞΌ = italic_Ξ½ = divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_Ξ΄ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_Ξ΄ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT . We first consider Problem 5 . Like stated above, we construct Ο π \pi italic_Ο and Ο π \rho italic_Ο so that spt β’ ( Ο β Ο ) spt tensor-product π π \mathrm{spt}(\pi\otimes\rho) roman_spt ( italic_Ο β italic_Ο ) does not contain any part of the diagonal of ( X Γ Y ) 2 superscript π π 2 (X\times Y)^{2} ( italic_X Γ italic_Y ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . We let
Therefore, infimum of Problem 5 is 0 in this example. On the other hand, for any Ξ³ β Ξ β’ ( ΞΌ , Ξ½ ) πΎ Ξ π π \gamma\in\Pi(\mu,\nu) italic_Ξ³ β roman_Ξ ( italic_ΞΌ , italic_Ξ½ ) , we have
Then we can compute
Therefore, we obtain
2.2.4 Other transportation problems
There are other variations of the optimal transportation problems such as weak optimal transportation problem, martingale optimal transportation problem, optimal transportation with congestion. Among these variations, optimal transportation with congestion problem considers interaction that occurs along the transportation. This problem was considered by [ 6 ] , but most of the study consider the equations describing the congestion. In [ 4 ] , however, the author adds a quadratic transportation term to describe the interaction.
3 Elementary properties
3.1 existence, non-uniqueness, and non-locality.
Existence of a Kantorovich solution to the Kantorovich QOT can be proved similarly to the existence proof of classical optimal transportation problem. Before we prove the existence, we show a short lemma about a tensor product of weakly converging sequences of probability measures.
Lemma 3.1 .
Suppose { ΞΌ k } k = 1 β superscript subscript subscript π π π 1 \{\mu_{k}\}_{k=1}^{\infty} { italic_ΞΌ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT and { Ξ½ k } k = 1 β superscript subscript subscript π π π 1 \{\nu_{k}\}_{k=1}^{\infty} { italic_Ξ½ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT be sequences of probability measures on Polish spaces X π X italic_X and Y π Y italic_Y that converge weakly to ΞΌ 0 subscript π 0 \mu_{0} italic_ΞΌ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and Ξ½ 0 subscript π 0 \nu_{0} italic_Ξ½ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT respectively. Then the sequence of probability measures { ΞΌ k β Ξ½ k } k = 1 β superscript subscript tensor-product subscript π π subscript π π π 1 \{\mu_{k}\otimes\nu_{k}\}_{k=1}^{\infty} { italic_ΞΌ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT β italic_Ξ½ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT weakly converges to ΞΌ 0 β Ξ½ 0 tensor-product subscript π 0 subscript π 0 \mu_{0}\otimes\nu_{0} italic_ΞΌ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT β italic_Ξ½ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT .
By Prokhorovβs Theorem, { ΞΌ k } k = 1 β superscript subscript subscript π π π 1 \{\mu_{k}\}_{k=1}^{\infty} { italic_ΞΌ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT and { Ξ½ k } k = 1 β superscript subscript subscript π π π 1 \{\nu_{k}\}_{k=1}^{\infty} { italic_Ξ½ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT are tight. We claim that { ΞΌ k β Ξ½ k } k = 1 β superscript subscript tensor-product subscript π π subscript π π π 1 \{\mu_{k}\otimes\nu_{k}\}_{k=1}^{\infty} { italic_ΞΌ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT β italic_Ξ½ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT is also tight. Indeed, for any Ο΅ > 0 italic-Ο΅ 0 \epsilon>0 italic_Ο΅ > 0 , there exist compact sets K β X πΎ π K\subset X italic_K β italic_X and L β Y πΏ π L\subset Y italic_L β italic_Y such that
for any k π k italic_k . Then
for any k π k italic_k . Noting that K Γ L πΎ πΏ K\times L italic_K Γ italic_L is also compact, { ΞΌ k β Ξ½ k } k = 1 β superscript subscript tensor-product subscript π π subscript π π π 1 \{\mu_{k}\otimes\nu_{k}\}_{k=1}^{\infty} { italic_ΞΌ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT β italic_Ξ½ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT is also tight. Then, invoking Prokhorov again, any subsequence of { ΞΌ k β Ξ½ k } k = 1 β superscript subscript tensor-product subscript π π subscript π π π 1 \{\mu_{k}\otimes\nu_{k}\}_{k=1}^{\infty} { italic_ΞΌ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT β italic_Ξ½ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT has a subsequence { ΞΌ k i β Ξ½ k i } i = 1 β superscript subscript tensor-product subscript π subscript π π subscript π subscript π π π 1 \{\mu_{k_{i}}\otimes\nu_{k_{i}}\}_{i=1}^{\infty} { italic_ΞΌ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT β italic_Ξ½ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT which converges weakly to a probability measure Ξ³ πΎ \gamma italic_Ξ³ . Then, we observe
Therefore we obtain that Ξ³ = ΞΌ 0 β Ξ½ 0 πΎ tensor-product subscript π 0 subscript π 0 \gamma=\mu_{0}\otimes\nu_{0} italic_Ξ³ = italic_ΞΌ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT β italic_Ξ½ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT . We just proved that any subsequence of { ΞΌ k β Ξ½ k } k = 1 β superscript subscript tensor-product subscript π π subscript π π π 1 \{\mu_{k}\otimes\nu_{k}\}_{k=1}^{\infty} { italic_ΞΌ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT β italic_Ξ½ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT has a subsequence that converges to ΞΌ 0 β Ξ½ 0 tensor-product subscript π 0 subscript π 0 \mu_{0}\otimes\nu_{0} italic_ΞΌ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT β italic_Ξ½ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT weakly, and this implies that the whole sequence { ΞΌ k β Ξ½ k } k = 1 β superscript subscript tensor-product subscript π π subscript π π π 1 \{\mu_{k}\otimes\nu_{k}\}_{k=1}^{\infty} { italic_ΞΌ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT β italic_Ξ½ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT converges to ΞΌ 0 β Ξ½ 0 tensor-product subscript π 0 subscript π 0 \mu_{0}\otimes\nu_{0} italic_ΞΌ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT β italic_Ξ½ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT weakly. β
Proposition 3.2 .
Problem 2 admits a solution.
Let { Ο k } k = 1 β β Ξ β’ ( ΞΌ , Ξ½ ) superscript subscript subscript π π π 1 Ξ π π \{\pi_{k}\}_{k=1}^{\infty}\subset\Pi(\mu,\nu) { italic_Ο start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT β end_POSTSUPERSCRIPT β roman_Ξ ( italic_ΞΌ , italic_Ξ½ ) be a minimizing sequence.
Tabu Search Techniques for the Quadratic Semi-Assignment Problem
- Conference paper
- Cite this conference paper
- Wolfgang Domschke 4 ,
- Peter Forst 4 &
- Stefan VoΓ 4 Β
195 Accesses
20 Citations
Assigning items to sets such that a quadratic function is minimized may be referred to as the quadratic semi-assignment problem. This problem indeed is a relaxed version of the well known quadratic assignment problem. Applications may be found in floor layout planning, in certain median problems with mutual communication, and in the problem of schedule synchronization in public transit networks. The same problem also arises in certain scheduling problems where the deviation from due dates is penalized by a quadratic function.
In this paper we compare different improvement procedures for the quadratic semi-assignment problem. The main contribution will be to explore detailed tabu search techniques with respect to this specific problem and, in addition, to compare them with simulated annealing.
This is a preview of subscription content, log in via an institution to check access.
Access this chapter
Subscribe and save.
- Get 10 units per month
- Download Article/Chapter or eBook
- 1 Unit = 1 Article or 1 Chapter
- Cancel anytime
- Available as PDF
- Read on any device
- Instant download
- Own it forever
- Compact, lightweight edition
- Dispatched in 3 to 5 business days
- Free shipping worldwide - see info
Tax calculation will be finalised at checkout
Purchases are for personal use only
Institutional subscriptions
Unable to display preview. Download preview PDF.
Similar content being viewed by others
A Tabu Search Matheuristic forΒ theΒ Generalized Quadratic Assignment Problem
Tabu Search and Solution Space Analyses. TheΒ Job Shop Case
Simplified Tabu Search with Random-Based Searches for Bound Constrained Global Optimization
Chhajed, D. and T.J. Lowe (1990). M-median and m-center problems with mutual communication: solvable special cases. Working paper, University of Illinois Urbana-Champaign.
Google Scholar Β
Dammeyer, F., P. Forst and S. VoΓ (1991). On the cancellation sequence method of tabu search. ORSA Journal on Commuting 3, 262β265.
Article Β Google Scholar Β
Domschke, W. (1989). Schedule synchronization for public transit networks. OR Spektrum 11, 17β24.
Dutta, A., G. Koehler and A. Whinston (1982). On optimal allocation in a distributed processing environment. Management Science 28, 839β853.
Glover, F. (1989). Tabu search β part I. ORSA Journal on Computing 1, 190β206.
Glover, F. (1990a). Tabu search β part II. ORSA Journal on Computing 2, 4β32.
Glover, F. (1990b). Tabu search: a tutorial. Interfaces 20, 74β94.
Moretti Tomasin, E., P. Pianca and A. Sorato (1988). Heuristic algorithms for the quadratic semi-assignment problem. Ricerca Operativa 18, 65β89.
Simeone, B. (1986). An asymptotically exact algorithm for equipartition problems. Discrete Applied Mathematics 14, 283β293.
Skorin-Kapov, J. (1990). Tabu search applied to the quadratic assignment problem. ORSA Journal on Computing 2, 33β45.
VoΓ, S. (1990). Network design formulations in schedule synchronization. Paper presented at the 5th Workshop on Computer-Aided Scheduling of Public Transport, Montreal.
Download references
Author information
Authors and affiliations.
FB 1 / FG Operations Research, Technische Hochschule Darmstadt, HochschulstraΓe 1, D - 6100, Darmstadt, Germany
Wolfgang Domschke,Β Peter ForstΒ &Β Stefan VoΓ
You can also search for this author in PubMed Β Google Scholar
Editor information
Editors and affiliations.
Institute of Production Management, FernuniversitΓ€t Hagen, D-5800, Hagen 1, Germany
GΓΌnter Fandel
The Institute of Public Policy, George Mason University, Fairfax, VA, 22030-444, USA
Thomas Gulledge
NIST B 124 Metrology, AMFR, Gaithersburg, MD, 20899, USA
Albert Jones
Rights and permissions
Reprints and permissions
Copyright information
Β© 1992 Springer-Verlag Berlin Β· Heidelberg
About this paper
Cite this paper.
Domschke, W., Forst, P., VoΓ, S. (1992). Tabu Search Techniques for the Quadratic Semi-Assignment Problem. In: Fandel, G., Gulledge, T., Jones, A. (eds) New Directions for Operations Research in Manufacturing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-77537-6_23
Download citation
DOI : https://doi.org/10.1007/978-3-642-77537-6_23
Publisher Name : Springer, Berlin, Heidelberg
Print ISBN : 978-3-642-77539-0
Online ISBN : 978-3-642-77537-6
eBook Packages : Springer Book Archive
Share this paper
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
- Publish with us
Policies and ethics
- Find a journal
- Track your research