Appendix D
Computation of antenna based complex gains

The normalized cross-correlation function (the correlator output), measured by an interferometer using two antennas labeled by i and j, in the limit I Tis∕ηi, can be written as:
                        +∫∞∫        ∘ ------
 Obs   Obs                           -ηiηj-  2πι(uijl+vijm+wij√1-l2-m2+ϕi-ϕj)
ρij  = ρ   (uij,vij,wij) =     I(l,m )  TisTsj e
                        -∞
                             ----dl-dm------
                             ∘ (1---l2---m2)-+ ϵij
(D.1)

where I(l,m) is the sky surface brightness, ηi is the sensitivity and Tis is the system temperature of the antenna i in units of Kelvin/Jy and Kelvin respectively, ϵij is the additive noise on the baseline i-j, and ϕi is the antenna based phase of the signal. The rest of the symbols have the usual meaning.

In practice, however, the antenna based amplitude (∘ ------
  ηi∕T si) and phase (ϕi) are potentially time varying quantities. This could be due to changes in the ionosphere, temperature variations, ground pick up, antenna blockage, noise pick up by various electronic components, background temperature, etc. Treating the quantities under the square root in the above equation as the antenna dependent amplitude gains, these can be written as complex gains gi = aie-ιϕi where ai = ∘ ------
  ηi∕Tis. For an unresolved source at the phase tracking center, variations in this amplitude will be indistinguishable from a variations in the ratio of η and Ts.

In terms of gis, we can write Equation D.1 as

 Obs      ⋆ ∘
ρij  = gigjρij + ϵij
(D.2)

where

      +∫∞ +∫∞                        √-------
ρ∘ij =        I(l,m ) e2πι(uijl+vijm+wij 1-l2-m2 )∘---dl dm------
     -∞ -∞                                    (1- l2 - m2 )
(D.3)

The use of the word “antenna based gains” for gis result in confusion for many and needs some clarifications. gis are called complex “gains” since they multiply with the complex quantity ρij. For an unresolved source, |gi| represents the fraction of correlated signal and arg(gi) represents the phase of the correlated part of the signal from the antenna with respect to the phase reference (usually the reference antenna). It is in this sense that it is referred to as “antenna based” gains. However, as defined here, they include Ts which in turn includes the sky background temperature. They are therefore a function of direction in the sky. However, here we assume that the angular scale over which gis vary is larger than the antenna primary beam (isoplanatic case).

For an unresolved source at the phase tracking center, all terms in the exponent of ρij are exactly zero. ρij in this case would be proportional to the flux density of the source.

Assuming that the antenna dependent complex gains are independent, with a gaussian probability density function (this implies that the real and imaginary parts are independently gaussian random processes), one can estimate gis by minimizing, with respect to gis, the function S given by

     ∑  |              |2
S =     ||ρOijbs- gi gj⋆ρ∘ij|| wij
     i,j
     i⁄=j
(D.4)

where wij = 1∕σij2, σij being the variance on the measurement of ρijObs

Dividing the above equation by ρij (the source model, which is presumed to be known – it is trivially known for an unresolved source), and writing ρijObs∕ρij = Xij, we get

     ∑  |          |
S =     |Xij - gi g⋆j|2 wij
     i,j
     i⁄=j
(D.5)

If ρij represents the structure of the source accurately, Xij will have no source dependent terms and is purely a product of the two antenna dependent complex gains.

Expanding Equation D.5, we get

       [                                   ]
S = ∑     |X  |2 - g⋆g X  - gg⋆X ⋆ + g g⋆gg ⋆ w
     i,j     ij     i j  ij    ij  ij   i i jj    ij
     i⁄=j
(D.6)

Evaluation ∂S _ ∂gi and equating it to zero 1 , we get

∂S--    ∑  [                ⋆   ]
∂g⋆i =       - gjXijwij + gigjgjwij   =  0
        jj⁄=i
(D.7)

or

      ∑   Xijgjwij
       j
gi =  j⁄=∑i----2----
           |gj| wij
       jj⁄=i
(D.8)

This can also be derived by equating the partial derivatives of S with respect to real and imaginary parts of gi as shown in Section D.3.

Since the antenna dependent complex gains also appear on the right-hand side of Equation D.8, it has to be solved iteratively starting with some initial guess for gjs or initializing them all to 1.

Equation D.8 can be written in the iterative form as:

               ⌊                     ⌋
                       ∑   X  gn-1w
               |        j   ij j   ij|
gn =  gn-1 + α ||gn-1 - j⁄=i-|----|----||
 i     i       |⌈ i     ∑   ||gn-1||2w  |⌉
                        j    j     ij
                       j⁄=i
(D.9)

where n is the iteration number and 0 < α < 1. Convergence would be defined by the constraint

|Sn - Sn -1| < δ
(D.10)

(the change in S from one iteration to another) where δ is the tolerance limit.

D.1 Interpretation of the equation

Equation D.8 offers itself for some intuitive understanding in the following way.

Xij is a product of two complex numbers, namely gi and gj, which we wish to determine. Xij is itself derived from the measured quantity V ijObs. Numerically speaking, each term in the summation of the numerator of Equation D.8 will involve gi (via Xij) and the multiplication of Xij with gjwij would give gi an effective weight of |gj|2wij. Since the denominator is the sum of this effective weight, the right-hand side of Equation D.8 can be interpreted as the weighted average of gi over all correlations with antenna i.

In the very first iteration, when gj = (1,0), the normalization would be incorrect since the numeric value of gj, as it appears inside Xij would be different from that used in the denominator of Equation D.8. However, as the estimates of gjs improve with iterations, the equation would progressively approach a true weighted average equation. The speed of convergence will depend upon the value of α and the convergence would be defined by the constraint in Equation D.10. In the ideal case when the true value of all gis is known, right hand side of Equation D.8 also reduces of gi.

Estimating gi for an antenna, by averaging over the measurements from all baselines in which it participates (for a unresolved source) makes sense since for an N element array, gi would be present in N-1 measurements (all the Xij|j=1,N;ji) and the best estimate of gi would be the weighted average of all these measurements. Proper weight for gi, buried in each of the products Xij, can be found heuristically as follows. gi, estimated from the measurements of a given baseline, must obviously be weighted by the signal-to-noise ratio on that baseline. This is wij in the above equations. It must also be weighted by the amplitude gain of the other antenna making the baseline, namely gj, to account for variation in antenna sensitivities and Ts. The total weight for gi would then be |gj|2wij, the sum of which appears in the denominator of Equation D.8. Knowing that ideally Xij = gigj, each of the Xij|j=1,N must be multiplied by gjwij (to apply the the above mentioned weights to gi), before being summed for all values of j and normalized by the sum of weights to form the weighted average of gi. One thus arrives at Equation D.8 using these heuristic arguments.

D.2 Estimation of the system temperature (Ts)

For an unresolved source of known brightness I, in the limit Ta Ts, ρij = I and Equation D.1 can be written as

                   ∘ ------
ρObs  =  Igig⋆  ≈ I   ηiηj-
  ij         j        Tsi Tjs
(D.11)

where ηi = Ae2kb, Ae is the effective area of the dish, kb is the Boltzman’s constant and

       ∘ ---
|gi| =     ηi-
          Tsi
(D.12)

Hence, knowing ηi, Tis can be estimated from the amplitude of the antenna dependent complex gains.

All contributions to ρijObs, which cannot be factored into antenna dependent gains, will result in the reduction of |g|. η remaining constant, this will be indistinguishable from an increase in the effective system temperature. Since majority of later processing of interferometry data for mapping (primary calibration, bandpass calibration, SelfCal, etc.) is done by treating the visibility as a product of two antenna based numbers, this is the effective system temperature which will determine the noise in the final map (though, as a final step in the mapping process, baseline based calibration can possibly improve the noise in the map).

In the normal case of no significant baseline based terms (ϵij) in Xij, the system temperature as measured by the above method will be equivalent to any other determination of Tis.

Ts can also be determined by recording interferometric data for a strong point source with and without an independent noise source of known temperature at each antenna. In this case

               ON 2
Tsi  = T ni (---gi--------)
          gOiF F2 - gOiN 2
(D.13)

where giON and giOFF are the antenna dependent gains with and without the noise source of temperature Tn. Note that ηi does not enter this equation. Also, Tn should be such that ∘ --a---n----s-
  T  ∕(T  + T ) 0.1 to ensure that the correlated signal is measured with sufficient signal-to-noise ratio. For example, for P-band, a calibrator with P-band flux density > 5 Jy must be used.

D.3 Derivation of gi using real and imaginary parts

gis are complex functions. One can therefore write S in terms of giI and gip, the real and imaginary parts of gi and minimize with respect to giI and gip separately. It is shown here that the complex arithmetic achieves exactly this and the results are same as that given by complex calculus. The superscripts I and R in the following are used to represent the real and imaginary parts of complex quantities.

Expanding Equation D.5, ignoring wijs and writing it in terms of real and imaginary parts we get

∑  ||        ⋆||2   ∑  [        ⋆][  ⋆    ⋆  ]
    Xij - gigj   =     Xij - gigj  X ij - gigj
ii,⁄=jj               ii,j⁄=j
            ∑  [(  R      I)  ( p     I)( p    I) ]
          =      X ij + ιXij -  gi + ιgi  gj - ιgj
            ii,⁄=jj
               [(          )  (        )(       ) ]
                 XRij - ιXIij -  gpi - ιgIi  gpj + ιgIj
            ∑  [(                 )    (                )]
          =       XRij - gpigpj - gIigIj + ι XIij + gpigIj - gIigpj
             i,j
            i⁄=j[(                 )    (                )]
                  XRij - gpgp - gIigIj - ι XIij + gpgIj - gIigp
            ∑           i j                    i       j
          =    S0S ⋆
             i,j   0
            i⁄=j
(D.14)

where

     [                ]   [                 ]
S0 =  XRij - gpigpj - gIigIj + ι XIij + gpigIj - gIigpj
(D.15)

Taking partial derivative of S with respect to gip and reintroducing wij, we get

         { [        ]        [       ]}
∂S--= ∑     - gp + ιgI S⋆ - S  gp + ιgI   w
∂gpi    j      j    j   0    0  j    j     ij
      j⁄=i
        ∑  [        ⋆ ⋆]
    = -     S0gj + gjS0 wij
        jj⁄=i
         ∑
    = - 2    Re (S0gjwij)
          j⁄=ji
         ∑   [(                )     (                 )  ]
    = - 2      XRij - gpigpj - gIigIj gpj +  XIij + gpigIj - gIigpj gIj wij
           j
         ∑j⁄=i [                           ]
    = - 2     XRijgpj - XIijgIj - gpigIj2 - gpigpj2 wij
           j
          j⁄=i
(D.16)

Therefore,

             [                  ]
-∂S-=  - 2 ∑  Re(X  g⋆) - |g |2gp w
∂gpi       j       ijj     j   i   ij
          j⁄=i
(D.17)

Equating ∂S _ gip to zero, we get

     ∑
        Re (Xijgj⋆wij)
 p   jj⁄=i
gi = ---∑-|g-|2-w-----
        j   j   ij
        j⁄=i
(D.18)

Similarly

∂S       ∑   [                  ]
--I-= - 2     Im (Xijg ⋆j)- |gj|2gIi wij
∂gi        j
          j⁄=i
(D.19)

Therefore the equivalent imaginary part of Equation D.18 is

     ∑
        Im (Xijg ⋆jwij)
     jj⁄=i
gIi = ---∑----2-------
        j |gj| wij
        j⁄=i
(D.20)

writing gi = gip + ιgiI and substituting for gip and giI from Equation D.18 and D.20 respectively, we get

     ∑  Xijg⋆wij
      j     j
gi = j⁄=∑i---2-----
        |gj| wij
     jj⁄=i
(D.21)

This is same as Equation D.8, which was arrived at by evaluating a complex derivative of Equation D.5 as ∂S∕∂gi, treating gi and gI as independent variables. Evaluating ∂S ∂gi = 0 would give the complex conjugate of Equation D.21. Hence, ∂S∕∂gi gives no independent information not present in ∂S∕∂gi.