1 INTRODUCTION
The Square Kilometre Array (SKA) is being developed as a radio telescope with a combination of unprecedented sensitivity, resolution, and field of view (FoV) over a 70-MHz to >10-GHz frequency range (Dewdney et al. Reference Dewdney, Hall, Schilizzi and Lazio2009). The key scientific objectives of this instrument include mapping the distribution of neutral hydrogen gas in the universe, studying the origin of magnetic fields, and measuring the dynamics of pulsars in the galaxy. To realise these objectives, a very large collecting area is needed (typically one square kilometre) using new antenna technology that can be mass produced with minimum cost.
The SKA design consists of three frequency bands, the lowest of which, termed SKA-low (Garrett et al. Reference Garrett, Cordes, de Boer, Jonas, Rawlings and Schilizzi2010), has been selected to be built in Australia and New Zealand.Footnote 1 SKA-low will use aperture phased arrays of antennas operating in the 70–450 MHz range and extending across a distance of several hundred kilometres. The wideband antennas convert cosmic radio waves from physical processes in the universe to RF signals, which are digitised, cross-correlated, and averaged. Post-correlation aperture synthesis is used to reconstruct the brightness distribution, to achieve high-dynamic range radio pictures of the sky (Hall et al. Reference Hall, Schilizzi, Dewdney and Lazio2008; Garrett et al. Reference Garrett, Cordes, de Boer, Jonas, Rawlings and Schilizzi2010).
Cross-correlating the several million SKA-low antenna elements signals to create these radio pictures would require an extremely large signal processing system. Instead, the signal processing load is reduced by grouping geographically adjacent antenna elements into digitally beamformed stations; this limits the number of signals to be transported and cross-correlated. Additional cost efficiencies can be gained by grouping individual antenna elements into beamformed tiles prior to the station beamformer (Faulkner et al. Reference Faulkner2010). Figure 1 shows a schematic of this two-stage beamforming approach for SKA-low. The cost efficiencies arise because the tile beamformer limits the number of independent signals (analogue or digital) early in the signal path. A first-order approximation of this effect is described in Appendix 5.
The actual reduction in station hardware cost due to two-stage beamforming depends on the technologies used. When analogue first-stage and digital second-stage beamforming is implemented in place of two-stage digital beamforming, the cost reduction is larger. This is because the analogue first-stage beamforming reduces, by an order of magnitude, the digitisation, intra-station signal transmission, and station beamformer costs. For example, an analysis of SKA-low station hardware costs shows a factor between 1.5 and 5 reduction in station hardware cost compared with all-digital beamforming (Colegate, Hall, & Gunst Reference Colegate, Hall and Gunst2012), across various architectural implementations. With cost estimates in Faulkner, Bij de Vaate, & van Ardenne (Reference Faulkner, Bij de Vaate and van Ardenne2011) of € 1.8–2.8 millions per station, this is a potentially significant cost reduction.
Present-day telescopes have various approaches to tile or station beamforming. The high band antenna of the Low Frequency Array telescope (de Vos, Gunst, & Nijboer Reference de Vos, Gunst and Nijboer2009) uses a two-stage beamforming station, but both stages are performed on regular gridded antenna elements, making it an example of a regular tile in a regular station. The low band antenna of the same telescope uses a single stage of beamforming on antenna elements in an irregular station layout. The Long Wavelength Array Station 1 telescope (Ellingson et al. Reference Ellingson2012) is an irregular array with a single-stage of beamforming, and is the first station of an envisioned larger correlation array. Instead of a station approach, the Murchison Widefield Array telescope (Lonsdale et al. Reference Lonsdale2009; Tingay et al. Reference Tingay2012) feeds a many-input (large-N) correlator with beamformed 16-element regular tiles.
None of these telescopes use beamformed tiles within an irregular station layout. However, stations composed of an irregular array of elements are under active consideration for the SKA (Bij de Vaate et al. Reference Bij de Vaate2011). Because these stations will have several orders of magnitude more elements per station than these other telescopes, two-stage beamforming is an attractive option to reduce cost. Although antenna elements arranged into beamformed tiles can change key performance characteristics of the station, the effects of two-stage beamforming on the beam pattern and effective area of an irregular station have not been considered previously in the SKA context. This paper examines the implications of placing beamformed tiles within an irregular station grid layout, and compares it with a single-stage irregular station. The aim in doing so is to assess the magnitude of performance degradation which accompanies the potentially lower cost two-stage layouts. Even though the intra-station layouts considered in this paper are not optimised for performance, they illustrate the distinguishing features of each layout and provide a reference point for future studies of SKA station layout and system design.
The paper is arranged as follows. Section 2 sets out the intra-station geometry and the analysis method used. Results are presented in Section 3 and discussed in Section 4.
2 THEORETICAL BACKGROUND
Aperture arrays (AAs) must be exquisitely well calibrated to meet the high imaging dynamic range requirements for the SKA (Dewdney et al. Reference Dewdney, Hall, Schilizzi and Lazio2009); one facet of calibration is correcting for a non-ideal synthesised beams, formed from the correlation of signals from AA stations. To do this, the instrumental response needs to be accurately characterised via an astronomical calibration procedure (Wijnholds et al. Reference Wijnholds, Van Der Tol, Nijboer and van der Veen2010).
The difficulty in calibrating the station beam depends on the station size and the nature of its sidelobes (Wijnholds, Bregman, & van Ardenne Reference Wijnholds, Bregman and van Ardenne2011), and is influenced by the element configuration; a regular array produces strong, predictable sidelobes while an irregular array produces weaker, more diffuse sidelobes. An irregular array smears out in spatial extent the sidelobes that are present at frequencies where a regular array becomes sparse. These sidelobes, or grating lobes in sparse regular arrays, cause strong frequency- and direction-dependent variations in the main lobe gain (van Cappellen, Wijnholds, & Bregman Reference van Cappellen, Wijnholds and Bregman2006).
For this analysis, the effects of mutual coupling are ignored as they tend to average out for large irregular arrays (or stations) (Gonzalez-Ovejero et al. Reference Gonzalez-Ovejero, de Lera Acedo, Razavi-Ghods, Craeye and Garcia2011). There will be non-trivial mutual coupling effects on a tile level but from a station perspective, these effects will not fundamentally change features such as sidelobe levels and effective area, which we analyse for the two-stage beamforming approach. Future work will incorporate the second-order effects of mutual coupling and simulated (or measured) element patterns.
Three representative configurations for station layout are considered in this paper: (i) single-stage irregular array (single-irregular), (ii) regular gridded tile in an irregular array (regular–irregular), and (iii) irregular gridded tile in an irregular array (irregular–irregular). The irregular–irregular configuration uses the same randomised tile layout for all tiles in the station. Both two-stage stations use a single tile layout to enable easily replicated beamforming hardware with a fixed number of inputs. This is likely to reduce the beamformer hardware cost. We note that future work, based in part on the present study, could involve non-tile intra-station configurations, wherein signals from free-form antenna groups are beamformed.
Specifications for the representative configurations are given in Table 1 and the layouts for the two-stage stations are shown in Figures 2 and 3. For comparison, the current description specifies 11 200 elements within a 180-m diameter SKA-low station (Dewdney et al. Reference Dewdney, Bij de Vaate, Cloete, Gunst, Hall, McCool, Roddis and Turner2010). We note that emerging thinking about the SKA-low configuration tends to now favour a somewhat larger number of smaller stations (e.g. Mellema et al. Reference Mellema2012).
The number of elements, station diameter, and element layout within the station are key parameters in determining station effective area and FoV. We choose to maintain a fixed number of elements in order to characterise the station effective area in the sparse regime of operation where the station sensitivity depends directly on the number of elements. The number of elements per station (N e) is held approximately constant and a 1.3-m minimum inter-element spacing (λ/2 at 115 MHz) is enforced. For the two-stage layouts, each tile consists of 16 elements. Due to the randomisation, the irregular tile is necessarily larger than the regular tile; hence, a larger station diameter is required for the irregular–irregular layout. An alternative approach is to control station diameter and vary the number of elements per station. While this approach ensures a similar FoV between station layouts, station costs are not directly comparable because the number of signal chains varies. Furthermore, such an approach for the irregular–irregular layout would result in fewer elements; hence, the station high-frequency sensitivity would suffer. Holding N e constant ensures that both two-stage beamforming stations have the same number of digitised signal chains.
The simulations are carried out in two stages, using Xarray,Footnote 2 a tool developed for computing radiation patterns together with effective area and other aperture array parameters. We generate the tile beam pattern using cos θ3.47 as a single element pattern which approximates one of the moderately directive candidate SKA-low antenna radiation patterns (de Lera Acedo et al. Reference de Lera Acedo2012). The station beam pattern uses this tile beam as the sub-array pattern for the two-stage beamforming. For single-stage beamforming, we simply use the single element pattern in generating the station beam. The beams are calculated with an antenna radiation efficiency of 90%, which is the minimum requirement for the SKA-low station (Dewdney et al. Reference Dewdney, Bij de Vaate, Cloete, Gunst, Hall, McCool, Roddis and Turner2010).
3 RESULTS
Conceptually, the SKA-low stations replace the function of dishes in a radio telescope array; each station beam is an input to the correlator. As for dishes, two important metrics describing telescope performance are the station beam sidelobe levels and station effective area (A eff).
Unlike dishes, the effective area of a station is strongly dependent on frequency. This occurs because the aperture is not fully sampled (or not a dense, highly coupled array). Effective area also changes as a function of the zenith angle due to the element pattern and geometric effects. Figure 4 shows A eff of the three station layouts as a function of frequency, for a zenith angle of 0°.
The sidelobe levels are evident in the station beam pattern. The broadside station beam pattern, at 70 and 300 MHz, for all three cases is shown in the u–v plane in Figures 6 and 8. The same beam at the v = 0 plane for the two frequencies is plotted in Figures 7 and 9. The u–v coordinates (equivalent to l, m coordinates) are given by u = sin θcos φ and v = sin θsin φ, where θ and φ are the zenith angle and the azimuth angle, respectively.
Statistics on inter-element spacing can provide further insight into the performance of stations of equal diameter (the single-stage irregular and regular–irregular layouts). The minimum separation between elements in the regular–irregular layout is defined by the regular tile layout: 1.3 m for every element. The single-stage irregular station has a range of minimum inter-element spacings, defined by physical constraints and randomisation of the elements within these constraints. The 1.3-m minimum spacing between elements is imposed by the element’s footprint, while the 189-m diameter station boundary limits how far apart the elements can be spaced. Figure 5 shows a frequency count of the minimum distance from each element to all other elements. The mean of these minimum inter-element spacings is 1.38 m and the maximum is 2.21 m. For comparison, the minimum inter-element spacing of 11 200 elements within a 189-m diameter single-stage regular gridded station (not shown) is approximately 1.6 m.
4 DISCUSSION
4.1 Station effective area
The station effective area A eff is an important metric for SKA-low because, for a constant number of stations, telescope sensitivity is linearly proportional to A eff. A related factor is whether the array is ‘dense’ or ‘sparse’ at the frequency in question. Both A eff and the transition from dense and sparse vary with frequency and the inter-element spacing.
There is no single definition for when an array is dense or sparse. The broad definition used in this paper is that an array is dense when the aperture is fully sampled with λ/2 or closer element packing, and inter-element mutual coupling is significant. Effective area is then approximately equal to the physical (geometric) area of the station
As Figure 4 shows, A eff for all layouts is similar at the higher frequencies, where the elements are in the sparse regime and A eff∝λ2. The dense–sparse transition is evident at lower frequencies. Here, the effective area is no longer proportional to λ2 as mutual coupling effects become active.
The general trend in Figure 4 is that the regular–irregular station exhibits the strongest departure from λ2 at lower frequencies, and the two-stage irregular station the weakest. This is to be expected, as the minimum inter-element spacing of the regular–irregular layout is 1.3 m for every element. Meanwhile, the single-stage irregular station has some elements of larger minimum spacing, but most are still less than 1.4 m (see Figure 5). For the larger diameter two-stage irregular station, the irregular tile is larger; hence, more of the elements remain sparse at lower frequencies.
The regular–irregular and single-stage irregular stations show a significant divergence in the effective area at lower frequencies, despite having the same 189-m diameter. The greatest percentage difference in the effective area occurs at ≈150 MHz, where A eff of the regular–irregular station is 74% of the single-stage irregular station, equating to a similar loss in telescope sensitivity. An inter-element spacing greater than 1.3 m for the regular tiles is a potential solution, but the tile geometry puts limitations on such an increase because it restricts the randomisation of the station layout. A conclusive analysis of the potential loss of the effective area due to regular tiles requires a full mutual coupling analysis and optimisation of the layouts, which is beyond the scope of this paper.
4.2 Sidelobes
As mentioned, over the full frequency range, the stations (for all layouts) operate as both dense and sparse array. The sidelobe characteristics, for the stations, will change as the array moves from the dense to sparse region of operation. To observe these different characteristics, we analyse the sidelobes at two frequencies which are representative of the two regimes.
4.2.1 Dense region (70 MHz)
The primary sidelobe (sidelobe closest to main lobe) is independent of the intra-station layout. Figures 6 and 7 show a primary sidelobe level of approximately −18 dB for all three stations; this level is consistent with that of a uniformly distributed circular aperture (Mailloux Reference Mailloux1995).
At the lower frequencies (e.g. 70 MHz), the element separation is less than half wavelength. Thus, although the element (or tile) placement is random, the station aperture is ‘filled’. This ‘filled’ aperture generates a station beam which contains an annulus of primary and secondary sidelobes. These sidelobes will not be smeared or suppressed by techniques such as station configuration rotation, which has been proposed for sparse arrays (van Cappellen et al. Reference van Cappellen, Wijnholds and Bregman2006). All three layouts exhibit these annuli. However, compared with the single-stage beamforming station, the sidelobes for the two-stage beamforming stations are spread over a wider area.
4.2.2 Sparse region (300 MHz)
At higher frequencies (e.g. 300 MHz), for all stations shown in Figures 8 and 9, the sidelobes are suppressed due to the random placement of elements. However, the sidelobes for the single-stage beamforming station are concentrated towards the horizon, away from the main lobe.
At the higher frequency of 300 MHz, the first-stage beamforming of tiles with relatively few elemental antenna inputs influences how the sidelobes are distributed as a function of scan angle and frequency. As Figure 8 shows, the irregular tiles smear out the strong, predictable sidelobes of the regular tiles. However, the general structure of the tile beam pattern is still visible in the station beam, especially for the regular tile.
The level of the secondary station sidelobes visible in Figure 9 is affected by the first-stage beamforming. The maximum secondary sidelobe level increases from −35 dB for the single-stage station to −27 dB for the two-stage regular–irregular station. In comparison, the classical 1/N result for distant sidelobes of a random distribution of 11 200 isotropic antenna elements is approximately −40 dB. The irregular station does not reach this level because the randomisation is restricted to a relatively small range of element spacings as shown in Figure 5, resulting in the annulus of secondary sidelobes visible in Figure 8.
The use of predictable sidelobes in calibration is under investigation (Wijnholds et al. Reference Wijnholds, Bregman and van Ardenne2011; Braun & van Cappellen Reference Braun and van Cappellen2006); if it is useful, then the regular tile beam structure may assist the calibration while maintaining a low sidelobe level. If more randomised sidelobe structure is beneficial, or even acceptable, significant gain in the effective area is possible (see Figure 4).
4.3 Station diameter and cost
Although results will depend on the exact station layout, these examples illustrate the impact of the two-stage beamforming and inter-element spacing on effective area. These trends are important, as the current SKA system description proposes stations of 180-m diameter and 11 200 elements (Dewdney et al. Reference Dewdney, Bij de Vaate, Cloete, Gunst, Hall, McCool, Roddis and Turner2010). The tight constraints on the minimum spacing due to the antenna footprint, and on the maximum spacing from the station diameter, limit the scope of layout optimisation.
Given these constraints, then why not increase the diameter? An obvious feature of Figure 4 is that the irregular–irregular station achieves a larger effective area at lower frequencies, for the same number of elements. Its larger diameter arises from the larger tile area required to achieve some degree of randomisation in the irregular tile, where the larger tiles simply do not fit in an irregular pattern within the 189-m diameter station.
The reasons not to place the same number of elements within a larger station diameter are twofold: cost and calibratability. At higher frequencies, a larger station diameter reduces the station beam FoV with no commensurate increase in the effective area. Therefore, fewer calibration sources are visible within the FoV, making the station calibration more difficult (Wijnholds et al. Reference Wijnholds, Bregman and van Ardenne2011).
The cost perspective is explored in Colegate et al. (Reference Colegate, Hall and Gunst2012). The larger physical area leads to increased costs related to on-site infrastructure. There are also increased signal processing costs to counteract the smaller station beam FoV. The number of these beams required to cover a given area of sky (the processed FoV) scales as D 2 st. The digital station beamformer processing costs are approximately proportional to the number of inputs and the number of beams formed (see Appendix 5). The cost of data transmission from the station to the correlator and the correlator processing cost itself are also approximately linearly proportional to the number of station beams (Colegate et al. Reference Colegate, Hall and Gunst2012). As an example, a 248-m diameter station requires 1.7 times more beams to achieve the same processed FoV as the 189-m diameter stations, resulting in roughly the same increase in signal processing costs.
4.4 Future work
Further to the illustrative examples in this paper, more complete optimisations will result in station layouts with slightly improved performance. The representative layouts in this paper maintain a constant number of elements per station, resulting in a constant A eff when the station is sparse at the higher frequencies. An alternative approach is to change the number of elements or station diameter, to ensure that all layouts instead maintain a relatively constant A eff at the lower frequencies. This will have cost implications, and the preferred optimisation approach depends on the SKA station requirements.
Regardless of the optimisation approach, there is scope for further characterisation of station performance, incorporating mutual coupling between the elements and the overall telescope response. For example, rotating the intra-station configuration between stations of the SKA-low telescope helps to reduce sidelobes in the correlated beam (van Cappellen et al. Reference van Cappellen, Wijnholds and Bregman2006). Similarly, rotating tiles within the station would smear out the station sidelobes arising from the regular tile layout. Increased randomisation of elements or tiles in the station layout can reduce sidelobes, but requires more station area, resulting in a smaller station FoV. Down-weighting the edges of the aperture distribution through a spatial taper of the elements or tiles would reduce sidelobes (Willey Reference Willey1962), again at the cost of the increased station area.
5 CONCLUSIONS
We have quantified the first-order effect of two-stage beamforming on the station beam and effective area (A eff) of the SKA-low. We used both regular and irregular tiles within an irregular station layout and compared these with a single-stage irregular station. At higher frequencies, the station A eff is same irrespective of the inner station layout. At lower frequencies, regular tiles can significantly reduce the station A eff compared with the other two configurations. Across all frequencies, the primary sidelobes are not affected by the station configuration, being largely a function of the station diameter. However, the secondary sidelobes are dependent on the input beam (tile or single element). While randomised two-stage configurations have the potential for considerable cost savings, the increase in far sidelobe levels by 10 dB or so will need to be assessed in the context of the SKA-low weighted science case. For example, low sidelobe levels are required to characterise the ionosphere at low frequencies (Wijnholds et al. Reference Wijnholds, Bregman and van Ardenne2011) and enable precision imaging in e.g. the Epoch of Reionisation domain. Conversely, time-domain studies at higher frequencies do not require precise secondary sidelobe control (Stappers et al. Reference Stappers2011). In combination with science priority setting, further station configuration studies which include e.g. mutual coupling will be useful to refine studies of two-stage and other hierarchical beamforming architectures.
ACKNOWLEDGMENTS
We thank Dr. Adrian Sutinjo for the discussions surrounding this topic and the reviewer for helpful suggestions in finalising our manuscript.
The International Centre for Radio Astronomy Research is a joint venture between Curtin University and the University of Western Australia, funded by the State Government of Western Australia and the joint venture partners. A. Jiwani is a recipient of a Curtin Strategic International Research Scholarship. T. M. Colegate is a recipient of an Australian Postgraduate Award and a Curtin Research Scholarship.
APPENDIX A. TILE BEAMFORMING COST REDUCTION
We consider here how tile beamforming can reduce station hardware costs. The number of independent signals in a station can be determined by considering the elemental signal path. The signal path in the station begins at the active antenna element, culminating in one or more station beams as input to the correlator. In between, key actions are to digitise the signal, perform beamforming, and to transport the signal, in analogue or digital form, between the antenna element, digitiser, and one or more beamformers. The intra-station signal transport and processing architecture defines the actions taken on the signal and the order of these actions.
Figure A1 shows three such intra-station architectures, indicating the order of the actions and the number of independent signals at each point of the signal path. These are related to, but do not directly correspond with, the intra-station layouts in this paper. Figure A1(a), which shows a single-stage of digital beamforming, applies to the single-irregular layout. Figures A1(b) and (c) respectively show a first stage of analogue and digital tile beamforming, where the tile beamformer is a signal aggregation point. Figures A1(b) and (c) apply to both regular–irregular and irregular–irregular layouts; we consider changes to the station beam pattern due to the different tile beamforming technology to be second-order effects. As shown in Figure A1, an intra-station architecture with tile beamforming reduces the number of independent signals subsequent to the tile beamformer. Assuming only a single tile beam is formed, the reduction is the number of elements per tile, in our case it is 16.
We estimate the reduction in the station hardware cost from tile beamforming by considering the number and cost of components in the elemental signal path. To a first-order approximation, the total cost for each of these components in a station is linearly proportional to the number of independent signals. For the digital beamformers, the processing cost depends on the architecture, but is approximately linearly proportional to the number of input signals and the number of output beams (see Appendix 5). As mentioned, only a single output beam is assumed for the tile beamformer.
The reduction in cost due to tile beamforming therefore depends on the cost of those components which are before and after the tile beamformer, and the fractional cost of these groupings. The total cost of components subsequent to the tile beamformer in Figures A1(b) and (c), such as signal transport to station beamformer and the station beamformer itself, decreases by a factor of 16. The cost of station hardware prior to the tile beamformer, such as that of the active antenna element, does not change with intra-station architecture. The total cost of digitisers in a station will be a factor of 16 larger when there is only digital beamforming (Figures A1 a and c) than for analogue tile beamforming (Figure A1 b).
An illustrative example using the fractional cost of the groups of components located before and after the tile beamformer shows that tile beamforming has the potential to significantly reduce station hardware costs. For example, consider a single-stage beamformed station. If the cost of the components prior to the tile beamformer represents half of the station hardware cost and those components after the tile beamformer represent the other half, then the two-stage beamforming cost is 53% of the single-stage station hardware cost. Because an analogue tile beamformer (Figure A1 b) aggregates the signals earlier on in the signal path than the digital tile beamformer (Figure A1 c), more of the cost will occur after the tile beamformer, thereby further reducing the station hardware cost.
APPENDIX B. BEAMFORMING COMPUTATIONAL COST
Digital beamforming can be done in the time domain or in the frequency domain on channelised signals. The computational cost of the frequency and time domain beamforming approaches are discussed in Barott et al. (Reference Barott, Milgrome, Wright, MacMahon, Kilsdonk, Backus and Dexter2011) and Khlebnikov et al. (Reference Khlebnikov, Zarb Adami, Armstrong and Jones2010), where computation cost is expressed as a function of the number of output beams, input antennas, and frequency channels, with additional costs to implement a time delay (where necessary) and the Fast Fourier Transform (FFT). Only the costs that scale with output beams and input antennas are relevant here; the other costs are specific to the signal processing architecture.
From Barott et al. (Reference Barott, Milgrome, Wright, MacMahon, Kilsdonk, Backus and Dexter2011) and Khlebnikov et al. (Reference Khlebnikov, Zarb Adami, Armstrong and Jones2010), for a given architecture and number of channels, the frequency and time domain station beamformer processing load can be respectively simplified to
If the channelisation cost does not dominate (i.e. N beam≫K ch/K BF[ν] and N input≫K ch/K BF[t]), the cost scaling is the same for both frequency and time domain beamforming. The processing cost of the station beamformer is thus approximated by