1 Introduction
Wide-area radio sky surveys enable a range of science across the Universe, from the most nearby scales, for example, measuring the solar wind (Morgan et al. Reference Morgan, McCauley, Waszewski, Ekers and Chhetri2023), to our own Galaxy, for example, finding unexpected types of transient radio sources (Hurley-Walker et al. Reference Hurley-Walker2022b), to the extragalactic sky (White et al. Reference White2020a,b), such as measurements and variability of radio galaxies (Ross et al. Reference Ross, Hurley-Walker, Seymour, Callingham, Galvin and Johnston-Hollitt2022) and galaxy clusters (Duchesne et al. Reference Duchesne, Johnston-Hollitt, Zhu, Wayth and Line2020), the detection of cosmic magnetism (Vernstrom et al. Reference Vernstrom, Heald, Vazza, Galvin, West, Locatelli, Fornengo and Pinetti2021), precision magnetism studies (Riseley et al. Reference Riseley2018, Reference Riseley2020), and ultimately cosmology (Hale et al. Reference Hale2024).
Building towards the Square Kilometre Array (SKA), a recent resurgence in low-frequency radio astronomy has seen a plethora of new wide-area radio sky surveys. Hurley-Walker et al. (Reference Hurley-Walker2022a) (hereafter referred to as HW22) introduced the GaLactic and Extragalactic All-Sky MWA – eXtended (GLEAM-X) survey, using the Murchison Widefield Array (MWA; Tingay et al. Reference Tingay2013) in its ‘extended’ Phase-ii configuration (Wayth et al. Reference Wayth2018; Beardsley et al. Reference Beardsley2019) to survey the sky south of Declination $+30^\circ$ over 72–231 MHz, with a source finding image formed at $\sim56^{\prime\prime}$ resolution. The authors also introduced the many other wide-area surveys that formed the foundation for, were contemporaneous with, or preceded GLEAM-X. Adding to the recent plethora of data releases, this paper presents the second data release to the GLEAM-X survey, henceforth called GLEAM-X DRII, which will add to the scientific resources available to the astronomical community.
Since GLEAM-X was described, a multitude of new radio surveys of the sky have become available or have had new data releases. The LOw-Frequency Array (LOFAR; van Haarlem et al. Reference van Haarlem2013) has produced a second data release from the LOFAR Two-metre Sky Survey at 120–180 MHz (LoTSS; Shimwell et al. Reference Shimwell2017, Reference Shimwell2022), a first data release from the LOFAR Low-Band Array Sky Survey at 41–66 MHz (LoLSS; de Gasperin et al. Reference de Gasperin2021, Reference de Gasperin2023), and deep imaging towards the LOFAR Deep Fields (Best et al. Reference Best2023). Dish interferometers with phased-array feeds have enabled wide-area higher-frequency surveys, such as the Apertif imaging survey (Adams et al. Reference Adams2022), and the Rapid Australian SKA Pathfinder Continuum Surveys (RACS), in both unpolarised and circular polarisation (McConnell et al. Reference McConnell2020; Duchesne et al. Reference Duchesne2023), as well as early results from measuring linear polarisation (Thomson et al. Reference Thomson2023). MeerKAT has also produced L-band surveys over smaller areas at greater sensitivities, such as the MeerKAT Absorption Line Survey (MALS) data release i (MALS; Deka et al. Reference Deka2023) and MIGHTEE (Heywood et al. Reference Heywood2022). The first data release from the MWA interlpanetary scintillation (IPS) survey has also been released (Morgan, Chhetri, & Ekers Reference Morgan, Chhetri and Ekers2022) which provides information on the sub-arcsecond structure of over 40,000 GLEAM sources.
While GLEAM-X is one of the several ongoing low-frequency surveys, it continues to play a unique role combining wide fractional bandwidth, low frequencies, and large sky coverage, particularly in the Southern sky. Covering Declinations up to $+30^\circ$ , GLEA M-X is complimentary to Northern sky surveys, like LoTSS, and bridges the gap as a Southern sky survey until SKA-LOW. Likewise, the synergies between other higher-frequency Southern sky surveys like RACS will prove a powerful source for discovery and characterisation of spectral properties for large populations of radio sources. The increased sensitivity and resolution of GLEAM-X compared to the predecessor, GLEAM, establish GLEAM-X as an avenue for novel scientific outcomes that were not available with GLEAM alone.
This paper is presented with the following layout. In Section 2, we describe the observational strategy and specific observations processed for this data release. In Section 3, we outline the changes and improvements made to the GLEAM-X processing pipeline since the initial data release and survey description (Hurley-Walker et al. Reference Hurley-Walker2022a). The properties of the final images are outlined in Section 4. In Section 5, we describe the compact source catalogue produced for this work and quality of this catalogue. All the results are summarised in Section 6.
All positions given in this paper are in J2000 equatorial coordinates.
2 Observations
GLEAM-X utilised the drift scan observing technique, outlined by Wayth et al. (Reference Wayth2015), but adapted to iterate over three hour angles (HA), $HA=0$ h, $\pm1$ h, proven to increase sensitivity (Franzen et al. Reference Franzen, Hurley-Walker, White, Hancock, Seymour, Kapińska, Staveley-Smith and Wayth2021). GLEAM-X DRI comprised four drift scans at Declinations centred on $-26^\circ$ and covering $4\mathrm{h}\leq \mathrm{RA}\leq13$ h with different HAs (Hurley-Walker et al. Reference Hurley-Walker2022a). Observations for GLEAM-X DRII adopt the same observing strategy and cover the South Galactic Pole (SGP) region, spanning $\sim 20$ h $\leq\mathrm{RA}\leq \sim 6$ h and Dec $\leq+30$ . GLEAM-X DRII consists of 28 drift scans, with seven Declination pointings ( $+20^\circ$ , $+1^\circ$ , $-12^\circ$ , $-26^\circ$ , $-40^\circ$ , $-55^\circ$ , and $-71^\circ$ ) and $\mathrm{HAs}=0$ h, $\pm1$ h. Each Declination pointing had four dedicated observing nights separated by one week and were taken from 2020-09-28 to 2020-10-25. A summary of the nights that were included in this data release is presented in Table A1.
As outlined by HW22, observations were taken with an instantaneous bandwidth of $30.72$ -MHz and cycled through five frequency ranges, 72–103, 103–134, 139–170, 170–200, and 200–231 MHz, every two minutes. This data release is the combination of over 1 000 h of observations taken with the MWA and covers a sky area of $\approx$ 12 892 sq.deg. The region of sky covered in this data release and in GLEAM-X DRI is presented in Fig. 1
3 Continuum pipeline
The data reduction process used for both this data release and GLEAM-X DRI is described in full by HW22. However, with the increased sky coverage of this data release, several improvements to the data processing were introduced to improve image quality for low elevation pointings and minimise contamination from bright radio sources. The updated GLEAM-X pipeline is publicly available on the GLEAM-X organisation GitHubFootnote a and as a containerised pipeline that can be run on any platform with Singularity installed (Kurtzer, Sochat, & Bauer Reference Kurtzer, Sochat and Bauer2017). Here we outline the changes to the reduction steps introduced in this data release. As the changes introduced in this data release are typically due to the increased Declination coverage (compared to GLEAM-X DRI where only zenith pointings were processed), the changes have been integrated into the GLEAM-X pipeline and will be implemented in future GLEAM-X data releases.
3.1 Calibration and improved sky model
We perform the same calibration approach as GLEAM-X DRI, by calibrating separately on individual 2-min snapshot observations in a direction-independent manner. A sky model is used that is primarily derived from GLEAM with additional models for complex sources. In this data release, we updated the model for Pictor-A with an improvement on the amplitude calibration, particularly for short baselines. The model for Pictor-A was derived from a combination of the Very Large Array (VLA) Sky Survey (VLASS; Lacy et al. Reference Lacy2020) and observations taken with MWA Phase-II as part of the GLEAM-X observing.
Calibration solutions from GLEAM-X observations that were near in time and pointing were applied to the observations of Pictor-A and then imaged using WSClean (Offringa et al. Reference Offringa2014). A multi-component model of Pictor-A was derived with spectral indices calculated using all channels of the MWA and VLASS for each component. The final model comprised four components: two extended lobes with steep spectral indices and two hot spots with flat spectral indices.
3.2 Sidelobe subtraction
For observations taken at a low elevation, the sidelobe of the MWA primary beam has significant sensitivity that can cause imaging artefacts including alias sources from the sidelobe reflected into the main lobe. In this release, we introduce a sidelobe subtraction for observations where the sidelobe was more sensitive than 0.1% of the primary beam.Footnote b
To reduce the sidelobe power, the visibilities are phase rotated to the sidelobe and then a shallow image is formed to create a model of the sidelobe. The shallow image is generated using WSCean, using the following settings:
-
• imaging the XX, YY, XY, and YX instrumental polarisation with the cleaning performed on peaks calculated from the sum of the squares but subtracted from individual images using the -join-polarizations option;
-
• four 7.68 MHz channels are jointly cleaned for each polarisation product;
-
• a Briggs robust parameter of $0.5$ (Briggs Reference Briggs1995);
-
• only one major clean cycle, where the images are inverse Fourier transformed back to visibilities and subtracted from the data;
-
• up to $4\times10^{4}$ minor cleaning cycles where subtractions are performed in the image plane rather than the visibilities;
-
• cleaning was stopped when negative components were reached
The ‘MODEL_DATA’ column of the observation measurement set was updated during this imaging, and subtracted from the visibilities after completion. The visibilities were then phase-rotated back to the mainlobe. This procedure is similar to the ‘demixing’ often used by LOFAR (van der Tol, Jeffs, & van der Veen Reference van der Tol, Jeffs and van der Veen2007; Morabito et al. Reference Morabito2022), which also subtracts a bright off-axis nearby source from the data. However, demixing involves calibrating using a model for the source, which is then used to subtract the source from the data. In the sidelobe subtraction procedure introduced here, no calibration is derived or applied, a model of the sensitive sidelobe is produced via imaging and subtracted directly. This procedure dramatically reduced the overall power of the sidelobe sources in the visibilities, particularly for aliased sources (described further in Section 5.5), and ultimately resulted in a slight decrease to the final image root-mean-squared (RMS) noise level. Self-calibration was also applied to any observations that were identified as having significant sidelobes, this is described further in Section 3.3.
3.3 Self-calibration
For observations that were identified as having a significant sidelobe, a round of amplitude and phase self-calibration was applied. There was no noticeable improvement on image quality for observations that had no significant sidelobe contributions, thus self-calibration was not performed on these observations. Visibilities were phase-rotated back to the mainlobe pointing and another shallow imaging procedure was performed using WSClean with the same imaging parameters as the shallow imaging for sidelobe subtraction outlined in Section 3.2. The ‘MODEL_DATA’ column of the observation measurement set was updated as part of the WSClean imaging procedure. The ‘MODEL_DATA’ column was then used to calibrate using mitchcal (Offringa et al. Reference Offringa2016). These calibration solutions were only applied if less than 20% of the solutions were flagged.
3.4 Ionospheric assessment
A final check of image quality was performed to identify any potential ionospheric effects resulting in significant blurring of sources in the image before combining snapshot images into the final mosaic. The blurring of sources from the ionosphere has a larger impact on the lowest frequencies (i.e. 72–103 MHz), but the quality check was performed on all frequencies for consistency.
For each snapshot image, a quality control catalogue of sparse (no nearby sources within 1′), unresolved in NRAO VLA Sky Syrvey (NVSS; Condon et al. Reference Condon, Cotton, Greisen, Yin, Perley, Taylor and Broderick1998) and/or Sydney University Molonglo Sky Survey (SUMSS; Mauch et al. Reference Mauch, Murphy, Buttery, Curran, Hunstead, Piestrzynski, Robertson and Sadler2003), and high signal-to-noise (SNR $\gt =$ 50) sources was produced and used for all further calculations. A source finding procedure is performed on each snapshot image using Aegean Footnote c (Hancock et al. Reference Hancock, Murphy, Gaensler, Hopkins and Curran2012; Hancock, Trott, & Hurley-Walker Reference Hancock, Trott and Hurley-Walker2018), and a cross-match with the quality control catalogue was performed with 1′ separation. Observations were flagged if there was fewer than 100 sources in the cross-match catalogue, or if the RMS noise level of the image were significantly higher ( $3\sigma$ ) than the average RMS noise level for GLEAM-X images at the corresponding frequency band. The average and standard deviation of the ratio of integrated flux density to peak flux density ( ${S_{\mathrm{int}}/{S_{\mathrm{peak}}}}$ ) of sources in the image were calculated for every observation for each night. Observations were flagged as having significant blurring if ${S_{\mathrm{int}}/{S_{\mathrm{peak}}}}$ had a mean $\geq1.1$ or a standard deviation $\geq0.125$ .
The mean and standard deviation of ${S_{\mathrm{int}}/{S_{\mathrm{peak}}}}$ were inspected for each observation over each observing night to identify any periods of high ionospheric activity or areas with poor image quality. The lowest frequency band (72–103 MHz) typically saw the largest number of observations flagged, which is expected since this frequency shows the greatest ionospheric blurring. However, the larger field of view also results in greater overlap between snapshot observations and thus minimal reduction in sensitivity in the final mosaic. This triage typically saw 10–15% of observations being flagged in the lowest frequency band in a night of observations and <5% for the higher-frequency bands, although up to 50% of observations were flagged in some nights with particularly large levels of ionospheric activity. Fig. 2 presents the mean ${S_{\mathrm{int}}/{S_{\mathrm{peak}}}}$ over a night of good ionospheric conditions and poor ionospheric conditions.
3.5 Combined multi-night mosaicking
The mosaicking procedure for individual nights followed the same routine that was outlined in GLEAM-X DRI. Individual snapshots were weighted according to the square of the primary beam model and the inverse square of the RMS of image. Snapshots were then co-added using swarp (Bertin et al. Reference Bertin, Mellier, Radovich, Missonnier, Didelon, Morin, Bohlender, Durand and Handley2002) and the point spread function (PSF) of the combined mosaic image was measured using the same approach as that outlined in GLEAM-X DRI. This resulted in a total of 26 mosaic images (20 $\times$ 7.68-MHz frequency channel mosaic images, five $\times$ 30.72-MHz mosaic images and a wideband 60-MHz image over 170–231 MHz) for each night of observations. However, for the Dec $-71^\circ$ drift scan, multiple nights had at least one frequency band that was of poor quality. Consequently, for the Dec $-71^\circ$ drift scan observations, instead of four nights each with 26 mosaics, observations from each night were combined in one step into 26 combined four-night mosaics.
The combined final mosaics for this data release were created by combining all 25 declination strips for each of the 20 7.68 MHz subband images and five 30.72 MHz images using swarp. To optimise the SNR and ensure a smooth co-addition of the declination strips, weight maps were derived from the inverse square of the RMS maps as measured by BANE, a companion tool of Aegean. Weight maps were also calculated based on a declination-dependent sigmoid function to downweight the edges of the mosaics and minimise artefacts between the different declination strips. The 170–200 MHz and 200–231 MHz images were re-gridded and convolved to a common resolution and combined using swarp to create the deep 60 MHz wideband image used for source finding.
3.6 Noise analysis
Here we analyse the noise properties of the source finding 170–231 MHz image and assess the impacts of confusion. We follow the noise analyses of GLEAM-X DRI and Hurley-Walker et al. (Reference Hurley-Walker2017) on a 25 deg $^2$ region centred on RA $2^\mathrm{h}30^\mathrm{m}$ Dec $-40^\circ00'$ as a representative region with typical source distribution and noise properties. The background and RMS maps are measured using BANE and the background is subtracted from the image. An initial source finding routine using Aegean, is used to detect sources down to $0.2\times$ the local RMS noise level. These sources are either masked or subtracted from the background-subtracted image using aeres from the Aegean package. In Fig. 3, we present the histogram of the pixel distribution for the background subtracted, masked, and source subtracted images.
As discussed in HW22, a survey approaching the confusion limit will skew towards a positive distribution. However, for the wideband source finding image, the distribution is almost entirely symmetrical. The higher resolution of GLEAM-X compared to GLEAM means confusion contributes a smaller fraction to the noise (Franzen et al. Reference Franzen, Vernstrom, Jackson, Hurley-Walker, Ekers, Heald, Seymour and White2019). However, at the lowest frequency band of GLEAM-X, 72–103 MHz, the lower resolution means confusion is contributing a significant fraction to the noise levels. Consequently, BANE is unable to accurately measure the noise levels across the entire mosaic in the lowest frequency band. While the lowest frequency images of GLEAM-X are close to the confusion limit, the higher resolution of the higher-frequency images provides sufficient information to reduce the contribution of confusion to the final noise maps. The contribution of confusion to the noise levels at the wideband source finding image is minimal, so we use the source positions from the wideband catalogue for the priorised fitting routine of Aegean.
For the 72–103 MHz image and the four 7.68 MHz subband images, an initial round of source finding using Aegean with the wideband catalogue as a prior for source positions was conducted. This catalogue of sources was subtracted using aeres and the noise and background maps were measured again using BANE. The noise analysis was then performed as described above, to produce a background subtracted image and masked and source subtracted images, using the updated noise and background maps. In Fig. 4, we present the histograms of the pixel distributions for the 72–103 MHz images both before and after the improved background and noise estimates. Using the new background maps, the noise distribution becomes almost completely symmetric and follows the values measured by BANE.
The increasing resolution with the higher frequencies means only the lowest frequency band, 72–103 MHz, is significantly impacted by confusion, thus improved noise and background maps were only generated for the 72–103 MHz image and corresponding 7.68 MHz subband images. All noise and background maps are made available as part of the survey data release.
4 Final images
The 26 mosaics produced at the end of the combined multi-night mosaicking described in Section 3.5 are the combination of 28 nights of observing in 2020 and are Stokes I images across 72–231 MHz in five 30.72 MHz and 20 ‘subband’ 7.68 MHz bands as well as one deep 60 MHz band image across 170–231 MHz. There is decreasing sensitivity towards the edges of the mosaic, we therefore select a region with roughly consistent sensitivity covering 20 h40 m $\leq$ RA $\leq$ 6 h40 m, -90 $^\circ$ $\leq$ Dec $\leq$ +30 $^\circ$ (12,892-deg $^2$ ) for this release. Postage stamps of all the images of this work are available on both the GLEAM-X websiteFootnote d and AAO Data Central.
As described in Section 3.6, we calculate the noise and background maps for each mosaic using BANE with re-calculated background and RMS maps for the the lowest frequency band and corresponding subband images (i.e. 72–103 MHz). As with GLEAM-X DRI, we also performed 10 loops of 3-sigma-clipping to exclude the source-filled pixels in the background estimations. We present an example of 10 sq. deg of the 170–231 MHz wideband mosaic with its associated background and RMS noise maps and the same region in GLEAM ExGal (Hurley-Walker et al. Reference Hurley-Walker2017) in Fig. 5.
The RMS noise of the wideband 170–231 MHz image has a median of 1.5 mJy beam $^{-1}$ , within the expectation of $\sim$ 1.2 mJy beam $^{-1}$ calculated in GLEAM-X DRI. Likewise, in the 30 MHz images we find RMS values ranging from 7–2 mJy beam $^{-1}$ , which is also comparable to values reported in GLEAM-X DRI. While values are slightly higher than those reported in GLEAM-X DRI, we attribute this to the larger Declination coverage and bright A-team sources included in this data release. Large areas of this data release are found to have RMS noise levels <1 mJy; we attribute the slight decrease in these areas to the improvements to the data reduction introduced in this release, for example, the sidelobe subtraction and self-calibration.
5 Compact source catalogue
As with GLEAM-X DRI, we present a source catalogue derived from the images in this work alongside with the mosaics. The details of the source detection and error derivation of this catalogue are described in full in GLEAM-X DRI, however, we summarise the detection strategy here. We use the algorithm Aegean to find sources in the most sensitive 170–231 MHz image that are S/N $\gt 4\sigma$ . The positions for all sources in this catalogue are then used for ‘priorised’ fitting on the other images to measure the flux densities using the local PSF of the relevant narrow-band images. During the priorised fitting stage, if Aegean is unable to determine the error for a given parameter, it sets the error to a value of $-1$ Footnote e , for these reason we use $-1$ as a flag value for the catalogue.
The catalogue is filtered to contain only sources within the defined region of this survey (i.e. 20 h40 m $\leq$ RA $\leq$ 6 h40 m, $-90^{\circ}\leq$ Dec $\leq+30^{\circ}$ ) with flux densities $\geq 5\sigma$ , where $\sigma$ denotes to local RMS noise. Simple spectral models are fit to all sources in the catalogue and sources that are fit well (described further in Section 5.2) have parameters reported in the final catalogue. The spectral modelling procedure is described further in Section 5.2.
The final catalogue, after filtering, consists of 624 866 radio sources detected over 12 892 deg $^{2}$ with 562 302 sources with reported fits for either a power-law or curved spectrum. The increase in source density from GLEAM ExGal to this data release is likely due to a combination of increased resolution and sensitivity. A detailed analysis of source counts of this data release is presented in Venville et al. (Reference Venville2024). We present the main statistics of the compact source catalogue for this data release in Table 1 as well as a comparison to relevant surveys. The catalogue has 388 columns (described in Appendix C).
5.1 Comparison with DRI and GLEAM
Both GLEAM-X DRI and this data release use GLEAM as the basis for flux density calibration. Here we compare the flux densities measured in this work with GLEAM ExGal. A catalogue of sources that are compact in both catalogues ( $S_{\mathrm{int}}/S_{\mathrm{peak}} \lt 2$ ), cross-match within a 15” radius, and have a confident power-law spectral index fit (reduced $\chi ^2 \lt 1.93$ , corresponding to a 99% confidence level) is used for all the following analysis.
We follow the same analysis outlined in GLEAM-X DRI and compare the ratio integrated flux densities as a function of SNR for both GLEAM-X DRII and GLEAM ExGal. The comparison of these ratios is presented in Fig. 6. As with GLEAM-X DRI, there is a trend towards 1.05 at higher SNR sources, however, as the effect is minimal we do not correct for it here. We advise an 8% error for the flux density scale of GLEAM-X DRII when comparing to other surveys, based on the 8% relative error of GLEAM for which our flux density scale is based.
We also compare the fitted spectral indices for components best fit by a power-law spectral model ( $\alpha_{\mathrm{fitted}}$ ), presented in Fig. 7, and find no clear trends. The increase in SNR of GLEAM-X does result in consistently smaller error bars for $\alpha_{\mathrm{fitted}}$ .
5.2 Spectral fitting
We fit two spectral models to the 20 narrow-band flux density measurements for all detected sources in the filtered compact source catalogue described in Section 5, we define the spectral index, $\alpha$ , as $S\propto \nu^{\alpha}$ , such that a negative $\alpha$ describes a negative slope in logarithmic space.
Sources are first fit with a simple power-law model parameterised as:
where $S_{\nu_0}$ , is the flux density in Jy at the reference frequency, $\nu_0$ . The large fractional bandwidth of MWA allows for curvature to be detected and characterised within the MWA bandwidth. We therefore also fit a modified power-law model that parameterises the spectral curvature:
where increasing $|q|$ describes increasing curvature, $q \lt 0$ corresponds to a concave curve and $q\gt 0$ corresponds to a convex curve (Duffy & Blundell Reference Duffy and Blundell2012). Equation (2) has no physical motivation and is used as an initial identification of potential peaked-spectrum sources (PSS). A comprehensive catalogue of PSS in GLEAM-X will be given in a separate publication (Ross et al. in prep). In Fig. 8, shows example SEDs for three sources in this release: a source best fit with a typical, linear power-law, and two best fit with a curved power-laws.
Both the simple power-law and curved power-law models are fit using the SciPy python module that applies a Levenberg–Marquardt non-linear least-squares regression algorithm (Virtanen et al. Reference Virtanen2020). Narrow-band measurements that had a negative integrated flux density were excluded from fitting and fits are reported for sources if both of the following are true:
-
• there were at least 15 integrated flux density measurements;
-
• the $\chi^2$ goodness-of-fit was above 99% likelihood confidence.
Similarly, a curved model is reported instead of the power-law model if the following criteria were all met:
-
• $q/\Delta q \geq 3$ ;
-
• $|q| \gt 0.2 $ ;
-
• the reduced $\chi^2$ for a power-law model was higher than the reduced $\chi^2$ for a curved power-law model.
By adding the criteria $|q|\gt 0.2$ , we avoid the potential for favouring a curved model over a power-law model for spectra that show only a small level of curvature (Callingham et al. Reference Callingham2017).
As with GLEAM-X DRI, the internal flux density scaling consistency of the catalogue is verified using the reduced $\chi^2$ of the model fitting. We adopt 2% as the flux density scaling error as this produces a consistent median reduced $\chi^2$ of unity as a function of SNR.
The distributions of $\alpha$ as a function of flux densities are presented in Fig. 9. The reported spectral indices of this data release are consistent with both GLEAM and GLEAM-X DRI, with a median $\alpha$ in the brightest flux density bin of $-0.84$ .
5.3 Astrometry
Following Hurley-Walker et al. (Reference Hurley-Walker2017) and HW22, the astrometry is calculated using the wideband 170–231 MHz reference catalogue. Only GLEAM-X sources with a high SNR ( $\geq50\sigma$ ) were used to calculate offsets, corresponding to a total of 107 323 sources used for the astrometric analysis. A reference catalogue combining both SUMSS and NVSS was generated by filtering to include only sparse (no internal cross matches within 3’) and unresolved ( $S_{\mathrm{int}}/S_{\mathrm{peak}}\lt 1.2$ ) sources. For astrometric calculations, the reported positions in the reference catalogue were assumed correct and offsets calculated relative to those positions. We find an average RA astrometric offset of $ -7\pm800$ mas and Declination astrometric offset of $ +4\pm800$ mas, where errors on the astrometry are calculated from 1 standard deviation. In the final source catalogue presented in this work, we also report fitting errors on the positions for all sources, which typically are larger than the average astrometric offsets calculated here, we therefore do not correct for these offsets.
It is worth noting, during the processing of individual snapshot images, we conduct an astrometric offset correction based on positions reported in the same reference catalogue based on SUMSS and NVSS. For this reason, while we report sub-arcsecond astrometric offsets in the final mosaic, this is not necessarily a true representation of the astrometric properties of this catalogue. With more widefield low-frequency surveys of the Southern hemisphere being released, future analyses comparing the astrometric positions reported in this catalogue with an independent radio catalogue may be possible. We suggest care is taken when cross-matching this catalogue with other surveys, particularly at higher frequencies. We present the density distribution of the astrometric offsets in Fig. 10.
5.4 Completeness
We follow the same procedure as Hurley-Walker et al. (Reference Hurley-Walker2017) and HW22, and use the wideband source finding mosaic to simulate completeness of the source catalogue. We inject 130 000 simulated point sources evenly distributed across the region of this release, into the 170–231 MHz wideband mosaic. The simulated sources are injected with 26 realisations for different flux density increments spanning $10^{-3}$ to ${10^{-0.5}}$ Jy. Positions for all 90 000 simulated sources remain constant for each realisation, and to avoid confusion, simulated sources are separated by at least 5’. The shape of the simulated sources is simulated based on the local major and minor axis of the PSF, and injected into the mosaic using aeres.
The same source finding procedure as described in Section 5 is performed on the mosaic with injected simulated sources for each of the 26 realisations. We then calculate the fraction of simulated sources that are recovered to estimate the completeness. For any simulated source that was detected but was near a real source, it was only included as ‘recovered’ if the recovered source position was closer to the simulated source rather than the real source.
In GLEAM-X DRI, the completeness was found to be around 50% at $\sim$ 5.6 mJy and 90% at $\sim$ 10 mJy. In this release, we find completeness that is overall consistent with expectations. The completeness for this work is estimated to be 50% at $\sim$ 5.8 mJy and 90% at $\sim$ 10.2 mJy. Fig. 11 presents the spatial distribution of fraction of simulated sources recovered. The smooth mosaicking of multiple drift scans in this release has produced a near uniform sensitivity and completeness across the region in this release, as expected. However, there is some dependence on RA due to bright A-team sources (most notably, Cygnus A and the Crab nebula) at RA $\approx$ 5 h Dec $\approx$ +20 d and RA $\approx$ 21 h Dec $\approx$ +20 d. There is a small Declination dependence for high and low Declinations. For high Declination (Dec > 0), observations are taken at low elevation, looking through a larger amount of ionosphere, and typically have more data flagged for poor data quality issues. Consequently, there is a roll off in completeness at high Declination (Dec >0). However, the roll off in completeness at low Declination (Dec <-80) is more likely due to issues with recovering the simulated sources and differentiating from real sources due to the small sky area around the South Galactic Pole. Fig. 12 shows the fraction of simulated sources recovered as a function of the flux density at 200 MHz. The variations in the completeness results in the large error bars in Fig. 12.
5.5 Reliability and known issues
Following the reliability analysis of GLEAM-X DRI to check how many false detections may be present, we perform a source finding procedure to identify only negative peaks. Initially, we identify 10 305 negative detections, with 1 316 detections with $S_{\mathrm{peak}}\gt 5\sigma$ . As identified in GLEAM-X DRI, there is a tendency for artefacts around bright positive sources producing both negative and positive detections. The filter identified in GLEAM-X DRI was applied to exclude both the positive and negative detections that were near these bright sources. Furthermore, a second filter was applied to the negative sources to exclude any that were within 2’ of a positive source, as these are likely a result of faint sidelobes of sources that were not properly cleaned.
We compare the negative and positive detections after applying the first filter and the second filter as a function of SNR. The reliability for each significance bin is presented in Fig. 13. For a conservative lower limit on the reliability for SNR at 5 $\sigma$ , we do not apply the second filter. We find the number of false detections to be 1.3% which falls quickly to under 1% at 7 $\sigma$ and a plateau for SNR $\gt 7\sigma$ .
In this analysis, a region was identified as containing a higher density of high SNR negative detections relative to the rest of the image. This region, covering roughly 23 h $\leq$ RA $\leq$ 1 h, +15 $^\circ$ $\leq$ Dec $\leq$ +30 $^\circ$ , largely overlaps with a region excluded from GLEAM ExGal due to poor ionospheric conditions during observations. It is possible the combination of the low elevation pointing with sparse to no coverage of GLEAM used for calibration in this processing is contributing to the slightly lower reliability in this region. Furthermore, due to a bright source in the sidelobe (PKS 2365–61), for a subset of the observations taken for the +20 Declination drift scans, there is an artefact that appears as a bright source repeating at regular intervals across the high-frequency mosaics (170–200 MHz, 200–231 MHz, and the subbands for each wide images as well as the deep source finding mosaic 170–231 MHz). The sidelobe subtraction procedure, described in Section 3.2, reduces the flux density of this phantom source, but does not remove the artefact entirely. The repeating artefact in this region is presented in Fig. 14.
6 Summary and outlook
In this work, we present the second data release of the GLEAM-X survey, comprising of images and an extragalactic source catalogue. This data release covers 12 892-deg $^2$ covering 20 h40 m $\leq$ RA $\leq$ 6 h40 m, -90 $^\circ$ $\leq$ Dec $\leq$ +30 $^\circ$ over 72–231 MHz in the form of 26 mosaics with bandwidths 60, 30 and 8 MHz. These mosaics have typical RMS noise levels ranging from $\approx1$ mJy beam $^{-1}$ in the 60 MHz bandwidth mosaic used for initial source finding to $\approx$ 10 mJy beam $^{-1}$ in the 8 MHz mosaic at the lowest frequency, 72–80 MHz. The RMS noise levels of this region are an order of magnitude lower than that reported in GLEAM ExGal (Hurley-Walker et al. Reference Hurley-Walker2017) and within expectations for the overall survey as reported by HW22.
We also present a catalogue of 624 866 components, 562 302 of which are well fit by either a power-law or curved power-law spectral model. This results in a source density of 48 deg $^{-2}$ . We estimate this catalogue is 98% complete at 50 mJy for the entire region with near uniform sensitivity and completeness across the region covered in this release, as expected. We estimate a reliability of 98.7% at a 5 $\sigma$ level. We identify 18 869 sources as being better fit by a curved power-law, suggesting an order of magnitude increase in the PSS sources from those identified using GLEAM (Callingham et al. Reference Callingham2017), and probing a population of fainter PSS. A comprehensive catalogue of PSS and population analysis will be described in an upcoming paper by Ross et al. (in preparation).
The data reduction of this release largely follows that of GLEAM-X DRI (Hurley-Walker et al. Reference Hurley-Walker2022a), but with some noticeable improvements that are now incorporated to future GLEAM-X processing. The subtraction of a model of sensitive sidelobes for observations taken at low elevation reduced the occurrence and/or S/N of alias sources appearing in the main lobe. The use of self-calibration for fields with bright and complex sources in the main lobe or sensitive sidelobes was also introduced and resulted in an improvement in the RMS noise levels of snapshot images. Lastly, a stricter quality assessment of blurring due to the ionosphere resulted in fewer poor quality snapshot observations being included in the final mosaics of this release, resulting in an overall improvement in the mosaic quality.
The uniform and high source density provided by this data release has enabled a study of the source count statistics and a measurement of the angular correlation function, which are presented by Venville et al. (Reference Venville2024). A search for transients on seconds to hours timescales has also been carried out and will be described in full by Horvath et al. (in preparation).
We aim to release the remainder of GLEAM-X via a series of releases. The POlarised GLEAM-X Survey (POGS-X) will be described fully by Zhang et al. (in prep); a Galactic plane release combining both GLEAM and GLEAM-X observations using joint deconvolution, which will be described in Mantovanini et al. (in prep); finally, we aim to produce a contiguous all-sky coverage and release the remainder of GLEAM-X (Ross et al. in preparation).
The images and source catalogue of the GLEAM-X DRII data release are publicly available at AAO Data Central (https://datacentral.org.au/services/cutout/).
Acknowledgements
NHW is supported by an Australian Research Council Future Fellowship (project number FT190100231) funded by the Australian Government. This scientific work uses data obtained from Inyarrimanha Ilgari Bundara/the Murchison Radio-astronomy Observatory. We acknowledge the Wajarri Yamaji People as the Traditional Owners and native title holders of the Observatory site. Establishment of CSIRO’s Murchison Radio-astronomy Observatory is an initiative of the Australian Government, with support from the Government of Western Australia and the Science and Industry Endowment Fund. Support for the operation of the MWA is provided by the Australian Government (NCRIS), under a contract to Curtin University administered by Astronomy Australia Limited. This work was supported by resources provided by the Pawsey Supercomputing Research Centre with funding from the Australian Government and the Government of Western Australia. Access to Pawsey Data Storage Services is governed by a Data Storage and Management Policy (DSMP). ASVO has received funding from the Australian Commonwealth Government through the National eResearch Collaboration Tools and Resources (NeCTAR) Project, the Australian National Data Service (ANDS), and the National Collaborative Research Infrastructure Strategy. This paper makes use of services or code that have been provided by AAO Data Central (datacentral.org.au). This research has made use of NASA’s Astrophysics Data System Bibliographic Services. The following software was used in this work: aoflagger and cotter (Offringa, van de Gronde, & Roerdink Reference Offringa, van de Gronde and Roerdink2012); WSClean (Offringa et al. Reference Offringa2014; Offringa & Smirnov Reference Offringa and Smirnov2017); Aegean (Hancock et al. Reference Hancock, Trott and Hurley-Walker2018); miriad (Sault, Teuben, & Wright Reference Sault, Teuben, Wright, Shaw, Payne and Hayes1995) NumPy (Dubois, Hinsen, & Hugunin Reference Dubois, Hinsen and Hugunin1996; Harris et al. Reference Harris2020); AstroPy (Astropy Collaboration et al. 2013); SciPy (Oliphant Reference Oliphant2007); Matplotlib (Hunter Reference Hunter2007). We also made extensive use of the visualisation and analysis packages DS9Footnote f and Topcat (Taylor Reference Taylor, Shopbell, Britton and Ebert2005). This work was compiled in the very useful online editor Overleaf.
Data availability
The catalogue associated with this paper is available at CDS via anonymous ftp to cdsarc.u-strasbg.fr (130.79.128.5) or via https://cdsarc.cds.unistra.fr/viz-bin/cat/VIII/113. Mosaics associated with this data release are made available via AAO Data Central and SkyView.
A Observations
B Noise Analysis
Given the large sky coverage of this data release, we repeat the noise analysis outlined in Section 3.6 for a region of lower quality and lower elevations for the MWA. In Fig. B1, we present the pixel distributions for a 25 square degrees region in the lowest wideband image covering 72–103 MHz centred at RA $2^\mathrm{h}30^\mathrm{m}$ Dec $+15^\circ00'$ . The lower elevations of this region can elongate the PSF, however, in GLEAM-X DRI, we outlined a strategy for ensuring the PSF is well defined over the entire region. Consequently, we find the well defined PSF, even in areas of low elevation, ensures we are able to accurately detect and subtract sources from the lowest band in order to reduce the impact of confusion. Comparing the distributions for the optimal scenario (presented in Fig. 4), to that of the ‘worst case’ scenario in Fig. B1, we can see the strategy for estimating the noise and RMS maps is still appropriate and we are not significantly impacted by confusion.
C Catalogue Column Names