1 INTRODUCTION
The largest astronomical catalogues are the all-sky optical ones that reach the 20th magnitude. The only such catalogues are the United States Naval Observatory USNO-B1.0 (Monet et al. Reference Monet2003) and its USNO-A (Monet Reference Monet1996) antecedents, the SuperCOSMOS Sky Surveys,Footnote 1 and now also the Gaia Footnote 2 DR1 $\textsl {Gaia}$ Collaboration (Brown et al. Reference Brown2016) that gives single-band coverage of unprecedented astrometric accuracy. Also notable are the Automated Plate Measuring (APM) machine catalogue (Irwin & McMahon Reference Irwin and McMahon1992) that covers the sky excluding the Galactic plane, and of course the Sloan Digital Sky Survey (SDSS)Footnote 3 that gives unprecedentedly deep coverage to a third of the sky. Those catalogues are so large that downloading is foreclosed for most, and distribution usually consists of getting a copy from someone who already has it.
The need is to transcend this barrier and make these data available via standard internet download. Herewith, I present a concise compilation of USNO-B1.0/A1.0, APM, and SDSS data that endeavours to do that. This ‘All-Sky Portable’ catalogue (hereinafter: ASP) amounts to 1 163 237 190 optical sources over the whole sky, presented in a minimized format of tenth-arcsecond precision astrometry, hundredth-of-magnitude precision red–blue photometry with stellar-fuzzy point spread functions (PSFs), and flags as to proper motion, epoch, variability, and provenance for each of photometry and astrometry. Total size is 11 Gb, zipped to 9 Gb. No Gaia data is used, but photometric comparison is given below in Section 5.
The concise downloadable size is not ASP’s only deliverable. Also, the presented data is otherwise difficult to obtain in bulk form. ASP includes all well-detected red/blue sources from the USNO-B1.0/A1.0, APM, and SDSS catalogues. These four catalogues have problematic access in bulk:
-
• The full USNO-B1.0 is 80 Gb in size and is not available for download on-line. Its data is packed into a proprietary format, hence is readable by dedicated software only. The USNO website does not provide the bulk catalogue on-line for reasons of bandwidth and readability, so it is accessible only via on-line queries.
-
• The APM catalogue is 8 Gb in size and was the premier scanned photometric product of its day (turn of the century), but was distributed on tape by individual request only. Its data are in binary SUN formats with structured headers giving global parameters. There is no place to download it; it is accessible only via on-line queries.
-
• The SDSS photometric catalogue is 70 TB large but has been made available in a condensed form as the ‘datasweep’ editionFootnote 4 of 300 Gb size, beyond the reach of those without large bandwidth.
-
• The USNO-A1.0 is 6 Gb in size and was distributed on 10 CDs in the late 1990’s; it presented 1955-epoch POSS-I astrometry not available in its successor USNO-B1.0. It was twice-superseded and is today not obtainable; it is accessible only via on-line query.Footnote 5
-
• The SuperCOSMOS catalogue (not included in ASP) is 4 TB large and is not available for bulk download.
Figure 1 shows where ASP presents these catalogues’ data on the all-sky map, and the ratios used. The ASP optical data is useful for efficient all-sky matchings and searches, and for data-driven analysis requiring continual access and which can be used to formulate queries onto more comprehensive on-line catalogues. Interested users would include those who need rapid and repeated all-sky optical processing, those who require an optical database for a client application, and those with limited bandwidth. A discussion of the assembly of this data, along with a brief round-up of the original surveys and the USNO-B1.0/A1.0, APM and SDSS catalogues, and details of this catalogue’s structure, follows.
2 SOURCE DATA AND SELECTION
Large sky photographic surveys were done throughout the 2nd half of the 20th century, generally using plates of 6.4 deg2 coverage. Northern sky surveys consisted of the first-epoch National Geographic Society-Palomar Observatory Sky Survey (POSS-I) (Abell Reference Abell1959) that took red and blue plates (spaced at 6° intervals) on the same night, thus ensuring true red–blue colour even for variable objects, and the second-epoch POSS-II (Reid et al. Reference Reid1991) that took deeper plates (spaced at 5° intervals) albeit in different epochs. Southern sky plates were photometrically equivalent to POSS-II; these were taken by United Kingdom Schmidt Telescope (UKST) surveys, encompassing SERC/ESO-R/AAO-R projects that covered the entire Southern sky.
The POSS-I survey used violet O (4 050 Å centred passband) for blue, while the POSS-II and UKST surveys used blue–green Bj 4 850 Å; red is centred on 6 400 Å in all those surveys. In calculating blue–red colour, the POSS-I O-E spread is thus about 1.5 times that of the other surveys’ Bj-R. This greater spread, plus that the POSS-I plates showing accurate blue–red colour (due to both plates being taken on the same night, thus minimizing variability issues), makes the POSS-I photometry especially desirable. Thus, in ASP, it is used as the first choice wherever available, which means down to declination −33°. The POSS-I and UKST surveys were included in the APM and USNO-B1.0/A1.0 catalogues, while the POSS-II survey appears only in the USNO-B1.0.Footnote 6
The APM catalogue covers sky away from the Galactic plane, see Figure 2 for its coverage. It consists of 270 M optical sources in 959 data files each of which presents one red–blue plate pairing; 498 files present POSS-I data that cover sky north of declination −3°, and 461 UKST files cover sky south of declination +3°; thus, the equator was doubly covered. The scanned plates were glass copies that were contrast-enhanced to reveal the faintest sources; this removed any dynamic range but simplified the magnitude calculations. The APM plate depths were calibrated against modelled sky values, except that the POSS-I E (red) plate depths were defined as equal to 20.0. Where red and blue sources matched together within 2 arcsec, they were reported as a single two-band source. There were four PSF classes: stellar, fuzzy, blended, and noise. APM data accepted into ASP were those with at least one band having a stellar or fuzzy PSF, or with both bands showing blended sources that are reported by ASP as either fuzzy or no-PSF. Single-band POSS-I data can show faint sources but were unfortunately flooded by spurious exposure artefacts (Flesch Reference Flesch2016). It was not possible to distinguish between the valid and invalid single-band POSS-I data, so with much regret I dropped all those unless matched elsewhere; most valid ones are fortunately recovered by the SDSS and USNO-B1.0 POSS-II data. The APM provides plate astrometry, which means 1950’s epoch for its POSS-I data.
The USNO-B1.0 (hereinafter: USNO-B) catalogue is all-sky and so covers the Galactic plane exclusively in ASP (except for small numbers from USNO-A1.0). It presents 1 042 M optical sources in a processed all-sky map partitioned into 1 800 files. Each record presents red–blue photometry from surveys of two epochs, plus near infrared photometry for the 2nd epoch, as available. It includes 7 435 scanned plates that consist of 937 POSS-I red–blue plate pairings covering sky of (−33° < δ < +90°), 897 POSS-II red–blue–infrared plate combos covering sky of (−3° < δ < +90°), and 822 UKST red–blue–infrared combos covering sky of (−90° < δ < −3°). USNO-B identified matching sources within a 3-arcsec matching radius, but also proper-motion evaluations were done within a 30-arcsec radius, hence enabling the recognition of moving stars as a single object. Whole astrometry was accordingly calculated into epoch 2000 wherever two epochs of source astrometry were available. There are three PSF classes: stellar, fuzzy, and no-PSF. USNO-B required two detections (out of three bands and two epochs) to accept any source; sources detected on only one plate were dropped.
The USNO-A1.0/A2.0 catalogues presented astrometry at the plate epoch, whereas their successor USNO-B presented astrometry projected to epoch 2000. The USNO-A1.0 presented epoch-1955 POSS-I data for all (δ > −33°), whereas the later USNO-A2.0 presented POSS-I data only for (δ > −17°), replacing the difference with UKST data. Therefore, the USNO-A1.0 (hereinafter: USNO-A) is used by ASP as the complete source for epoch-1955 POSS-I data not already provided by APM, in particular over (−33° < δ < −3°) and on the Galactic plane; the use of epoch-1955 data is discussed at the end of Section 4. The USNO-A included only sources detected in both red and blue bands within a 2-arcsec matching radius, and presented magnitudes of tenth-magnitude precision without any PSF information. ASP uses USNO-A UKST data (δ < −33°) only where unmatched to any other.
The SDSS data is taken from their DR9 datasweeps catalogue with 361M optical sources that covers ≈35% of the sky. The SDSS photometry is 5-band ugriz and the datasweeps acceptance criterion is that each source should have a ‘reasonable detection’ in at least one band, i.e., has u<22.5 or g<22.5 or r<22.5 or i<22.0 or z<21.5 (extinction corrected) (Aihara et al. Reference Aihara2011, Section 5, footnote 84). Only their red (r: 6 200 Å) and green (g: 4 750 Å) bands are presented in ASP; they are AB magnitudes that are not extinction-corrected so can show ⩾0.3 mag fainter than elsewhere presented. Many rows (25M) were missing photometry for one or both r and g bands, and some magnitudes were unexpectedly faint, even into the 30’s, which could not be useful, especially given that r = 24.80 and g = 25.11 are the zero-flux magnitudes (Stoughton et al. Reference Stoughton2002, Table 21). I wished to include only objects with useful r and g magnitudes into ASP, so my simple acceptance criterion is that both bands be brighter than their zero-flux magnitudes, given that the ‘reasonable detection’ criterion was already met. As a supplement, 9M additional SDSS DR8 objects were taken from the 160M-object SDSS-XDQSO quasar targeting catalogue of Bovy et al. (Reference Bovy2011); they show extinction-corrected asinh magnitudes (‘luptitudes’ in the SDSS lexicon), which met the ‘reasonable detection’ standard and which, for these 9M objects, met my acceptance criterion where the datasweeps AB magnitudes did not. These look valid on spot checks of SDSS finding charts, and so they are added for completeness. SDSS photometry provides two PSF classes: stellar and galaxy (i.e., fuzzy), which are decided over the whole source bandwidth, and hence are written in ASP as the same for both bands.
3 PROCESSING AND ASSEMBLY OF THE ASP CATALOGUE
Processing commenced with the APM data that present 959 complete red–blue plates that overlap neighbouring plates, thus allowing the use of shared stars to recalibrate the plate depths.Footnote 7 The POSS-I (Northern sky) and UKST (Southern sky) plates overlap each other on the equator, and their red bands are both commensurate with Cousins R; thus, they could be calibrated against each other in bulk. This comparison showed that, consistent with an earlier exercise (Flesch & Hardcastle Reference Flesch and Hardcastle2004), the POSS-I E plate depths were listed at 0.16 magnitude too bright as a group. Thus, all APM POSS-I E plate depths and magnitudes were made 0.16 magnitude fainter, and the same was done for the POSS-I O magnitudes to preserve the internal APM (O–E) colour.
Next, red–blue photometry was selected from the USNO-B data, preferring two-band POSS-I magnitudes where available. The USNO-B data identifies the source survey and plate for each record; these were used to parcel out its data into 2 656 files corresponding to the original plates. These plate data are incomplete because USNO-B already de-duplicated them across overlapping plates within a 3-arcsec matching radius, but were needed here for calibration purposes. Tycho-2 stars and other unallocated objects were set aside to be added back in later. Spurious sources with magnitudes fainter than 23.0 were identified and removed. USNO-B has 910 plates in common with APM; their photometry were now normalized (calibrated) plate-by-plate7 onto their APM equivalents, using shared stars. The bulk result showed that, in Galactic places not covered by APM, POSS-I magnitudes were to be adjusted by +0.09 in E and by +0.30 in O, and UKST magnitudes were perfectly aligned in R and to be adjusted by +0.40 in Bj; this was done. USNO-A magnitudes were similarly calibratedFootnote 8 : the bulk Galactic adjustments for POSS-I magnitudes were +0.1 in both E and O, and for UKST magnitudes were −0.5 in R and −1.1 in Bj.
The 910 plates with both APM and USNO-B coverage were now individually combined, matching objects one-to-one across catalogues within a 5.35-arcsec radius; it was important not to generate spurious duplicates, thus the large radius.Footnote 9 The USNO-A POSS-I plate data were now added into the USNO-B/APM plate data in the same way. Two polar POSS-I plate files were combined into a single polar file; similarly, four POSS-II polar plate files were thus combined. The outcome was 2 700 red–blue plate files corresponding to 936 POSS-I plates, 894 POSS-II plates, and 870 UKST plates.
De-duplication was now done on overlapping same-survey plates, one-to-one to a matching radius of 5.35 arcsec; approx. 50 million duplicates were removed. Numerous spot checks on well-separated duplicates confirmed that duplicates were being accurately targeted; the large radius was needed because astrometric offsets at plate margins can be large, up to 3 arcsec, and the overlapping plates are offset opposite to each other, thus doubling the total offset. Most duplications came from the APM files that contained full plate data including overlapping areas; also, USNO-B had performed de-duplication of overlapping plates to a 3-arcsec radius; thus, many more were still eligible for de-duplication, similarly for USNO-A. All these de-duplications were across plate files, and never internally to any file, so that close doublets were preserved. ‘Best’ photometry was kept from matching objects, preferring two-band photometry to single-band, and preferring photometry with PSFs to that without. A total of 3 364 overlapping de-duplication exercises were done on the POSS-I plate files, 3 269 on the POSS-II plate files, and 3 171 on the UKST plate files.
Each of the three surveys (POSS-I, POSS-II, and UKST), having been fully calibrated and de-duplicated, was now assembled into whole sky data and partitioned into computing-friendly ‘tiles’ of sky (of 10° RA × 9° DEC) with arcminute-wide margins added onto their edges to prevent edge effects. Matching tiles across surveys were now overlain and their objects de-duplicated one-to-one within a 4.75-arcsec radius (found via annulus counts), keeping ‘best’ photometry as before. Two-band POSS-I photometry was always retained as the first choice, followed by any other two-band photometry, else single-band. Tycho-2 stars from the USNO-B were also added back in, matched within a 5.35-arcsec radius. USNO-A UKST data was added where unmatched to any other. The resultant APM/USNO-A/B data number 957 205 473 sources and represent an all-sky map of relatively uniform photometric depth, although less deep near the Galactic plane where high stellar density challenges instrument design limits.
The SDSS DR9 sweeps data was now merged onto this, identifying shared objects one-to-one within a 3.35-arcsec matching radius (found via annuli counts, refer Figure A2); the closer radius is because of the excellent astrometric accuracy of the SDSS data, so that the astrometric errors are on the APM/USNO-A/B side only. Last, the SDSS data from XDQSO (Bovy et al. Reference Bovy2011) was added where not already present, for completeness. Because the uniformity of the APM/USNO-A/B photometry is potentially useful in research, the SDSS photometry overwrites it only where two-band APM/USNO-A/B photometry does not already exist. However, the superior SDSS astrometry is always retained wherever matched. This policy means that ASP presents 323 126 904 objects with SDSS astrometry, but of those only 206 031 717 bear SDSS photometry.
With merging completed, the sky tiles were cropped to their fiducial dimensions (thus freed of edge effects) and re-assembled into the 100 files presented on the web site. The completed ASP catalogue has 1 163 237 190 sources, each of which is flagged as to the provenance of its photometry and, separately, its astrometry.
4 METHODOLOGY OF THE ASP CATALOGUE’S PRESENTATION
Each ASP object is reported simply with its astrometry, magnitudes, PSFs, and flags, but those data can be sourced from up to six places from four input catalogues. To manage this, I endeavoured to select the best photometry and most recent-epoch astrometry for each object, via the following hierarchy of usage:
-
• For photometry, two-band photometry is always preferred to single-band. Within that constraint, POSS-I is the top choice because of the wide spread between its red 6 400 Å E and violet 4 050 Å O bands, plus that its (O–E) colour is reliable due to both plates being taken on the same night. Second preference goes to POSS-II and UKST (which both feature blue–green 4 850 Å Bj) because they comprise a relatively uniform all-sky map that can be useful for research. As a tie-breaker, APM photometry is preferred to USNO-A/B because of its recalibrated accuracy. Last, the deeper SDSS photometry is ‘clipped on’ where two-band photometry was not already present in the APM/USNO-A/B all-sky map. In this way, researchers can retrieve the uniform APM/USNO-A/B sky map shown in Figure 3 by simply disregarding the SDSS photometric data.
-
• For astrometry, the superior SDSS astrometry is always kept; its mean epoch is 2005 (observed 2000–2009). Next, USNO-B projected their astrometry to epoch 2000 wherever two survey epochs were available, and ASP flags those accordingly. Remaining objects have the mean epoch of the POSS-II, UKST, or POSS-I astrometry with which they are flagged, being 1993 for POSS-II (observed 1985–2000), 1985 for UKST (observed 1978–1990), and 1955 for POSS-I (observed 1949–1958). Note that the ‘mean epochs’ used here have a typical uncertainty of ±5 years to the true observational epoch of any datum. Counts by catalogue and survey are given in Table 1.
The presented PSFs are usually from the photometry provenance but can come from either. In very rare cases the red and blue magnitudes are from different provenances because only single-band source data were available; in such cases the cited photometry provenance pertains to the blue magnitude.
The USNO-B gives a proper motion flag where stars were identified as moving; this is included in ASP where USNO-B’s confidence of the proper motion is >90%. The nominal count is 157 204 744 objects so flagged, but USNO-B over-reported proper motion incidence by ≈50 × in a bid for completeness, so that flag should always be taken as indicative only and needing confirmation in individual cases.
ASP provides 25 621 092 objects with 1955-epoch POSS-I astrometry, which can be moving stars matching to nearby later-epoch signatures, but are present for other reasons also: (1) objects close to bright stars or on galaxy disks that were not identified by later survey reductions, see Figure 4 for an example, (2) moving asteroids that are unmatchable, (3) variable objects that have faded, (4) POSS-I plate artefacts that would be invalid objects, and (5) POSS-II/UKST plate artefacts and edge effects that lose 2nd-epoch stars, thus leaving those POSS-I objects unmatched. Due to different epochs, moving stars can manifest two or even three times in ASP, each flagged with its own epoch, but I estimate that the incidence of this is just ≈0.1% of all objects and usually one of those matching sources will have the proper motion flag, see Figure 5 for an example.
5 PHOTOMETRIC COMPARISON OF CATALOGUES
The POSS-I and POSS-II/UKST surveys used red plates that replicated the standard Cousins R band, but their blue plates differed from the standard Johnson B (4 400 Å) band and from each other. The SDSS r and g bands differ again. ASP reports its magnitudes in those original survey photometries, but with just one survey per each object, identified with a flag. The user may wish to standardize to R and B and to Johnson V for purposes of comparison, so some convenient literature-based transformations follow. First, the standard colour transform from Minkowski & Abell (Reference Minkowski and Abell1963) is
Solving for V
For POSS-I E and O (4 050 Å) photometry, McMahon et al. (Reference McMahon, White, Helfand and Becker2002) give this transform to Johnson B:
which, combining with (1) and taking E = R, gives
For POSS-II and UKST photometry, Blair & Gilmore (Reference Blair and Gilmore1982) give this transform of Bj (4 850 Å) with Johnson B and V:
Similarly, Irwin, Demers, & Kunkel (Demers) give
Those two transforms agree within their error margins, so taking their mean and combining with (1) gives
For SDSS photometry, Lupton (Reference Lupton2005) gives
All these transformations enable comparison of POSS-I, POSS-II/UKST, and SDSS magnitudes by converting each to Johnson V. The following transformations to Johnson B and Cousins R enables comparison of (B − R) colour:
For POSS-I data, R = E, and combining with (2):
For UKST and POSS-II data, R equates to Cousins R:
For SDSS data, Lupton (Reference Lupton2005) gives
The recent Gaia DR1 $\textsl {Gaia}$ Collaboration (Brown et al. Reference Brown2016) presents optical data in a single broad (3 300 Å–10 500 Å) ‘G’ (for ‘Gaia’) band to their limit of G = 20.7. It is interesting to compare the ASP optical depths to that limit, so I use the transformation from V and R to G given by Jordi et al. (Reference Jordi2010):
This transform enables mapping to Gaia G for all ASP data since we already have transforms to V and R.
These transforms enable us to compare the plate depths of the catalogue/surveys included in ASP. APM gives R/Bj or E/O plate depths for its 959 plates, and their mean depths are shown in Table 2 along with USNO-A/B mean plate depths derived from magnitude profiles. SDSS ‘depth’ is a nebulous concept, but I include it in Table 2 by taking the SDSS data frequency maxima of r = 22.50 and g = 22.98 in the SDSS data as surrogates for depth; perhaps deeper values would be justified. The mappings to V and G show:
-
1. SDSS coverage is ≈2 magnitudes deeper than other coverage. Runner-up is the APM Southern UKST coverage, which at average plate depth of V = 21.68 is also impressively deep.
-
2. APM data is deeper than USNO-A/B because of its deep blue band coverages ranging 0.6–1.0 magnitude deeper than those of USNO-A/B. This may have been consequential to APM’s use of contrast-enhanced glass plate copies that removed all dynamic range but revealed the faintest objects, which appears to have been particularly effective with blue plates.
-
3. USNO-A/B plate depths away from the Galactic plane (i.e., onto the APM footprint shown in Figure 2) reach about a magnitude deeper than those onto the Galactic plane. Also, very dense places such as the Galactic Bulge and the LMC and SMC have shallower (and unreliable) coverage, due to operational limits of the plate measuring machine.
-
4. SDSS coverage is ≈2 magnitudes deeper than the Gaia limit of G = 20.7. South sky UKST coverage away from the Galactic plane reaches G = 21.0, also deeper than the Gaia limit.
Note: Only two-band stellar objects were used in this analysis.
Calibration of these catalogue–surveys is discussed in Section 3.
6 THE SKY AS PRESENTED BY ASP
The ASP data is found to have astrometric accuracy onto the SDSS epoch of 90.96% within 1 arcsec and 96.65% within 2 arcsec. The excellent SDSS astrometry is used as the benchmark for this calculation. In ASP, 117 095 187 APM/USNO-A/B objects match to SDSS objects within the matching radius of 3.35 arcsec. Of those, 87.49% match within 1 arcsec and 95.37% match within 2 arcsec; this is taken as representative of all 840 110 286 ASP objects without SDSS astrometry. The 323 126 904 ASP objects with SDSS astrometry are taken as perfectly positioned. Combining these two groups, the overall astrometric accuracy works out to that stated above. For stellar-PSF objects only, the match rates over 47 830 533 shared objects are 81.64% matching within 0.5 arcsec, 92.99% within 1 arcsec, and 97.48% within 2 arcsec, which can be taken as the practical astrometric accuracy of ASP sky away from the SDSS footprint.
The ASP data is a useful listing of the photographed sky, but its survey-based coverage is inhomogeneous on both large and small scales, and shows processing artefacts. To confer some overview, some ASP sky tiles are shown as density maps, refer Figures 6–10 with discussions thereon. Individual photographic plates are clearly seen because plate overlaps are denser due to unique data contributed by each plate, plus some well-offset duplicates and artefacts.
Along those lines, a few caveats to keep in mind:
-
1. Fuzzy (galaxy-like) objects from the APM or USNO-B usually display magnitudes which are too bright. This is because the magnitudes were calculated from the plate image size using a stellar (not fuzzy) PSF model. For R>16, the mean conversion to true magnitude is R true = $\frac{2}{3}$ (11 +R), but the scatter is too great to apply this individually. Divergence increases non-linearly for R<16 and can be many magnitudes too bright for R<10.
-
2. USNO-A/B had difficulty setting sky levels in high-density places like the Magellanic Clouds and Galactic bulge, so the outcome there was spurious data interwoven with valid data. Use with great caution in those places.
-
3. USNO-A/B plates show label-caused artefacts. Their POSS-I plates show a small rectangular label in the upper left corner that manifests as spurious r≈20.5 b≈21.1 data there. The USNO-B POSS-II plates have an opaque label on the lower right corner that is without data but is fully covered by neighbouring plates; however, an edge-shaped artefact resulted because objects did not get de-duplicated across the edge there. They can be seen in Figures 6 and 7.
-
4. USNO-B included objects that appeared at least twice in their five bands of data (being first-epoch red and blue and second-epoch red, blue, and infrared). Thus, since ASP reports just two bands of data, many USNO-B objects appear as just single-band and those are found to have reduced reliability. Single-band objects on the SDSS footprint are not matched to SDSS data (else they would be presented with two-band SDSS photometry) and so are often false. Single-band objects comprise 12.89% of all objects in ASP, and when selecting one it would be best to confirm its existence on a DSSFootnote 10 image or SDSS finding chart.
-
5. Some well-offset duplicates remain in the data, including cases where a fuzzy-PSF object conceals a point source that was nonetheless contributed by another source catalogue. De-duplication processing can also (rarely) copy photometric attributes across close neighbours, and duplicates from different epochs can be present, refer Figure 5. The user should be alert to the possible presence of such duplicates, and check with DSS or SDSS as necessary.
7 CATALOGUE LAYOUT
The ASP catalogue uses an efficient data design to minimize its size. Part of this design is that RA and DEC are presented as offsets from stated zero points. The RA zero point is given by a 1-byte field that identifies the tenth of a RA hour, e.g., ‘117’ ⇒ 11.7 RA hours = 175.5 RA degrees. The DEC zero point is given by the file name × 1.8°, e.g., ‘N17’ ⇒ 17 × 1.8° = 30.6°. This design allows RA and DEC to be presented as simple 2-byte fields which give the offset from its zero point in units of 0.1 arcsec.
Accordingly, this ASP catalogue presents each record as a 10-byte binary row, with 1 byte for the RA zero point, 2 bytes each for the RA and DEC offsets, 2 bytes each for red and blue photometry, and 1 byte for flags. RA and DEC fields use an unsigned integer binary little-endian format as are used in PCs nowadays; alternatively they can be input as signed integers, and where the resultant value is negative, add 65 536.
The ASP catalogue comes in 100 files, each of which holds all RA for a 1.8°-wide (in DEC) ring of sky. These 100 rings thus stack the sky from pole to pole. The Southern files (S00.asp–S49.asp) are written just like Northern files (N00.asp–N49.asp) with all declinations written as positive numbers only. The user should process them identically to the Northern files, treating South as North, and convert the declinations to negative at the conclusion by simply affixing the ‘-’.
The ReadMe gives a thorough discussion of all the fields, and includes some BASIC and Python code that performs the processing. It is available on the file download web page.
The ASP catalogue is available for download as 15 zipped files of 5+ ASP files each,Footnote 11 with total size of 9 Gb zipped, 11 Gb unzipped. No FITS files are provided due to the large size of the data. There is no on-line query client; ASP is provided only as a bulk catalogue for download.
8 MISCELLANEOUS NOTES
There are 102 048 ‘inferred’ optical magnitudes in the ASP data. They are legacy data from Flesch & Hardcastle (Reference Flesch and Hardcastle2004), which joined USNO-A2.0 point sources to APM isophotal ellipses to find cases where two point sources were required to generate the APM ellipse but only one was reported—therefore, the missing point source could be astrometrically calculated, i.e., inferred. Since then, the addition of USNO-B data has filled in most of those missing data. In ASP, they are used only to fill out the photometry for single-band sources, thus showing that the object is seen in the other band also. Inferred magnitudes have an expected error of 1 magnitude and can be greater.
Variability greater than 1 magnitude was evaluated across overlapping plates throughout the assembly of this catalogue. Such variability is flagged in both red and blue bands for 3.28% of all objects. However, it should be treated as indicative only and needing confirmation in individual cases.
9 CONCLUSION
This ASP optical catalogue is presented with 1 163 237 190 optical sources over the whole sky, taken from earlier optical catalogues produced in the years 1996–2003 and which are no longer available in bulk, plus SDSS data. The data is in a binary format of tenth-of-arcsecond precision astrometry, hundredth-of-magnitude precision red–blue photometry with stellar-fuzzy PSFs, and flags for proper motion, variability, epoch, and provenance of each of photometry and astrometry. The catalogue is downloadable at a total zipped size of 9 Gb.
ACKNOWLEDGEMENTS
Great thanks to Dave Monet (USNO) and Mike Irwin (APM) for permission to use their optical catalogues in this work, and for helpful advice. Also, much gratitude to Adam Myers for provision of the SDSS sweeps data which has made ASP complete. An extra shout-out to Dave Monet for heavy lifting: 10 000 survey plates scanned, amazing.
This work was not funded.
APPENDIX
A USE OF ANNULI TO DETERMINE MATCHING RULES
A substantive task in producing ASP was to avoid the inclusion of duplicate objects. The three main sources of duplicates were input catalogues presenting the same sky from the same survey, or the same sky from different surveys, or adjacent survey plates with margins overlapping each other. These were matched using a one-to-one matching algorithm that matches astrometrically closest objects and removes those objects from their respective pools, then repeats iteratively to completion. But the overriding question is what maximum matching radius (i.e., offset) to use to identify those duplicates, because matches at large offsets are more likely to be discrete objects. Such true objects constitute a background that we wish to preserve, so for large processing we need a statistical approach to discern the duplicates from the true background. This involves counting the number of matches for each binned matching radius and using those counts to understand the data. However, doing so baldly is not good enough, as is shown in Figure A1.
Figure A1 is not helpful: For large matching offsets, the count profile is non-linear and not even monotonic, so the background of true objects is not seen. The solution is two-fold: (1) use many-to-one matching instead of one-to-one, and (2) instead of counts of matches, present the density of matches, where the area used is that of the 0.1-arcsec-wide annulus on the sky for each matching offset (i.e., radius) bin which works out simply to:
Using this, the background of true objects for the SDSS-to-APM/USNO-B matching exercise is revealed as shown in Figure A2 (zoom in to see well).
SDSS photometry is deeper than APM/USNO-B with twice the optical density on the sky; because of this, true SDSS doublets are plentiful. The priority here is to preserve true SDSS doublets equally with removing the duplicates, so the crossover point is where their frequency is equal on the sky, i.e., at y = 200 in Figure A2. This line is crossed between the bins of x = 3.3 and 3.4 (marked with the arrow), so the matching radius used in ASP processing was 3.35 arcsec, although the final matching processing was one-to-one.
Therefore, for each matching task, it was first necessary to do a many-to-one matching exercise to determine the true background and decide the limiting matching radius, and then to apply that matching radius in a one-to-one matching process which performed the actual de-duplication. The one-to-one matching preserves close doublets better than many-to-one does, thus enabling farther out matching, but I could not know how much farther, so I left it where it was.
Additional bar charts below show, for some of the matching tasks encountered in ASP processing, the count density profiles, the matching radius decided on, and any additional considerations described in the text thereon. The charts are best viewed at a high zoom.