1 Introduction

Retrieval of tropospheric ozone (O3) concentrations from satellites can be challenging given optical interferences such as clouds, aerosols and the stratospheric O3 layer. State-of-the-art spectrometers and interferometers on board current satellites ranging from the microwave to the ultraviolet bands of the electromagnetic spectrum are used to retrieve total column O3 (TCO) and vertically-resolved O3 mixing ratios (Beer et al. 2001; Goldberg et al. 2003; Levelt et al. 2006; Waters et al. 2006; Clerbaux et al. 2009; Liu et al. 2010). The signal collected by satellites from the lower-free troposphere and the boundary layer is small, resulting in coarse vertical resolution (~5 km) at these lower layers and increased uncertainty of the retrieved values (Bowman et al. 2002, 2006; Worden et al. 2004). Much work has been conducted to extract the tropospheric column O3 (TPCO) from the raw irradiances and/or TCO measurements (Fishman and Larsen 1987; Ziemke et al. 1998; 2006; Liu et al. 2006; Schoeberl et al. 2007). The validation of both TPCO and TCO amounts against independent datasets and/or model simulations have been extensive (Balis et al. 2007; Worden et al. 2007a; Kroon et al. 2008; Nassar et al. 2008; Osterman et al. 2008; Richards et al. 2008; Antón et al. 2009; Liu et al. 2010; Kroon et al. 2011). TES retrievals of vertically resolved O3 are generally biased high between 0 and 20 % in the troposphere when compared to ozonesonde measurements (Nassar et al. 2008; Osterman et al. 2008; Boxe et al. 2010), and Ozone Monitoring Instrument (OMI; also on board Aura) ozone profile retrievals are biased high between 20 and 60 % in the troposphere compared to ozonesonde measurements (Kroon et al. 2011). By combining satellite measurements of different spectral ranges (infrared and ultraviolet), improvements by a factor of two in the sensitivity to the lower tropospheric and boundary layer O3 have been observed and simulated (Worden et al. 2007b; Landgraf and Hasekamp 2007; Sellitto et al. 2011; Natraj et al. 2011; Zoogman et al. 2011). The inability of the current sun-synchronous, low-earth orbit satellite measurements to resolve O3 concentrations within the boundary layer has limited the application of satellite O3 observations for control and regulation purposes, the study of chemical and physical processes leading to the formation and transport of O3, and investigations of regional impacts of O3 to the health of humans and the biosphere.

Instrumentation, retrieval algorithms and data processing have become increasingly sophisticated, resulting in detection of plumes within the troposphere (Kar et al. 2010; Jourdain et al. 2007) and observations of impacts of O3 on the biosphere (Fishman et al. 2010). The previous studies have relied on either enhanced signals within plumes and/or temporal averaging to detect O3 mixing ratios within the troposphere, limiting the applications to higher frequency (i.e. diurnal) processes within the boundary layer.

Work has been conducted to quantify the correlation between O3 mixing ratios at the surface and partial column O3 amounts in the boundary layer (0–3 km) using ozonesonde datasets across North America (Chatfield and Esswein 2012). Correlation coefficients between 0.7 and 0.95 were observed. The applicability of these correlations to satellite retrievals assumes that satellite retrievals are sensitive to boundary layer atmospheric pollutants. The calculated averaging kernels used in the satellite retrievals within the boundary layer suggest that satellites are not sensitive to boundary layer pollutants (e.g. Bowman et al. 2006). Similar investigations within a more defined region over a shorter timescale can potentially reveal more information on the extent and processes behind vertical O3 correlations.

At present, we have only polar-orbiting ozone satellites with limited lower tropospheric detection. Thus, techniques that can extract near-surface information from partial columns within the mid-to-upper troposphere are highly desirable. In this work, we present a promising technique to expand TES capabilities using campaign data from a prototypical pollution regime. We present an investigation of O3 variability and vertical correlations of O3 mixing ratios and partial columns at two sites in the mid-Atlantic United States. Detailed comparisons of ozonesondes with in situ aircraft and satellite measurements are provided. The results from this study are then used to derive near-surface O3 amounts using existing satellite measurements from the Tropospheric Emission Spectrometer (TES). This work utilizes the extensive set of observations collected as part of NASA’s Deriving Information on Surface Conditions from COlumn and VERtically Resolved Observations Relevant to Air Quality (DISCOVER-AQ) campaign during July 2011 in the Baltimore, MD/Washington D.C. metropolitan area (Fig. 1). The campaign consisted of measurements from three aircraft, twelve ground stations, two free-flying and tether balloon systems, and a ship in the Chesapeake Bay (data publicly available at http://www-air.larc.nasa.gov/cgi-bin/arcstat-d). A subset of these measurements (described below), specifically over Edgewood, MD (EW; Lat: 39.4100, Lon: −76.2967) approximately 30 km northeast of Baltimore, MD and Beltsville, MD (BV; Lat: 39.0540, Lon: −76.8773) approximately 22 km northeast of Washington, D.C. provide a description of the composition and dynamics of the atmosphere. The respective proximities of these sites to the Chesapeake Bay (~3 km for EW and 38 km for BV) allow the investigation of marine layers on upper tropospheric and near-surface O3 correlations. The objective of this study is to use correlation statistics to improve methods of obtaining near-surface O3 from satellite retrievals. The approaches taken within this manuscript apply to an operational satellite, but with further refinements with more campaign statistics, can also enhance retrievals from planned geostationary satellites (NASA’s Geostationary Coastal and Air Pollution Events, GEO-CAPE; National Research Council 2007; Europe, Sentinel-4; Committee on Earth Observation Satellites 2009; Ingmann et al. 2012; Asia, Korean Geostationary Environment Monitoring Spectrometer, GEMS and Geostationary Mission for Meteorology and Air Pollution, GMAP-Asia; Akimoto et al. 2008; TEMPO, http://science.nasa.gov/missions/tempo/).

Fig. 1
figure 1

Site map of the NASA DISCOVER-AQ July 2011 campaign in the Baltimore, MD/Washington D.C. metropolitan region. Ground measurement sites of interest are noted by the yellow stars. An example flight track of the NASA P3-B aircraft is indicated by the white line, with circles representing locations of vertical profiles

2 Measurements

A total of 39 ozonesondes were launched at EW and 25 at BV during the campaign. The O3 instrument at EW comprised of an electrochemical concentration cell (ECC; Droplet Measurement Technologies, DMT) containing 1 % solution by mass of potassium iodide (KI) in deionized water and fully-buffered (0.625 % sodium phosphate and 2.5 % potassium bromide by mass; Komhyr 1986). The DMT sonde is manufactured to the same specifications as the ENSCI-2Z (Environmental Science Corporation) sondes flown in prior Maryland soundings (Thompson et al. 2007; Yorks et al. 2009). At BV, a DMT ozonesonde and a 0.5 % KI, half-buffered solution was used. The nominal uncertainty of O3 measurements in the troposphere is 5 % (e.g. Deshler et al. 2008), but the uncertainty is dependent on the electrochemical solution strength and ambient pressure. Data were collected at 1 Hz as the balloon ascends and the data were transmitted in real time to a ground station using an attached radiosonde (International Met Systems, iMet-1 at EW and Vaisala, RS-92 at BV). The radiosonde provides pressure, temperature, humidity, wind speed and direction, and georeference information. The pressure, temperature and humidity on the radiosonde are accurate to within 0.5 hPa, 0.2 °C and 5 %, respectively. The almost daily ozonesonde launches coincided with aircraft vertical profiles (discussed below) during flight days and the approximate overpass of the Aura satellite (~13:45 LT) on non-flight days. The mean height at apogee was 31.7 km with an average ascent time of 99 min. Maximum distances of the balloon to the respective ground station ranged from 19 to 88 km with a median of 47 km. The distance between the site and the balloon typically peaked at the top of the troposphere.

Ozonesonde data were validated against coincident aircraft profiles within the boundary layer and the lower free troposphere. The NASA P3-B, a four-turboprop engine aircraft, conducted 45 profiles over EW and 43 profiles over BV during the DISCOVER-AQ July 2011 campaign. The average minimum and maximum altitudes over which the aircraft flew over EW were 0.3 km and 3.6 km, respectively. Profiles over BV were between 0.3 km and 1.7 km. The aircraft profiling depth over BV was limited by airspace regulations. A typical spiral of the aircraft was 5 km in diameter and each profile took on average 14 min. On the P3-B, O3 is measured in-situ via the chemiluminescent reaction it undergoes with excess nitric oxide (NO; Ridley et al. 1992). A 2 standard cubic centimeters per minute (sccm) flow of pure NO is added to the 480 sccm flow of ambient air, and the photons produced are counted using a photomultiplier tube cooled with dry ice. The approximate response time is 1 s and the data are reported at 1 Hz. Over the course of the DISCOVER-AQ 2011 campaign, the instrument was calibrated on the ground three times against a Teco Model 49PS calibrator, and the calibration was stable to a nominal 3 %. Given this, and the intrinsic uncertainty in the calibrator, we estimate an overall uncertainty of 5 % in the chemiluminescent measurement of O3. For the levels of O3 typical of the DISCOVER-AQ campaign, the precision in the 1-s values is in the range 0.2–0.4 ppbv, increasing with mixing ratio.

Ozonesonde measurements of total column, partial column and mixing ratio profiles at EW were compared with nadir-viewing TES observations on board the Aura satellite (Beer et al. 2001). The level 2 (version 5) O3 products, which consist of global survey and special observation swaths of O3 profiles and total and tropospheric O3 column amounts, were used for these analyses. A total of 11 global surveys were conducted during the campaign. Three special observations occurred within the region during the campaign, focusing on BV, EW and Fair Hill, MD (another DISCOVER-AQ ground site ~50 km northeast of EW, Lat: 39.7014; Lon: −75.8600). The swaths have a horizontal resolution at the surface of 5.3 km × 8.4 km. Ozone mixing ratios are retrieved for 88 pressure levels including the surface with retrievals approximately every 600–800 m. The higher number of tropospheric retrievals from TES and higher sensitivity of the instrument to O3 within the troposphere compared to similar satellite products from OMI favors using TES data for these analyses. Pixels from TES that were within 150 km and 3 h of ozonesonde launch locations were defined as coincident and used for the comparison. These coincidence criteria are stricter than those of previous studies (e.g. Worden et al. 2007a). A total of 3 TES overpasses were defined as coincident and used in this analysis: one global survey and two special observations (stares) over EW and BV. These satellite overpasses were coincident with three ozonesondes launched at EW and two ozonesondes at BV. A total of 28 TES scans were analyzed since the special observations contained between 13 and 14 scans per overpass. The scans were filtered using the reported retrieval master quality flag in addition to the “Ccurve” flag, which represents cases of unrealistic retrievals. Three scans on each edge of the special observation overpasses were not included in the analysis (J. Worden, personal communication). The filters reduced the number of scans analyzed to 12. The average reported precision of the total and tropospheric column O3 amount from TES for the data used are 1.1 % and 5.4 %, respectively, with biases in the tropospheric mid-latitudes between 10 and 20 % (Nassar et al. 2008; Boxe et al. 2010) .

3 Ozonesonde corrections

Uncertainties in the O3 measurement from the sondes are a function of altitude typically ranging from 5 to 10 %, however, previous studies have shown a high bias when using the 1 % KI, fully-buffered solution with ENSCI-2Z ECCs when compared to a UV photometer during controlled experiments (Barnes et al. 1985; Johnson et al. 2002; Smit et al. 2007; Deshler et al. 2008). Sensitivity tests of the O3 response show that the buffer concentration is the major contributing factor towards the high bias (Johnson et al. 2002), most likely due to side reactions involving hypoiodite and perphospate leading to an additional source of iodine (Saltzman and Gilbert 1959). Using a 1 % solution strength at Edgewood, MD allowed the opportunity to conduct field observations with coincident in situ measurements to identify the magnitude of the previously identified artifacts. Thus, O3 partial pressure data that were collected during DISCOVER-AQ 2011 at Edgewood, MD were corrected for this high bias by applying a correction based on the controlled experiments conducted as part of the Jülich Ozone Sonde Intercomparison Experiment (JOSIE) 1998 simulation (Smit et al. 2007). The correction presented below is independent of the background corrections and pump flow corrections summarized in Johnson et al. (2002).

The correction was calculated as follows: First, the relative bias profiles observed during JOSIE-1998 were averaged into 5 km vertical bins. The smoothed data were spatially interpolated corresponding to the measured altitudes of each ozonesonde using a cubic spline. The interpolated relative biases were multiplied by each O3 partial pressure measurement and subtracted from the actual measurement. Finally, the corrected partial pressures of O3 were converted to a volume mixing ratio using the in situ temperature and pressure measurements.

Differences between the coincident O3 measurements collected from ozonesondes and in situ measurements from the P3-B using EW and BV profiles were observed (Fig. 2). The corrected sonde and uncorrected sonde measurements from EW are both presented. The data are 100 m bin averages with uncertainties presented every 500 m for clarity. Uncertainties are calculated as ±1σ using bootstrap sampling of 10,000 points. The mean bias (MB) and coefficient of variation of the root-mean-squared error (RMSE cv ) averaged over the vertical range of coincident measurements of the uncorrected ozonesonde data from EW referenced to the aircraft measurements are 13.3 % and 13.6 %, respectively. With the correction, the MB and RMSE cv become 0.4 % and 9.6 %, within the combined uncertainty (~11 %) of the respective measurements. The BV site used 0.5 % KI, half-buffered solutions. The MB and RMSE cv for Beltsville ozonesonde profiles are 6.1 % and 6.9 %, respectively, in agreement with previous controlled simulations using 0.5 % KI, half-buffered solutions (Smit et al. 2007). Since the biases were less than the estimated combined uncertainty of the P3-B and sonde O3 measurements and since this correction is strictly to compensate for the previously observed artifacts associated with using a 1 % KI solution, the correction was not applied to the Beltsville sonde data. The response time of the ozonesondes are between 20 and 30 s, whereas the response time of the P3-B chemiluminescence ozone instrument is on the order of 1 s. Thus, a spatial lag between the ozonesonde and aircraft measurements based on the ascent rate of the balloon is expected. The spatial lag was calculated as the maximum of the un-scaled cross-covariance between the ozonesonde and aircraft measurements. Each ozonesonde flight was treated separately. The altitude of the ozonesonde measurement was adjusted downward using the mean ascent rate between the time of measurement (tm) and tm minus the lag time. When the lag of the ozonesonde measurements is taken into account and compared with the aircraft measurements, the MB and RMSE cv is reduced to 0.1 % and 8.3 %. Although the corrections increase the average uncertainty of O3 mixing ratios from ozonesondes throughout the entire ozonesonde profile from 8 % to 9 %, these results show that this correction, based on independent controlled simulations, is sensible for these measurements. The bias correction was applied to all EW ozonesonde measurements hereafter and the response time correction was applied to both EW and BV sondes.

Fig. 2
figure 2

Comparison of percent error between the coincident ozonesondes and P3-B ozone profiles at Edgewood, MD (black) during July 2011 using uncorrected (dashed) and corrected (for chemical artifacts, solid) data. Uncorrected data from Beltsville, MD is shown (gray). The Beltsville profile comparisons do not exceed 2 km due to the airspace limitations placed on the P3-B aircraft. Profile data are 100 m bin-averages. The error bars (presented every 500 m for clarity) represent ±1 standard deviations calculated from bootstrap statistics

There is potential for this correction to be applied in the post-processing of past ozonesonde measurements using ECC O3-detection methods and a 1 % KI solution, fully buffered. The challenges faced by long-term O3-monitoring networks such as the Global Atmospheric Watch (GAW), the Network for the Detection of Atmospheric Composition Change (NDACC) and the Southern Hemisphere Additional Ozonesondes (SHADOZ; e.g. Thompson et al. (2003) require a standardized approach to O3 measurement so that long-term trends can be accurately deduced (Smit et al. 2011). This correction provides the steps for one method of standardizing ECC-sonde types using the fully-buffered 1 % KI solution. It should be noted that the results of this correction are from one inter-comparison experiment extending only up to 5 km and further inter-comparisons, especially in the mid-to-upper troposphere are needed to determine if these methods can generally be extended to other locations and altitudes. Although the corrections as described here are used for subsequent analyses in this manuscript, we note that other correction algorithms have been proposed (e.g. Deshler et al. 2012).

4 Ozone profiles

Ozone profile shapes are similar over EW and BW (Fig. 3). A local maximum of O3 mixing ratios in the mid-to-top of the boundary layer (~2 km) is observed at both sites, presumably due to a combination of precursor availability, reaction time and surface loss processes. Above the boundary layer, O3 decreases due to the general lack of emission and transport of precursors into the lower free troposphere as inferred by steep gradients of species such as formaldehyde, carbon monoxide and nitrogen dioxide measured by the P3-B aircraft above both sites during the campaign (not shown). Ozone mixing ratios are relatively constant in the lower free troposphere up to 5 km. Above 5 km, the O3 mixing ratios increase with height, presumably due to vertical mixing between the troposphere and the high O3 concentrations of the stratosphere or contributions from O3 production chemistry stemming from upstream North American pollution.

Fig. 3
figure 3

Ozone mixing ratio vertical profiles from all ozonesondes (gray) launched during the DISCOVER-AQ July 2011 campaign over (a) Edgewood, MD and (b) Beltsville, MD. The mean vertical profile (black line) was calculated using 100 m bin averages

TCO was calculated using O3 partial pressure measurements from ozonesondes and corrected for the amount between the burst altitude and the top of the atmosphere using a combination of Microwave Limb Sounder and ozonesonde climatological data (McPeters and Labow 2012). The observed mean TCO during the campaign was 337 DU with a day-to-day variability of 6 %. The TPCO, with the troposphere defined as the lowest altitude of a 2 km thick layer having a lapse rate of 2 K km−1, on average composes 21 % (69 DU) ± 3 % (1σ) of the TCO and has a day-to-day variability of 21 %. The TPCO is significantly and positively correlated with TCO (r = 0.70, p-val < 0.01). The variability of partial column O3 amounts within the 1–3 km, 4–6 km, and 7–10 km sub-regions are 27 %, 23 %, and 32 % (2σ) over EW and 36 %, 26 % and 38 % over BV, respectively. As expected from the profile analysis, the variability is large within the boundary layer, decreases in the lower free-troposphere, and increases again in the mid-troposphere. Throughout the profile, the variability of the partial column O3 amounts is greater over BV compared to EW. The large variabilities in O3 partial columns during this one summer month, especially within the boundary layer, make forecasting this sub-region challenging, the implications of which are discussed below.

Five ozonesondes during the campaign coincided with three NASA Aura satellite overpasses. The TCO and TPCO measurements from the ozonesondes were compared to the same measurements retrieved from the TES (Table 1). For these analyses, the top of the troposphere was defined using the reported tropopause pressure from the Aura TES retrievals. The mean biases (ozonesondes as reference) of the TES TCO and TPCO are −3.2 DU (−1.0 %) and 0.2 DU (0.4 %), respectively. The coefficient of variations (RMSE cv ) of the TES TCO and TPCO are 5.2 % and 9.2 %, respectively. TES retrievals of TCO and TPCO over EW are generally lower than ozonesonde observations, while TES retrievals of TCO and TPCO over BV are higher than ozonesonde observations. This pattern over BV is also observed in comparisons of ozonesondes with TCO and trajectory-enhanced tropospheric ozone residual (TTOR) products from OMI. The results over EW contrast with previous validations of TES (Osterman et al. 2008; Nassar et al. 2008; Richards et al. 2008; Boxe et al. 2010), which find that TES column and profile retrievals, particularly in the troposphere, have a high bias between 0 and 20 %. These differences may be attributed, in part, to increased surface pollution at EW in July, to which satellite measurements are not sensitive and highlights the differences in observations between an intensive campaign such as DISCOVER-AQ and a more seasonal study. A comparison of TES O3 profile retrievals to coincident ozonesondes show that, qualitatively, the two types of measurements agree well (Fig. 4). The differences between the TES ozone profile retrievals to the a priori show that TES is responding to changes in radiances throughout most of the troposphere, although the responses vary vertically. The differences between the TES profile and the a priori profile are larger in the mid-troposphere (7–10 km) compared to the boundary layer (0–3 km), which is expected based on the limited information the satellite receives from within the boundary layer. In the case of July 14, the a priori seems to do a better job than the TES profile retrieval when compared to the ozonesondes.

Table 1 Comparison of total and tropospheric column amounts of ozone over Edgewood and Beltsville, MD using ozonesonde and TES observations
Fig. 4
figure 4

Coincident TES (gray line) and ozonesonde O3 mixing ratio profiles over EW (black line) and BV (green line) during DISCOVER-AQ in July 2011. The TES a priori O3 mixing ratio profile (black dashed) is shown for each overpass. The times (UTC) of the TES overpasses and the launch times of the ozonesondes are shown. Error bars on TES mixing ratio profiles represent 1σ of multiple scans during the stare observations. There were no coincident ozonesonde launches over BV on July 7

5 Upper air-surface comparisons

Using data from ozonesondes, correlations between upper air O3 and the near-surface O3 from EW and BV were calculated (Fig. 5). Data were averaged into 100 m vertical bins, the near-surface representing 0–100 m AGL and each layer was correlated with the near-surface. The correlation coefficient profiles show 3 laminae. Within the boundary layer (generically as <3 km), the correlation between layers aloft and the surface decreases approximately linearly with height at both EW and BV. The decrease of the correlation coefficient in the boundary layer at EW is greater than BV. In the free troposphere (3–10 km), the correlation coefficients are approximately constant with height, with an average correlation coefficient of 0.16 ± 0.09 (1σ) over EW and 0.50 ± 0.07 (1σ) over BV, the correlation over BV being significant at the 95 % confidence level (p-val = 0.02). Above 10 km, there is a decrease in the correlation coefficients and the coefficients become negative in the stratosphere. In the BV ozone correlation profile, O3 mixing ratios in layers aloft significantly (>95 % confidence) correlate with the O3 mixing ratios within the near-surface layer up to 10 km (Fig. 5 bold lines). In contrast, EW O3 mixing ratios in layers aloft significantly correlate with the O3 mixing ratios within the near-surface layer up to 3 km.

Fig. 5
figure 5

Vertical profile of correlation coefficients between layers aloft and the surface (0–100 m) over Edgewood (black) and Beltsville (green). Thicker lines indicate areas where correlations are significant based on 95 % confidence intervals (p-val < 0.05)

The steeper gradients and the lower values of the correlation coefficients over EW reflect the dynamic complexity of the site due to its proximity to the Chesapeake Bay, in agreement with past observations (Loughner et al. 2011; Stauffer et al. 2013). To illustrate, boundary layer heights were calculated for each ozonesonde over EW and BV using a modified lapse rate method (Heffter 1980). Potential temperature profiles were smoothed by calculating 50 m vertical bin averages. The boundary layer height was defined as the lowest layer to exceed a lapse rate of 7 K km−1 and the layer must be above 200 m. Ozonesondes that were launched during cloudy or rainy conditions, as determined by radiation measurements at each site and defined as having incoming total irradiance of less than 50 % of the maximum observed daily mean irradiance, were excluded from the analysis. Based on this criterion, one BV ozonesonde and five EW ozonesondes were excluded for the boundary layer height analysis. Boundary layer heights calculated from ozonesonde data using this method compare somewhat well (MB = −0.076 km) with an independent method using a mini micropulse aerosol lidar (Fig. 6a ). The boundary layer height is obtained from lidar measurements using a gradient technique, since aerosols often are trapped near the surface by the capping inversion layer (Davis et al. 2000; Brooks 2003). Figure 6b shows the difference in boundary layer heights over EW and BL as a function of time-of-day. A 4th order polynomial was fit to each dataset between 7 and 17 EST. At both sites, the boundary layer height increases throughout the day, as expected. However, boundary layer heights over BV are often higher and the variability of the boundary layer heights in the afternoon is less compared to over EW. The lower boundary layer heights at EW are the result of thermal internal boundary layers (TIBLS) induced from marine flow (Stauffer et al. 2013). TIBLs act to decouple surface air pollutant concentrations from those aloft, leading generally to the differences in the correlation coefficent profiles between marine (EW) and terrestrial (BV) observation sites (Fig. 5).

Fig. 6
figure 6

(a) Comparison of independent methods of calculating the boundary layer height at Edgewood, MD presented with the 1:1 line and (b) boundary layer height as a function of time of day over Beltsville, MD (green dots) and Edgewood, MD (black dots). A fourth-order polynomial was fit to the data over each site to show average diurnal patterns

Correlations between every 100 m layer were considered for EW and BV (Fig. 7). The boundary layer is more decoupled from the lower free troposphere over EW compared to BV, as expected from TIBL influences over EW. The correlations show three distinct layers over both sites where adjacent laminae are coupled, shown by the high correlation coefficient “triangles” adjacent to the diagonal centerline. These high correlation coefficients in these sub-regions suggest vertical mixing between those layers. The layer correlations were expected within two of the sub-regions: the boundary layer (0–3 km) and the upper troposphere (10–14 km). These layers are frequently perturbed by convective mixing and tropopause folding, respectively. The correlations within the mid-troposphere (7–10 km) is more unexpected since vertical motions within this layer are relatively weaker and occur over larger spatial and temporal scales, and the variability of O3 in the mid-troposphere is small compared to the boundary layer. Without more detailed analyses of case studies leading to the high correlations within 7–10 km, we can only hypothesize based on previous work that these high correlations are due to either: 1) strong convection during extreme events, 2) large-scale vertical mixing associated with the passage of synoptic fronts or 3) long-range transport (Cooper et al. 2006, 2007; Gettelman et al. 2011).

Fig. 7
figure 7

(a,c) Correlation coefficients between 100 m layers as measured by ozonesondes over Edgewood and Beltsville, MD. (b,d) P-values of the correlations at the respective sites with the significant correlation coefficients (pval < 0.05) shown in red

The upper boundary layer (1–3 km) O3 mixing ratios show significant and positive correlations with the mid-troposphere (7–10 km) O3 mixing ratios over both sites, with average correlations of 0.42 ± 0.09 (1σ) at EW and 0.43 ± 0.09 (1σ) at BV within the sub-region. Median p-values within the region are 0.01 (interquartile range = 0.00–0.03) and 0.03 (interquartile range = 0.01–0.07) at EW and BV, respectively. An analysis of the 95 % confidence intervals of the correlation coefficients calculated using a Fisher Transformation and associated z-statistical test show that most of the correlation coefficients between the mid-troposphere and near-surface are significant (Table 2). The significant correlations between the mid-troposphere and the near-surface are surprising especially at EW since the correlations between the near-surface and the lower free troposphere are not significant. However, this decoupling caused by TIBL influence show the complexity of the dynamics and the multiple dynamic and chemical processes occurring in the tropospheric column over a site. This discontinuity suggests that vertical transport is not a contributing factor for the correlations observed between the mid-troposphere and the near-surface. We speculate that synoptic phenomena such as available sunlight for photochemical processes coupled with the availability of precursors may be responsible for the significant correlations observed. Given the location of these two sites downwind of major urban areas (Baltimore and Washington D.C.) and the fact that the mid-Atlantic region acts as an exhaust pipe for much of the contiguous United States, we speculate the occurrence of ozone precursors such as peroxyacetyl nitrate (PAN), carbon monoxide and volatile organic compounds (VOCs). These ideas are beyond the scope of this manuscript and are the subject of ongoing investigations. Further analyses below attempt to exploit this correlation by deriving near-surface O3 information using partial column O3 retrievals from satellites in the upper troposphere.

Table 2 Confidence intervals of correlations observed at each site at specific quantiles between within the mid-tropospheric (7–10 km) and near-surface (1–3 km) vertical regions

6 Near-surface ozone prediction from partial column measurements

Analogous to the O3 mixing ratio correlation analyses described above, the partial columns of O3 in the mid-troposphere (7–10 km; MTPC) and the near-surface partial columns of O3 (1–3 km; NSPC) were compared using ozonesonde measurements (Fig. 8). Correlation coefficients between these sub-regions were 0.56 and 0.49 for EW and BV, respectively. Having considered the near-surface range between 1 and 3 km above the surface for both EW and BV rather than between 0 and 3 km above the surface increased the EW correlation coefficient by 17 % with no change in the BV correlation coefficient. Thus, the impact of TIBLs at the EW site dramatically impacts partial column correlations. The linear regression statistics are similar at both sites (Table 3), giving confidence that these methods can be extended to other sites in the mid-Atlantic.

Fig. 8
figure 8

Scatter plots of near-surface (1–3 km) partial column ozone and upper troposphere (7–10 km) partial column ozone using ozonesondes over Edgewood, MD (black triangles) and Beltsville, MD (green dots). A least-squares regression line was fit to the data over each site

Table 3 Regression statistics from linear regression of upper tropospheric (7–10 km) partial column ozone amounts on near-surface (1–3 km) partial column ozone amounts using ozonesonde observations

Further analyses were conducted to derive NSPC using MTPC as retrieved from TES using a regression model formulated from independent ozonesonde observations. The sensitivity and vertical resolution of TES is the largest in the 3–6 km layer as indicated by the averaging kernel profiles (Fig. 9). However, due to the dynamics at EW, no significant O3 correlations were observed between the near-surface and the 3–6 km layer, thus prediction of NSPC using satellite partial columns from 3 to 6 km was not considered for this study and mid-tropospheric (7–10 km) retrievals of TES were used. The averaging kernels binned into height layers also highlight the sensitivity limitations that satellites have within the boundary layer, further giving merit to the methods presented here. The NSPC derivations from TES were compared to coincident NSPC measurements from ozonesondes for validation. MTPCs (in Dobson units) were calculated from TES O3 and pressure profile retrievals using:

Fig. 9
figure 9

TES averaging kernel profiles from select altitudes, color-coded by altitude bin, during the global survey overpass on 7 July 2011

$$ p{c_i}=\frac{{\overline{{{N_i}{C_i}}}\varDelta z}}{{2.69\times {10^{20 }}}} $$
(1)

where N i is the number density in layer i (in molecules m−3), C i is the mixing ratio in layer i and Δz is the height of layer i (in m). In this case, layer i is defined between 7 and 10 km. Prior to the calculation of the MTPCs, the TES O3 and pressure profiles were first linearly interpolated every 500 m and pressures at each height were calculated using the hypsometric equation from the nearest TES retrieved pressure level below. The MB and RMSE cv of TES MTPC calculations referenced to ozonesonde measurements is −8.4 % and 9.9 %, respectively (Fig. 10). In general, TES partial columns in the mid troposphere underestimate the observations with a range between −13 % and 2 %. NSPCs derived from TES MTPC calculations using the linear regression coefficients presented in Table 3 for the respective sites differ from observed NSPCs from ozonesondes with a range of −1.3 to 0.9 DU (Fig. 10). The relative difference of the TES NSPCs ranges between −10 and 10 %, less than the observed variability in the near-surface during the campaign. All the derived NSPC values from TES are within 11 % of the ozonesonde observations. The MB and RMSE cv for the TES NSPC comparisons to ozonesondes are 1.5 % and 8.7 %, respectively. The uncertainty of the derived NSPC from satellites is less than the observed variability of partial column O3 amounts in that sub-region. These results show that by utilizing satellite information from the mid-troposphere, a region of the atmosphere in which satellites are more sensitive to changes in radiances and thus atmospheric composition, quantitative information can potentially be obtained with respect to atmospheric pollutants near the surface. The results obtained from this simple linear regression model and associated analyses are encouraging when considering the challenging task of retrieving near-surface O3 information from satellites.

Fig. 10
figure 10

Comparison of measured MTPC O3 from TES (blue squares) and ozonesondes (blue circles) during a total of 5 coincident TES overpass/ozonesonde launches. Additionally, the calculated NSPC from TES (red squares) and measured NSPC from ozonesondes (red circles) are compared. Error bars represent the propagated uncertainty caused by random errors

7 Summary and conclusion

NASA’s DISCOVER-AQ campaign in the Baltimore, MD/Washington D.C. region in July 2011 provided a unique opportunity to inter-compare ozone profiling instrumentation as well as investigate correlations between upper tropospheric and near-surface O3 partial columns and concentrations. Comparisons between in situ aircraft O3 profiles and ozonesondes within the boundary layer and the lower free troposphere have confirmed systematic errors in the ozonesonde measurement previously observed in controlled simulations. These systematic errors were corrected using independent methods, reducing the mean bias between the ozonesondes and the in situ aircraft to less than 1 %.

Variability within O3 profiles is largest within the boundary layer and in the upper troposphere and the O3 profiles at two mid-Atlantic sites (Edgewood, MD and Beltsville, MD) are similar. The day-to-day variability of O3 within the total column is small (6 %), but is quite large (21 %) when only the tropospheric column, making up ~21 % of the total column, is considered. The variability of NSPCs from ozonesondes is large (27–36 %).

Upper air O3 concentrations are significantly positively correlated to surface (0–100 m) O3 up to 10 km at Beltsville, MD, but only up to 3 km at Edgewood, MD. The influences of marine layers at Edgewood, MD decouple the surface layer from those above, thus complicating retrieval algorithms over this site and presumably other coastal sites. However, significant positive correlations between upper troposphere (7–10 km) and near-surface (1–3 km) O3 exist over both sites, potentially providing solutions through regression analyses for retrieving near-surface O3.

The results presented in this manuscript were tested using current polar-orbiting O3 profile retrievals from NASA’s TES instrument on board the Aura satellite, and validated against coincident ozonesonde profiles. Mid-troposphere partial columns calculated using ozonesonde and TES O3 profile measurements on average agree to within 20 %. Derived NSPCs, calculated from a linear regression model formulated from an independent ozonesonde dataset, agree with the ozonesonde observations within 11 %. Although the dataset used in these analyses are spatially and temporally limited, the results in this focused study show connections between the mid-troposphere and the near surface, which may not have otherwise been observed. These results are encouraging for obtaining near-surface O3 information using satellite measurements. The benefit of the techniques presented here is that they can be applied using existing satellite hardware and retrieval algorithms, as well as future geostationary satellite missions. The use of mid-tropospheric retrievals rather than retrievals from below 3 km has benefits: 1) TES is more sensitive within the mid-troposphere compared to the boundary layer and 2) the retrievals are less likely to be impacted by clouds. The temporal resolution of the TES measurements at a particular ground site and the focused temporal range of this study limited the number of data samples in which to compare to ozonesondes. These analyses are limited in that they include data from only 64 ozonesondes at two Mid-Atlantic sites during one month in the summer. However, the intensive nature of these campaigns provides a thorough understanding of the atmospheric composition and the processes involved. Work to extend these analyses to other locations and other seasons using long-term ozonesonde observation data from Wallops Is., VA, Boulder, CO and more recently completed DISCOVER-AQ campaigns is in progress.

This research identifies the importance of the mid-tropospheric region (7–10 km) in terms of resolving the near-surface O3 partial column amounts. Research to develop algorithms that better quantify the O3 in the mid-troposphere, which utilize multiple wavebands and independent statistical correlations with atmospheric variables, is essential (Moody et al. 2012). The continuation of these studies will reduce satellite retrieval uncertainties in the troposphere and allow future geostationary satellite missions to be better utilized.