The search for life on other worlds looms large in NASA’s 30-year strategic vision.1 Already, several mission concept studies are either completed or underway that would use a larger than 8-m aperture UV-optical-IR space telescope equipped with a coronagraph or starshade to characterize potentially habitable exoEarths (e.g., ATLAST, HDST, LUVOIR; the acronyms stand for: LUVOIR = Large UV-Optical-IR Surveyor,1 ATLAST = Advanced Technology Large Aperture Space Telescope,2–4 and HDST = High Definition Space Telescope.5). Alternatively, smaller starshade-based Habitable-Exoplanet Imaging Mission concepts exist.6 All would benefit from better visible and near-IR (VISIR; to ) detectors than exist today. Moreover, because of different overall system design considerations, different solutions may turn out to be optimal depending on whether a mission is coronagraph or starshade based. Our aim in this article is to discuss a short list of technologies that we believe to be potentially capable of biosignature characterization for either coronagraph or starshade missions.
Once a rocky exoplanet in the habitable zone has been found, biosignature characterization will be the primary tool for determining whether we think it harbors life. Biosignature characterization uses moderate resolution spectroscopy, , to study atmospheric spectral features that are thought to be necessary for life, or that can be created by it (e.g., , , , , ). We discuss these biosignatures in more detail in Sec. 2.1 and the spectral resolution requirements for observing them in Sec. 2.2. Even using a very large space telescope, biosignature characterization is extremely photon starved. Ultra low noise detectors are needed, and true energy resolving single-photon detectors would be preferred if they could be had without the vibration that is associated with conventional cryocoolers.
Our aim in this article is to provide an introduction to the detector needs for biosignature characterization, and some of the emerging technologies that we believe hold promise for meeting them within the next decade. The technologies fall into two broad categories: (1) low noise detectors (including “photon counting”) that are compatible with passive cooling and (2) true energy resolving single-photon detectors that require active cooling.
We draw a clear distinction between photon counting low noise detectors and single-photon detectors. A photon counting detector is able to resolve individual photons, although the detection process still adds significant noise. For example, many kinds of photon counting detectors have significant dark current and spurious charge generation at the ultra low flux levels that are encountered during biosignature characterization. A single-photon detector, on the other hand, provides essentially noiseless detection of light. Noise in the single-photon detectors discussed here manifests as an uncertainty in the energy of a detected photon rather than an uncertainty in the number of photons.
The low noise detectors include electron multiplying charge coupled devices (EMCCD) for the visible and HgCdTe photodiode and avalanche photodiode (APD) arrays for the near-IR. With targeted investment, we believe that all can be improved beyond today’s state of the art. Section 3 describes a low risk but evolutionary payoff route to improving these existing noncryogenic detectors for use with conventional spectrographs. One advantage of this approach is that it completely retires the risks, cost, and complexities associated with a cryocooler. The disadvantages include increased noise and the need for dispersive spectrograph optics.
The single-photon detectors that we discuss are based on thin superconducting films and operate at . Cryogenic cooling is required to achieve these temperatures. In return for cryogenic cooling, single-photon detectors promise noiseless (in the conventional astronomy sense), nearly quantum-limited photon detection with built in energy resolution. The built in energy resolution offers the tantalizing prospect of nondispersive imaging spectrometry, thereby eliminating most spectrograph optics. In this article, we focus on two single-photon detectors that have already been used for astronomy and that offer the potential for multiplexing up to sufficiently large formats. These are microwave kinetic inductance device (MKID) arrays and transition-edge sensor (TES) microcalorimeter arrays. Section 4 discusses a path forward using single-photon detectors that offers the potential for nearly quantum-limited detector performance and nondispersive imaging spectrometry if the cooling challenges can be met.
Cryogenic cooling in the context of LUVOIR brings its own challenges. High performance space coronagraphs require tens of picometers wavefront error (WFE) stability. This extreme stability is incompatible with the vibration from existing cryocoolers. Since ultra-low vibration cooling is a necessary prerequisite to using cryogenic single-photon detectors on a LUVOIR, we briefly describe a few preliminary concepts for achieving it in Sec. 5. Although vibration will undoubtedly present challenges in starshade missions, too, we believe that the coronagraphic LUVOIR represents a challenging “worst case” for cooler design studies.
In the interest of brevity, we have limited discussion to a fairly short list of detector technologies that are either already under development, or that we view as particularly promising. One could easily add other technologies to those that are discussed. For example, scientific CMOS arrays have a wide consumer base and potentially provide subelectron read noise with better radiation tolerance than CCDs because no charge transfer is required. Superconducting nanowire single-photon detectors may provide another route to cryogenic single-photon detectors that, while not energy resolving, would still promise essentially noiseless detection. As the field matures, it may be desirable to revisit these and other technologies. Although the need for essentially noiseless detection is clear, no existing technology currently fulfills all of the needs.
Why Better Detectors Are Needed
Spectroscopic biosignature characterization places some particularly challenging demands on VISIR detector systems. Many of these derive from the extraordinarily low flux levels (Sec. 2.3). In the case of superconducting detectors, achieving sufficient energy resolution and photon coupling efficiency are also challenges. (This paper’s focus is on VISIR detectors for biosignature characterization. For other science programs, a general purpose LUVOIR would benefit from better detectors across its full 90 nm to “stretch” wavelength range, including the UV. We refer the interested reader to Bolcar et al.,7 for a discussion of some of these other detector needs.)
For the most likely potential exoEarths, biosignature characterization will be used to study spectral features that are thought to correlate with biological activity. Figure 1 shows how the earth would appear if it were to be seen as an exoplanet. To make this spectrum, Turnbull et al.8 observed the night side of the moon and solved for the earth’s contribution as it would appear to a distant observer. We define a likely life “detection” as consisting of (1) a rocky planet, (2) with water vapor, (3) a primary biosignature, and (4) a confirming biosignature to rule out false positives.
Lacking a confirming biosignature, one could attempt to increase the statistical significance of a result by resolving the temporal dependence of a feature. Arguments for a biological source could be strengthened by placing a detection in a more comprehensive geological and astrophysical context by measuring other atmospheric gases including CO, , , and characterizing the host star’s energy distribution.
Among confirming biosignatures, is particularly important because it is difficult to simultaneously maintain significant concentrations of , , and . Nonequilibrium concentrations are most straightforwardly explained by biological processes. The feature at is unfortunately blended with . There is stronger feature between 3 and and a still stronger feature at about . [These longer wavelength lines would present other challenges, including reduced angular resolution (for a fixed aperture) and potentially increased thermal background.] The spectrum shows a few other features, notably and . Although these features do not provide as much information as the primary and secondary biosignatures, they can still be useful, especially when no confirming biosignature is available.10 For a thorough discussion of several false positive mechanisms and their spectral signatures, the interested reader is referred to Schwieterman et al.10
Figure 1 omits one important near-UV biosignature. There is a strong bandhead at 260 to 350 nm. This bandhead is so strong and wide that it can potentially be characterized by imaging in a pair of filters. Because our focus here is on spectroscopic biosignature characterization, we defer discussion of these (potentially imaging) near-UV detectors to a future publication. Bolcar et al.7 discuss detectors for this application in slightly more detail (see especially his Table 6).
Finally, the earth’s atmosphere has not always been as it is today, and it is conceivable that other atmospheres may harbor life.11 For these reasons, we should be open to the possibility of having to characterize several spectral features in order to understand how likely an exoplanet is to harbor life. Having the best detectors possible will maximize our chances of success.
Required Spectral Resolution
Several authors have studied the spectral resolution requirements for biosignature characterization.9,12,13 Their recommendations vary depending upon the spectral features of interest and the model assumptions. As can be seen from Fig. 1, important features include , , , and . All have absorption features in the VISIR and are important to terrestrial life. Consistent with Brandt and Spiegel,9 we have adopted as a challenging but probably still achievable biosignature upon which to base our VISIR detector requirements because it is the narrowest feature on this list. An instrument that can characterize can also characterize and , and potentially other features including and under the right conditions (see, e.g., Ref. 13).
With regard to the spectral resolution value, Des Marais et al.12 reported that to 72 was well matched to at 760 nm and (see their Table 1). This was based on a theoretical model that included the Earth’s current atmospheric temperature structure, but that allowed for different chemical abundances. More recent studies have recommended that somewhat higher resolution is desirable for . For example, when Kaltenegger, Traub, and Jucks (KTJ)13 modeled the evolution of the expected spectra of the Earth and its biosignatures over geological timescales and found that to 136 was optimal for observing in the visible. They furthermore concluded that higher resolution, to 244, would be desirable to observe in the near-IR. More recently, Brandt and Spiegel9 recommended that was probably adequate for in the VISIR. This is also consistent with the line width that we measured from Turnbull et al.’s “earthshine” spectrum (Fig. 1). Our requirement represents a working compromise between the still evolving scientific understanding and the practicalities of developing flight hardware. As the field matures, it may be necessary to revisit this requirement.
Strawman ATLAST detector needs.
|Bandwidth||0.4 to (need) 0.4 to (goal)a,b|
|Spurious count rate||Small compared to dark current|
|QE||over bandpass (conventional)|
|over bandpass (energy resolving)|
|Spectral resolution||at (energy resolving only)|
|Other||Rad-hard, minimum 5-year lifetime at L2. Noncryogenic operation strongly preferred by ATLAST.|
If were not required, then lower resolution could be tolerable depending on the scientific objectives. For example, KTJ found that to 11 would be sufficient to characterize in the VISIR throughout Earth’s evolution and that to 5 would be sufficient for during specific epochs. However, KTJ’s recommended range was wide, and they concluded by recommending to 325 to detect , , , and in the VISIR. To better understand the full biosignature trade space with regard to spectral resolution, we refer the interested reader to Refs. 9, 12, and 13.
Photon Starved Science
Once the light from the host star has been suppressed, the remaining light from the exoplanet and its zodiacal cloud will be feeble at best. To put the photon arrival rate into better perspective, consider a simple toy model consisting of: (1) a perfect coronagraph, (2) a 25% efficient integral field unit (IFU) spectrograph, (3) a observing wavelength, (4) pixel size , (5) , and (6) a background that is the earth’s zodiacal light. With these assumptions, the background count rate is . More sophisticated models that include the effects of imperfect coronagraphs and simulated exoEarths reach the same conclusion: biosignature characterization is extremely photon starved.14
The preceding calculation assumed a nonenergy resolving detector behind a conventional IFU spectrograph. Use of an energy-resolving single-photon detector would eliminate the need for spectrograph optics and increase the count rate per pixel by about a factor of (see Appendix for the derivation). Under these conditions, the count rate would be about per energy resolving pixel.
Strawman Detector Needs
Table 1 shows the detector requirements that were used for NASA’s recently completed ATLAST study.7 We adopt these as the basis for further discussion. We are aware of additional desirable characteristics. For example, a starshade-based habitable exoplanet mission might benefit from response further into the IR than the ATLAST team considered. We have tried to note these other needs as they come up.
Taken collectively, the “requirements” of Table 1 enable characterization of a few dozen exoEarth candidates during an approximately five-year LUVOIR-like mission. Stark et al.14 provide a good overview of the mission yield modeling.
The quantum efficiency (QE) requirements merit further discussion. To achieve reasonable exoEarth yields, all mission concepts that we are aware of assume no QE penalty compared to today’s best EMCCDs and HgCdTe IR arrays.2,5,14 They also assume megapixel class detector arrays paired with conventional spectrographs. If an energy resolving detector were to be used, then the spectrograph optics could be greatly simplified and the optical throughput would go up. For this reason, lower QE can be tolerated with a single-photon detector than with a conventional detector.
Improving Today’s State of the Art
The most mature VISIR detector candidates are semiconductor based. These include silicon EMCCDs for the visible and HgCdTe photodiode. APD arrays for the VISIR. EMCCDs, HgCdTe hybrids, and HgCdTe APD arrays are attractive because of their comparative maturity, low risk, and the possibility that their performance might be “good enough” for biosignature characterization, even if they do not function as single-photon detectors.
For use in space, radiation tolerance is a major consideration. Existing e2v EMCCDs may not be sufficiently radiation tolerant (“rad-hard”) for future biosignature characterization missions. Current generation e2v EMCCDs were designed for use on the ground and are based on an n-channel CCD architecture for which phosphorus is the dopant. We discuss how the radiation tolerance of EMCCDs could be enhanced in Sec. 3.1.
Teledyne’s HxRG photodiode arrays, like JWST’s H2RGs and the closely related WFIRST H4RG-10s (WFIRST’s H4RG-10s still require radiation testing. However, based upon our knowledge of the components, we expect the radiation tolerance to be similar to that of JWST’s H2RGs although the degradation rate in pixels per year may differ on account of the smaller pixel size.), are radiation tolerant. JWST testing has shown that H2RGs experience graceful degradation in pixel operability, whereby to 3% of the pixels per year will degrade to the extent that they no longer exhibit full science performance. Although the affected pixels no longer meet full flight specification, they can still be useful for many things, and in any case, only a small percentage of pixels are affected at the end of JWST’s nominal five year mission. The radiation tolerance of HgCdTe APD arrays is to be determined, but in any case, the failure modes that are seen in n-channel CCDs do not apply.
Better Electron-Multiplying Charge-Coupled Devices
e2v EMCCDs are widely regarded as the most mature detector technology for visible wavelength biosignature characterization today. For this reason, pixel e2v CCD201s have been selected for the WFIRST coronagraph’s imaging camera and integral field spectrograph. Harding and Demers et al.15 describe the extensive trade study that led to this selection. When new and not degraded by radiation, EMCCDs are close to meeting the needs for biosignature characterization.
Unfortunately, a major concern with the current e2v EMCCD design for biosignature characterization on LUVOIR or a Habitable Exoplanet Imaging Mission is radiation-induced performance degradation. This may include decreased charge transfer efficiency, increased clock-induced charge (CIC), and decreased pixel operability. Moreover, the subelectron read noise that EMCCDs enable has the potential to reveal other damage that is ordinarily buried in the 2 to read noise of conventional CCD systems. Although ongoing work at JPL should retire these concerns for WFIRST,15 the demands of future biosignature characterization missions will be more challenging. For future missions, it would be wise to apply e2v’s known radiation hardening fabrication processes to EMCCDs and to explore other rad-hard detector concepts.
Existing e2v EMCCDs use gate oxide designs that were intended to maximize manufacturing yields for less demanding ground-based applications. They trade radiation tolerance in exchange for lower manufacturing cost. The oxides are thicker and of a different composition to those that are used in radiation hardened CCDs. Radiation hardened oxides can reduce the flat-band voltage shift from to (Si) in standard devices to (Si) in devices fabricated using radiation-hardened oxides.16
It would also be desirable to explore design enhancements for reducing CIC. CIC is strongly dependent upon clock amplitude. In the CCD201, there are boron implants beneath two of the phases that build in the electric fields that are needed for inverted operation. These inherently represent a compromise. Making them stronger increases well depth, but it comes at the expense of CIC when reading the CCD out on account of the higher voltages that are required. For biosignature characterization, it could be worthwhile to explore implant designs that aim to trade well depth for improved CIC.
Thinner and different oxide designs may also be beneficial for reducing CIC, which Janesick attributes to hole detrapping near the silicon/oxide interface.17 Burt et al.16 attribute some of the performance degradation that is seen with radiation dose to depassivation of the silicon surface under the oxides, and moreover suggests that thinner oxide layers should result in less depassivation. If true, one might reasonably expect to see less CIC degradation in parts that use thin versus thick oxide layers.
Ultimate Limits of HgCdTe Photodiode Arrays
is today’s most mature material for astronomical near-IR instruments. By adjusting the mole fraction of cadmium, , it is possible to tune the cutoff wavelength from about out to 5 to while still achieving a performance that enables low background space astronomy. HgCdTe arrays have substantial heritage for NASA astronomy. The Hubble Space Telescope has operated both NICMOS and H1R HgCdTe arrays. Teledyne H2RGs are used by all of JWST’s near-IR instruments and by Euclid. Teledyne H4RG-10s are planned for WFIRST. The read noise floor of existing HgCdTe photodiode arrays is a few electrons rms per pixel. When cooled sufficiently, the dark current of today’s cutoff flight grade HgCdTe arrays already achieves the that is needed for biosignature characterization.
The source-follower-per-detector architecture of Teledyne’s HxRG arrays has been used since the late 1980s. To achieve significant improvement in noise, it is necessary to understand exactly where in HxRG arrays read noise and dark current originate and why. Studies that aim to separate the contributions of the photodiodes, resistive interconnects, ROIC source-followers, and SIDECAR controllers would be beneficial.
For example, if it were to be found that noise in the resistive interconnects were to be important, than further work aimed at reducing the interconnect resistance and/or lower operating temperature could be beneficial. On the other hand, if noise from the pixel source-follower were to be important, than further refinement of this circuit might be justified. The first step is careful characterization of existing HxRG detectors (i.e., JWST and WFIRST spares) aimed at building an itemized noise budget and understanding how environmental parameters like operating temperature affect performance.
Another area where improvement might be possible is persistence. Persistence, or latent charge, is charge that accumulates and is trapped during an exposure only to be released as an undesirable ghost signal in a subsequent exposure. Persistence is modulated by charge traps or electrically active defect states in the HgCdTe. Design and process improvements that aim to reduce the defect density, or that aim to build in electric fields that repel charges from areas of high defect density, could be beneficial for reducing persistence.
HgCdTe Avalanche Photodiode Arrays
HgCdTe APD arrays are a promising technology that initially entered astronomy for comparatively high background applications including adaptive optics and interferometry18 and wavefront sensing and fringe tracking.19 More recently, they have been used at the telescope to provide diffraction-limited imaging via the “lucky imaging” technique.20 Although HgCdTe APD arrays have been made by DRS, Raytheon, and Teledyne, those made by Selex in the UK are the focus of most attention in astronomy now.
A group at the University of Hawaii has been evaluating Selex SAPHIRA for applications including low background astronomy.20 With an appropriately optimized process, the HgCdTe itself is potentially capable of the same QE performance as the JWST arrays. (JWST’s H2RG’s achieve from 0.6 to and from 1 to 21) Moreover, because gain is built into the pixels before the first amplifier, they promise photon counting and potentially even single photon detection if “dark current” can be reduced to acceptable levels.
“Dark current” is the most significant obstacle to using Selex APD arrays for ultra-low background astronomy today. The to gain corrected “dark current” that has been reported20 is almost certainly dominated by glow from the ROIC. The ROIC that is used in current devices was not optimized for ultra-low background or even low background astronomy. Work continues at the University of Hawaii to try to disentangle ROIC glow from more fundamental leakage currents in current generation APD arrays. On the longer term, work is also underway aimed at optimizing the ROIC design.
Although HgCdTe APD arrays hold out the promise of read noise below that which can be achieved using conventional photodiode, like conventional photodiodes, there will ultimately be a leakage current noise floor that is determined by thermally activated defect states in the HgCdTe. However, it is likely that today’s performance is still far from that floor, and more work is needed to better understand the full potential of HgCdTe APD arrays for ultra-low background astronomy in the context of missions like LUVOIR.
Maturing Energy Resolving Single-Photon Detectors
Today’s EMCCDs, HgCdTe hybrids, and HgCdTe APD arrays are not single-photon detectors in the context of biosignature characterization. All would add significant noise and thereby reduce mission exoEarth yields below what could be achieved with a noiseless detector. On the other hand, superconducting MKID and TES arrays already function as single-photon detectors today.
The use of these superconducting detectors by LUVOIR is contingent upon the development of ultra-low vibration cooling (Sec. 5). However, even if superconducting detectors are found to be impractical for LUVOIR, their nearly quantum-limited performance could still be very attractive for a starshade-based Habitable Exoplanet Imaging Mission.
Introduction to Superconducting Proportional Detectors
Although proportional detection of photons is not widely used for VISIR astronomy today, it has a long history in x-ray astronomy, where gas proportional counters and Ge and Si diodes have long been standard detectors. These charge-based detectors are proportional in the sense that their response to light is proportional to photon energy. Although they provide an easy to measure signal, they suffer from noise sources that make high resolution spectroscopy impossible. In both cases, only part of the signal goes into ionization, so there is an unavoidable partition noise. This problem can be addressed in two ways, either to move to a much smaller gap in the detection system or to collect the energy into a gapless system, such as a thermal distribution of phonons and/or electrons.
The small gap solution leads us to superconducting detectors, superconducting tunnel junctions, or kinetic inductance detectors, which measure the quasiparticle excitations produced by the absorption of a photon. The superconducting gap is about a factor of 1000 smaller than typical semiconductor bandgaps, so the expected energy resolution at 6 keV improves from about 130 eV for Si diodes to 2.7 eV for Nb superconducting detectors.22 An alternative approach is to let all of the deposited energy thermalize and then measure the temperature of the system. This is a quasiequilibrium system, and it is a robust measurement technique. For x-ray detection, the uncertainty of energy measurements easily reaches limits set by the thermodynamics of the system, and with proper design, is never limited by the ability to thermalize the photon energy.
Thermal sensors simultaneously detect individual photons and use the thermal signal to measure photon energy. For the designs discussed here, the minimum photon energy is well separated from the system noise, so the probability of dark events is near zero, and the read noise manifests itself as the limit to the energy resolution of the system. In the following sections, we will discuss the performance of cryogenic proportional photon detectors, TES microcalorimeters (Sec. 4.2), and MKIDs (Sec. 4.3) as VISIR spectrometers.
Transition-Edge Sensor Microcalorimeter Arrays
In a microcalorimeter, the energy of an absorbed photon is determined from the temperature rise of the detector. The energy resolution of such a detector is set by thermodynamic noise sources in the detector and amplifier noise. The microcalorimeter concept and performance limits are presented by Moseley, Mather, and McCammon (1984)23 and Irwin et al. (1995).24 Tutorial articles on the principles of operation and optimization of microcalorimeters and superconducting TESs provide a detailed discussion of these devices in the linear regime.25,26 Since these devices work near equilibrium, it is generally possible to design detectors that operate very near these fundamental limits.26–28
When a photon is absorbed by the detector, the temperature rises on a short-time scale, of order the sound crossing time. The output signal will rise on a time scale set by the electronic time constant of the detector/amplifier combination. The detector will rise to a maximum temperature , where is the lumped heat capacity and is the deposited energy. A simple microcalorimeter, modeled as a lumped heat capacity and thermal conductance, , will have a single-pole response, with a time constant . Under bias, the response is sped up by electrothermal feedback to , where is the Joule power in the detector and is the detector temperature.
The energy resolution of a microcalorimeter scales as , where is the temperature, is the Boltzmann constant, and is a unitless measure of the sensor’s resistance sensitivity to temperature.23 Elsewhere in this article, represents spectral resolution. Here, represents resistance. This resolution limit assumes that we remain in the linear response range and uses an optimal detection filter based on the system noise and signal shape. The thermodynamic performance does not depend on the choice of ; it can be used to minimize other nonoptimal effects in the detector, such as slow thermalization.
A typical design for such a detector would set the heat capacity for a given value of to allow saturation to begin just above the highest energy of interest. Having chosen this value of , the resolving power of the linear system can be improved only by lowering the operating temperature.
The first demonstration of a VISIR microcalorimeter was presented in 1998 by Cabrera et al.29 In this work, they built a tungsten TES microcalorimeter square with a transition temperature of 100 mK. The thermal conductance of these detectors was set by their internal electron-phonon coupling, so they were deposited on a substrate, requiring no additional thermal isolation. These devices were operated in the VISIR spectral region and provided an energy resolution of 0.15 eV. This was significantly in excess of the naïve prediction, but a more complete analysis that we have done, including the current dependence of the resistance and athermal phonon loss to the substrate, can account for most of the excess noise. Both of these terms can be significantly reduced by design optimization.
In considering these detectors as candidates for the characterization of exoplanet atmospheres, we need to explore the paths to achieving the resolving power that is required. This optimization must include the efficient coupling of the detectors to the optical photons as well as providing the required energy resolution. The coupling design may be as simple as absorption by a matched film, as in the case of Cabrera et al.,29 or may require an antenna structure to couple to optically small detectors. We explore three optimizations (Table 2) and compare the system resources required in each case. For the natural time constant row in Table 2, we have assumed a tungsten TES with and determined by its electron system and electron–phonon coupling, respectively.
Three optimizations for TES microcalorimeters.a
|Parameter||Case I: Fully linear; C/α chosen to saturate at highest in-band photon energy, T0 lowered to allow required resolution.||Case II: This saturates at lowest photon energy and uses the Fixsen et al. algorithm to extract energies of saturated events.||Case III: This saturates at 0.2× the lowest photon energy and uses the Fixsen et al. algorithm to extract energies of saturated events.|
|5.1 mK||22.8 mK||114.1 mK|
|1.3 s||14.3 ms|
These designs provide possible paths to practical detectors for biosignature characterization. More specifically, they potentially provide the noiseless detection of photons with intrinsic resolving power . By operating in the nonlinear regime, we should be able to reach the required performance at , temperatures already demonstrated by ADRs designed for space (see Sec. 5). We furthermore believe that the near equilibrium operation should allow us to closely approach the predicted performance.
Microwave Kinetic Inductance Devices
This section briefly reviews MKIDs in general, and in particular, describes what makes a VISIR MKID. We consider the state of the art in the context of MKID device physics, and paths for improving energy resolution, QE, and pixel count to achieve the performance goals specified in Table 1.
An MKID detects absorption of photons in a superconductor by a change in kinetic inductance. A current in a superconductor stores energy both in the magnetic field and in the kinetic energy of the charge carriers (Cooper pairs). The former corresponds to the geometric inductance of normal conductors, while the latter represents additional, kinetic inductance. For a photon to be absorbed, its energy must exceed twice the superconducting gap energy (the binding energy per electron in Cooper pairs) in order to create unpaired electrons (quasiparticles). A reduced number of Cooper pairs requires the remaining Cooper pairs to move faster to transport the same current, thereby increasing kinetic inductance. While typical MKID material contains on the order of 1 to 10 million Cooper pairs per cubic micron in its superconducting state, the temporary destruction of even one Cooper pair is possible to detect if a small volume inductor is combined with a (distributed or lumped) capacitor to form a low-loss superconducting resonator with resonance frequency . An exceedingly small change in inductance can be sensed as a shift in a resonance frequency, measured with a commercially available microwave amplifier.
In typical MKID materials, the superconducting transition temperature is of order 1 K, for which the energy gap is 0.15 meV, and the minimum frequency for photon absorption is 74 GHz (). Absorption of a single VISIR photon creates a large number of quasiparticles ( to ) in proportion to the photon energy. The MKID operating temperature and inductor volume can be made small enough so that the number of thermally generated quasiparticles is nil. MKIDs are then energy resolving detectors with zero dark count rate.
In addition to high sensitivity, MKIDs have a natural means of multiplexing: large numbers of superconducting resonators, tuned to slightly difference resonance frequencies, can be connected in parallel by injecting a comb of microwave carrier waves on one transmission line, and read out by one microwave amplifier. Systems have been demonstrated for simultaneous readout of up to 4000 MKIDs.31
Owing to sensitivity at long wavelengths, and the ability to multiplex tens of thousands of detectors, there has been a wealth of work directed at the potential of MKIDs for sensitive detection of FIR to mm-wave radiation. In pushing toward single-photon sensitivity in this spectral region, the fundamental noise contributions in MKIDs have been studied extensively, and there has been much innovation in optical coupling schemes (see e.g., Refs. 32 and 33). Here, we discuss a few of the differences in approach relevant at the higher photon energies in VISIR MKIDs.
The MKID signal is a change in the amplitude and phase of a microwave carrier tone propagating on a transmission line weakly coupled to the MKID resonator via some auxiliary coupling capacitance or inductance. The amplitude and phase shift is proportional to the number of quasiparticles produced by absorption of a photon, , where is the photon frequency and is the efficiency. MKIDs have two fundamental sources of noise: (1) quasiparticle generation-recombination (G-R) noise and (2) microwave amplifier noise. Quasiparticles are generated not only by optical absorption, but also by thermal fluctuations (e.g., thermal phonons). The quasiparticles fluctuate in number as they are constantly being generated and then recombining into Cooper pairs at a rate, specific to the material, which increases in proportion to the density of quasiparticles. As the temperature or optical load increases, the increasing population of quasiparticles both adds noise34 and increases microwave dissipation (lowering the quality factor ). Maintaining a sufficiently high is required both because it lessens the importance of amplifier white noise, but also because the number of detectors that can be multiplexed within the amplifier bandwidth is proportional to (or, more precisely, to the effective , , where is the microwave carrier frequency and is the optimally-filtered pulse detection bandwidth determined by the and the noise sources).
Other nonfundamental sources of MKID noise exist [e.g., fluctuations in resonator capacitance from two-level-systems (TLS) in disordered native oxides or other dielectrics present] and are the subject of active research. However, G-R and amplifier noise, and the associated effects on , determine some general aspects of MKID optimization. For photon-counting capability, there is a maximum inductor volume that gives the desired energy resolution and a detector speed faster than the photon arrival rate. There is also a minimum volume that keeps the under optical loading high enough for practical multiplexing. Figure 2 illustrates these design constraints applied to MKIDs, made from TiN, with parameters similar to those of the state-of-the-art ARCONS VISIR MKID detectors.35
In Fig. 2, the -axis is chosen to be the mean optical power absorbed. At high powers, the quasiparticles produced from one photon are still present in significant numbers when the next photon arrives. At low powers, the MKID recovers to nearly zero quasiparticle number between photons; in Fig. 2, this corresponds to the curves of fixed energy resolution flattening and becoming independent of optical power (photon rate) at sufficiently low power levels. The maximum photon rate for this regime is not as fast as one would expect given that the initial time constant for decay of the quasiparticle number is typically quite fast () because the decay is not exponential. The rate of recombination events is proportional to the square of the number of quasiparticles present, yielding a mean number of quasiparticles at time given by , where is the recombination rate constant for the material and is the detector volume. While we have carried out Monte Carlo simulations of the time-domain MKID waveforms for random photon arrival streams, the simpler approximate treatment represented by Fig. 2 gives energy resolution and bandwidth results sufficiently accurately for this discussion.
For long-wavelength applications, one goal is generally for MKID sensitivity to reach the photon-noise background-limited noise equivalent power, or to count millimeter-wave photons with resolving power . However, the goal for VISIR MKIDS is much higher . ARCONS MKIDs achieve for . While Fig. 2 indicates the observed resolving power is about what is expected, more detailed detector models than used here in making Fig. 2 lead the ARCONS team to conclude that their resolving power is a factor of two less than expected due to a peculiarity of the microstructure of TiN films that gives a photon-absorption-position dependent responsivity. Additionally, resolution is expected to improve when using parametric amplifiers (currently under development) to gain a large () improvement in amplifier noise temperature compared with HEMT amplifiers. However, improving VISIR MKIDs to faces a significant challenge in that there is statistical noise on the quasiparticle creation process. Absorption of a photon well above the gap frequency initially forms a pair of high energy quasiparticles, which decay in energy by emitting phonons sufficiently energetic to break additional Cooper pairs. In the cascade of quasiparticles and phonons produced, the fraction of the original photon energy lost to the substrate by phonons is variable. Consequently, the energy resolution is subject to the Fano statistics limit: , where and is the Fano factor.22 The energy resolution will be additionally degraded if hot phonons escape to the substrate prior to completion of the cascade process to convert the photon energy into low energy photons and quasiparticles.36
Using , and say, , one finds the Fano limit is at , and at . Figure 3 shows the change in inductor volume to reach the Fano limit at , assuming the switch to a parametric amplifier. For a LUVOIR optical power (0.05 aW for energy resolving detectors at 0.1 cps, instead of 20 aW for ARCONS ground based background at 75 cps), the Fano limit goal requires the detector volume be reduced from 70 to . The volume reduction will start to push the MKID response into a nonlinear regime but yields the necessary sensitivity. In this regime, the instantaneous shift in resonance frequency and during the pulse are both large but evolve in tandem to give an approximately constant phase shift in the microwave carrier for some time. The energy of the photon is then not just encoded in the pulse amplitude but in the pulse duration. (This saturation of the pulse amplitude is similar to the mode of TES operation advocated in the previous section.) Keeping the ARCONS TiN thickness value of 60 nm, the required area for the inductor needs to shrink from to . Optical coupling schemes are discussed below; however, while it may be possible to still couple optically to this inductor size, the Fano limit seems to preclude attaining the LUVOIR resolving power goal at the long wavelength end at . However, depending upon the spectral features of interest, this may be acceptable since Fig. 1 appears to suggest that high spectral resolution is most important for and features in the visible. It would be helpful if atmospheric models could be used to better define the required spectral resolution as a function of wavelength throughout the VISIR.
One path to circumventing the Fano limit and hot phonon escape is to fabricate the MKID on a suspended membrane structure, potentially changing it from a pair-breaking detector to an equilibrium thermal detector in which the MKID serves only as a thermometer sensing the temperature rise of the membrane. Such thermal MKID (TKID) devices have already been made and tested for x-ray microcalorimetry.37 Arrays of MKIDs on one common silicon membrane, rather than individual membranes, have also been demonstrated.38 The membrane design reduces the stochastic variation in detected energy at the cost of slowing the response time, but, for the low photon count rates in biosignature characterization, that may be acceptable.
Optical coupling to VISIR MKIDs efficiently over a wide wavelength range presents a greater challenge than typical for long-wavelength MKIDs. The large variety of coupling schemes developed for long-wavelength MKIDS fall into two categories: (1) transmission line coupled, and (2) direct absorption coupled. In the first category, radiation is collected by an antenna and guided to the MKID by a superconducting transmission line (made of a higher gap superconductor than the MKID). The MKID is designed to act as a resistive termination (at frequencies above its gap) that matches the characteristic impedance of the optical input line. For VISIR MKIDs, the optical frequencies are above the gap of any superconductor, so this approach cannot be used. In the second category, the MKID material is directly illuminated by means of lenses or placement inside a waveguide. In the long-wavelength case, the optical frequency is well above the superconducting gap frequency, but far below the inverse of the Drude scattering time. The thin MKID film then acts as a sheet resistor with a real-surface impedance equal to the DC value (typically tens of ohms/square) seen in the normal (nonsuperconducting) state. By appropriate choice of the index of refraction of the (transparent) substrate and use of a back-short, highly efficient optical coupling to the MKID can be achieved over a fractional bandwidth of 30% or more. At the much higher frequencies in the VISIR case, MKID materials exhibit a more complex dielectric function. Figure 4 shows the surface impedance at VISIR frequencies for two examples of MKID films, molybdenum nitride and thin aluminum, which we have used at NASA Goddard. The real part of the impedance is not frequency independent, and the imaginary part is not small. This is typical of all MKID materials, including TiN, NbTiN, PtSi, and WSi. An optical efficiency near 100% can be designed in some narrow frequency range by forming an optical cavity involving the MKID layer, its substrate, and auxiliary metal or dielectric films; however, it seems a complex task to achieve efficiency simultaneously over 0.4 to . Additional complications are (1) the MKID films are not necessarily thin compared to the optical wavelength, (2) one of the favored substrates, single crystal silicon, has its semiconducting gap in the frequency range of interest, and (3) amorphous dielectrics associated with TLS may add noise. The TiN MKIDs in ARCONS absorb 70% of the light at , but only 30% at , and microlens arrays are used to focus the light onto the small inductors.35 More than one MKID design may be needed in biosignature characterization focal planes to efficiently couple photons from 400 nm to . For LUVOIR, this may not be a significant penalty because the coronagraph itself will have limited bandpass, perhaps 10%, as a consequence of needing to achieve a starlight suppression ratio. Nevertheless, decreasing MKID inductor size (to increase spectral resolution) and improving absorption efficiency are important challenges for VISIR MKID development.
Ultra-Low Vibration Cooling
Overview of Coolers for
Stored cryogen systems have been used in the past to provide cooling to observatories and instruments with near zero vibration, but they are impractically massive for missions with lifetimes greater than 5 years and have largely been replaced by mechanical cryocoolers. Cryocoolers are far lighter and have lifetimes limited primarily by their control electronics.
While there are many types of closed cycle cryocoolers, they generally share several common elements. All use a working fluid, typically helium, and have a compressor at the high temperature end, followed by a heat exchanger, where the heat of compression is rejected to a radiator. All have a heat exchanger where the heat from the coldward flowing gas is rejected to the warm-ward flowing gas. In the case of alternating flow (ac) systems, such as Stirling cycle or pulse tube coolers, this is called a regenerator; in the case of continuous flow coolers, such as turbo-Brayton or Joule–Thomson (JT) coolers, this is called a recuperator or counterflow heat exchanger. Finally, in all systems, gas is expanded by various means and then enters a heat exchanger, where heat is absorbed at the operating temperature.
Linear Compressor Cryocoolers
Almost all flight cryocoolers launched to date are based on linear motor driven piston compressors with noncontact clearance seals. These devices, originally developed in the 1970s and 80s at Oxford University, have virtually unlimited lifetimes. They also have inherently high vibration at their operating frequency, typically 20 to 70 Hz, which unfortunately is in a range that often contains important telescope and instrument structural mode frequencies. Many flight cryocoolers use a second, coaligned piston and control electronics to provide active vibration cancellation along the axis of motion, but cancellation is imperfect, partially because the piston force couples into other degrees of freedom. In most of these coolers, the regenerator and the expansion piston (or pulse tube) are mounted together in a single unit with the compressor, which must be mounted directly to the instrument. In linear piston driven JT coolers, the alternating flow of a compressor is rectified with a set of reed valves. This scheme is used on the JWST/MIRI instrument and the Astro-H/SXS instrument. The resulting flow, after being cooled by a separate cryocooler, can then be piped many meters to a remote expansion valve. Although the inherent noise of the flow in the line and in the expansion valve is low, the lines must be directly coupled to the circulating compressor and the cryocooler, and transmit their vibration to sensitive parts of the observatory.
Low Vibration Cryocoolers
Because of the known problems with the vibration from linear-piston cryocoolers, alternative coolers with much lower exported vibration force in the critical 0 to 200 Hz band have been developed. Two examples are JT expansion coolers with sorption-based compressors (called sorption coolers here) and reverse Brayton cycle coolers using miniature turbine compressors and expanders (called turbo-Brayton coolers here). In both cases, the flow is continuous rather than oscillating, and the compressors can be mounted meters away from the instruments.
In the turbo-Brayton cooler [Fig. 5(a)], a motor-driven turbine works on the gas at the warm end, compressing it, and a turbine-driven generator extracts work from the gas as it expands at the cold end. Because expansion in the turbine ideally approaches an isentropic process, the reverse-Brayton cycle has inherently high efficiency. The turbines are very small devices, of order several millimeters, that operate at very high rotational frequency, typically 10 kHz for the compressor and 3 kHz for the expander, which is far above the critical structural mode frequencies for a large telescope, such as JWST. The turbines float on self-actuated gas bearings, and are thus noncontact devices, so the lifetime of the cooler is typically limited only by the rad-hardness of its electronics. A single-stage turbine can produce only a relatively modest compression ratio, especially in helium. This can be offset to some degree by connecting multiple compressors in series, but the compression ratio is typically modest compared to other coolers and requires a rather large, sophisticated, very high efficiency recuperator. The turbo-Brayton system can have multiple turbo-expander stages that can absorb heat at multiple temperatures. While low temperature radiators (in addition to the main warm radiator following the compressor) will improve system efficiency, the cooler can be made to operate without them, and so can have a relatively modest impact on the spacecraft configuration.
Sorption coolers [Fig. 5(b)] are driven by sorption compressors, which are simply beds of material which absorb gas at low temperature and emit it at high temperature. To reach the temperatures of interest here, at least two stages will be needed. For a hydrogen upper stage, the process is typically chemisorption, where the gas reacts with a metal such as LaNiSn to form metal hydrides. For a helium low temperature stage, this process is typically adsorption onto a highly porous material such as carbon. While chemisorption compressors operate around ambient temperature, carbon sorbents must operate at low temperature (), and so require a radiator on the cold side of the spacecraft. At least two beds are required. At any time, one is cold and absorbing gas at low pressure, while another is warm and emitting gas at high pressure. Although the process is inherently cyclical, buffer volumes and careful control of cool down and warm-up rates can smooth out pressure fluctuations. As the sorbent beds switch from cooling to warming and vice-versa, check valves keep the gas moving in one direction. These are the only moving parts, and they open and close only with the frequency of the heating and cooling of the sorbent beds, which is well below any structural mode frequency for a large telescope, although valve actuation does produce a small impulse with broad frequency content. Compressors can be staged, and relatively high compression ratios can be achieved, so a relatively simple recuperator can be a used. At the low temperature end, gas expands isenthalpically through a simple JT valve. Isenthalpic expansion provides no cooling in an ideal gas, so prior to reaching the expansion valve, it must be cooled well below its region of ideal behavior. For a system capable of absorbing heat below , a helium cooler would be needed, which will require precooling with a hydrogen stage. The hydrogen stage would need precooling with a set of staged low temperature radiators, with the coldest radiator at . Thus, such a cooler would have a significant impact on the spacecraft configuration.
Low vibration coolers have been used in at least two important astrophysics missions. A turbo-Brayton cooler was installed on the HST/NICMOS instrument during servicing mission 3B to replace a solid nitrogen dewar that had failed. The cooler was a single-stage device that used neon as a working fluid and provided cooling at 73 K to the NICMOS detectors.39 Once the cooler reached steady state, it had no detectable effect on HST image quality. A hydrogen sorption cooler was used on Planck to provide cooling at to a linear compressor-driven helium JT cooler.40 Its compressor operated between 270 and 460 K, and a three-stage V-groove radiator was used to provide precooling. Operation of the sorption cooler caused no detectable noise, although any signal from the sorption cooler would have been minuscule compared to that of the linear compressor. Since these missions, there have been additional advances in low vibration cooling systems. Notably, Breedlove et al.41 demonstrated a two-stage turbo-Brayton system that provides 236 mW of cooling at 10 K, and Burger et al.42 demonstrated a hydrogen/helium sorption cooler that provides 5 mW of cooling at 4.5 K.
The effectiveness of cooling by the expansion of helium gas drops off rapidly below 1 K, and other physical phenomena must be used to reach deep subKelvin temperatures. In terrestrial laboratories, where power is effectively free, and gravity provides a natural separation of the -rich and -poor phases of a liquid mixture, dilution refrigerators are most commonly used to reach temperatures as low as 0.002 K. Dilution coolers are based on the entropy of mixing of in . An open-cycle dilution refrigerator was developed by the Grenoble group for cooling the HFI detectors on Planck. The device used and stored at room temperature in four large high pressure tanks. The gas was precooled by the three-stage radiator, the sorption cooler, and the helium JT cooler before reaching the dilution refrigerator, where it provided several hundred nanowatts of cooling at 0.1 K. The gas lasted 29 months. The same group is working on a closed cycle dilution refrigerator that relies on surface tension to separate the phases, but so far, they have not demonstrated a complete system that will operate without gravity.
The Goddard Space Flight Center has developed magnetic coolers or adiabatic demagnetization refrigerators (ADRs) for lifting heat from milliKelvin temperatures in a 0-g environment. Magnetic cooling is based on manipulation of the entropy of paramagnetic compounds with a magnetic field. Because of their unfilled d and f subshells, many rare earth and period 4 transition metal ions have magnetic moments and have states, where is the total angular momentum quantum number. In the limit of small interaction between ions, these states are degenerate, so the associated entropy is , which at temperatures below 10 to 15 K for most materials is far larger than other entropy terms. Applying a magnetic field breaks this degeneracy and suppresses the entropy. At sufficiently low temperature, the interaction between ions also acts to align the moments and suppress entropy. As the material approaches its ordering temperature, magnetic entropy drops sharply. Figure 6(a) illustrates this behavior; the solid curves are the entropy as a function of temperature at several values of applied field.
The rectangle labeled A-B-C-D shows the ideal ADR cycle. In a single-stage ADR, the paramagnetic compound sits in the bore of a superconducting solenoid, thermally connected to a thermal reservoir at the warm end through a heat switch. In process A, the field is ramped to maximum with the switch closed, so the heat of magnetization is dumped to the reservoir. In process B, the heat switch is open, so the paramagnetic material is isolated (or adiabatic), and the temperature drops isentropically as the field is reduced until the desired operating temperature is reached. In C, the field is reduced slowly at a rate that generates cooling only sufficient to cancel the heat input from the low temperature load, until the field reaches 0. Finally in D, the field is ramped rapidly up to the reservoir temperature, at which point the heat switch is closed and the cycle repeats. Note that this is a Carnot cycle, so that in the limit of ideal operation, ADRs have maximum possible thermodynamic efficiency.
NASA Goddard Space Flight Center has built three flight ADR systems. Two were nearly identical single-stage coolers for the XRS instrument on Astro-E and E2. They lifted heat from the detector array at 0.060 K to a liquid helium tank at 1.3 K. The ADRs had a hold time of 33 h at the detector operating temperature, and had a 1-h recycle time. Astro-E2 successfully reached orbit, and the ADR worked flawlessly until the liquid helium ran out. The third device is a three-stage ADR for the SXS instrument on Astro-H. It has multiple operating modes. In nominal mode, it lifts heat from the detector array at 0.050 K to a liquid helium tank at 1.3 K. In this mode, the hold time is 49 h, and the recycle time is only 0.75 h. Once the helium runs out, the system provides continuous cooling to the empty tank at 1.5 K and also cools the detectors to 0.050 K, although with reduced hold time.
As array sizes of low temperature detectors scale up, so does the low temperature heat load. For standard ADRs, maintaining long hold times requires scaling up the ADR system proportionally. The continuous ADR (CADR) circumvents this limitation.43 A CADR is a multistage ADR adapted so that the first (coldest) stage stays at the detector operating temperature. For half of its cycle, this stage operates normally, absorbing heat from the detectors through a controlled ramp-down of its field. However, as its field approaches zero, the second stage is brought down to a temperature below the operating temperature, and the heat switch is closed. The first stage must then magnetize to maintain the operating temperature, and in this way transfers the heat it has absorbed to the second stage. As the field approaches maximum, the heat switch is open, the first stage starts demagnetizing, while the second stage magnetizes up to a higher temperature and transfers its heat to the third stage. The process can be cascaded, with heat transferred to higher temperature stages, and finally to the heat sink, presumably a cryocooler. The most obvious benefit of the CADR is that operation is continuous, so there is no interruption of science data taking. Perhaps, more importantly, because operation is continuous, detector operation and ADR operation are decoupled, and the stages can be cycled much more rapidly. Since the same heat is lifted with each cycle, increasing frequency increases cooling power per unit mass.
While a four-stage CADR lifting heat from 0.035 K to has been demonstrated, raising the heat sink temperature will enable its use with turbo-Brayton coolers, and greatly ease integration with sorption coolers. Although magnetocaloric materials will operate effectively above 10 K, compact superconducting magnets made from standard NbTi/copper composite wire cannot reach sufficiently high fields when operating above . Compact, low current magnets based on composite wire can provide sufficient field while operating above 10 K.44 Recently, a simple ADR stage based on such a magnet has demonstrated heat lift from 4 K to 10 K. With some additional effort, such a stage could be integrated into a CADR that provides heat lift from to greater than 10 K. It is also possible to design a CADR to lift heat from temperatures significantly below 35 mK with proper choice of materials.
Suitability of Coolers for a LUVOIR Mission
One well developed concept for a LUVOIR mission was ATLAST.45 ATLAST was based largely on extensions of JWST, and because of the similarity, results of detailed structural and optical modeling for JWST provided useful estimates of ATLAST parameters. To meet its science goals, ATLAST required wavefront stability of 0.01 nm over 10 min. Feinberg et al.46 considered the sensitivity of WFE to disturbances. Sensitivity is worst in the 20 to 65 Hz band containing the tip-tilt modes of the primary mirror segments. Using results from JWST deployed dynamics modeling, they showed that substantially better isolation from the momentum wheel assembly disturbances would be required and argued that this could be achieved using a noncontact linkage between the spacecraft bus, including the sunshield, and the telescope. For both the turbo-Brayton and sorption coolers, the compressors could be mounted on the spacecraft side. In both cases, exported disturbances would be far less than those of the momentum wheel assemblies. However, flow lines, heat exchangers, and expansion valves (for the sorption cooler) or expansion turbines (for the turbo-Brayton cooler) would need to be mounted on the telescope structure.
The MIRI JT cooler has similar flow lines, heat exchangers, and an expansion valve. Using the same deployed dynamics model, the JWST team examined the sensitivity of WFE to disturbance caused by turbulent flow in the MIRI cooler. Using computational fluid dynamics, they derived the power spectral density (PSD) of force inputs at the various mounting points to the structure. These are bounded by . The resulting WFE, integrated up to 200 Hz, is . Thus, to be a small part of the ATLAST WFE budget, these disturbances would need to be reduced by at least three orders of magnitude. Similar computational fluid dynamics calculations were recently done to determine the noise generated by various components of a turbo-Brayton cooler. While the worst noise generators, such as step changes in cross-sectional area, would be avoided in an ultra-low vibration cooler, even a relatively minor obstacle, an over-penetrated weld in a straight pipe, produced a force PSD of .
One potential way to achieve extremely low vibration levels during exoplanet observations would be to use a thermal storage device to provide cooling, and simply switch off the cooler during this period. For example, a reservoir of evaporating liquid helium could absorb heat from the instrument during observations, and the gas could be collected in a tank. Between observations, the cooler could be turned on to reliquefy the gas. However, heat loads on the instruments could be high. For example, in the ATLAST concept, the telescope structure which surrounded and supported the instrument was controlled at ambient temperature. This, combined with the expected long observation period (up to days), means that such a thermal storage unit would need to be large and heavy. Since it operates with a limited duty cycle, the cooler would also need to be larger and heavier, and its required power is correspondingly larger. Furthermore, at the 10 pm WFE level, it may be difficult to mitigate the impact on dimensional stability of switching between two modes, one in which the cooler lines are drifting up in temperature, and one in which they are cold. A better approach, and one in line with the overall architecture of ATLAST, would be to develop the technology to allow the cooling system to be maintained in a steady state.
Advancing turbo-Brayton and sorption coolers so that the exported vibrations are in the single-digit will require a significant technology investment. Careful design and fabrication of the entire flow path to eliminate any sharp changes in curvature and including the line stiffness in the computation of forces will likely lead to more than an order of magnitude reduction. However, to reach the desired levels, it may be necessary to ensure laminar flow with no regions of flow separation throughout the flow path. Laminar flow without separation is a completely steady state, so in principle should produce no vibration. However, flow lines will need to be substantially larger to keep the Reynolds number below the critical value. Other modifications may also be necessary. At the expansion valve outlet in hydrogen sorption coolers, the fluid is typically in two-phase flow, which is generally noisy. It may be necessary to avoid this, although it will impact performance. For turbo-Brayton coolers, imbalance in the turbo expander rotor causes a disturbance at the rotational frequency of the turbine. With current rotor balancing technology, the disturbance amplitude is typically hundreds of millinewtons. Although the impact of a force input at these high frequencies is less well understood, clearly a very large isolation factor is required. One possibility may be to follow the approach of Aldcroft et al.,47 who designed and built a six stage, six degree-of-freedom isolator with at least 250 dB of attenuation in the desired frequency band.
ADRs have no moving parts and are generally considered to be zero-vibration devices. However, the stresses in the magnets cycle up to fairly high levels as the fields ramp up and down, and it will be necessary to determine if this generates any disturbances at the relevant level. This points out an important technology need: experimental techniques for detecting extremely low disturbance forces. Such techniques will be necessary for other telescope components.
We have discussed a broad suite of detector and cooling technologies for biosignature characterization using future space observatories, such as LUVOIR and the Habitable-Exoplanet Imaging Mission. For easy reference, Table 3 summarizes some of these technologies and their challenges with reference to the state-of-the-art.
Summary of where further work is desirable.
|EMCCD||(i)||Radiation tolerance||Acceptable degradation after 5 years at L2||Space radiation tolerance not a design consideration for existing EMCCDs||Radiation tolerance can be improved by applying known design techniques. Rad-hard alternatives should be considered for risk mitigation.|
|(ii)||Spurious count rate||(1) May be affected by radiation hardening|
|Ultra-low vibration cooler||(i)||Reduce vibration PSD||(1) Enables use of MKID and TES detectors|
|(2) Laminar flow system studies desirable as first step toward cooling without this operational constraint|
|HgCdTe photodiode array||(i)||Dark current||Better enables more science||0.001||(1) LUVOIR would be strongly detector limited with existing HxRGs|
|(2) Incremental improvement is possible. Detailed characterization of existing H2RGs and H4RGs for LUVOIR is desirable as a first step|
|(ii)||Total noise||Better enables more science||…|
|(iii)||Persistence||Better enables more science||Varies. A typical requirement that is often met is in the first exposure following saturation||Persistence is highly dependent upon detector design, detector implementation, operating environment, and observing strategy. Any improvement will be beneficial.|
|HgCdTe APD array||(i)||Dark count rate||(gain corrected)||(1) The state-of-the-art is almost certainly ROIC glow|
|(2) Further characterization of existing APD arrays for LUVOIR is desirable as a first step|
|MKID array||(i)||Improve energy resolution||at||at 400 nm|
|(ii)||Improve photon absorption||from 400 nm||70% at 400 nm 30% at 1000 nm||Meeting need may require MKIDs tuned to specific bandpasses|
|TES array||(i)||Improve energy resolution||at||at 400 nm at||Characterization of existing VISIR TESs is a desirable first step|
|(ii)||Improve photon absorption||from 400 nm||N/A||Existing VISIR TESs not optimized for high absorption efficiency|
For EMCCDs, improving radiation tolerance is arguably the greatest need. As is discussed in Sec. 3.1, radiation tolerance was not a design consideration for current generation EMCCDs. One should not be surprised to see the radiation-induced performance degradation that is typical for n-channel CCDs in space (e.g., charge transfer efficiency degradation), and other artifacts that may be revealed at subelectron noise levels (CIC is one example, but surprises are also possible). For LUVOIR and/or a Habitable Exoplanet Imaging Mission, we believe that it would be wise to apply known CCD radiation hardening design features and fabrication processes to EMCCDs.16 For risk mitigation, it may also make sense to explore similar detector architectures that promise greater radiation tolerance.
It would also be desirable to improve CIC in EMCCDs, for which the current state-of-the-art is already close to “good enough” when new. For CIC, we believe that incremental improvements in operation and design hold good promise for meeting the need on the relevant timescale.
There is still some room from improvement in near-IR photodiode arrays similar to the HxRGs that are being used for JWST, Euclid, and WFIRST. Although the current architecture seems unlikely to function as a single-photon detector, significant incremental improvement (perhaps factors of two to three reduction in read noise) may be possible. A reasonable first step would be detailed characterization of existing HxRGs aimed at separating out the different contributors to the noise (photodiode, resistive interconnect, pixel source follower, other amplifiers, etc.). Near-IR APD arrays like those made by Selex may also be promising if the “dark current” can be reduced to . The gain corrected “dark current” of current devices is almost certainly dominated by ROIC glow, but there may still be significant work required to go from the to-be-determined leakage current of these devices to the that is needed.
Superconducting MKID and TES arrays already function as single-photon detectors and both have already been used for VISIR astronomy. Use of these technologies by LUVOIR is contingent upon developing ultra-low vibration cooling. If ultra-low vibration cooling is available, then the challenges for both MKID arrays and TES microcalorimeter arrays are similar. Higher energy resolution and better photon coupling efficiency are needed. If ultra-low vibration cooling is not available, then we believe that MKID and TES microcalorimeter arrays may still be attractive for a starshade-based Habitable Exoplanet Imaging Mission because they would offer nearly quantum-limited performance.
With specific regard to MKID arrays, further work should include the development of VISIR MKID arrays with designs targeting the energy resolution and optical efficiency required for biosignature characterization. Several areas of investment will be needed. One expects significant resolution improvements over the state-of-the-art in the near-term from the development of broadband parametric amplifiers with nearly quantum-limited sensitivity, and from switching to MKID materials with greater uniformity in thin-film properties that will eliminate position-dependent broadening of the measured photon energy. In addition, reaching Fano-limited energy resolution will likely require designs that reduce VISIR MKID inductor volume by a factor on the order of 30 from current devices designed for the optical background in ground-based instruments, while at the same time managing to improve optical efficiency. Achieving high enough optical efficiency over the broad LUVOIR bandwidth will be challenging given the nonconstant, reactive complex resistivity of MKID materials at VISIR frequencies. However, even achieving the Fano-limit with currently favored MKID materials (transition temperature ) will not be sufficient to reach biosignature characterization goals. Either VISIR MKIDs (and cooling systems) will need to be developed with lower (operating ) in order to give a better Fano-limit, or else effort will be needed to optimize the TKID (membrane) style of detector in order to circumvent the Fano limit for VISIR MKIDs.
Both MKIDs and TESs require ultra-low vibration cooling for use in a LUVOIR. For a Habitable Exoplanet Imaging Mission, the vibration requirements may be less stringent. For these technologies to be viable in all biosignature characterization mission architectures, we recommend the development of prototype technology for ultra-low vibration coolers. As a first step, studies are needed to examine the feasibility of a laminar-flow system, including a detailed computational effort to determine whether flow separation can be avoided. Once feasibility has been established, the most immediate need will be for techniques that can be used to verify the computational models in prototype components at the required nN levels.
Count Rate of Energy Resolving Versus Conventional Pixels
In Sec. 2.3, we assert that if an energy resolving detector was to be used for nondispersive imaging spectrometry, then the count rate per pixel would be about the count rate per pixel of a conventional IFU spectrograph. The order of magnitude derivation is as follows.
Stark et al.14 studied space observatory exoEarth yields to suggest lower limits on telescope aperture size. Their study required them to model the performance of both a conventional IFU spectrograph and a nondispersive imaging spectrograph. Table 4 lists their key assumptions.
|10 m||Telescope diameter|
|steradians||Solid angle subtended by photometry aperture|
|OWA||213 mas||Outer working angle|
|Central wavelength for spectral characterization|
Following Stark, the photometer aperture, , maps onto 4 energy resolving pixels in the nondispersive imaging spectrometer. The required number of energy resolving pixels is therefore,
In the IFU implementation, the photometer aperture maps onto four lenslets. Stark furthermore maps each lenslet onto six conventional pixels, three in the spatial dimension by two in the spectral dimension, yielding 24 pixels per spectral resolution element. He assumed a 20% bandpass and per spectral “channel,” yielding 240 conventional pixels per photometric aperture. In this article, we have adopted as being better matched to characterizing . Following Stark, but requiring , yields 480 conventional pixels per photometric aperture. With these assumptions, Eq. (1) becomes
If we assume that the overall throughput is about the same in the two implementations, then the same light is being spread over more pixels in the conventional IFU spectrograph than in the nondispersive imaging spectrometer. To within the uncertainties, this implies that the count rate per pixel will be about higher in the energy-resolving detector than in the conventional detector.
We wish to thank Brendan Crill and Warren Holmes of NASA Jet Propulsion Laboratory and Matthew Greenhouse of NASA Goddard Space Flight Center for carefully reading the entire manuscript and providing invaluable comments. We wish to thank the referee for several helpful comments and suggestions that have improved the manuscript. This work was supported by a NASA Goddard Space Flight Center Internal Research and Development (IRAD) award to develop space coronagraph detector technology and a NASA Goddard Space Flight Center Science and Exploration Directorate Science Task Group award entitled, “Life Finder Detectors.”
Bernard J. Rauscher is an experimental astrophysicist at NASA Goddard Space Flight Center. His research interests include astronomy instrumentation, space detector systems, extragalactic astronomy and cosmology, and most recently, the search for life on other worlds. His work developing detector systems for the James Webb Space Telescope has been recognized by a shared Congressional Space Act award and NASA’s Exceptional Achievement Medal.
Edgar R. Canavan is an aerospace engineer at NASA Goddard Space Flight Center. His research interests include magnetic cooling and the properties of materials and devices at low temperatures.
Samuel H. Moseley is a senior astrophysicist at NASA Goddard Space Flight Center. He has received many awards for his work, including SPIE’s 2013 George W. Goddard Award. The citation reads in part, “in recognition of his extraordinary inventions of superconducting imaging arrays for astronomy, ranging from submillimeter bolometers to energy sensitive X-ray microcalorimeters, and even dark matter detectors.”
John E. Sadleir is a condensed matter physicist in the Detector Systems Branch at NASA Goddard Space Flight Center. His research focuses on cryogenic detectors for particle physics, cosmology, and astrophysics applications.
Thomas Stevenson is an electronics engineer at NASA Goddard Space Flight Center. His research interests include superconducting properties of materials and devices, and development of photon detectors for astrophysics applications in spectral regions ranging from microwave, submillimeter, and far infrared, to x-rays.