Static Contributors to the MTF: Detector
The topic of sharpness and contrast performance is continued from here.
Detector MTF Models
Detector Sampling MTF
Each detector pixel performs spatial averaging of the incident irradiance over its active area. More precisely, the scene irradiance is integrated with the detector’s spatial responsivity function (which, for a uniform pixel, is a rectangle function of width equal to the pixel pitch). In the spatial domain this is a convolution, and in the frequency domain it becomes a multiplication with the Fourier transform of the rectangle — the sinc function[1] (See also the information here):
where \(p\) is the pixel pitch and \(f\) is the spatial frequency. Note the normalised sinc function notation used.
This is the most fundamental detector MTF contributor — it applies universally to all detector types (CCD, CMOS, infrared) and is determined entirely by the pixel geometry. The key features of this MTF contributor are:
At zero frequency (\(f = 0\)): MTF = 1 (the full signal is captured).
At Nyquist (\(f = f_{ny} = 1/2p\)): \(\text{MTF} = \text{sinc}(0.5) = 2/\pi \approx 0.637\). This means that even for a perfectly sampled signal at the Nyquist limit, the detector averages away about 36% of the modulation.
At \(f = 1/p\): MTF = 0. The spatial period of the input equals the pixel pitch, so the signal is entirely averaged out within a single pixel.
Beyond \(f = 1/p\): the sinc function goes negative, corresponding to contrast reversal — bright and dark regions swap. This is related to aliasing but is distinct from it; aliasing is a sampling effect while the sinc rolloff is an averaging (integration) effect.
For detectors with a fill factor less than unity (i.e., the photosensitive area is smaller than the pixel pitch), the effective averaging width is reduced and the MTF rolls off more slowly. In this case, \(p\) in the formula should be replaced by the active detector width \(d\) rather than the pixel pitch. Most modern sensors have fill factors close to 1 (aided by microlenses in CMOS sensors), so \(p\) is commonly used directly.
When pixel binning is applied (combining \(n \times n\) pixels), the effective pixel pitch increases to \(n \times p\), and the sampling MTF degrades correspondingly — the sinc function becomes narrower in the frequency domain.
Detector Diffusion MTF
When photons are absorbed in the detector substrate, they generate charge carriers (electron-hole pairs). These carriers do not stay exactly where they are generated — they diffuse laterally through the semiconductor material before being collected by the pixel’s depletion region. This lateral spread means that a photon absorbed under one pixel can contribute signal to a neighbouring pixel, blurring the image and reducing the MTF.
The extent of this effect depends on the detector material and architecture:
CCD detectors: Moderate diffusion effect. In front-illuminated CCDs, carriers generated deep in the substrate (from longer wavelength photons, particularly in the red and near-IR) have to diffuse further to reach the depletion region, leading to a wavelength-dependent MTF loss[2]. Back-illuminated CCDs can have worse diffusion MTF unless thinned, as carriers generated near the back surface must traverse the full substrate thickness.
CMOS detectors: Generally smaller diffusion effect than CCDs, and modern designs incorporate several architectural features that further reduce it[2]:
Thin epitaxial layers (often <10 µm) limit the lateral diffusion distance.
Deep Trench Isolation (DTI): Physical SiO2 barriers etched between pixels (typically 3–4 µm deep) that block lateral carrier diffusion at the pixel boundary. DTI effectively eliminates electrical crosstalk in well-isolated designs and is standard in modern small-pitch sensors.
Pinned Photodiodes (PPD): The depletion structure extends deeper into the substrate and can be over-depleted, increasing carrier drift velocity and reducing the time (and distance) carriers diffuse before collection.
Back-Side Illumination (BSI): The substrate is thinned (to ~3–5 µm) so that photons enter from the back and are absorbed close to the depletion region. This shortens the diffusion path but reverses the geometry compared to front-side illumination.
The combination of these features means that diffusion MTF is often a secondary concern for visible-wavelength CMOS imagers. However, at longer wavelengths (NIR, >800 nm), photons penetrate deeper and the diffusion effect increases — CMOS MTF at 850 nm can be significantly lower than at visible wavelengths.
Infrared detectors (HgCdTe, InSb): Diffusion is a dominant MTF contributor. These detectors typically have thick absorber layers (to achieve sufficient quantum efficiency at long wavelengths) and relatively large diffusion lengths. In HgCdTe detectors, the minority carrier diffusion length can be tens of micrometres — comparable to or larger than the pixel pitch — making lateral diffusion the primary MTF-limiting mechanism in many IR focal plane arrays[3]. This gets more important as the pixel pitch gets smaller. InSb detectors (covering 1–5.5 µm) face similar challenges. The effect worsens at longer wavelengths where thicker absorbers are required.
Diffusion MTF Model
General Model — Crowell & Labuda (1969)
The most general analytical diffusion MTF model was derived by Crowell & Labuda[4] for the silicon diode array camera tube. Their Eq. (5) gives the quantum efficiency as a function of spatial frequency \(k\) (here written as \(f\) in cycles/length), accounting for surface recombination velocity \(S\), back-surface reflectivity \(R\), and a finite substrate.
In the C&L geometry, light enters the field-free region first (from the illuminated surface at \(x = 0\)), passes through the field-free region of thickness \(L_a\), and is collected at the depletion region boundary at \(x = L_a\). The substrate back surface is at \(x = L_b\), so the depletion region width is \(L_b - L_a\).
The frequency-dependent quantum efficiency \(\eta(f)\) is:
where:
and the frequency-dependent auxiliary length is:
The diffusion MTF is the normalised form:
At \(f = 0\), \(L_f = L_{Diff}\), so \(\eta(0)\) is obtained by substituting \(L_f = L_{Diff}\) in the expression above.
The parameters are:
\(f\) : spatial frequency
\(\alpha_{abs}(\lambda)\) : photon absorption coefficient of the substrate material (wavelength-dependent)
\(L_a\) : field-free region thickness (distance from illuminated surface to the depletion region boundary)
\(L_b\) : total substrate thickness (distance from illuminated surface to the back surface); depletion width is \(L_b - L_a\)
\(L_{Diff}\) : minority carrier diffusion length (\(L_{Diff} = \sqrt{D\tau}\), where \(D\) is the diffusion coefficient and \(\tau\) is the carrier lifetime)
\(S\) : surface recombination velocity at the illuminated surface (cm/s)
\(D\) : minority carrier diffusion coefficient (cm²/s); note \(S/D\) has units of 1/length
\(R\) : reflectivity of the back surface (dimensionless, \(0 \leq R \leq 1\))
The \(\beta^{\pm}\) terms encode the boundary condition at the illuminated surface: \(S\) controls how quickly carriers recombine there. The \((1-R)\) term accounts for carriers that reach the back surface and are either reflected back into the substrate (fraction \(R\)) or lost (fraction \(1-R\)).
Simplified Models
The general C&L equation is simplified for five practically important cases by applying boundary condition limits and geometry choices. Full derivations are in the derivation notebook; numerical examples with real detector parameters are in the numerical examples notebook. In all cases \(R = 0\) (no back-surface reflection).
Model |
Geometry |
Detector category |
Conditions |
Physical meaning |
Example detector |
|---|---|---|---|---|---|
BSI-1 |
BSI |
Scientific CCD |
\(S \to \infty\), finite \(L_b\) |
Dead back surface; all carriers reaching the illuminated surface recombine |
Teledyne e2v CCD47-10 |
BSI-2 |
BSI |
Scientific sCMOS |
\(S = 0\), finite \(L_b\) |
Perfectly passivated back surface; carriers reflect back toward junction |
Gpixel GSENSE400BSI |
BSI-3 |
BSI |
IR FPA |
\(L_b \to \infty\), general \(S\) |
Thick substrate; finite surface recombination |
Teledyne H2RG (MCT) |
FSI-1 |
FSI |
Consumer CMOS |
Dead far surface, finite \(L_a\) |
Light enters depletion first; finite field-free bulk |
Sony IMX174 |
FSI-2 |
FSI |
Bulk CCD |
Dead far surface, \(L_a \to \infty\) |
Semi-infinite bulk; equivalent to the Seib[5] / Fiete[6] model |
Generic FSI CCD |
BSI geometry (\(L_a\) = field-free depth, \(L_b\) = depletion depth): light enters the field-free region first (from the back surface), diffuses to the depletion edge at \(x = L_a\), and the depletion layer extends from \(L_a\) to \(L_b\).
FSI geometry (reversed): light enters the depletion region first (\(L_D\) = depletion width), and the field-free bulk of thickness \(L_a\) is behind it. The BSI-1 through BSI-3 equations follow directly from C&L Eq. (5); the FSI cases require an independent derivation from the diffusion equation with reversed boundary conditions.
The strong wavelength dependence of \(\alpha_{abs}(\lambda)\) is physically significant in all cases: longer wavelengths have smaller \(\alpha_{abs}\), so photons are absorbed deeper, carriers travel further before collection, and the diffusion MTF degrades accordingly.
Applicability
Detector type |
Model |
\(L_a\) |
\(L_b\) / \(L_D\) |
Example |
|---|---|---|---|---|
Sci-CCD (BSI, thinned) |
BSI-1 |
15–20 µm |
10–20 µm |
e2v CCD47-10 |
sCMOS (BSI, thin epi) |
BSI-2 |
3–8 µm |
1–2 µm |
Gpixel GSENSE400BSI |
MCT IR array (BSI) |
BSI-3 |
5–15 µm |
\(L_b \to \infty\) |
Teledyne H2RG |
Consumer CMOS (FSI) |
FSI-1 |
3–10 µm |
1–3 µm |
Sony IMX174 |
Bulk CCD (FSI) |
FSI-2 |
\(L_a \to \infty\) |
1–5 µm |
Generic FSI CCD |
The model assumes an uninterrupted substrate with no lateral barriers to diffusion. For modern CMOS sensors with Deep Trench Isolation (DTI), the physical trenches between pixels block lateral carrier movement, and the actual diffusion MTF will be better than the model predicts. In such cases the model provides a conservative (pessimistic) estimate. For CCDs without pixel-level isolation barriers, the model is directly applicable.
Note that the model is isotropic — the diffusion MTF is the same in the ALT and ACT directions for a given spatial frequency.
Typical Diffusion Length Values
Detector Type |
Typical \(L_{Diff}\) |
Notes |
|---|---|---|
Front-illuminated CCD (visible) |
1–5 µm |
Wavelength dependent; worse in red/NIR |
BSI CMOS (visible) |
<2 µm |
Thin substrates limit diffusion |
HgCdTe (MWIR, 3–5 µm) |
10–30 µm |
Can exceed pixel pitch |
HgCdTe (LWIR, 8–12 µm) |
15–50 µm |
Thick absorbers, long diffusion lengths |
InSb (MWIR) |
10–40 µm |
Similar magnitude to HgCdTe MWIR |
Detector Crosstalk MTF
Crosstalk occurs when signal generated by a photon in one pixel leaks into adjacent pixels. There are two distinct mechanisms[2][6]:
Electrical crosstalk: Photogenerated charge carriers diffuse laterally in the substrate and are collected by a neighbouring pixel’s depletion region. This is closely related to the diffusion effect discussed above, but treated here as an inter-pixel coupling rather than a continuous spread. It is the dominant crosstalk mechanism in most detectors.
Optical crosstalk: Photons scatter, reflect, or diffract within the detector structure (e.g., off metal interconnect layers, microlens edges, or colour filter boundaries) and are absorbed in a neighbouring pixel. This is more significant in small-pitch sensors with tall pixel structures.
The net effect is that a point source illuminating a single pixel produces a response in its neighbours, reducing modulation at high spatial frequencies where the pattern alternates between pixels.
Nearest-Neighbour Crosstalk Model
The crosstalk model uses the Discrete Impulse Response method[6]. A unit signal (1) is generated in the target pixel, and a fraction of that charge leaks to its available neighbours. We define separate coefficients for the two types of neighbour:
\(X_s\): crosstalk to a side (edge-adjacent) neighbour
\(X_d\): crosstalk to a diagonal (corner-adjacent) neighbour
For a centre pixel with 8 neighbours (4 side, 4 diagonal), the discrete kernel is:
The kernel sums to unity (signal is conserved). Its 2D Discrete Fourier Transform gives the transfer function:
where \(p\) is the pixel pitch, \(f_x\) and \(f_y\) are the spatial frequencies in the two detector axes, and \(X_s\) and \(X_d\) are dimensionless fractions. Because the kernel is symmetric, \(H\) is real-valued and equals the MTF directly.
Taking a 1D slice along one axis (\(f_y = 0\)) yields:
This is equivalently written in terms of the Nyquist frequency \(f_\text{nyq} = 1/(2p)\):
since \(2\pi f p = \pi f / f_\text{nyq}\). The Nyquist form is numerically preferable when \(f_\text{nyq}\) is known directly, as the ratio \(f/f_\text{nyq}\) is dimensionless.
When diagonal crosstalk is negligible (\(X_d = 0\)), this reduces to the classical formula \(\text{MTF}_\text{xtalk}(f) = 1 - 2X_s(1 - \cos(2\pi f p))\).
This model has the property that:
At zero frequency (\(f = 0\)): MTF = 1 (signal is conserved, just redistributed)
At Nyquist (\(f = f_{ny} = 1/2p\)): \(\text{MTF} = 1 - 4(X_s + 2X_d)\) (maximum degradation, where the pattern alternates every pixel)
The constraint \(4X_s + 4X_d < 1\) is required for the kernel centre weight to remain positive. In practice, typical values for a high-performance detector are \(X_s\) = 1%–4% and \(X_d\) = 0.2%–1%.
Crosstalk Across Detector Types
Detector Type |
Typical \(X_s\) |
Primary Mechanism |
Notes |
|---|---|---|---|
CCD (visible) |
0.01–0.03 |
Electrical |
Well-controlled in modern designs |
BSI CMOS (visible) |
0.01–0.05 |
Optical + Electrical |
Optical crosstalk increases at small pitch (<2 µm)[2] |
HgCdTe (IR) |
0.02–0.10 |
Electrical |
Lateral diffusion in thick absorber; can be dominant MTF limiter[3] |
InSb (IR) |
0.02–0.08 |
Electrical |
Similar to HgCdTe; depends on pixel pitch vs. diffusion length |
For HgCdTe detectors, crosstalk can reduce the total system MTF by over 30% at the 50% MTF frequency[3]. Ion implantation guard-ring structures around each pixel can suppress electrical crosstalk, and this is a common design mitigation in modern IR focal plane arrays.
Relationship Between Diffusion and Crosstalk
The diffusion MTF and the electrical component of crosstalk MTF describe the same underlying physics — lateral movement of charge carriers in the substrate — but model it at different levels of abstraction[2][3]:
Diffusion MTF is a continuous, physics-based model. It uses the material diffusion length \(L_d\) as its parameter and does not reference pixel boundaries. It models the full spatial spread of the charge cloud, including long-range tails that can extend over multiple pixels.
Crosstalk MTF is a discrete, measurement-based model. It uses the inter-pixel coupling coefficients \(X_s\) and \(X_d\) as its parameters, typically obtained from detector characterisation data. It captures only the nearest-neighbour signal sharing and does not model the shape of the underlying spread.
Because of this shared physical origin, the two should not be blindly multiplied in an MTF budget when the crosstalk is predominantly electrical. Doing so would double-count the same effect. The recommended approach is:
If material properties are known (diffusion length), use diffusion MTF alone for the electrical component.
If only measured inter-pixel coupling data is available, use crosstalk MTF alone.
If both optical and electrical crosstalk are significant (e.g., small-pitch BSI CMOS), the diffusion MTF can be used for the electrical component and the crosstalk MTF for the optical component only, with \(X_s\) representing only the optical coupling fraction.
CTE (Charge Transfer Efficiency) MTF
This section applies only to CCD detectors and is not relevant for CMOS or infrared focal plane arrays, which read out pixels individually rather than shifting charge through a register.
In a CCD, the accumulated charge in each pixel is read out by sequentially transferring it through neighbouring pixels towards the output amplifier. Each transfer is not perfectly efficient — a small fraction of the charge is left behind in traps within the silicon (and released later, into the wrong pixel).
This charge transfer inefficiency (CTI = \(1 - \varepsilon\), where \(\varepsilon\) is the CTE) causes a directional smearing of the image along the transfer direction, which is similar to motion blur in the transfer direction. The effect is cumulative: pixels further from the readout register undergo more transfers and suffer more degradation. For a CCD with \(n\) columns, the last pixel to be read out experiences \(n\) transfers (or \(n \times n_\text{phases}\) for a multi-phase CCD). The standard model[7] for the resulting MTF is:
where:
\(N\) is the number of charge transfers (depends on pixel position and CCD architecture)
\(\varepsilon\) is the CTE per transfer (dimensionless)
\(p\) is pixel pitch
\(f\) is the spatial frequency
It is also possible to write the equation in terms of Nyquist frequency \(f_\text{nyq} = 1/(2p)\):
This form is numerically preferable because the ratio \(f / f_\text{nyq}\) is dimensionless and avoids unit-conversion ambiguity when the Nyquist frequency is known directly.
Note
Boreman [2] uses a reversed terminology, where \(\varepsilon\) is the fractional charge left behind at each transfer (i.e., CTI = \(1 - \text{CTE}\)). In this document, \(\varepsilon\) is the CTE — the fraction successfully transferred. The MTF_Model_1D.detector_cte factory uses the same convention: pass cte=0.99999, not cte=0.00001.
At zero frequency (\(f = 0\)), the cosine term equals 1 and the MTF is unity — charge is conserved overall. The worst degradation occurs at the Nyquist frequency, where the cosine term equals \(-1\), giving \(\text{MTF} = \exp(-2N(1-\varepsilon))\).
One-directional smear. CTE blur is strictly directional — opposite to the charge transfer direction. Unlike diffusion (isotropic) or aperture (symmetric), CTE produces an asymmetric tail on every point source pointing away from the readout register. In a pushbroom imager this contributes only along the clocking axis: parallel-register clocking drives the ALT CTE MTF; serial-register clocking drives the ACT CTE MTF.
Position-dependence. The number of transfers \(N\) depends on the pixel’s position — pixels close to the readout amplifier undergo few transfers and are sharp, while pixels at the far end undergo \(N_\text{max}\) transfers and are most degraded. The CTE MTF therefore varies roughly linearly across the image strip. In opticks this is exposed by passing num_pixels at runtime, not via SensorParams.
TDI as a transfer multiplier. For on-chip analog TDI CCDs, charge accumulates through extra parallel-register transfers before reaching the readout register:
This applies only to on-chip analog TDI (where each stage is a real charge transfer in silicon). Digital (“shift-and-add”) TDI — the CMOS approach where each row is read out independently and summed in firmware after the ADC — performs no charge transfer between stages, so no extra transfers contribute to CTE degradation.
Connection to crosstalk MTF. The CTE MTF and crosstalk MTF share the same \((1 - \cos)\) kernel. This is not a coincidence: CTE can be viewed as temporal crosstalk — instead of charge leaking into a spatial neighbour, a fraction leaks into the next read cycle (the temporal neighbour). The compounding over \(N\) transfers turns the linear \((1 - \cos)\) factor into the exponential of the closed-form CTE expression.
CTE Values and Impact
Modern scientific CCDs achieve very high CTE values[7]:
Application |
Typical CTE |
CTI (\(1-\varepsilon\)) |
|---|---|---|
Consumer / industrial CCDs |
0.99999 |
\(10^{-5}\) |
Scientific CCDs (new) |
0.999999 |
\(10^{-6}\) |
Space-qualified CCDs (new) |
0.9999995 |
\(5 \times 10^{-7}\) |
Radiation-damaged CCDs (space, after years) |
0.99999–0.9999 |
\(10^{-5}\)–\(10^{-4}\) |
Three-phase CCDs have significantly higher CTE than two-phase CCDs.
While CTI values appear tiny, the cumulative effect over thousands of transfers is significant. For example, a 4096-pixel CCD with 3-phase architecture and \(\varepsilon = 0.99999\) undergoes \(N = 4096 \times 3 = 12288\) transfers for the furthest pixel. At the Nyquist frequency: \(\text{MTF} = \exp(-2 \times 12288 \times 10^{-5}) = \exp(-0.246) \approx 0.78\). Also note that the CTI values are different in horizontal and vertical directions.
Radiation damage in space progressively degrades CTE by creating charge traps in the silicon lattice[8]. This is a major concern for long-duration space missions (e.g., the Hubble Space Telescope ACS/WFC suffered from degradation and required dedicated CTE correction algorithms after years in orbit).
CMOS detectors do not suffer from MTF degradation due to CTE because each pixel has its own amplifier and is read out individually — there is no charge transfer between pixels.
Directionality: ALT vs ACT for Pushbroom Detectors
For pushbroom (line scanner) detectors, the along-track (ALT) and across-track (ACT) directions have fundamentally different characteristics, and the detector MTF contributors are not necessarily the same in both directions:
Sampling MTF: In the ACT direction, the sampling MTF is determined by the physical pixel pitch as usual. In the ALT direction, the effective “pixel size” is determined by the integration time multiplied by the image velocity on the focal plane — this is often different from the physical pixel pitch. The sinc MTF must therefore be computed separately for each direction, using the appropriate effective pitch.
Diffusion MTF: The lateral diffusion of charge carriers is an isotropic process in the semiconductor — carriers spread equally in all directions. The diffusion MTF is therefore the same in both ALT and ACT for a given spatial frequency. However, because the Nyquist frequencies may differ between ALT and ACT (due to different effective pixel sizes), the relative impact of diffusion at each direction’s Nyquist will differ.
Crosstalk MTF: Like diffusion, the nearest-neighbour coupling is generally isotropic for square pixels. However, for rectangular pixels or detectors with asymmetric structures (e.g., transfer gates or bus lines running in one direction), the crosstalk coefficients \(X_s\) and \(X_d\) can differ between ALT and ACT.
CTE MTF: This is inherently directional. Charge is transferred along one specific axis — typically along the column (ALT direction for a pushbroom) for the parallel register, and along the row (ACT direction) for the serial register. Each direction has a different number of transfers and potentially different CTE values. For a TDI (Time Delay Integration) CCD, the parallel CTE applies in the ALT direction with \(N\) equal to the number of TDI stages times the number of phases.
As a consequence, for pushbroom and TDI systems, the full detector MTF budget should be computed separately for the ALT and ACT directions.
Summary: Detector MTF Sources by Detector Type
The following table summarises the applicability and typical impact of each detector-level MTF contributor:
MTF Source |
CCD (Visible) |
CMOS (Visible) |
HgCdTe / InSb (IR) |
|---|---|---|---|
Sampling (sinc) |
Always applies |
Always applies |
Always applies |
Diffusion |
Moderate (1–5 µm \(L_d\)) |
Small (<2 µm \(L_d\), thin substrates) |
Large / dominant (10–50 µm \(L_d\)) |
Crosstalk |
Small (well-controlled) |
Small–Moderate (optical at small pitch) |
Large (electrical, thick absorber) |
CTE |
Applies (degrades with radiation) |
Not applicable |
Not applicable |
For visible-wavelength systems, the detector sampling MTF typically dominates, with diffusion and crosstalk as secondary contributors. For infrared systems, diffusion and crosstalk can be the primary MTF-limiting mechanisms, often exceeding the degradation from optical aberrations.
The topic of sharpness and contrast performance is continued here.