1 Introduction

Supersymmetry (SUSY) [1,2,3,4,5,6,7] is one of the most studied extensions of the Standard Model (SM). In its minimal realization (the Minimal Supersymmetric Standard Model, or MSSM) [8, 9], it predicts new fermionic (bosonic) partners of the fundamental SM bosons (fermions) and an additional Higgs doublet. These new SUSY particles, or sparticles, can provide an elegant solution to the gauge hierarchy problem [10,11,12,13]. In R-parity-conserving models [14], sparticles can only be produced in pairs and the lightest supersymmetric particle (LSP) is stable. This is typically assumed to be the \(\displaystyle \tilde{\chi }^0_1\) neutralino,Footnote 1 which can then provide a natural candidate for dark matter [15, 16]. If produced in proton–proton collisions, a neutralino LSP would escape detection and lead to an excess of events with large missing transverse momentum above the expectations from SM processes, a characteristic that is exploited to search for SUSY signals in analyses presented in this paper.

The production cross-sections of SUSY particles at the Large Hadron Collider (LHC) [17] depend both on the type of interaction involved and on the masses of the sparticles. The coloured sparticles (squarks and gluinos) are produced in strong interactions with significantly larger production cross-sections than non-coloured sparticles of equal masses, such as the charginos (\(\tilde{\chi }^{\pm }_{i}\), \(i = 1, 2\)) and neutralinos (\(\tilde{\chi }^{0}_{j}\), \(j = 1, 2, 3, 4\)) and the sleptons (\(\tilde{\ell } \) and \(\tilde{\nu } \)). The direct production of charginos and neutralinos or slepton pairs can dominate SUSY production at the LHC if the masses of the gluinos and the squarks are significantly larger. With searches performed by the ATLAS [18] and CMS [19] experiments during LHC Run 2, the exclusion limits on coloured sparticle masses extend up to approximately \(2\,\)TeV [20,21,22], making electroweak production an increasingly important probe for SUSY signals at the LHC.

This paper presents a set of searches for the electroweak production of charginos, neutralinos and sleptons decaying into final states with two or three electrons or muons using 36.1 fb\(^{-1}\) of proton–proton collision data delivered by the LHC at a centre-of-mass energy of \(\sqrt{s}=13\) TeV. The results build on studies performed during LHC Run 1 at \(\sqrt{s}=7\) TeV and 8 TeV by the ATLAS Collaboration [23,24,25]. Analogous studies by the CMS Collaboration are presented in Refs. [26,27,28,29].

After descriptions of the SUSY scenarios considered (Sect. 2), the experimental apparatus (Sect. 3), the simulated samples (Sect. 4) and the event reconstruction (Sect. 5), the analysis search strategy is discussed in Sect. 6. This is followed by Sect. 7, which describes the estimation of SM contributions to the measured yields in the signal regions, and by Sect. 8, which discusses systematic uncertainties affecting the searches. Results are presented in Sect. 9, together with the statistical tests used to interpret them in the context of relevant SUSY benchmark scenarios. Section 10 summarizes the main conclusions.

2 SUSY scenarios and search strategy

This paper presents searches for the direct pair-production of \(\displaystyle \tilde{\chi }^+_1 \displaystyle \tilde{\chi }^-_1 \), \(\displaystyle \tilde{\chi }^\pm _1 \displaystyle \tilde{\chi }^0_2 \) and \(\tilde{\ell } \tilde{\ell } \) particles, in final states with exactly two or three electrons and muons, two \(\displaystyle \tilde{\chi }^0_1 \) particles, and possibly additional jets or neutrinos. Simplified models [30], in which the masses of the relevant sparticles are the only free parameters, are used for interpretation and to guide the design of the searches. The pure wino \(\displaystyle \tilde{\chi }^\pm _1 \) and \(\displaystyle \tilde{\chi }^0_2 \) are taken to be mass-degenerate, and so are the scalar partners of the left-handed charged leptons and neutrinos (\(\tilde{e}_{L }\), \(\tilde{\mu }_{L }\), \(\tilde{\tau }_{L }\) and \(\tilde{\nu }\)). Intermediate slepton masses, when relevant, are chosen to be midway between the mass of the heavier chargino and neutralino and that of the lightest neutralino, which is pure bino, and equal branching ratios for the three slepton flavours are assumed. The analysis sensitivity is not expected to depend strongly on the slepton mass hypothesis for a broad range of slepton masses, while it degrades as the slepton mass approaches that of the heavier chargino and neutralino, leading to lower \(p_T\) values for the leptons produced in the heavy chargino and neutralino decays [25]. Lepton flavour is conserved in all models. Diagrams of processes considered are shown in Fig. 1. For models exploring \(\displaystyle \tilde{\chi }^+_1 \displaystyle \tilde{\chi }^-_1 \) production, it is assumed that the sleptons are also light and thus accessible in the sparticle decay chains, as illustrated in Fig. 1a. Two different classes of models are considered for \(\displaystyle \tilde{\chi }^\pm _1 \displaystyle \tilde{\chi }^0_2 \) production: in one case, the \(\displaystyle \tilde{\chi }^\pm _1 \) chargino and \(\displaystyle \tilde{\chi }^0_2 \) neutralino can decay into final-state SM particles and a \(\displaystyle \tilde{\chi }^0_1 \) neutralino via an intermediate \(\tilde{\ell } _{L }\) or \(\tilde{\nu }_{L }\), with a branching ratio of 50% to each (Fig. 1b); in the other case the \(\displaystyle \tilde{\chi }^\pm _1 \) chargino and \(\displaystyle \tilde{\chi }^0_2 \) neutralino decays proceed via SM gauge bosons (W or Z). For the gauge-boson-mediated decays, two distinct final states are considered: three-lepton (where lepton refers to an electron or muon) events where both the W and Z bosons decay leptonically (Fig. 1c) or events with two opposite-sign leptons and two jets where the W boson decays hadronically and the Z boson decays leptonically (Fig. 1d). In models with direct \(\tilde{\ell } \tilde{\ell } \) production, each slepton decays into a lepton and a \(\displaystyle \tilde{\chi }^0_1 \) with a 100% branching ratio (Fig. 1e), and \(\tilde{e}_{L }\), \(\tilde{e}_{R }\), \(\tilde{\mu }_{L }\), \(\tilde{\mu }_{R }\), \(\tilde{\tau }_{L }\) and \(\tilde{\tau }_{R }\) are assumed to be mass-degenerate.

Fig. 1
figure 1

Diagrams of physics scenarios studied in this paper: a \(\displaystyle \tilde{\chi }^+_1 \displaystyle \tilde{\chi }^-_1 \) production with \(\tilde{\ell } \)-mediated decays into final states with two leptons, b \(\displaystyle \tilde{\chi }^\pm _1 \displaystyle \tilde{\chi }^0_2 \) production with \(\tilde{\ell } \)-mediated decays into final states with three leptons, c \(\displaystyle \tilde{\chi }^\pm _1 \displaystyle \tilde{\chi }^0_2 \) production with decays via leptonically decaying W and Z bosons into final states with three leptons, d \(\displaystyle \tilde{\chi }^\pm _1 \displaystyle \tilde{\chi }^0_2 \) production with decays via a hadronically decaying W boson and a leptonically decaying Z boson into final states with two leptons and two jets, and e slepton pair production with decays into final states with two leptons

Events are recorded using triggers requiring the presence of at least two leptons and assigned to one of three mutually exclusive analysis channels depending on the lepton and jet multiplicity. The 2\(\ell \) + 0 jets channel targets chargino- and slepton-pair production, the 2\(\ell \) + jets channel targets chargino-neutralino production with gauge-boson-mediated decays, and the 3\(\ell \) channel targets chargino-neutralino production with slepton- or gauge-boson-mediated decays. For each channel, a set of signal regions (SR), defined in Section 6, use requirements on \(E_{\text {T}}^{\text {miss}}\) and other kinematic quantities, which are optimized for different SUSY models and sparticle masses. The analyses employ “inclusive” SRs to quantify significance without assuming a particular signal model and to exclude regions of SUSY model parameter space, as well as sets of orthogonal “exclusive” SRs that are considered simultaneously during limit-setting to improve the exclusion sensitivity.

3 ATLAS detector

The ATLAS experiment [18] is a multi-purpose particle detector with a forward-backward symmetric cylindrical geometry and nearly \(4\pi \) coverage in solid angle.Footnote 2 The interaction point is surrounded by an inner detector (ID), a calorimeter system, and a muon spectrometer.

The ID provides precision tracking of charged particles for pseudorapidities \(|\eta | < 2.5\) and is surrounded by a superconducting solenoid providing a 2 T axial magnetic field. The ID consists of silicon pixel and microstrip detectors inside a transition radiation tracker. One significant upgrade for the \(\sqrt{s}=13\) TeV running period is the installation of the insertable B-layer [31], an additional pixel layer close to the interaction point which provides high-resolution hits at small radius to improve the tracking performance.

In the pseudorapidity region \(|\eta | < 3.2\), high-granularity lead/liquid-argon (LAr) electromagnetic (EM) sampling calorimeters are used. A steel/scintillator tile calorimeter measures hadron energies for \(|\eta | < 1.7\). The endcap and forward regions, spanning \(1.5<|\eta | <4.9\), are instrumented with LAr calorimeters, for both the EM and hadronic measurements.

The muon spectrometer consists of three large superconducting toroids with eight coils each, and a system of trigger and precision-tracking chambers, which provide triggering and tracking capabilities in the ranges \(|\eta | < 2.4\) and \(|\eta | < 2.7\), respectively.

A two-level trigger system is used to select events [32]. The first-level trigger is implemented in hardware and uses a subset of the detector information. This is followed by the software-based high-level trigger, which runs offline reconstruction and calibration software, reducing the event rate to about 1 kHz.

4 Data and simulated event samples

This analysis uses proton–proton collision data delivered by the LHC at \(\sqrt{s}=13 \text { TeV} \) in 2015 and 2016. After fulfilling data-quality requirements, the data sample amounts to an integrated luminosity of 36.1 fb\(^{-1}\). This value is derived using a methodology similar to that detailed in Refs. [33], from a calibration of the luminosity scale using xy beam-separation scans performed in August 2015 and May 2016.

Various samples of Monte Carlo (MC) simulated events are used to model the SUSY signal and help in the estimation of the SM backgrounds. The samples include an ATLAS detector simulation [34], based on Geant4 [35], or a fast simulation [34] that uses a parameterization of the calorimeter response [36] and Geant4 for the other parts of the detector. The simulated events are reconstructed in the same manner as the data.

Diboson processes were simulated with the Sherpa  v2.2.1 event generator [37, 38] and normalized using next-to-leading-order (NLO) cross-sections [39, 40]. The matrix elements containing all diagrams with four electroweak vertices with additional hard parton emissions were calculated with Comix [41] and virtual QCD corrections were calculated with OpenLoops [42]. Matrix element calculations were merged with the Sherpa parton shower [43] using the ME+PS@NLO prescription [44]. The NNPDF3.0 NNLO parton distribution function (PDF) set [45] was used in conjunction with dedicated parton shower tuning developed by the Sherpa authors. The fully leptonic channels were calculated at NLO in the strong coupling constant with up to one additional parton for \(4\ell \) and \(2\ell +2\nu \), at NLO with no additional parton for \(3\ell +\nu \), and at leading order (LO) with up to three additional partons. Processes with one of the bosons decaying hadronically and the other leptonically were calculated with up to one additional parton at NLO and up to three additional partons at LO.

Diboson processes with six electroweak vertices, such as same-sign W boson production in association with two jets, \(W^\pm W^\pm jj\), and triboson processes were simulated as above with Sherpa  v2.2.1 using the NNPDF3.0 PDF set. Diboson processes with six vertices were calculated at LO with up to one additional parton. Fully leptonic triboson processes (WWW, WWZ, WZZ and ZZZ) were calculated at LO with up to two additional partons and at NLO for the inclusive processes and normalized using NLO cross-sections.

Events containing \(Z\) bosons and associated jets (\(Z/\gamma ^*\) + jets, also referred to as Z + jets in the following) were also produced using the Sherpa v2.2.1 generator with massive b / c-quarks to improve the treatment of the associated production of \(Z \) bosons with jets containing b- and c-hadrons [46]. Matrix elements were calculated with up to two additional partons at NLO and up to four additional partons at LO, using Comix, OpenLoops, and Sherpa parton shower with ME+PS@NLO in a way similar to that described above. A global K-factor was used to normalize the Z + jets events to the next-to-next-to-leading-order (NNLO) QCD cross-sections [47].

For the production of \(t\bar{t}\) and single top quarks in the Wt channel, the Powheg-Box v2 [48, 49] generator with the CT10 PDF set [50] was used, as discussed in Ref. [51]. The top quark mass was set at 172.5 GeV for all MC samples involving top quark production. The \(t\bar{t}\) events were normalized using the NNLO+next-to-next-to-leading-logarithm (NNLL) QCD [52] cross-section, while the cross-section for single-top-quark events was calculated at NLO+NNLL [53].

Samples of \(t\bar{t} V\) (with \(V=W\) and Z, including non-resonant \(Z/\gamma ^*\) contributions) and \(t\bar{t}WW\) production were generated at LO with MadGraph5_aMC@NLO v2.2.2 [54] interfaced to Pythia 8.186 [55] for parton showering, hadronisation and the description of the underlying event, with up to two (\(t\bar{t} W\)), one (\(t\bar{t} Z\)) or no (\(t\bar{t} WW\)) extra partons included in the matrix element, as described in Ref. [56]. MadGraph was also used to simulate the tZ, \(t\bar{t} t\bar{t} \) and \(t\bar{t} t\) processes. A set of tuned parameters called the A14 tune [57] was used together with the NNPDF2.3LO PDF set [58]. The \(t\bar{t} W\), \(t\bar{t} Z\), \(t\bar{t} WW\) and \(t\bar{t} t\bar{t} \) events were normalized using their NLO cross-section [56] while the generator cross-section was used for tZ and \(t\bar{t} t\).

Higgs boson production processes (including gluon–gluon fusion, associated VH production and vector-boson fusion) were generated using Powheg-Box v2 [59] and Pythia 8.186 and normalized using cross-sections calculated at NNLO with soft gluon emission effects added at NNLL accuracy [60], whilst \(t\bar{t} H\) events were produced using MadGraph5_aMC@NLO 2.3.2 + Herwig++  [61] and normalized using the NLO cross-section [56]. All samples assume a Higgs boson mass of 125 GeV.

The SUSY signal processes were generated from LO matrix elements with up to two extra partons, using the MadGraph v2.2.3 generator interfaced to Pythia 8.186 with the A14 tune for the modelling of the SUSY decay chain, parton showering, hadronization and the description of the underlying event. Parton luminosities were provided by the NNPDF2.3LO PDF set. Jet–parton matching was realized following the CKKW-L prescription [62], with a matching scale set to one quarter of the pair-produced superpartner mass. Signal cross-sections were calculated at NLO, with soft gluon emission effects added at next-to-leading-logarithm (NLL) accuracy [63,64,65,66,67]. The nominal cross-section and its uncertainty were taken from an envelope of cross-section predictions using different PDF sets and factorization and renormalization scales, as described in Ref. [68]. The cross-section for \(\displaystyle \tilde{\chi }^+_1 \displaystyle \tilde{\chi }^-_1 \) production, each with a mass of 600 GeV, is \(9.50\pm 0.91\) fb, while the cross-section for \(\displaystyle \tilde{\chi }^\pm _1 \displaystyle \tilde{\chi }^0_2 \) production, each with a mass of 800 GeV, is \(4.76\pm 0.56\) fb.

In all MC samples, except those produced by Sherpa, the EvtGen v1.2.0 program [69] was used to model the properties of b- and c-hadron decays. To simulate the effects of additional pp collisions per bunch crossing (pile-up), additional interactions were generated using the soft QCD processes of Pythia 8.186 with the A2 tune [70] and the MSTW2008LO PDF set [71], and overlaid onto the simulated hard-scatter event. The Monte Carlo samples were reweighted so that the distribution of the number of pile-up interactions matches the distribution in data.

5 Event reconstruction and preselection

Events used in the analysis were recorded during stable data-taking conditions and must have a reconstructed primary vertex [72] with at least two associated tracks with \(p_{\text {T}} >400~\hbox {MeV}\). The primary vertex of an event is identified as the vertex with the highest \(\Sigma p_{\text {T}} ^2\) of associated tracks.

Two identification criteria are defined for the objects used in these analyses, referred to as “baseline” and “signal” (with the signal objects being a subset of the baseline ones). The former are defined to disambiguate between overlapping physics objects and to perform data-driven estimations of non-prompt leptonic backgrounds (discussed in Sect. 7) while the latter are used to construct kinematic and multiplicity discriminating variables needed for the event selection.

Baseline electrons are reconstructed from isolated electromagnetic calorimeter energy deposits matched to ID tracks and are required to have \(|\eta |<2.47\), \(p_{\text {T}} >{10}\,{\text {GeV}}\), and to pass a loose likelihood-based identification requirement [73, 74]. The likelihood input variables include measurements of calorimeter shower shapes and track properties from the ID.

Baseline muons are reconstructed in the region \(|\eta |<2.7\) from muon spectrometer tracks matching ID tracks. All muons must have \(p_{\text {T}} >{10}\,{\text {GeV}}\) and must pass the “medium identification” requirements defined in Ref. [75], based on selection of the number of hits and curvature measurements in the ID and muon spectrometer systems.

Jets are reconstructed with the anti-\(k_t\) algorithm [76] as implemented in the FastJet package [77], with radius parameter \(R=0.4\), using three-dimensional energy clusters in the calorimeter [78] as input. Baseline jets must have \(p_{\text {T}} >{20}\,{\text {GeV}}\) and \(|\eta |<4.5\) and signal jets have the tighter requirement of \(|\eta |<2.4\). Jet energies are calibrated as described in Refs. [79, 80]. In order to reduce the effects of pile-up, jets with \(p_{\text {T}} <{60}\,{\hbox {GeV}}\) and \(|\eta |<2.4\) must have a significant fraction of their associated tracks compatible with originating from the primary vertex, as defined by the jet vertex tagger [81]. Furthermore, for all jets the expected average energy contribution from pile-up is subtracted according to the jet area [81, 82]. Events are discarded if they contain any jet that is judged by basic quality criteria to be detector noise or non-collision background.

Identification of jets containing b-hadrons (b-jets), so called b-tagging, is performed with the MV2c10 algorithm, a multivariate discriminant making use of track impact parameters and reconstructed secondary vertices [83, 84]. A requirement is chosen corresponding to a 77% average efficiency obtained for b-jets in simulated \(t\bar{t} \) events. The corresponding rejection factors against jets originating from c-quarks, from \(\tau \)-leptons, and from light quarks and gluons in the same sample at this working point are 6, 22 and 134, respectively.

Baseline photon candidates are required to meet the “tight” selection criteria of Ref. [85] and satisfy \(p_{\text {T}} >25\) GeV and \(|\eta |<2.37\), but excluding the transition region \(1.37<|\eta |<1.52\), where the calorimeter performance is degraded.

After object identification, an “object-removal procedure” is performed on all baseline objects to remove possible double-counting in the reconstruction:

  1. 1.

    Any electron sharing an ID track with a muon is removed.

  2. 2.

    If a b-tagged jet (identified using the 85% efficiency working point of the MV2c10 algorithm) is within \(\Delta R = 0.2\) of an electron candidate, the electron is rejected, as it is likely to be from a semileptonic b-hadron decay; if the jet within \(\Delta R = 0.2\) of the electron is not b-tagged, the jet itself is discarded, as it likely originates from an electron-induced shower.

  3. 3.

    Electrons within \(\Delta R=0.4\) of a remaining jet candidate are discarded, to further suppress electrons from semileptonic decays of b- and c-hadrons.

  4. 4.

    Jets with a nearby muon that carries a significant fraction of the transverse momentum of the jet (\(p_{\text {T}} ^\mu > 0.7 \sum p_{\text {T}} ^{\text {jet tracks}}\), where \(p_{\text {T}} ^\mu \) and \(p_{\text {T}} ^{\text {jet tracks}}\) are the transverse momenta of the muon and the tracks associated with the jet, respectively) are discarded either if the candidate muon is within \(\Delta R=0.2\) of the jet or if the muon is matched to a track associated with the jet. Only jets with fewer than three associated tracks can be discarded in this step.

  5. 5.

    Muons within \(\Delta R=0.4\) of a remaining jet candidate are discarded to suppress muons from semileptonic decays of b- and c-hadrons.

Signal electrons must satisfy a “medium” likelihood-based identification requirement [73] and the track associated with the electron must have a significance of the transverse impact parameter relative to the reconstructed primary vertex, \(d_0\), of \(\vert d_0\vert /\sigma (d_0) < 5\), with \(\sigma (d_0)\) being the uncertainty in \(d_0\). In addition, the longitudinal impact parameter (again relative to the reconstructed primary vertex), \(z_0\), must satisfy \(\vert z_0 \sin \theta \vert < 0.5\) mm. Similarly, signal muons must satisfy the requirements of \(\vert d_0\vert /\sigma (d_0) < 3\), \(\vert z_0 \sin \theta \vert < 0.5\) mm, and additionally have \(|\eta |<2.4\). Isolation requirements are also applied to both the signal electrons and muons to reduce the contributions of “fake” or non-prompt leptons, which originate from misidentified hadrons, photons conversions, and hadron decays. These \(p_{\text {T}} \)- and \(\eta \)-dependent requirements use track- and calorimeter-based information and have efficiencies in \(Z\rightarrow e^{+} e^{-} \) and \(Z\rightarrow \mu ^{+} \mu ^{-} \) events that rise from 95% at 25 GeV to 99% at 60 GeV.

The missing transverse momentum \(\mathbf {p}_\mathrm {T}^\mathrm {miss}\), with magnitude \(E_{\text {T}}^{\text {miss}}\), is the negative vector sum of the transverse momenta of all identified physics objects (electrons, photons, muons, jets) and an additional soft term. The soft term is constructed from all tracks that are not associated with any physics object and that are associated with the primary vertex, to suppress contributions from pile-up interactions. The \(E_{\text {T}}^{\text {miss}}\) value is adjusted for the calibration of the jets and the other identified physics objects above [86].

Events considered in the analysis must pass a trigger selection requiring either two electrons, two muons or an electron plus a muon. The trigger-level thresholds on the \(p_{\text {T}}\) value of the leptons involved in the trigger decision are in the range 8–22 GeV and are looser than those applied offline to ensure that trigger efficiencies are constant in the relevant phase space.

Events containing a photon and jets are used to estimate the Z + jets background in events with two leptons and jets. These events are selected with a set of prescaled single-photon triggers with \(p_{\text {T}} \) thresholds in the range 35–100 GeV and an unprescaled single-photon trigger with threshold \(p_{\text {T}} =140\) GeV. Signal photons in this control sample must have \(p_{\text {T}} >37\) GeV to be in the efficiency plateau of the lowest-threshold single-photon trigger, fall outside the barrel-endcap transition region defined by \(1.37< |\eta | < 1.52\), and pass “tight” selection criteria described in Ref. [87], as well as \(p_{\text {T}}\)- and \(\eta \)-dependent requirements on both track- and calorimeter-based isolation.

Simulated events are corrected to account for small differences in the signal lepton trigger, reconstruction, identification, isolation, as well as b-tagging efficiencies between data and MC simulation.

6 Signal regions

In order to search for the electroweak production of supersymmetric particles, three different search channels that target different SUSY processes are defined:

  • 2\(\ell \)+ 0 jets channel: targets \(\displaystyle \tilde{\chi }^+_1 \displaystyle \tilde{\chi }^-_1 \) and \(\tilde{\ell } \tilde{\ell } \) production (shown in Fig. 1a, e) in signal regions with a jet veto and defined using the “stransverse mass” variable, \(m_{\mathrm{T2}}\) [88, 89], and the dilepton invariant mass \(m_{\ell \ell }\);

  • 2\(\ell \)+ jets channel: targets \(\displaystyle \tilde{\chi }^\pm _1 \displaystyle \tilde{\chi }^0_2 \) production with decays via gauge bosons (shown in Fig. 1d) into two same-flavour opposite-sign (SFOS) leptons (from the Z boson) and at least two jets (from the W boson);

  • 3\(\ell \) channel: targets \(\displaystyle \tilde{\chi }^\pm _1 \displaystyle \tilde{\chi }^0_2 \) production with decays via intermediate \(\tilde{\ell } \) or gauge bosons into three-lepton final states (shown in Fig. 1b, c).

In each channel, inclusive and/or exclusive signal regions (SRs) are defined that require exactly two or three signal leptons, with vetos on any additional baseline leptons. In the 2\(\ell \) + 0 jets channel only, this additional baseline lepton veto is applied before considering overlap-removal. The leading and sub-leading leptons are required to have \(p_{\text {T}}\) \(>25\) GeV and 20 GeV respectively; however, in the 2\(\ell \) + jets and 3\(\ell \) channels, tighter lepton \(p_{\text {T}}\) requirements are applied to the sub-leading leptons.

6.1 Signal regions for 2\(\ell \) + 0 jets channel

In the 2\(\ell \) + 0 jets channel the leptons are required to be of opposite sign and events are separated into “same flavour” (SF) events (corresponding to dielectron, \(e^{+}e^{-}\), and dimuon, \(\mu ^{+}\mu ^{-}\), events) and “different flavour” (DF) events (electron–muon, \(e^{\pm }\mu ^{\mp }\)). This division is driven by the different background compositions in the two classes of events. All events used in the SRs are required to have a dilepton invariant mass \(m_{\ell \ell }>40\) GeV and not contain any b-tagged jets with \(p_{\text {T}}\) \(>20\) GeV or non-b-tagged jets with \(p_{\text {T}}\) \(>60\) GeV.

After this preselection, exclusive signal regions are used to maximize exclusion sensitivity across the simplified model parameter space for \(\displaystyle \tilde{\chi }^+_1 \displaystyle \tilde{\chi }^-_1 \) and \(\tilde{\ell } \tilde{\ell } \) production. In the SF regions a two-dimensional binning in \(m_\mathrm {T2}\) and \(m_{\ell \ell }\) is used as high-\(m_{\ell \ell }\) requirements provide strong suppression of the Z + jets background, whereas in the DF regions, where the Z + jets background is negligible, a one-dimensional binning in \(m_\mathrm {T2}\) is sufficient. The stransverse mass \(m_\mathrm {T2}\) is defined as:

$$\begin{aligned} m_\mathrm {T2}= \min _{\mathbf {q}_\mathrm {T}}\left[ \max \left( m_\mathrm {T}(\mathbf {p}_\mathrm {T}^{\ell 1},\mathbf {q}_\mathrm {T}),m_\mathrm {T}(\mathbf {p}_\mathrm {T}^{\ell 2}, \mathbf {p}_\mathrm {T}^\mathrm {miss}-\mathbf {q}_\mathrm {T})\right) \right] , \end{aligned}$$

where \(\mathbf {p}_\mathrm {T}^{\ell 1}\) and \(\mathbf {p}_\mathrm {T}^{\ell 2}\) are the transverse momentum vectors of the two leptons, and \(\mathbf {q}_\mathrm {T}\) is a transverse momentum vector that minimizes the larger of \(m_\mathrm {T}(\mathbf {p}_\mathrm {T}^{\ell 1},\mathbf {q}_\mathrm {T}) \) and \(m_\mathrm {T}(\mathbf {p}_\mathrm {T}^{\ell 2},\mathbf {p}_\mathrm {T}^\mathrm {miss}-\mathbf {q}_\mathrm {T})\), where:

$$\begin{aligned} m_\mathrm {T}(\mathbf {p}_\mathrm {T},\mathbf {q}_\mathrm {T}) = \sqrt{2(p_{\text {T}} q_\mathrm {T}-\mathbf {p}_\mathrm {T}\cdot \mathbf {q}_\mathrm {T})}. \end{aligned}$$

For SM backgrounds of \(t\bar{t} \) and WW production in which the missing transverse momentum and the pair of selected leptons originate from two \(W\rightarrow \ell \nu \) decays and all momenta are accurately measured, the \(m_\mathrm {T2}\) value must be less than the W boson mass \(m_W\), and requiring the \(m_\mathrm {T2}\) value to significantly exceed \(m_W\) thus strongly suppresses these backgrounds while retaining high efficiency for many SUSY signals.

When producing model-dependent exclusion limits in the \(\displaystyle \tilde{\chi }^+_1 \displaystyle \tilde{\chi }^-_1 \) simplified models, all signal regions are statistically combined, whereas only the same-flavour regions are used when probing \(\tilde{\ell } \tilde{\ell } \) production. In addition, a set of inclusive signal regions are also defined, and these are used to provide a more model-independent test for an excess of events. The definitions of both the exclusive and inclusive signal regions are provided in Table 1.

Table 1 The definitions of the exclusive and inclusive signal regions for the 2\(\ell \) + 0 jets channel. Relevant kinematic variables are defined in the text. The bins labelled “DF”or “SF” refer to signal regions with different-flavour or same-flavour lepton pair combinations, respectively

6.2 Signal regions for 2\(\ell \) + jets channel

In the 2\(\ell \) + jets channel, two inclusive signal regions differing only in the \(E_{\text {T}}^{\text {miss}}\) requirement, denoted SR2-int and SR2-high, are used to target intermediate and large mass splittings between the \(\displaystyle \tilde{\chi }^\pm _1/\displaystyle \tilde{\chi }^0_2 \) chargino/neutralino and the \(\displaystyle \tilde{\chi }^0_1 \) neutralino. In addition to the preselection used in the 2\(\ell \) + 0 jets channel, with the exception of the veto requirement on non-b-tagged jets, the sub-leading lepton is also required to have \(p_{\text {T}}\) \(>25\) GeV and events must have at least two jets, with the leading two jets satisfying \(p_{\text {T}}\) \(>30\) GeV. The b-jet veto is applied in the same way as in the 2\(\ell \) + 0 jets channel. Several kinematic requirements are applied to select two leptons consistent with an on-shell Z boson and two jets consistent with a W boson. A tight requirement of \(m_\mathrm {T2}>100\) GeV is used to suppress the \(t\bar{t}\) and WW backgrounds and \(E_{\text {T}}^{\text {miss}} >150~(250)\) GeV is required for SR2-int (SR2-high).

An additional region in the 2\(\ell \) + jets channel, denoted SR2-low, is optimized for the region of parameter space where the mass splitting between the \(\displaystyle \tilde{\chi }^\pm _1/\displaystyle \tilde{\chi }^0_2 \) and the \(\displaystyle \tilde{\chi }^0_1 \) is similar to the Z boson mass and the signal becomes kinematically similar to the diboson (VV) backgrounds. It is split into two orthogonal subregions for performing background estimation and validation, and these are merged when presenting the results in Sect. 9. SR2-low-2J requires exactly two jets, with \(p_{\text {T}} >30\) GeV, that are both assumed to originate from the W boson, while SR2-low-3J requires 3–5 signal jets (with the leading two jets satisfying \(p_{\text {T}}\) \(>30\) GeV) and assumes the \(\displaystyle \tilde{\chi }^\pm _1 \displaystyle \tilde{\chi }^0_2 \) system recoils against initial-state-radiation (ISR) jet(s). In the latter case, the two jets originating from the W boson are selected to be those closest in \(\Delta \phi \) to the \(Z(\rightarrow \ell \ell ) + E_{\text {T}}^{\text {miss}} \) system. This is different from SR2-int and SR2-high, where the two jets with the highest \(p_{\text {T}}\) in the event are used to define the W boson candidate. The rest of the jets that are not associated with the W boson are collectively defined as ISR jets. All regions use variables, including angular distances and the W and Z boson transverse momenta, to select the signal topologies of interest. The definitions of the signal regions in the 2\(\ell \) + jets channel are summarized in Table 2.

Table 2 Signal region definitions used for the 2\(\ell \) + jets channel. Relevant kinematic variables are defined in the text. The symbols W and Z correspond to the reconstructed W and Z bosons in the final state. The Z boson is always reconstructed from the two leptons, whereas the W boson is reconstructed from the two jets leading in \(p_{\text {T}}\) for SR2-int, SR2-high and the 2-jets channel of SR2-low, whilst for the 3–5 jets channel of SR2-low it is reconstructed from the two jets which are closest in \(\Delta \phi \) to the Z (\(\rightarrow \ell \ell \)) + \(E_{\text {T}}^{\text {miss}}\) system. The \(\Delta R_{(jj)}\) and \(m_{jj}\) variables are calculated using the two jets assigned to the W boson. ISR refers to the vectorial sum of the initial-state-radiation jets in the event (i.e. those not used in the reconstruction of the W boson) and jet1 and jet3 refer to the leading and third leading jet respectively. The variable \(n_{\text {non-} b \text {-tagged jets}}\) refers to the number of jets with \(p_{\text {T}} >30\) GeV that do not satisfy the b-tagging criteria

6.3 Signal regions for 3\(\ell \) channel

The 3\(\ell \) channel targets \(\displaystyle \tilde{\chi }^\pm _1 \displaystyle \tilde{\chi }^0_2 \) production and uses kinematic variables such as \(E_{\text {T}}^{\text {miss}}\) and the transverse mass \(m_\mathrm {T}\), which were used in the Run 1 analysis [24]. Events are required to have exactly three signal leptons and no additional baseline leptons, as well as zero b-tagged jets with \(p_{\text {T}} >20\) GeV. In addition, two of the leptons must form an SFOS pair (as expected in \(\displaystyle \tilde{\chi }^0_2 \rightarrow \ell ^{+}\ell ^{-}\displaystyle \tilde{\chi }^0_1 \) decays). To resolve ambiguities when multiple SFOS pairings are present, the transverse mass is calculated using the unpaired lepton and \(\mathbf {p}_\mathrm {T}^\mathrm {miss}\) for each possible SFOS pairing, and the lepton that yields the minimum transverse mass is assigned to the W boson. This transverse mass value is denoted by \(m_{\mathrm{T}}^{\mathrm{min}}\), and is used alongside \(E_{\text {T}}^{\text {miss}}\), jet multiplicity (in the gauge-boson-mediated scenario) and other relevant kinematic variables to define exclusive signal regions that have sensitivity to \(\tilde{\ell } \)-mediated and gauge-boson-mediated decays. The definitions of these exclusive regions are provided in Table 3. The bins denoted “slep-a,b,c,d,e” target \(\tilde{\ell } \)-mediated decays and consequently have a veto on SFOS pairs with an invariant mass consistent with the Z boson (this suppresses the WZ background). The invariant mass of the SFOS pair, \(m_{\ell \ell }\), the magnitude of the missing transverse momentum, \(E_{\text {T}}^{\text {miss}}\), and the \(p_{\text {T}}\) value of the third leading lepton, \(p_{\text {T}}^{\ell _{3}}\), are used to define the SR bins. Conversely, the bins denoted “WZ-0Ja,b,c” and ”WZ-1Ja,b,c” target gauge-boson-mediated decays and thus require the SFOS pair to have an invariant mass consistent with an on-shell Z boson. The 0-jet and \(\ge \) 1-jet channels are considered separately and the regions are binned in \(m_\mathrm {T}^{\mathrm{min}}\) and \(E_{\text {T}}^{\text {miss}}\).

Table 3 Summary of the exclusive signal regions used in the 3\(\ell \) channel. Relevant kinematic variables are defined in the text. The bins labelled “slep” target slepton-mediated decays whereas those labelled “WZ” target gauge-boson-mediated decays. The variable \(n_{\text {non-} b \text {-tagged jets}}\) refers to the number of jets with \(p_{\text {T}} >20\) GeV that do not satisfy the b-tagging criteria. Values of \(p^{\ell _{3}}_\mathrm{T}\) refer to the \(p_{\text {T}}\) of the third leading lepton and \(p_{\text {T}} ^{\mathrm{jet1}}\) denotes the \(p_{\text {T}}\) of the leading jet

7 Background estimation and validation

The SM backgrounds can be classified into irreducible backgrounds with prompt leptons and genuine \(E_{\text {T}}^{\text {miss}}\) from neutrinos, and reducible backgrounds that contain one or more “fake” or non-prompt (FNP) leptons or where experimental effects (e.g., detector mismeasurement of jets or leptons or imperfect removal of object double-counting) lead to significant “fake” \(E_{\text {T}}^{\text {miss}}\). A summary of the background estimation techniques used in each channel is provided in Table 4. In the 2\(\ell \) + 0 jets and 3\(\ell \) channels only, the dominant backgrounds are estimated from MC simulation and normalized in dedicated control regions (CRs) that are included, together with the SRs, in simultaneous likelihood fits to data, as described further in Sect. 9. In addition, all channels employ validation regions (VRs) with kinematic requirements that are similar to the SRs but with smaller expected signal-to-background ratios, which are used to validate the background estimation methodology. In the 2\(\ell \) + jets channel, the MC modelling of diboson processes is studied in dedicated VRs and found to accurately reproduce data.

Table 4 Summary of the estimation methods used in each search channel. Backgrounds denoted CR have a dedicated control region that is included in a simultaneous likelihood fit to data to extract a data-driven normalization factor that is used to scale the MC prediction. The \(\gamma \) + jet template method is used in the 2\(\ell \) + jets channel to provide a data-driven estimate of the Z + jets background. Finally, MC stands for pure Monte Carlo estimation

For the 2\(\ell \) + 0 jets channel the dominant backgrounds are irreducible processes from SM diboson production (WW, WZ, and ZZ) and dileptonic \(t\bar{t}\) and Wt events. MC simulation is used to predict kinematic distributions for these backgrounds, but the \(t\bar{t}\) and diboson backgrounds are then normalized to data in dedicated control regions. For the diboson backgrounds, SF and DF events are treated separately and two control regions are defined. The first one (CR2-VV-SF) selects SFOS lepton pairs with an invariant mass consistent with the Z boson mass and has a tight requirement of \(m_\mathrm {T2}\) \(>130\) GeV to reduce the Z + jets contamination. This region is dominated by ZZ events, with subdominant contributions from WZ and WW events. The DF diboson control region (CR2-VV-DF) selects events with a different flavour opposite sign pair and further requires \(50<m_{\mathrm{T2}}<75\) GeV. This region is dominated by WW events, with a subdominant contribution from WZ events. The \(t\bar{t}\) control region (CR2-Top) uses DF events with at least one b-tagged jet to obtain a high-purity sample of \(t\bar{t}\) events. The control region definitions are summarized in Table 5. The Z + jets and Higgs boson contributions are expected to be small in the 2\(\ell \) + 0 jets channel and are estimated directly from MC simulation.

The three control regions are included in a simultaneous profile likelihood fit to the observed data which provides data-driven normalization factors for these backgrounds, as described in Sect. 9. The results are propagated to the signal regions, and to dedicated VRs that are defined in Table 5. The normalization factors returned by the fit for the \(t\bar{t}\), VV-DF and VV-SF backgrounds are \(0.95\pm 0.03\), \(1.06\pm 0.18\) and \(0.96\pm 0.11\), respectively. Figure 2a, b show the \(E_{\text {T}}^{\text {miss}}\) and \(m_{\mathrm {T2}}\) distributions, respectively, for data and the estimated backgrounds in VR2-VV-SF with these normalization factors applied.

Table 5 Control region and validation region definitions for the 2\(\ell \) + 0 jets channel. The DF and SF labels refer to different-flavour or same-flavour lepton pair combinations, respectively. The \(p_{\text {T}}\) thresholds placed on the requirements for b-tagged and non-b-tagged jets correspond to 20 GeV and 60 GeV, respectively

In the 2\(\ell \) + jets channel, the largest background contribution is also from SM diboson production. In addition, Z + jets events can enter the SRs due to fake \(E_{\text {T}}^{\text {miss}}\) from jet or lepton mismeasurements or genuine \(E_{\text {T}}^{\text {miss}}\) from neutrinos in semileptonic decays of b- or c-hadrons. These effects are difficult to model in MC simulation, so instead \(\gamma {+}\text {jets}\) events in data are used to extract the \(E_{\text {T}}^{\text {miss}}\) shape in \(Z{+}\text {jets}\) events, which have a similar topology and \(E_{\text {T}}^{\text {miss}}\) resolution. Similar methods have been employed in searches for SUSY in events with two leptons, jets, and large \(E_{\text {T}}^{\text {miss}}\) in ATLAS [90] and CMS [91, 92]. The \(E_{\text {T}}^{\text {miss}}\) shape is extracted from a data control sample of \(\gamma {+}\text {jets}\) events using a set of single-photon triggers and weighting each event by the trigger prescale factor. Corrections to account for differences in the \(\gamma \) and Z boson \(p_{\text {T}} \) distributions, as well as different momentum resolutions for electrons, muons and photons, are applied. Backgrounds of \(W\gamma \) and \(Z\gamma \) production, which contain a photon and genuine \(E_{\text {T}}^{\text {miss}}\) from neutrinos, are subtracted using MC samples that are normalized to data in a \(V\gamma \) control region containing a selected lepton and photon. For each SR separately, the \(E_{\text {T}}^{\text {miss}}\) shape is then normalized to data in a corresponding control region with \(E_{\text {T}}^{\text {miss}} <100\) GeV but all other requirements the same as in the SR. To model quantities that depend on the individual lepton momenta, an \(m_{\ell \ell }\) value is assigned to each \(\gamma {+}\text {jets}\) event by sampling from \(m_{\ell \ell }\) distributions (parameterized as functions of boson \(p_{\text {T}}\) and \(E_\mathrm {T,\parallel }^{\mathrm {miss}}\), the component of \(E_{\text {T}}^{\text {miss}}\) that is parallel to the boson’s transverse momentum vector) extracted from a \(Z{+}\text {jets}\) MC sample. With this \(m_{\ell \ell }\) value assigned to the photon, each \(\gamma {+}\text {jets}\) event is boosted to the rest frame of the hypothetical Z boson and the photon is split into two pseudo-leptons, assuming isotropic decays in the rest frame.

To validate the method, two sets of validation regions, “tight” and “loose”, are defined for each SR. The definitions of these regions are provided in Table 6. The selections in the “tight” regions are identical to the SR selections with the exception of the dijet mass \(m_{jj}\) requirement, which is replaced by the requirement (\(m_{jj}<60\) GeV or \(m_{jj}>100\) GeV) to suppress signal. These “tight” regions are used to verify the expectation from the \(\gamma {+}\text {jets}\) method that the residual \(Z{+}\text {jets}\) background after applying the SR selections is very small. The “loose” validation regions are instead defined by removing several other kinematic requirements used in the SR definition (\(m_\mathrm {T2}\), all \(\Delta \phi \) and \(\Delta R\) quantities, and the ratios of \(E_{\text {T}}^{\text {miss}}\) to W \(p_{\text {T}}\), Z \(p_{\text {T}}\), and \(p_{\text {T}}\) of the system of ISR jets). These samples have enough \(Z{+}\text {jets}\) events to perform comparisons of kinematic distributions, which validate the normalization and kinematic modelling of the \(Z{+}\text {jets}\) background. The data distributions are consistent with the expected background in these validation regions, as shown in Fig. 2c for the \(E_{\text {T}}^{\text {miss}}\) distribution in VR2-int-loose.

Once the signal region requirements are applied, the dominant background in the 2\(\ell \) + jets channel is the diboson background. This is taken from MC simulation, but the modelling is verified in two dedicated validation regions, one for signal regions with low mass-splitting (VR2-VV-low) and one for the intermediate and high-mass signal regions (VR2-VV-int). Requiring high \(E_{\text {T}}^{\text {miss}}\) and exactly one signal jet (compared to at least two jets in the SRs) suppresses the \(t\bar{t}\) background and enhances the purity of diboson events containing an ISR jet, in which each boson decays leptonically. Figure 2d shows the \(m_\mathrm {T2}\) distribution in VR2-VV-int for data and the expected backgrounds.

Table 6 Validation region definitions used for the 2\(\ell \) + jets channel. Symbols and abbreviations are analogous to those in Table 2

For both the 2\(\ell \) + 0 jets and 2\(\ell \) + jets channels, reducible backgrounds with one or two FNP leptons arise from multijet, W + jets and single-top-quark production events. For both analyses, the FNP lepton background is estimated from data using the matrix method (MM) [93]. This method uses two types of lepton identification criteria: “signal”, corresponding to leptons passing the full analysis selection, and “baseline”, corresponding to candidate electrons and muons as defined in Sect. 5. Probabilities for real leptons satisfying the baseline selection to also satisfy the signal selection are measured as a function of \(p_{\text {T}}\) and \(\eta \) in dedicated regions enriched in Z boson processes; similar probabilities for FNP leptons are measured in events dominated by leptons from heavy flavour decays and photon conversions. The method uses the number of observed events containing baseline–baseline, baseline–signal, signal–baseline and signal–signal lepton pairs in a given SR to extract data-driven estimates for the FNP lepton background in the CRs, VRs, and SRs for each analysis.

For the 3\(\ell \) channel, the irreducible background is dominated by SM WZ diboson processes. As in the 2\(\ell \) + 0 jets channel, the shape of this background is taken from MC simulation but normalized to data in a dedicated control region. The signal regions shown in Table 3 include a set of exclusive regions inclusive in jet multiplicity which target \(\tilde{\ell } \)-mediated decays, and a set of exclusive regions separated into 0-jet and \(\ge 1\) jet categories which target gauge-boson-mediated decays. To reflect this, three control regions are defined in order to extract the normalization of the WZ background: an inclusive region (CR3-WZ-inc) and two exclusive control regions (CR3-WZ-0j and CR3-WZ-1j). The results of the background estimations are validated in a set of dedicated validation regions. This includes two validation regions that are binned in jet multiplicity (VR3-Za-0J and VR3-Za-1J), and a set of inclusive validation regions (VR3-Za, VR3-Zb, VR3-offZa and VR3-offZb) targeting different regions of phase space considered in the analysis (i.e. within and outside the Z boson mass window, high and low \(E_{\text {T}}^{\text {miss}}\), and vetoing events with a trilepton invariant mass within the Z boson mass window). The definitions of the control and validation regions used in the 3\(\ell \) analysis are shown in Table 7. The normalization factors extracted from the fit for inclusive WZ events, WZ events with zero jets, and WZ events with at least one jet are \(0.97\pm 0.06\), \(1.08\pm 0.06\) and \(0.94\pm 0.07\), respectively. Other small background sources such as VVV, tV and Higgs boson production processes contributing to the irreducible background are taken from MC simulation.

Table 7 Control and validation region definitions used in the 3\(\ell \) channel. The \(m_{\mathrm {SFOS}}\) quantity is the mass of the same-flavour opposite-sign lepton pair and \(m_{{\ell \ell \ell }}\) is the trilepton invariant mass. Other symbols and abbreviations are analogous to those in Table 3

In addition to processes contributing to the reducible backgrounds in the 2\(\ell \) channels, the reducible backgrounds in the 3\(\ell \) channel also include \(Z+\)jets, \(t\bar{t}\), WW and in general any physics process leading to less than three prompt and isolated leptons. The reducible backgrounds in the 3\(\ell \) channel are estimated using a data-driven fake-factor (FF) method [94]. This method uses two sets of lepton identification criteria: the tight, or “ID”, criteria corresponding to the signal lepton selection used in the analysis and the orthogonal loose, or “anti-ID”, criteria which are designed to yield an enrichment in FNP leptons. In particular, for the anti-ID leptons the isolation and identification requirements applied to signal leptons are reversed. The Z + jets background events in the signal, control and validation regions are estimated using lepton \(p_{\text {T}}\)-dependent fake factors, defined as the ratio of the numbers of ID to anti-ID leptons in an FNP-dominated region. These fake factors are then applied to events passing selection requirements identical to those in the signal, control or validation region in question but where one of the ID leptons is replaced by an anti-ID lepton. The “top-like” contamination, which includes \(t\bar{t}\), Wt, and WW, is subtracted from these anti-ID regions along with contributions from any remaining MC processes, to avoid double-counting. The top-like reducible background contributions are then estimated differently: data-to-MC scale factors derived with DF opposite-sign events are applied to simulated SF events. Figure 2e, f show the \(E_{\text {T}}^{\text {miss}}\) distribution in VR3-Zb and the \(m_{T}^{\mathrm {min}}\) distribution in VR3-Za, respectively.

Fig. 2
figure 2

Distributions of \(E_{\text {T}}^{\text {miss}}\), \(m_{T}^{\mathrm {min}}\), and \(m_\mathrm {T2}\) for data and the estimated SM backgrounds in the (top) 2\(\ell \) + 0 jets channel, (middle) 2\(\ell \) + jets channel, and (bottom) 3\(\ell \) channel. Simulated signal models are overlaid for comparison. For the 2\(\ell \) + 0 jets (3\(\ell \)) channel, the normalization factors extracted from the corresponding CRs are used to rescale the \(t\bar{t}\) and VV (WZ) backgrounds. For the 2\(\ell \) + 0 jets channel the “top” background includes \(t\bar{t}\) and Wt, the “other” backgrounds include Higgs bosons, \(t\bar{t}V\) and VVV and the “reducible” category corresponds to the data-driven matrix method estimate. For the 2\(\ell \) + jets channel, the “top” background includes \(t\bar{t}\), Wt and \(t\bar{t}V\), the “other” backgrounds include Higgs bosons and VVV, the “reducible” category corresponds to the data-driven matrix method estimate, and the Z + jets contribution is evaluated with the data-driven \(\gamma \) + jet template method. For the 3\(\ell \) channel, the “reducible” category corresponds to the data-driven fake-factor estimate. The uncertainty band includes all systematic and statistical sources and the final bin in each histogram also contains the events in the overflow bin

8 Systematic uncertainties

Several sources of experimental and theoretical systematic uncertainty are considered in the SM background estimates and signal predictions. These uncertainties are included in the profile likelihood fit described in Sect. 9. The primary sources of systematic uncertainty are related to the jet energy scale (JES) and resolution (JER), theory uncertainties in the MC modelling, the reweighting procedure applied to simulation to match the distribution of the number of reconstructed vertices observed in data, the systematic uncertainty considered in the non-prompt background estimation and the theoretical cross-section uncertainties. The statistical uncertainty of the simulated event samples is taken into account as well. The effects of these uncertainties were evaluated for all signal samples and background processes. In the 2\(\ell \) + 0jets and 3\(\ell \) channels the normalizations of the MC predictions for the dominant background processes are extracted in dedicated control regions and the systematic uncertainties thus only affect the extrapolation to the signal regions in these cases.

The JES and JER uncertainties are derived as a function of jet \(p_{\text {T}}\) and \(\eta \), as well as of the pile-up conditions and the jet flavour composition of the selected jet sample. They are determined using a combination of data and simulation, through measurements of the jet response balance in dijet, Z + jets and \(\gamma +\)jets events [79, 80].

The systematic uncertainties related to the \(E_{\text {T}}^{\text {miss}}\) modelling in the simulation are estimated by propagating the uncertainties in the energy or momentum scale of each of the physics objects, as well as the uncertainties in the soft term’s resolution and scale [95].

The remaining detector-related systematic uncertainties, such as those in the lepton reconstruction efficiency, energy scale and energy resolution, in the b-tagging efficiency and in the modelling of the trigger [73, 75], are included but were found to be negligible in all channels.

The uncertainties coming from the modelling of diboson events in MC simulation are estimated by varying the renormalization, factorization and merging scales used to generate the samples, and the PDFs. In the 2\(\ell \) + 0 jets channel the impact of these uncertainties in the modelling of Z + jets events is also considered, as well as uncertainties in the modelling of \(t\bar{t}\) events due to parton shower simulation (by comparing samples generated with Powheg + Pythia to Powheg + Herwig++  [61]), ISR/FSR modelling (by comparing the predictions from an event sample generated by Powheg + Pythia with those from two samples where the radiation settings are varied), and the PDF set.

In the 2\(\ell \) + jets channel, uncertainties in the data-driven \(Z{+}\text {jets}\) estimate are calculated following the methodology used in Ref. [90]. An additional uncertainty is based on the difference between the expected background yield from the nominal method and a second method implemented as a cross-check, which extracts the dijet mass shape from data validation regions, normalizes the shape to the sideband regions of the SRs, and extrapolates the background into the W mass region.

For the matrix-method and fake-factor estimates of the FNP background, systematic uncertainties are assigned to account for differences in FNP lepton composition between the SR and the CR used to derive the fake rates and fake factors. An additional uncertainty is assigned to the MC subtraction of prompt leptons from this CR.

The exclusive SRs in the 2\(\ell \) + 0 jets and 3\(\ell \) channels are dominated by statistical uncertainties in the background estimates (which range from 10 to 70% in the higher mass regions in the 2\(\ell \) + 0 jets channel and from 5 to 30% in the 3\(\ell \) channel). The largest systematic uncertainties are those related to diboson modelling, the JES and JER uncertainties and those associated with the \(E_{\text {T}}^{\text {miss}}\) modelling. In the 2\(\ell \) + jets channel the dominant uncertainties are those associated with the data-driven estimate of the \(Z{+}\text {jets}\) background, which range from approximately 45 to 75%.

9 Results

The HistFitter framework [96] is used for the statistical interpretation of the results, with the CRs (for the 2\(\ell \) + 0 jets and 3\(\ell \) channels) and SRs both participating in a simultaneous likelihood fit. The likelihood is built as the product of a Poisson probability density function describing the observed number of events in each CR/SR and Gaussian distributions that constrain the nuisance parameters associated with the systematic uncertainties and whose widths correspond to the sizes of these uncertainties; Poisson distributions are used instead for MC statistical uncertainties. Correlations of a given nuisance parameter among the different background sources and the signal are taken into account when relevant.

In the 2\(\ell \) + 0 jets and 3\(\ell \) channels, a background-only fit which uses data in the CRs is performed to constrain the nuisance parameters of the likelihood function (these include the normalization factors for dominant backgrounds and the parameters associated with the systematic uncertainties). In all channels the background estimates are also used to evaluate how well the expected and observed numbers of events agree in the validation regions, and good agreement is found. In the 2\(\ell \) + 0 jets, 2\(\ell \) + jets, and 3\(\ell \) channels, the number of considered VRs is 3, 8, and 6, respectively, and the most significant deviations observed are 0.4\(\sigma \), 1.4\(\sigma \), and 0.8\(\sigma \), respectively. The precision of the expected background yields in the VRs is significantly better than in the corresponding SRs and the dominant sources of systematic uncertainty in the VRs and corresponding SRs are similar. For the 2\(\ell \) + 0 jets channel, the results for the exclusive signal regions are shown in Tables 8, 9 and 10 for SR2-SF-a to SR2-SF-g, SR2-SF-h to SR2-SF-m and SR2-DF-a to SR2-DF-d, respectively. The results for the 2\(\ell \) + 0 jets inclusive signal regions are shown in Table 11, while Table 12 summarizes the expected SM background and observed events in the 2\(\ell \) + jets SRs. For the 3\(\ell \) channel, the results are shown in Table 13 for SR3-WZ-0Ja to SR3-WZ-0Jc and SR3-WZ-1Ja to SR3-WZ-1Jc (which target gauge-boson-mediated decays) and Table 14 for SR3-slep-a to SR3-slep-e. A summary of the observed and expected yields in all of the signal regions considered in this paper is provided in Fig. 3. No significant excess above the SM expectation is observed in any SR.

Table 8 Background-only fit results for SR2-SF-a to SR2-SF-g in the 2\(\ell \) + 0 jets channel. All systematic and statistical uncertainties are included in the fit. The “other” backgrounds include all processes producing a Higgs boson, VVV or \(t\bar{t}V\). A “–” symbol indicates that the background contribution is negligible
Table 9 Background-only fit results for SR2-SF-h to SR2-SF-m in the 2\(\ell \) + 0 jets channel. All systematic and statistical uncertainties are included in the fit. The “other” backgrounds include all processes producing a Higgs boson, VVV and \(t\bar{t}V\). A “–” symbol indicates that the background contribution is negligible
Table 10 Background-only fit results for SR2-DF-a to SR2-DF-d in the 2\(\ell \) + 0 jets channel. All systematic and statistical uncertainties are included in the fit. The “other” backgrounds include all processes producing a Higgs boson, VVV or \(t\bar{t}V\). A “–” symbol indicates that the background contribution is negligible
Table 11 Background-only fit results for the inclusive signal regions in the 2\(\ell \) + 0 jets channel. All systematic and statistical uncertainties are included in the fit. The “other” backgrounds include all processes producing a Higgs boson, VVV and \(t\bar{t}V\). A “–” symbol indicates that the background contribution is negligible
Table 12 SM background results in the 2\(\ell \) + jets SRs. All systematic and statistical uncertainties are included. The “top” background includes all processes producing one or more top quarks and the “other” backgrounds include all processes producing a Higgs boson or VVV. A “–” symbol indicates that the background contribution is negligible
Table 13 Background-only fits for SR3-WZ-0Ja to SR3-WZ-0Jc and SR3-WZ-1Ja to SR3-WZ-1Jc in the 3\(\ell \) channel. All systematic and statistical uncertainties are included in the fit
Table 14 Background-only fits for SR3-slep-a to SR3-slep-e in the 3\(\ell \) channel. All systematic and statistical uncertainties are included in the fit
Fig. 3
figure 3

The observed and expected SM background yields in the signal regions considered in the 2\(\ell \) + 0 jets, 2\(\ell \) + jets and 3\(\ell \) channels. The statistical uncertainties in the background prediction are included in the uncertainty band, together with the experimental and theoretical uncertainties. The bottom plot shows the difference in standard deviations between the observed and expected yields. Here \(n_{\mathrm {obs}}\) and \(n_{\mathrm {exp}}\) are the observed data and expected background yields, respectively, \(\sigma _{\mathrm {tot}}=\sqrt{n_{\mathrm {bkg}}+\sigma ^{2}_{\mathrm {bkg}}}\), and \(\sigma _{\mathrm {exp}}\) is the total background uncertainty

Figure 4 shows a selection of kinematic distributions for data and the estimated SM backgrounds with their associated statistical and systematic uncertainties for the loosest inclusive SRs in the 2\(\ell \) + 0 jets channel: SR2-SF-loose and SR2-DF-100. The normalization factors extracted from the corresponding CRs are propagated to the VV and \(t\bar{t}\) contributions. Figure 5 shows the \(E_{\text {T}}^{\text {miss}}\) distribution in SR2-int and SR2-high, which differ only in the \(E_{\text {T}}^{\text {miss}}\) requirement, and in SR2-low of the 2\(\ell \) + jets channel. In the 3\(\ell \) channel, distributions of \(E_{\text {T}}^{\text {miss}}\) and the third leading lepton \(p_{\text {T}}\) are shown for the SR bins targeting \(\tilde{\ell } \)-mediated decays in Fig. 6 while Fig. 7 shows distributions of \(E_{\text {T}}^{\text {miss}}\) in the bins targeting gauge-boson-mediated decays. Good agreement between data and expectations is observed in all distributions within the uncertainties.

Fig. 4
figure 4

The a \(m_{\ell \ell }\) and b \(m_\mathrm {T2}\) distributions for data and the estimated SM backgrounds in the 2\(\ell \) + 0 jets channel for SR2-SF-loose and c the \(m_\mathrm {T2}\) distribution for the SR2-DF-100 selection. The normalization factors extracted from the corresponding CRs are used to rescale the \(t\bar{t}\) and VV contributions. The “top” background includes \(t\bar{t}\) and Wt, and the “other” backgrounds include Higgs bosons, \(t\bar{t}V\) and VVV. The “reducible” category corresponds to the data-driven matrix method’s estimate. The uncertainty bands include all systematic and statistical contributions. Simulated signal models for sleptons (a, b) or charginos (c) pair production are overlayed for comparison. The final bin in each histogram also contains the events in the overflow bin. The vertical red arrows indicate bins where the ratio of data to SM background, minus the uncertainty on this quantity, is larger than the y-axis maximum

Fig. 5
figure 5

Distributions of \(E_{\text {T}}^{\text {miss}}\) for data and the expected SM backgrounds in the 2\(\ell \) + jets channel for a SR2-int/high and b SR2-low, without the final \(E_{\text {T}}^{\text {miss}} \) requirement applied. The “top” background includes \(t\bar{t}\) , Wt and \(t\bar{t}V\), and the “other” backgrounds include Higgs bosons and VVV. The Z + jets contribution is evaluated using the data-driven \(\gamma \) + jet template method and the “reducible” category corresponds to the data-driven matrix method’s estimate. The uncertainty bands include all systematic and statistical contributions. Simulated signal models for charginos/neutralinos production are overlayed for comparison. The final bin in each histogram also contains the events in the overflow bin. The vertical red arrows indicate bins where the ratio of data to SM background, minus the uncertainty on this quantity, is larger than the y-axis maximum

Fig. 6
figure 6

Distributions of \(E_{\text {T}}^{\text {miss}}\) for data and the estimated SM backgrounds in the 3\(\ell \) channel for a SR3-slep-a and b SR3-slep-b and c distributions of the third leading lepton \(p_{\text {T}}\) in SR3-slep-c,d,e. The normalization factors extracted from the corresponding CRs are used to rescale the WZ background. The “reducible” category corresponds to the data-driven fake-factor estimate. The uncertainty bands include all systematic and statistical contributions. Simulated signal models for charginos/neutralinos production are overlayed for comparison. The final bin in each histogram also contains the events in the overflow bin

Fig. 7
figure 7

Distributions of \(E_{\text {T}}^{\text {miss}}\) for data and the estimated SM backgrounds in the 3\(\ell \) channel for a SR3-WZ-0Ja,b,c, b SR3-WZ-1Ja, c SR3-WZ-1Jb and d SR3-WZ-1Jc. The normalization factors extracted from the corresponding CRs are used to rescale the 0-jet and \(\ge \) 1-jet WZ background components. The “reducible” category corresponds to the data-driven fake-factor estimate. The uncertainty bands include all systematic and statistical contributions. Simulated signal models for charginos/neutralinos production are overlayed for comparison. The final bin in each histogram also contains the events in the overflow bin. The vertical red arrows indicate bins where the ratio of data to SM background, minus the uncertainty on this quantity, is larger than the y-axis maximum

In the absence of any significant excess, two types of exclusion limits for new physics scenarios are calculated using the CL\(_\mathrm {s}\) prescription [97]. First, exclusion limits are set on the masses of the charginos, neutralinos, and sleptons for the simplified models in Fig. 1, as shown in Fig. 8. Figure 8a, b show the limits in the 2\(\ell \) + 0 jets channel in the models of direct chargino pair production with decays via sleptons and direct slepton pair production, respectively. Limits are calculated by statistically combining the mutually orthogonal exclusive SRs. For the chargino pair model, all SF and DF bins are used and chargino masses up to 750 GeV are excluded at 95% confidence level for a massless \(\displaystyle \tilde{\chi }^0_1 \) neutralino. In the region with large chargino mass, the observed limit is weaker than expected because the data exceeds the expected backgrounds in SF-e, SF-f, and SF-g. For the slepton pair model, which assumes mass-degenerate \(\tilde{\ell } _{L }\) and \(\tilde{\ell } _{R }\) states (where \(\tilde{\ell } =\tilde{e},\tilde{\mu },\tilde{\tau }\)), only SF bins are used and slepton masses up to 500 GeV are excluded for a massless \(\displaystyle \tilde{\chi }^0_1 \) neutralino.

Figure 8c shows the limits from the 3\(\ell \) channel in the model of mass-degenerate chargino–neutralino pair production with decays via sleptons, calculated using a statistical combination of the five SR3-slep regions. In this model, chargino and neutralino masses up to 1100 GeV are excluded for \(\displaystyle \tilde{\chi }^0_1 \) neutralino masses less than 550 GeV.

Figure 8d shows the limits from the 3\(\ell \) and 2\(\ell \) + jets channels in the model of mass-degenerate chargino–neutralino pair production with decays via W / Z bosons. The 3\(\ell \) limits are calculated using a statistical combination of the six SR3-WZ regions. Since the SRs in the 2\(\ell \) + jets channel are not mutually exclusive, the observed CL\(_\mathrm {s}\) value is taken from the signal region with the best expected CL\(_\mathrm {s}\) value. The 3\(\ell \) and 2\(\ell \) + jets channels are then combined, using the channel with the best expected CL\(_\mathrm {s}\) value for each point in the model parameter space. In this model, chargino and neutralino masses up to 580 GeV are excluded for a massless \(\displaystyle \tilde{\chi }^0_1 \) neutralino.

Fig. 8
figure 8

Observed and expected exclusion limits on SUSY simplified models for a chargino-pair production, b slepton-pair production, c chargino–neutralino production with slepton-mediated decays, and d chargino–neutralino production with decays via W / Z bosons. The observed (solid thick red line) and expected (thin dashed blue line) exclusion contours are indicated. The shaded band corresponds to the ±1\(\sigma \) variations in the expected limit, including all uncertainties except theoretical uncertainties in the signal cross-section. The dotted lines around the observed limit illustrate the change in the observed limit as the nominal signal cross-section is scaled up and down by the theoretical uncertainty. All limits are computed at 95% confidence level. The observed limits obtained from ATLAS in Run 1 are also shown [23]

Second, model-independent upper limits are set on the visible signal cross-section (\(\langle \epsilon \mathrm{\sigma }\rangle _{\mathrm{obs}}^{95}\)) as well as on the observed (\(S^{95}_{\mathrm{obs}}\)) and expected (\(S^{95}_{\mathrm{exp}}\)) number of events from processes beyond-the-SM in the signal regions considered in this analysis. The p-value and the corresponding significance for the background-only hypothesis are also evaluated. For the 2\(\ell \) + 0 jets channel the inclusive signal regions defined in Table 1 are considered whereas for the 3\(\ell \) channel the calculation is performed for each bin separately. All the limits are at 95% confidence level. The results can be found in Table 15.

Table 15 Summary of results and model-independent limits in the inclusive 2\(\ell \) + 0 jets, 2\(\ell \) + jets, and 3\(\ell \) SRs. The observed (\(N_{\mathrm{obs}}\)) and expected background (\(N_{\mathrm{exp}}\)) yields in the signal regions are indicated. Signal model-independent upper limits at 95% confidence level on the visible signal cross-section (\(\langle \epsilon \mathrm{\sigma }\rangle _{\mathrm{obs}}^{95}\)), and the observed and expected upper limit on the number of BSM events (\(S^{95}_{\mathrm{obs}}\) and \(S^{95}_{\mathrm{exp}}\), respectively) are also shown. The ±1\(\sigma \) variations of the expected limit originate from the statistical and systematic uncertainties in the background prediction. The last two columns show the p value and the corresponding significance for the background-only hypothesis. For SRs where the data yield is smaller than expected, the p value is truncated at 0.5 and the significance is set to 0

10 Conclusion

Searches for the electroweak production of neutralinos, charginos and sleptons decaying into final states with exactly two or three electrons or muons and missing transverse momentum are performed using 36.1 fb\(^{-1}\) of \(\sqrt{s}=13\) TeV proton–proton collisions recorded by the ATLAS detector at the Large Hadron Collider. Three different search channels are considered. The 2\(\ell \) + 0 jets channel targets direct \(\displaystyle \tilde{\chi }^+_1 \displaystyle \tilde{\chi }^-_1 \) production where each \(\displaystyle \tilde{\chi }^\pm _1 \) decays via an intermediate \(\tilde{\ell } \), and direct \(\tilde{\ell } \tilde{\ell } \) production. The 2\(\ell \) + jets channel targets associated \(\displaystyle \tilde{\chi }^\pm _1 \displaystyle \tilde{\chi }^0_2 \) production where each sparticle decays via an SM gauge boson giving a final state with two leptons consistent with a Z boson and two jets consistent with a W boson. Finally, the 3\(\ell \) channel targets associated \(\displaystyle \tilde{\chi }^\pm _1 \displaystyle \tilde{\chi }^0_2 \) production with decays via either intermediate \(\tilde{\ell } \) or gauge bosons.

No significant excess above the SM expectation is observed in any of the signal regions considered across the three channels, and the results are used to calculate exclusion limits at 95% confidence level in several simplified model scenarios. For associated \(\displaystyle \tilde{\chi }^\pm _1 \displaystyle \tilde{\chi }^0_2 \) production with \(\tilde{\ell } \)-mediated decays, masses up to 1100 GeV are excluded for \(\displaystyle \tilde{\chi }^0_1 \) neutralino masses less than 550 GeV. Both the 2\(\ell \) + jets and 3\(\ell \) channels place exclusion limits on associated \(\displaystyle \tilde{\chi }^\pm _1 \displaystyle \tilde{\chi }^0_2 \) production with gauge-boson-mediated decays. For a massless \(\displaystyle \tilde{\chi }^0_1 \) neutralino, \(\displaystyle \tilde{\chi }^\pm _1 \)/\(\displaystyle \tilde{\chi }^0_2 \) masses up to approximately 580 GeV are excluded. In the 2\(\ell \) + 0 jets channel, for direct \(\displaystyle \tilde{\chi }^+_1 \displaystyle \tilde{\chi }^-_1 \) production with decays via an intermediate \(\tilde{\ell } \), masses up to 750 GeV are excluded for a massless \(\displaystyle \tilde{\chi }^0_1 \) neutralino. For \(\tilde{\ell } \tilde{\ell } \) production, masses up to 500 GeV are excluded for a massless \(\displaystyle \tilde{\chi }^0_1 \) neutralino, assuming mass-degenerate \(\tilde{\ell } _{L }\) and \(\tilde{\ell } _{R }\) (where \(\tilde{\ell } =\tilde{e},\tilde{\mu },\tilde{\tau }\)). These results significantly improve upon previous exclusion limits based on Run 1 data.