# Rare decays of b hadrons

Post-publication activity

Curator: Patrick Koppenburg

# Introduction

Physics studies fundamental interactions and their effects. At the most basic level, particle physics aims to describe the fundamental blocks of matter and their interactions. A century of research has led to the Standard Model of Particle Physics. It relies on firm theoretical grounds unifying quantum mechanics, special relativity and field theory, and is successful at describing all phenomena measured in particle interactions, whether at low or very high energies. Yet, it has an empirical character with many parameters that need to be determined by experiment, and it is incomplete as it does not account for gravity, does not explain the baryon asymmetry in the Universe and does not provide a candidate for dark matter. The Standard Model is therefore believed to be an approximation of a more complete theory that is currently unknown (just like Newton’s laws are an approximation of General Relativity). The primary goal of research in particle physics is to find this more complete theory. In the following “New Physics” is used as a catch-all for any contribution, usually associated with a new particle, not included in the Standard Model.

Rare decays of hadrons containing a heavy “beauty” (also called “bottom”) quark, denoted $b$ hadrons, provide a powerful way of exploring yet unknown physics. Small contributions from virtual new particles that are too heavy to be produced at colliders may lead to measurable deviations from the expected properties in the Standard Model. See the box for an example of a virtual particle.

 Virtual Particles Figure 1: Feynman diagram of the beta decay of the neutron. Fundamental particle interactions are mediated by force carriers: The photon for the electromagnetic interaction, the gluon for the strong interaction and the $W^\pm$ and $Z$ bosons for the weak interaction. In the nuclear beta decay a neutron decays to a proton, an electron and a neutrino (Figure 1). This weak interaction is mediated by a virtual $W^-$ boson, sometimes denoted $W^{*-}$. Here virtual means that the process violates energy and momentum conservation for a very short time, as allowed by Heisenberg’s Uncertainty Principle. The mass of the $W^-$ boson is about 80 GeV/$c^2$, while the neutron-proton mass difference is considerably less: 1.3 MeV/$c^2$. It is said that the $W^-$ boson is “off-shell”.

The study of rare decays is an active field within flavour physics, the field of research studying transitions of quarks or leptons from one species (or “flavour”) to another. This article focuses on rare decays of hadrons containing $b$ quarks. The most prevalently produced $b$ hadrons are the $B^{0}$ meson composed of a $\overline b$ anti-quark and a $d$ quark, the $B^{+}$ ($\overline b$ $u$) and $B_{s}^{0}$ ($\overline b$ $s$) mesons, as well as the $\Lambda_{b}^{0}$ ($bud$) baryon. Their masses are in the range 5 to 6 GeV/$c^{2}$, which is about six times that of the proton, but well below the mass of the $W$ boson of 80 GeV/$c^{2}$. The corresponding antiparticles $\overline B^0$, $B^{-}$, $\overline B^0_s$ and $\overline \Lambda_b^0$ are obtained by replacing all quarks by anti-quarks and vice-versa. The study of CP violation involves investigation of differences in the behaviour of particles and antiparticles, and is the subject of a dedicated review. The inclusion of charge conjugate processes is implied throughout this document.

Hadrons with $b$ quarks decay most of the time via a $b \to cW^{-*}$ transition, where the asterisk indicates the $W$ boson is virtual. The transitions $b \to uW^{-*}$ also occur, but are less likely. These two transitions are called “tree decays” as the process involves a single mediator, the $W^{-}$ boson. An example of a tree $d \to uW^{-*}$ transition is shown in Figure 1.

This article describes transitions involving more complicated processes. The quark transitions $b \to d$ and $b \to s$ do not happen at tree level in the Standard Model as the $Z$ boson does not couple to quarks of different flavour.

Figure 2: Feynman diagram of the dominating Standard Model contribution to the decay $B_s^0 \to \mu^+ \mu^-$.

Processes like the rare decay $B_{s}^{0} \to \mu^{+}\mu^{-}$ proceed via loops as shown in Figure 2 (sometimes referred to as penguins, a word coined by John Ellis - see Izlar, Kelly (2013)). Such processes are rare as the probability of a transition rapidly decreases with the number of electroweak vertices: two in the case of a tree decay, three or four for a loop. Also, the heavier the virtual particles involved, the more suppressed the decay. In the following, decays with probabilities in the range $10^{-4}$ to $10^{-10}$ are discussed. Some of the most interesting decays are described in the Section Key players. They all have in common the following features:

1. Suppressed decay amplitudes, as predicted by the Standard Model, which may potentially be of the same size as New Physics amplitudes.

2. Sufficiently precise Standard Model predictions for their decay rate, or any other observable of interest.

3. Experimental precision which potentially allows disentangling the Standard Model contribution from other contributions.

A historical example, the decay $K_{\rm L}^{0} \to \mu^{+}\mu^{-}$, is described in this box.

 A Historical Example: $K_{\rm L}^{0} \to \mu^{+}\mu^{-}$ Figure 3: Feynman diagrams of the two loops contributing to the decay $K^0_L \to \mu^+ \mu^-$. The $K_{\rm L}^{0} \to \mu^{+}\mu^{-}$ decay is forbidden at tree level and had an important role in opening the field of rare decays in the 1960s. Its unexpected non-observation allowed the prediction of the then unknown charm quark by Glashow, Iliopoulos and Maiani (GIM mechanism) in 1970. The idea of the GIM mechanism is that this decay only occurs via loops, one involving the $u$ quark and the other the $c$ quark (Figure 3). The amplitudes of the two loops are of opposite sign, causing complete cancellation in the limit of equal up and charm quark masses. The non-observation of this decay could be explained by adding a new particle to the theory, the $c$ quark, which was eventually discovered in 1974. This is an example of an observation of New Physics mediated by a new virtual particle. The $K_{\rm L}^{0} \to \mu^{+}\mu^{-}$ branching fraction is now measured to be $(6.84 \pm 0.11)×10^{-9}$. Nowadays there is a great deal of interest in the $B_{s}^{0}$-counterpart of this decay: $B_{s}^{0} \to \mu^{+}\mu^{-}$, discussed in Section The decays $B_{s}^{0} \to \mu^{+}\mu^{-}$ and $B^{0} \to \mu^{+}\mu^{-}$. A special category of rare decays is those forbidden in the Standard Model, like lepton- or baryon-number violating decays. In their case the Standard Model prediction is effectively zero, but other models may predict non-zero rates. Any observation would be a sign of New Physics. Given the impressive success of the Standard Model, New Physics amplitudes are known to be small. Therefore, any search for potentially observable deviations from Standard Model predictions will be facilitated if the Standard Model amplitudes are also suppressed, which is the case in rare decays. Studies of such decay modes require large data samples to produce enough of the relevant particles. This is referred to as the intensity frontier, as opposed to the energy frontier aiming at producing and studying heavy particles on-shell. The main experiments are briefly described in Experiments.

## Key players

The description of the process of the formation of hadrons out of quarks and gluons, called hadronisation, is difficult and leads to large theoretical uncertainties. Theoretically favoured are thus decays to purely leptonic final states, such as the decay $B_{s}^{0} \to \mu^{+}\mu^{-}$ (Section The decays $B_{s}^{0} \to \mu^{+}\mu^{-}$ and $B^{0} \to \mu^{+}\mu^{-}$). There is also interest in the charged counterparts of these decays, notably $B^{+} \to \ell^{+}\nu$, where $\ell^{+}$ is any lepton, $e^{+}$, $\mu^{+}$, $\tau^{+}$ (Section Other leptonic decays). They are generated by a charged $W^{+}$ current, but have interesting theoretical connections to decays that are induced by loops.

Radiative $b \to s\gamma$ and electroweak penguin $b \to s\ell^{+}\ell^{-}$, and $b\to s \nu \overline \nu$ decays are also of great interest. These are quark-level transitions, which cannot be measured directly as the quarks form immediately hadrons. In experiments exclusive decays are detected, and the inclusive decay is the sum of all contributions. For instance the decay $b \to s\gamma$ was first observed by its exclusive contribution $B \to K^{*}\gamma$ (Section The decay $b \to s\gamma$).

Exclusive decays are experimentally favoured, but come with larger theoreticl uncertainties. The decay $B^{0} \to K^{*0}\ell^{+}\ell^{-}$ is a well-known example. While the decay rate is hard to compute precisely, observables describing angular distributions of the decay products can be more precisely predicted (Section The decays $b \to s\ell^{+}\ell^{-}$).

Among forbidden decays, lepton flavour violating decays of $b$ and $c$ hadrons, like $B_{(s)}^{0} \to e^{\pm}\mu^{\mp}$ or $B^{+} \to K^{+}e^{\pm}\mu^{\mp}$, or of leptons, like $\mu^{+} \to e^{+}\gamma$, $\tau^{+} \to \mu^{+}\mu^{+}\mu^{-}$ or $\tau^{+} \to \mu^{+}\gamma$ are actively being searched for. Rare charm hadron decays are also being studied, but the experimental sensitivity is presently not sufficient to reach the very low rates predicted in the Standard Model. Finally, research in rare kaon decays is ongoing, though mainly at different experiments than those studying rare charm or beauty hadron decays. These channels are not further discussed in this article.

# Theory

This section describes briefly the theoretical framework that is commonly used to study rare decays. Its main goal is to define some vocabulary which is commonly used in publications on rare decays. It may be skipped by readers mostly interested in experimental results.

The common theoretical approach to rare decays is model independent. In flavour physics and in particular in rare decays studies, the underlying physics is parametrised in terms of an effective Hamiltonian describing the transition amplitude of an initial state $I$ to a final state $F$ following Fermi’s Golden Rule (Dirac, Paul A.M. (1927), Fermi, Enrico (1949)). The partial decay width is written as
$\Gamma(I \to F) = \frac{2\pi}{\hbar}\left|\left<F|{\cal H}_\text{eff}|I\right>\right|^2\times\text{phase-space}.$
Experimentally, the branching fraction ${\cal B}$ is measured rather than the decay width. They are related by
${{\cal B}(I \to F) = \frac{\Gamma(I \to F)}{\Gamma(I,\text{total})}},\quad \Gamma(I,\text{total}) = \frac{1}{\tau_I},$
where $\tau_{I}$ is the lifetime of particle $I$ (and natural units with $c = \bar{h}= 1$ are used).

The Standard Model prediction for any particular transition can be inferred from a calculation of the effective Hamiltonian derived from the Standard Model Lagrangian. This Hamiltonian is parametrised in terms of a sum of operators ${\cal O}_i$ and Wilson coefficients $C_{i}$
${\cal H}_\text{eff} = -\frac{G_F}{\sqrt{2}}\sum\limits_i V_\text{CKM}C_i{\cal O}_i,$
where $V_{\rm CKM}$ stands for some product of Cabibbo-Kobayashi-Maskawa matrix elements that describe the probability of given transitions between different quark flavours. The operators encompass the information about the Lorentz structure and the Wilson coefficients encode the effects of higher energy scales. In the case of the Standard Model these are the effects of the $W$, $Z$ bosons and top quarks, which are effectively removed from the theory and incorporated in the coefficients.

Any $I \to F$ decay can be described by this effective Hamiltonian, usually with many terms being irrelevant, with $<F|{\cal O}_i|I>=0$. Thus studying a set of decays will give various constraints on the effective Hamiltonian, permitting global fits to Wilson coefficients. This is briefly discussed in Section Wilson coefficient fits. This procedure does not simplify the computation of the amplitudes, as the matrix elements $\left<F|{\cal O}_i|I\right>$ contain the most difficult parts of the calculation. It provides however a common language that is not dependent on the considered New Physics model.

In particular, calculations of decay rates of exclusive decays with hadrons in the final state ($B^{0} \to K^{*0}\mu^{+}\mu^{-}$ for example) are difficult. Our lack of knowledge needs to be parametrised in heuristic quantities that describe the hadronisation, like form-factors and decay constants. They can be calculated in lattice QCD and, in many cases, can also be determined experimentally. Their discussion is beyond the scope of this document.

Figure 4: Effective process $n \to p e \overline \nu$ for the nuclear beta decay.

The operators ${\cal O}_{1,2}$ describe the $V - A$ structure of weak decays and first-order corrections. For example, the $W$ boson having been absorbed into the $C_{1}$ and $C_{2}$ coefficients, the nuclear beta decay $n \to p e \overline \nu$ is represented by a four-fermion operator as shown in Figure 4. This is how Enrico Fermi first described the process in 1934 (Fermi, Enrico (1934)). The operators ${\cal O}_{3-6,8}$ describe loops involving gluons. They are not of interest for this article.

Of most interest in rare decays are the suppressed operators ${\cal O}_{7}$, ${\cal O}_{9}$ and ${\cal O}_{10}$. The operator ${\cal O}_{7}$ dominates the radiative decay $b \to s\gamma$ giving a decay width
$\Gamma(b \to s \gamma) = \frac{G_F^2\alpha_{\rm EM}m_b^5}{32\pi^4}|V_{ts}^\ast V_{tb}|^2 |C_{7}|^2 + \text{corrections},$
where $\alpha_{\rm EM}$ is the electromagnetic constant, $m_{b}$ the $b$ quark mass, and $V_{ij}$ are parameters of the CKM matrix. A measurement of the $b \to s\gamma$ branching fraction thus provides a direct constraint on $C_{7}$.

The operators ${\cal O}_{9}$ and ${\cal O}_{10}$ dominate $b \to qll$ transitions, with ${\cal O}_{9}$ corresponding to a vector and ${\cal O}_{10}$ to an axial current. Finally the decays $B \to \ell^{+}\ell^{-}$ are, in the Standard Model, dominated by operator ${\cal O}_{10}$, with a decay rate which can be written as
${\Gamma}(B \to \ell^+\ell^-) = \frac{G_F^2M_W^2m_B^3f_B^2}{8\pi^5}|V_{tb}^\ast V_{tq}|^2\frac{4m_\ell^2}{m_B^2}\sqrt{1-\frac{4m_\ell^2}{m_B^2}} |C_{10}|^2 + \text{corrections},$
for $B = B^{0}, B_{s}^{0}$ (and $q = d, s$) with $f_{B}$ the $B$ decay constant and $V_{ij}$ CKM matrix elements. The $B_{s}^{0} \to \mu^{+}\mu^{-}$ branching fraction thus provides a constraint on $C_{10}$. Other operators, labelled ${\cal O}_\text{P}$ and ${\cal O}_\text{S}$, which are negligible in the Standard Model, could also contribute to this decay.

If the $V - A$ structure of weak interactions is not assumed, new primed operators with flipped helicities appear, most notably ${\cal O}_7'$, and its Wilson coefficient $C_{7}'$ which generate a right-handed photon in $b \to s\gamma$ decays.

For a comprehensive review of the effective Hamiltonian used to study rare decays, see Buchalla, Gerhard; Buras, Andrzej J and Lautenbacher , Markus E (1996) and Buras, Andrzej J and Fleischer, R (1998). A more pedagogical introduction can be found in Chapter 20 of Branco, G C; Lavoura, L and Silva, J P (1999). Standard Model expectations of Wilson coefficients and operators have been calculated at next-to-leading order or better.

There exist many theories beyond the Standard Model providing predictions for Wilson coefficients. Often these values depend on unknown parameters of the theory, as masses of yet unseen new particles. This is particularly the case for supersymmetry, a well-motivated extension of the Standard Model.

# Experiments

There are essentially two families of experiments studying $b$ hadrons:

$B$ factories

are experiments based at $e^{+}e^{-}$ colliders operating most of the time at a collision energy near $10.6$ GeV, corresponding to the mass of the $\Upsilon(4S)$ resonance, the lightest meson decaying to two $B$ mesons. ARGUS at DESY (Hamburg, Germany), CLEO at Cornell (Ithaka, USA), BaBar at SLAC (Stanford, USA), Belle and its successor Belle II at KEK (Tsukuba, Japan) are notable examples of such experiments.

operate at a $pp$ or $p \overline p$ collider with centre-of-mass energies of several TeV. CDF and D0 were located at Fermilab’s Tevatron (Batavia, USA). ATLAS, CMS and LHCb presently operate at CERN’s LHC (Geneva, Switzerland).

Hadron colliders have the advantage of much larger production rates: the production cross-section of $b$ quarks is a factor 500000 larger at the LHC than at a $B$ factory. The advantage of the $B$ factories is cleanliness. Collision events with a produced $\Upsilon(4S)$ resonance are easy to identify, allowing for high efficiencies and low background levels. In such events only two $B$ mesons are produced, making the reconstruction of the full collision event possible. In a typical LHC collision only one in hundred collisions produce a $b$ quark pair and the two $b$ hadrons are surrounded by hundreds of other particles. Efficient background fighting techniques are thus essential but have a cost in terms of efficiencies. Excellent background rejection is achieved by precise vertexing and exploiting the $b$-hadron flight distance. The physics programme is also somewhat different: $B$ factories have only access to $B^{0}$ and $B^{+}$ mesons (and $B_{s}^{0}$ mesons when operating at the $\Upsilon(5S)$ resonance), while hadronic collisions produce all $b$ hadrons, including the $B_{s}^{0}$ meson, the $B_{c}^{+}$ meson (composed of a $c$ quark and an $\overline b$ anti-quark), as well as the $\Lambda_{b}^{0}$ and $\Xi_{b}$ baryons.

## Short history of $b$-quark physics

After the discovery of the $b$ quark at Fermilab through the observation of mesons formed by a $b$ and an $\overline b$ anti-quark in 1977 and of the $B$ meson at Cornell , searches for rare decays of $b$ hadrons rapidly took pace. The first limit on the decay $B^{0} \to \mu^{+}\mu^{-}$ was set by the CLEO collaboration in 1985, the start of a long quest during which the sensitivity was improved by six orders of magnitude, as illustrated in Figure 5 (See Section The decays $B_{s}^{0} \to \mu^{+}\mu^{-}$ and $B^{0} \to \mu^{+}\mu^{-}$).

Figure 5: History of $B_s^0 \to \mu^+ \mu^-$ and $B^0 \to \mu^+ \mu^-$ limits. Figure courtesy of F. Dettori.

The CLEO and ARGUS experiments were located at $e^{+}e^{-}$ colliders operating at the $\Upsilon(4S)$ resonance. The same concept was employed and improved by the BaBar and Belle experiments in the first decade of the 21st century. If Cornell was initially able to produce few tens of $B\bar{B}$ pairs per day, the PEP-II and KEKB accelerators at SLAC and KEK achieved a daily rate of one million $B\bar{B}$ pairs. In the meantime, experiments at CERN’s LEP $e^{+}e^{-}$ collider and at Fermilab’s Tevatron proton-anti-proton collider used higher energy collisions to produce and study all $b$-hadron species. All the above-mentioned experiments have terminated their programme but most still exploit their data set to produce new results. Belle and the associated accelerator complex has been undergoing a major upgrade and has recently come back as the Belle II experiment.

## Present

Until Belle II reaches its design luminosity, the leadership in $b$ physics is taken by the LHCb experiment. Important contributions also come from the ATLAS and CMS experiments. These three experiments exploit LHC data collected in proton-proton collisions at centre-of-mass energies of 7 (2010–11), 8 (2012) and 13 TeV (2015–18).

ATLAS and CMS are detectors optimised for high-energy processes, such as the discovery of the Higgs boson. They also perform $b$-physics research, most effectively in decays of $b$ hadrons to pairs of muons. This distinct signature allows for efficient selection of these decays during the online filtering phase where a large reduction of the recorded collision rate is required, which is difficult to achieve for decays to electrons or hadrons.

The LHCb experiment on the contrary is optimised for the physics of hadrons containing $b$ and $c$ quarks. It is a single-arm forward detector designed to exploit the relatively large $b\bar{b}$ production in LHC proton-proton collisions in the forward direction. It includes a tracking system surrounding a dipole magnet whose polarity can be reversed, silicon sensors coming as close as 8 mm to the proton beam and a particle identification system based on Cherenkov radiation. The high-resolution silicon system exploits the typical $b$-hadron flight distances of a few millimetres before their decay to select them. This sets requirements on the number of $pp$ collisions per bunch crossing, defining an upper limit to the total collision rate at which the experiment can operate. Consequently, the luminosity is decreased compared to ATLAS and CMS.

# Main experimental results

This section presents the main recent experimental results and their interpretation. It starts with a more historical section on the decay $b \to s\gamma$ which had (and still has) an important role in the development of the field.

## The decay $b \to s\gamma$

Figure 6: Feynman diagram of the dominating Standard Model contributionto the decay $b \to s \gamma$.

In the Standard Model the decay $b \to s\gamma$ occurs dominantly via a loop involving the top quark and the $W$ boson (Figure 6). It has played a very important role in flavour physics from the 1980s. At the time it was the dependency of the branching fraction on the then unknown top quark mass that was the driving force behind the theoretical calculations and the experimental searches (the top quark mass affects the value of the $C_7$ coeffcient). When $B^0-\overline B^0$ mixing was (at the time surprisingly) observed in 1987, it became clear that the top quark was very heavy. The top quark was eventually discovered at the Tevatron in 1995 and its mass measured, which determined the Standard Model decay rate of $b \to s\gamma$ to be a few $10^{-4}$.

The first observation of the $b \to s\gamma$ decay actually preceded the top quark observation. In 1993 the CLEO collaboration reported a signal of the exclusive decay $B \to K^{*0}\gamma$ with a branching fraction of $(4.5 \pm 1.5 \pm 0.9)×10^{-5}$ (Ammar, R et al. (1993), Figure 7), where the first uncertainty is statistical and the second systematic. This opened the quest for the inclusive decay $b \to s\gamma$, i.e. the sum of all exclusive contributions. The branching fraction of this decay is more precisely calculable than its individual exclusive components, like $B^{0} \to K^{*0}\gamma$, allowing for more precise comparisons of experimental and theoretical results. The experimental challenges are discussed in below.

Figure 7: First observation of the decay $B^0 \to K^{*0} \gamma$ by the CLEO collaboration (see the peak at 5.28 GeV) (Ammar, R et al. (1993)).
 The inclusive $b \to s \gamma$ spectrum Figure 8: Background-subtracted photon energy spectrum of $b \to s \gamma$ decays at Belle (Limosani, A et al. (2009)). Many experiments located at $e^{+}e^{-}$ colliders have performed measurements using different methods. The total rate of any $B$ meson to a photon plus anything (where the photon is not caused by an electromagnetic decay, e.g. $\pi^{0} \to \gamma\gamma$ or $\eta \to \gamma\gamma$) can be measured by a sum of exclusive decay modes. A fully inclusive approach is also possible but more challenging. Only the photon and properties of the rest of the collision event are used to separate signal and backgrounds, mostly originating from non-$B$ decays. At the $B$ factories they are determined from data taken at centre-of-mass energies below the $\Upsilon(4S)$ resonance mass, decays to $\pi^{0}$ and $η$ which are vetoed and modelled from data, and mis-identified photons which are modelled from simulation. This method has the disadvantage of larger backgrounds, but has the advantage not to rely on any modelling of the composition of the hadronic state. It thus comes with smaller theoretical uncertainties. A typical result for the measured photon energy spectrum in inclusive $b \to s\gamma$ decays is shown in Figure 8. The integral of the spectrum gives the decay rate. The width is related to the momentum of the $b$ quark in the $B$ meson, which can be seen as a two-body system of a $b$ and a light quark, similar to a hydrogen atom. The world-average measured branching fraction for a photon with an energy above 1.6 GeV in the B-meson rest-frame is $(3.49 \pm 0.19)×10^{-4}$, which can be compared with the latest theoretical calculation of $(3.36 \pm 0.23)×10^{-4}$. The very good agreement of these two values sets very strong constraints on New Physics models, in particular supersymmetry. Although the total branching fraction seems to indicate no discrepancy with the Standard Model prediction, small contributions from New Physics may still occur. The $b \to s\gamma$ decay rate is essentially a measurement of the Wilson coefficient $C_{7}$ (see Section Theory).

In the Standard Model, the left-handed chirality structure of the weak interactions makes the photon emitted in $b \to s\gamma$ decays mainly left-handed. It is interesting to also probe right-handed contributions (sensitive to the Wilson coefficient $C_{7}'$), which requires determination of the polarisation of the photon. This is challenging as the helicity (or chirality) of the photon cannot be measured directly in the detector. Several methods have been proposed, none of which provides a strong constraint so far. The first and so far only measurement of a non-zero photon polarisation uses the decay $B \to K^{0}\pi^{-}\pi^{+}\gamma$ , but the interpretation in terms of the photon chirality is still unclear. The most stringent constraints come from global fits to Wilson coefficients (see Section Wilson coefficient fits).

A promising way of searching for right-handed contributions to $b\to s\gamma$ decays is the study of time-dependent $CP$ violation in transitions to flavour eigenstates. For instance in the decay $B^0\to K^{*0}\gamma$ with $K^{*0}\to K_{\rm S}^0\pi^0$, the amplitude of $B^0\to K^{*0}\gamma$ may interfere with that of the same decay following $B^0$ mixing, $\overline{B}^0\to \overline{K}^{*0}\gamma$. But this is only the case if the photons have the same helicities, and so the left-handed component of the $B^0$ decay will interfere with the right-handed component of the $\overline{B}^0$ decay. This process has been studied at the BaBar and Belle experiments, and the $B^0_s$ counterpart decay $B_s^0\to \phi\gamma$ at LHCb. No significant $CP$ asymmetry is observed to date.

## The decays $B_{s}^{0} \to \mu^{+}\mu^{-}$ and $B^{0} \to \mu^{+}\mu^{-}$

Figure 9: Feynman diagram of the (left and middle) dominating Standard Model contributions to $B_s^0 \to \mu^+ \mu^-$ and (right) a potential contribution in the context of supersymmetry.

The rare decay $B_{s}^{0} \to \mu^{+}\mu^{-}$ proceeds in the Standard Model by a box-type diagram involving the $W$ and $Z$ bosons and the $t$ quark (Figure 9). The most recent Standard-Model determination of its branching fraction is $(3.57 \pm 0.17)×10^{-9}$, where the uncertainty is dominated about equally by CKM matrix elements and the $B_{s}^{0}$ decay constant. In Standard Model extensions, the branching fraction of $B_{s}^{0} \to \mu^{+}\mu^{-}$ could be enhanced, in particular in models containing additional Higgs bosons (Figure 9, right). The decay $B_{s}^{0} \to \mu^{+}\mu^{-}$ and the even more suppressed decay $B^{0} \to \mu^{+}\mu^{-}$ have been searched for over three decades, with most recent results from the Tevatron and the LHC (Figure 5).

Figure 10: Two muon invariant mass distribution of $B \to \mu^+\mu^-$ candidates at LHCb and CMS. The result of the simultaneous fit is overlaid. Figure from Khachatryan, Vardan et al. (2015).

The first observation was reported in 2014 jointly by the CMS and LHCb collaborations. The CMS and LHCb LHC Run 1 data sets, which had been already published separately, were combined in a joint fit to the data of both experiments. The result was published in Nature (Khachatryan, Vardan et al. (2015)), which is unusual in high energy physics. The fit to the invariant mass distribution of the two-muon system is shown in Figure 10. The result of the combination is an observation of the $B_{s}^{0} \to \mu^{+}\mu^{-}$ decay and a small excess over the background for $B^{0} \to \mu^{+}\mu^{-}$. The first measured branching fraction ${\cal B}(B_{s}^{0} \to \mu^{+}\mu^{-})=(2.8{{\:}^{+{\:}0.7}_{-{\:}0.6}}) \times 10^{-9}$ came out consistent with the Standard Model prediction.

Figure 11: Combination of $B_{(s)}^0 \to \mu^+\mu^-$ results. Image courtesy of D. Straub, updated from J. Aebischer et al. (2019))

For the even more suppressed $B^{0}$ decay, the result is ${\cal B}(B^{0} \to \mu^{+}\mu^{-})=(3.9{{\:}^{+{\:}1.6}_{-{\:}1.4}}) \times 10^{-10}$, which is slightly, but not significantly, larger than the Standard Model prediction of $(1.06 \pm 0.09)×10^{-10}$. LHCb, ATLAS and CMS updated their Run 1 result, adding data obtained in 2015 and 2016. The obtained $B^0_s\to\mu^+\mu^-$ branching fractions and $B^0\to\mu^+\mu^-$ limits are all consistent. A graphical representation of all results is provided in Figure 11, with the correlation of the two branching fractions taken into account. The decays $B\to\mu^+\mu^-$ are the flagship $B\to\ell^+\ell^-$ modes because of the clean signature provided by the muon pairs. However, $B\to e^+e^-$ and $B\to\tau^+\tau^-$ also exist. According to its Standard Model prediction, the decay $B_s^0\to e^+e^-$ is out of reach in the foreseeable future. Due to the low electron mass, it is even more helicity suppressed than $B_s^0\to\mu^+\mu^-$. Moreover the study of final states involving electrons at hadron colliders is difficult due to the lower reconstruction efficiency and the poorer mass resolution (see for instance Fig. Figure 17). When passing through matter, electrons radiate a significant amount of energy by bremsstrahlung. This affects the reconstructed momentum and thus smears all derived quantities, like the invariant mass of the two-electron system. The best limits are currently ${\cal B}(B_s^0\to e^+e^-<9.4\times10^{-9}$ and ${\cal B}(B^0\to e^+e^-<2.5\times10^{-9}$ at 90\% C.L. Either limit is obtained assuming the absence of the other mode, as the signals would not be distinguishable due to the poor mass resolution. These limits are fair form the Standard Model, but start to set constraints on models allowing for different couplings to leptons, see Lepton Universality Tests.

The expected rate of $B_s^0\rightarrow\tau^+\tau^-$ is considerably larger, but the decay is experimentally challenging due to the difficult tau lepton reconstruction and associated large backgrounds. LHCb published a search for the decay $B_s^0\rightarrow\tau^+\tau^-$, but with a sensitivity still far from the Standard Model expectation. Belle II are likely to perform improved searches of such decays in the near future.

## Other leptonic decays

Just as $B_{s}^{0} \to \mu^{+}\mu^{-}$ and $B^{0} \to \mu^{+}\mu^{-}$ are theoretically clean decays, so are their counterparts with neutrinos. The challenge is on the experimental side. The decay $B^0 \to \nu \overline \nu$ is traditionally labelled as "$B^{0} \to$ invisible" as there is no way to experimentally determine the number of neutrinos (or if there were any at all). In the Standard Model the branching fraction is vanishing as it is helicity-suppressed by a factor $(m_{\nu}/m_{B}^{0})^{3}$. Helicity suppression occurs because of the $B^{0}$ meson is spinless, so the two spin-1/2 neutrinos must have opposite spins. For massless neutrinos this would be impossible as neutrinos are always left-handed and antineutrinos right-handed. Only the minute mass of neutrinos (rarely) allows opposite-spin neutrinos to be emitted.

Searches have been performed by the $B$ factory experiments using the full reconstruction technique (also referred to as “on the recoil”). One $B$ meson from the $B \overline B$ pair is fully reconstructed and the other is required to leave no trace in the detector. The branching fraction is limited to be less than $2.4 × 10^{-5}$ at 90% confidence level, by the BaBar experiment.

Decays to one charged lepton, $B^{+} \to \ell^{+}\nu$, are similarly helicity suppressed, with the strength of this suppression depending on the mass of the charged lepton. These are tree decays where the $\overline b$ and $u$ quarks in the $B^{+}$ meson annihilate, but rare because of the helicity suppression. Contributions with the $W^{+}$ mediator replaced by a charged Higgs boson could enhance or suppress the branching fraction. These decays have all been searched for by the $B$ factories using the full reconstruction technique describe above.

The $B^{+} \to \tau^{+}\nu_{\tau}$ decay, where the suppression is the weakest, has a predicted branching fraction of ${\cal B}^\text{SM}=(0.76{{\:}^{+{\:}0.08}_{-{\:}0.06}})\times 10^{-4}$ in the Standard Model and a measured rate of $(1.14 \pm 0.27)×10^{-4}$, which are in agreement. The more suppressed decays $B^{+} \to e^{+}\nu$ and $B^{+} \to \mu^{+}\nu$ have not been observed yet, with limits on their branching fraction around $10^{-6}$.

## The decays $b \to s\ell^{+}\ell^{-}$

The family of decays $b \to s\ell^{+}\ell^{-}$ ($\ell = e, \mu$) is a laboratory of New Physics studies on its own. In the Standard Model these decays are induced by a loop diagram similar to that of $b \to s\gamma$ (but with a $Z$ component) and a box diagram (Figure 12). The amplitudes corresponding to these diagrams interfere, which causes complex phenomenology.

Figure 12: Feynman diagrams of the dominant Standard Model contributions to $b \to s \ell^+ \ell^-$: (left) electroweak loop, (centre) box, (right) $c \overline c$ loop diagram.

The exclusive decay $B^{0} \to K^{*0}\ell^{+}\ell^{-}$, with $K^{*0} →K^{+}\pi^{-}$, provides a rich set of observables with different sensitivities to New Physics, and for which theoretical predictions are available. These observables are affected by varying levels of uncertainties related to the calculation of quantum chromodynamical effects. Yet, selected ratios of observables benefit from cancellations of uncertainties, thus providing a cleaner test of the Standard Model. The best known example is the lepton forward-backward asymmetry, explained in more details in the box.

Figure 13: Sketch of the $b \to s \ell^+ \ell^-$ dilepton mass squared distribution, courtesy of U. Haisch. The yellow band indicates the theoretically favoured region $1<q^2<6$ GeV$^2/c^4$. The green line is the Standard Model contribution from operators ${\cal O}_{7,9,10}$ while the red line shows the effect of taking lepton masses and $c \overline c$ contributions into account.

This interesting picture is complicated by a dependence on $q^{2}$, the dilepton mass squared (Figure 13). At very low $q^{2}$, $B^{0} \to K^{*0}\ell^{+}\ell^{-}$ behaves like $B^{0} \to K^{*}\gamma$, with a slightly off-shell photon decaying to two leptons. The physics is dominated by the ${\cal O}_7$ operator, as discussed in Section The decay $b \to s\gamma$.

At higher $q^2$ values, there is an interference of the amplitudes controlled by the ${\cal O}_{9}$ and ${\cal O}_{10}$ operators, related to the $Z$ loop and $W$ box diagrams, respectively. This “low-$q^{2}$” region between 1 and 6 GeV$^{2}/c^{4}$ is the most interesting and theoretically cleanest. Beyond this, non-suppressed $c \overline c$ contributions (Figure 12, right) make the picture more complicated and theoretical estimates are less reliable. The observation of high mass resonances above the $ψ(2S)$ meson by the LHCb collaboration ( Aaij, R et al., (2013) ) is an indication that a lot of care is needed when interpreting the high-$q^{2}$ region.

 The $A_{\mathrm{\rm FB}}$ and $P_5'$ asymmetries Figure 14: The angles $\theta_\ell$, $\theta_K$ and $\phi$ in the decay $B^0\rightarrow K^{*0}\mu^+\mu^-$. Figure by T. Blake. Figure 15: The $P_5'$ Asymmetry in $B^0 \to K^{*0}\mu^+\mu^-$ In the decay $B^{0} \to K^{*0}\mu^{+}\mu^{-}$, followed by $K^{*0} \to K^{+}\pi^{-}$, the direction of the four outgoing particles can be described by three angles, shown in . The forward-backward asymmetry $A_{\rm FB}$ is defined as the relative difference between the number of positive and negative leptons going along the direction of the $B^{0}$ meson in the rest frame of the two-lepton system. This corresponds to an asymmetry in the distribution of the $θ_{\ell}$ angle. Similarly, the $K^{*0}$ polarisation fraction $F_{\rm L}$ depends on the angle $θ_{K}$, defined analogously to $θ_{\ell}$. Other asymmetries can be constructed from the other angles or combinations of them. The $P_{5}′$ asymmetry is based on the angles $θ_{K}$ and $ϕ$. It is defined as the relative difference between the number of decays in the regions in red and blue in Figure 15, divided by $\sqrt{F_\text{L}(1-F_\text{L})}$. Quantities based on several angles are more difficult to measure than single-angle ones as they require a better understanding of the reconstruction efficiencies depending on the kinematics of the outgoing particles.
Figure 16: Compilation of experimental results for (left) $A_{\rm FB}$ and (right) $P_5'$ with $B^0\to K^{*0}\ell^+\ell^-$ and comparison with theoretical predictions (bands).

The differential decay width with respect to the dilepton mass squared $q^{2}$, the forward-backward asymmetry $A_{\rm FB}$, and the longitudinal polarisation fraction $F_{\rm L}$ of the $K^{*0}$ resonance have been measured by many experiments with no significant sign of deviations from the Standard Model expectation. The most recent measurement of $A_{\rm FB}$ by the LHCb experiment is shown in (left). LHCb also studied other angular asymmetries. In particular in 2013 a local deviation of the $P_{5}′$ observable (see the box) from the Standard Model expectation was observed around $q^{2} ∼ 5$ GeV$^{2}/c^{4}$ and then confirmed with larger data sets. Belle, ATLAS and CMS have subsequently presented data that are consistent with the LHCb results, see Figure 16 (right). Belle, ATLAS and CMS subsequently presented data that are consistent with the LHCb results.

This deviation triggered a lot of interest among theorists. It is not clear yet if the discrepancy in $P_{5}'$ is a statistical fluctuation, is due to under-estimated theoretical uncertainties, or is the manifestation of a new vector current beyond the Standard Model.

Similar measurements have been made in the decays $B \to K\ell^+\ell^-$, $B_{s}^{0} \to \phi\mu^{+}\mu^{-}$, $\Lambda_{b}^{0} \to \Lambda\mu^{+}\mu^{-}$ and $\Lambda_{b}^{0} \to pK^-\mu^{+}\mu^{-}$. The angular observables are consistent with the Standard Model, but there is some tension in the branching fraction measurements, which are on the low side compared to the expectation.

The decay family $b\to s \nu \overline \nu$ is theoretically cleaner than its charged-lepton counterpart $b \to s\ell^{+}\ell^{-}$. There are no interferences from $c \overline c$ loops as those do not annihilate to neutrino pairs. The main difficulty is on the experimental side and only $B$ factory experiments have attempted looking at such decays using the full reconstruction technique. None have been found and the most stringent limits on the decay rates of $B^0 \to K^{0*} \nu \overline \nu$ and $B^+ \to K^+ \nu \overline \nu$ are at the $10^{-5}$ level.

## Lepton universality tests

The above-mentioned $b \to s \ell^+ \ell^-$ measurements have been reported assuming that muons and electrons behave the same way. This assumption, called lepton universality, is built into the Standard Model and has been extensively tested, most notably at LEP experiments. The only Standard Model particle that has different couplings to leptons is the Higgs boson, which couples proportionally to mass. The presence of new particles that would couple differently to leptons can be tested by measuring the ratio

 $R_H \equiv \frac{{\cal B}(B\to H\mu^+\mu^-)_{q^2}}{{\cal B}(B\to He^+e^-)_{q^2}}$,


where $H$ is any hadron and the $q^2$ index indicates that this ratio is to be measured in a well-suited range of dilepton masses.

Surprisingly, the lepton universality ratio in $B^+\to K^+\ell^+\ell^-$ decays, $R_K$, was measured to be lower than unity with LHCb Run 1 data in the $1<q^2<6\:\gevcc$ range, which is most sensitive to New Physics contributions. The Standard Model prediction for this ratio is unity within $10^{-3}$ as all hadronic uncertainties cancel in the ratio. The experimental value was then updated with 2016 data to $R_K=0.846{\:}^{0.060}_{0.054}{\:}^{0.016}_{0.014}$, which corresponds to a $2.5\sigma$ tension with unity. shows the mass peaks of $B^+\to K^+\mu^+\mu^-$ and $B^+\to K^+e^+e^-$, highlighting the effect of bremsstrahlung affecting electron reconstruction. A similar deviation is seen by BaBar and Belle although with larger uncertainties.

Figure 17: Mass plots of (left) $B^+ \to K^+ \mu^+ \mu^-$ and (right) $B^+ \to K^+ e^+e^-$. Figures from Aaij, R et al. (2019). The dark shaded area shows the combinatorial background, and the light shaded the background from partially reconstructed $B \to K^+ \ell^+ \ell^- X$ decays where $X$ is undetected.

The ratio $R_{K^*}$ is defined in analogy. LHCb published two measurements in bins of $q^2$, which each differ by about $2\sigma$ from the Standard Model expectation of approximately unity. These results are more precise than those previously determined by the

Belle and BaBar collaborations,as shown in Figure 18.

Similar decays can be used to perform tests of lepton universality in $B^0_s \to \phi \ell^+ \ell^-$, $\Lambda^0_b \to \Lambda \ell^+ \ell^-$, $\Lambda^0_b \to p K^- \ell^+ \ell^-$, which are all accessible by the LHCb experiment, but some will have very limited yields. These measurements are complementary, as the different spins of the hadronic component probe different New Physics couplings. Also, the angular distributions described in Section The decays $b \to s\ell^{+}\ell^{-}$ should be investigated separately for decays to electrons and to muons. Belle reported separately the values of the $P_5'$ asymmetry, but no discrepancies were observed given the small available data sample.

Figure 18: . Compared results for the (left) $R_{K}$ and (right) $R_{K^*}$ ratios as determined by Belle, BaBar and LHCb.

Given the hints of lepton flavour universality violation between muons and electrons, it seems natural to wonder if such an effect can be seen in processes involving the third generation ($\tau$) lepton. This has been tested in $B \to D^{(\ast)}\tau \overline \nu_\tau$ decays, comparing to the same decay with muons or electrons instead of tau leptons. Unlike the decays described above, the Standard Model contribution to this decay is not suppressed (and does not match the definition of a rare decay). It proceeds via a tree-level $b \to cW^{-}$ transition, with the $W^{-}$ decaying to a lepton and a neutrino. The expectation is that the rates for the decays involving electrons, muons and tau leptons differ only due to phase-space effects (plus small effects due to form factors). The ratios of the rates of $b\rightarrow c\tau\bar{\nu}$ to $b\rightarrow c\ell\bar{\nu}$ ($\ell=\mu,e$) measured by BaBar, Belle and LHCb come out larger than expected, with an average deviating by approximately $3\sigma$ from the Standard Model predictions. This could indicate the presence of new couplings preferring tau leptons.

# Wilson coefficient fits

This section briefly describes some constraints on Wilson coefficients. It relies on Section Theory.

Figure 19: Allowed regions in the $C_{9e}^\text{NP}$ versus $C_{9\mu}^\text{NP}$ plane. Figure from Algueró, M. et al. (2019).

Several groups have performed model-independent fits of Wilson coefficients, using most of the experimental results presented above. The fits differ by the set of experimental results used, the statistical treatment of uncertainties and choices of form factors. Another major difference is the level of trust of computations of quark loops (most notably $c \overline c$ loops) incorporated in the fit. Depending on these choices, the determined tension with the Standard Model ranges from one to several standard deviations.

In all cases, the New Physics scenario which is preferred changes the value of the $C_{9}$ coefficient (adding a non-zero term $C_{9}^{\rm NP}$). This term could then be different depending on the flavour of the involved leptons (introducing $C_{9e}^{\rm NP}$ and $C_{9\mu}^{\rm NP}$), thus breaking lepton universality, see . The data are not conclusive yet, but a tension with the Standard Model point at $(0, 0)$ is visible. The significance of this tension depends on the assumed theory uncertainties.

Another popular model is to assume that the weak interaction $V - A$ structure holds in New Physics and thus to impose $C_{9}^{\rm NP} = -C_{10}^{\rm NP}$. The data are consistent with such a hypothesis, but again it is too early to draw conclusions.

Right-handed components are also added in the fits, in particular using asymmetries in $b \to s\ell^{+}\ell^{-}$ decays that are sensitive to such effects. Presently there is no evidence for any significant need for right-handed currents.

There is a plethora of model-dependent interpretations of these findings. The deviations can be accommodated by supersymmetry, models with new vector bosons, two Higgs doublets, scalar interactions or leptoquarks.

# Prospects

At the risk of stating the obvious, rare decays have the advantage of being rare. This ensures that the experimental precision will stay dominated by statistical uncertainties, and thus will not run into a limit imposed by irreducible systematic uncertainties. The theoretically cleanest measurements, like the lepton-universality ratios $R_{X_{s}}$ and the ratio of $B^{0} \to \mu^{+}\mu^{-}$ to $B_{s}^{0} \to \mu^{+}\mu^{-}$ branching fractions will continue to be of interest as more data are acquired at the LHC and by Belle II. The future will tell us if the deviations from expectations hold and tell us something new about Nature.

Other measurements, like branching fractions (for instance $b \to s\gamma$) have already reached the theoretical precision and more work is needed on this side to allow more precise comparisons of experimental values and Standard Model predictions. Finally, asymmetries in $B^{0} \to K^{*0}\ell^{+}\ell^{-}$ are in between. If the presently measured central values stay while the uncertainties reduce, we may soon be in the situation of having to understand a very significant deviation with predictions based on the Standard Model. More investigations of theory uncertainties are needed before any conclusion can be reached.

The Run 2 of the Large Hadron Collider is now ongoing and the amount of rare $b$-hadron decays collected by the LHC experiments will increase rapidly. Also, the Belle II experiment will start in 2018. Improved measurements of the processes described in this article, and also new complementary measurements, will become available and will lead to improved precision which will be useful in global fits.

# References

• Aaij, R et al. (2016). Angular analysis of the $B^0\to K^{\ast0}\mu^+\mu^-$ decay using $3$~fb$^{-1}$ of integrated luminosity, JHEP 02: 104. doi:10.1007/JHEP02(2016)104.
• Aaboud, M et al. (2015). Study of the rare decays of $B^0_s$ and $B^0$ into muon pairs from data collected during the LHC Run 1 with the ATLAS detector,  : . arXiv:1604.04263
• Ammar, R et al. (1993). Evidence for penguins: First observation of $B \to K^*(892)\gamma$, Phys. Rev. Lett. 71(5): 674-678. doi:10.1103/PhysRevLett.71.674.
• Branco, G C; Lavoura, L and Silva, J P (1999). CP Violation, Int. Ser. Monogr. Phys., Volume 103. Clarendon Press, Oxford. ISBN 978-0198716754
• Buchalla, Gerhard; Buras, Andrzej J and Lautenbacher , Markus E (1996). Weak decays beyond leading logarithms, Rev. Mod. Phys. 68: 1125-1144. doi:10.1103/RevModPhys.68.1125.
• Buras, Andrzej J and Fleischer, R (1998). Quark mixing, CP violation and rare decays after the top quark discovery, Adv. Ser. Direct. High Energy Phys. 15: 65-238.
• Dirac, Paul A.M. (1927). The quantum theory of the emission and absorption of radiation, Proceedings of the Royal Society of London A: Mathematical,Physical and Engineering Sciences 114(767): 243-265. doi:10.1098/rspa.1927.0039.
• Fermi, Enrico (1934). Versuch einer Theorie der $\beta$-Strahlen. I, Z. Phys. 88(3-4): 161-177. doi:10.1007/BF01351864.
• Fermi, Enrico (1949). Nuclear Physics - A Course Given by Enrico Fermi at the University of Chicago, Int. Ser. Monogr. Phys., Volume 103. University of Chicago Press, Chicago. ISBN 978-0226243658
• Khachatryan, Vardan et al. (2015). Observation of the rare $B_s^0 \to \mu^+\mu^-$ decay from the combined analysis of CMS and LHCb data, Nature 522:(68): 68-72. doi:10.1038/nature14474.
• Limosani, A et al. (2009). Measurement of Inclusive radiative $B$-meson decays with a photon energy threshold of 1.7 GeV, Phys. Rev. Lett. 103(241801): . doi:10.1103/PhysRevLett.103.241801.
• Aebischer, Jason et al. (2019). B-decay discrepancies after Moriond 2019, EPJC 80 252 (2020): . arXiv:1903.10434,doi=10.1140/epjc/s10052-020-7817-x