When analysing a mass spectrum, there are two available deconvolution modes:
Manual deconvolutions:
Isotopic cluster-based deconvolutions (Section 5.2, “Deconvolution based on isotopic cluster peaks” );
Charge state envelope-based deconvolutions (Section 5.1, “Deconvolution based on charge state” );
Automated deconvolutions (Section 5.3, “Automated deconvolution based on isotopic cluster peaks” )
Deconvolutions are performed in order to get back to the Mr mass of the analyte while reading m/z values. In the following sections, all the avaiable deconvolution processes are described.
Before delving into the deconvolutions, it is necessary to present two menu options that are found in the plot widgets contained in the Mass spectra window: the menu items under the Centroidation and deconvolutions menu (Figure 5.1, “Mass spectrum plot widget-specific deconvolution menu” ).
These two menus allow one to set parameters for the manual deconvolutions (see text for details).
In this kind of deconvolution, at the present time, the software assumes that the ionization agent is the proton and that the ionization is positive.
The deconvolution is based on the determination of the distance
between two peaks —consecutive or not— of a given charge
state envelope. When the user -click-drags the cursor from one
peak to another, the program tries to calculate if the distance
between
two peaks matches one or more charge difference(s). If so, it
computes the molecular (Mr [12C]-relative molecular mass) mass
of the analyte.
The Mr value that is computed is for the analyte below the mass peak at which the mouse drag moment operation started.
Top panel of Figure 5.2, “Charge state-based mass deconvolution” shows the charge state envelope-based deconvolution process for a protein of Mr≍8566 Da. In the top panel, the deconvolution has involved two consecutive peaks ( span> is 1). The mouse drag movement occurred from left to right. Thus, the m/z value chosen for the computation is that under the left peak (start point of the movement). The status line at the bottom of the plot indicates the selection range, the delta movement on the x-axis, the computed charge, the Mr value chosen for the calculation (the Mr value at the start of drag movement) and finally the calculated Mr value.
This “two consecutive mass peaks” method is the default method. However, it might happen that no clearly visible mass peak is available around one nice peak that might be chosen as the start of the mouse drag operation. In this case it is possible to define a different span> between two peaks elected for the deconvolution (see Figure 5.1, “Mass spectrum plot widget-specific deconvolution menu” ). In the figure, than span has been set to 2, which means that the mouse drag movement encompasses three mass peaks: the movement start peak, one peak in the middle and finally the movement stop peak (the span is thus of two intervals between the extreme peaks).
The bottom panel of the figure now displays the same Mr value for the protein even if the span is now of two intervals.

Deconvolution approach using two peaks belonging to the same charge state envelope. The top deconvolution involves two consecutive mass peaks (peak span value is 1). The bottom deconvolution involves two non-consecutive peaks (peak span value is 2). The Mr value, expectedly, did not change whatever the configured span.
The charge calculation, which is at the heart of the deconvolution, almost never produces an integer value with no fractional part (say, charge z=15.0) because it is almost impossible to drag the mouse cursor the exact number of pixels that would match a m/z range leading to such an integral charge value. Almost always, the charge that is calculated looks like 14.995 or 15.001, for example. This is due to the fact that the mouse moves at discrete positions on the screen and these positions might be more or less far apart, depending on the mouse capabilities and on the current zoom factor over the mass spectrum region of interest.
It is advised to zoom-in as much as possible over the peaks at hand so as to minimize the difficulties above. It may happen, however, that even zoomed-in peaks are not sufficiently distant to allow a charge calculation. In this case, reduce the stringency over the fractional part that is allowed in the charge (see menu item Set charge minimal fractional part at Figure 5.1, “Mass spectrum plot widget-specific deconvolution menu” ). By default, the stringency is set at 0.99, that is, any calculated value that has a fractional part either superior or equal to 0.99 or inferior or equal to 0.01 would lead to a successful round-up/round-down to the nearest integer value. Outside of the [0.01-0.99] interval, no charge calculation is performed and thus no deconvolution is performed. When the stringency is too high, reducing it will allow the deconvolution to be carried-over. General experience is that setting that value to 0.997 is fine for most situations and provides very reliable results.
In this kind of deconvolution, the user -click-drags the cursor
between the first two peaks (when possible) of the isotopic cluster.
The
charge state of the ion is the inverse of the distance between the
two consecutive peaks (that is, the m/z delta value). Figure 5.3, “Isotopic cluster-based mass deconvolution”
shows that deconvolution process at work.
The -click-dragging direction (left→right or
right→left) determines the final Mr that is computed
because that value is calculated for the peak under the
mouse when the mouse
drag movement is initiated. This is visible in the two panels of Figure 5.3, “Isotopic cluster-based mass deconvolution”
, where the
top panel shows the Mr computed for the left peak and the bottom
panel shows the Mr computed for the right peak. Since the ion is
monocharged, the difference is 1 Da.
This is a significant departure from the previous versions, where the postulate was that the single real peak of interest in an isotopic cluster was the left-most monoisotopic peak. Since this software has been used by scientists in research projects using almost 100 % labelled bacteria (with [13C] and [15N]), that concept has become moot. Indeed, analytes from these bacteria have their monoisotopic peak at the far right end of the isotoopic cluster.
The new behaviour allows the scientists to compute the Mr value of the peak of interest in an isotopic cluster, be that for a non-labelled or for a labelled analyte. See the following articles as examples of heavy isotope almost full labelling of bacteria.
Heavy isotope labeling and mass spectrometry reveal unexpected remodeling of bacterial cell wall expansion in response to drugs. Atze H, Liang Y, Hugonnet JE, Gutierrez A, Rusconi F, Arthur M. Elife, 2022, doi: 10.7554/eLife.72863, PMID: 35678393 .
Peptidoglycan-tethered and free forms of the Braun lipoprotein are in dynamic equilibrium in Escherichia coli. Liang Y, Hugonnet JE, Rusconi F, Arthur M. Elife, 2024, doi: 10.7554/eLife.91598, PMID:39360705 .
(p)ppGpp modifies RNAP function to confer β-lactam resistance in a peptidoglycan-independent manner. Voedts H, Anoyatis-Pelé C, Langella O, Rusconi F, Hugonnet JE, Arthur M. Nat Microbiol, 2024, PMID:38443580 .
The automated deconvolution that is based on the isotopic cluster peaks is designed to be reliable for low-mass analytes (typically below 5 kDa). The process involves two separate steps: first, the centroids are extracted from the mass spectrum (see Section 4.1.7, “ Centroid Extraction from Mass Spectra ”); second, the extracted centroids are used for the deconvolution proper. The deconvolution is a highly complex process that would require extensive knowldedge to understand. The main concepts are the following:
Iterate in each observed centroid (from smallest (m/z,i) values to greatest) of the input centroided mass spectrum and, for each centroid, assume that is has a given charge (in the charges range provided in the configuration; see below). With that assumed charge, compute a neutral analyte mass.
The neutral analyte mass is then used to construct a theoretical neutral averagine-based isotopic cluster. Each centroid of the isotopic is then taken in sequence and subject to a matching process.
The matching operation takes place by first converting back the theoretical neutral analyte cluster centroids to a charged analyte m/z value (same charge as that initially used to craft the neutral mass); second, the m/z value is searched for in the input mass spectrum centroids. When an input spectrum m/z value matches that of the theoretical isotopic cluster, a match is stored.
When all the centroids of the theoretical isotopic cluster have been iterated into, the set of found matches is checked for reliability (typically, there must have been at least three matches in the cluster, albeit not necessarily contiguous). Other checks are performed also. If the checks are all positive, then the match is stored for the next step.
At then end of the processing of all the input centroids, all the found matches are processed in order to consolidate them to a neutral mass. Each neutral mass is listed along with all the m/z value that was actually matched or the set of m/z values corresponding to different charges of the same analyte molecule.
The low mass deconvolution algorithm can be tightly configured using a large set of parameter settings that are detailed below and that are accessible via the application preferences window shown in Figure 5.4, “” and detailed below.
Min. charge (Max. charge): the charge range that is explored.
Mass tolerance (ppm): the tolerance with which all the mass matches are performed. A typical value is 20.
Min. mass (Max. mass): the neutral mass range that is explored.
Min. intensity: the intensity value below which input mass spectrum centroids are discarded.
Explained intensity threshold: Once the matches between the centroids in the theoretical isotopic cluster and those in the input centroided mass spectrum have been performed, the reliability of these matches is checked like so: select the monoisotopic centroid and its contiguous centroid in both the theoretical isotopic cluster and in the matched centroided mass spectrum. Compute the ratio between the intensities of these centroids. That ratio is used to normalize the theoretical centroids intensities so they can be compared to the matched input mass spectrum centroid intensities. Compute the sum of the matched centroids intensities for all the matches (input mass spectrum vs theoretical). Compute the [input / theoretical] summed intensities ratio, which reflects the 'explained intensity'. The greater the ratio, the better the intensity correspondence between the theoretical and experimental isotopic clusters. That ratio value is required to be greater than the value set for this parameter for the matches to be considered reliable. Typical values: 0.5-0.7.
Relative cluster shape tolerance: Once the matches between the centroids in the theoretical isotopic cluster and those in the input centroided mass spectrum have been performed, the reliability of these matches is checked like so: get the monoisotopic (highest probability centroid, isotopologue 0) match. Get the isotopologue 1 match. For each one of these two matches compute the ratio of the intensities for the theoretical isotopic cluster data and for the input centroided mass spectrum (observed data) This yields the following intensity ratios: [ theor_ratio = theor. intensity iso 1 / theor. intensity iso 0 ] and [ obser_ratio -> obser. intensity iso 1 / obser. intensity iso 0 ]. Compute the ratio [ abs(obser_ratio - theor_ratio) / theor_ratio ] which amounts to a 'normalized difference'. If the ratio above is smaller than this parameter setting, then the matches are considered to be reliable. Typical values: 0.6-0.7.
Min Pearson corr. score: Once the matches between the centroids in the theoretical isotopic cluster and those in the input centroided mass spectrum have been performed, the reliability of these matches is checked like so: A Pearson correlation factor is computed to check that the observed and theoretical isotopic cluster centroids are correlated by their intensity. If the obtained correlation factor is greater or equal to this setting, then the matches are considered to be reliable. Typical values: 0.7-0.90.
Neutral averagine formula: This the averagine formula to be used to model the isotopic clusters starting from a neutral analyte mass.
Isotopic data file: The path name of the file holding the isotopic data. This is a file having the following format (removed data for clarity):
1,hydrogen,H,1,1.007825,1,0,0.999884,-0.000115,0
1,hydrogen,H,1,2.014101,2,1,0.0001157,-9.0644,0
2,helium,He,2,3.0160293,3,0,0.00000,-13.5206046 0 [
... ] 6,carbon,C,6,12.00000,12,0,0.9892119,-0.0108466,0
6,carbon,C,6,13.003354,13,1,0.0107,-4.529315,0
7,nitrogen,N,7,14.0030,-0.003648,0
7,nitrogen,N,7,15.00010,15,1,0.0036419,-5.615226,0
8,oxygen,O,8,15.99491,16,0,0.9975676 -0.00243,0
8,oxygen,O,8,16.999131,17,1,0.00038093,-7.872715,0
8,oxygen,O,8,17.999159,18,2,0.0020513,-6.189236,0It is possible to use an isotopic data file that lists heavy isotopes as the most probable isotopes for any chemical element. In this manual author's research, bacteria are fully labelled with the heavy stable isotopes of carbon and nitrogen, which produces peptidoglycan structural elements that display as inverted isotopic clusters (with respect to natural isotopic abundance analytes). The deconvolver works also in this case by only detecting the input mass spectrum isotopic cluster having the proper inverted shape, and reports the analytes with the diagnostic ions at monoisotopic m/z values corresponding to the highest value in the isotopic cluster that was correctly recapitulated by the averagine-based theoretical isotopic cluster. This means that whatever the isotopic abundance configuration, the reported ions are correct: their monoisotopic m/z value is reported.
The last field may be used to indicate the path name to a file where all the processing details are stored. Perusing these data might be interesting when the deconvolution results do not correspond to expected results.
When the deconvolution process has ended, the results are shown in the console window.
When -click-dragging the mouse cursor between two mass spectrum
locations of interest, the program computes the apparent resolving
power. This process is shown on Figure 5.5, “Calculation of the resolving power”
, where
the resolving power is calculated by dragging the mouse cursor from
one edge of a peak to the other at half maximum height (this is
called full width at half maximum [FWHM]
resolution).