Planta Med 2021; 87(12/13): 1061-1068
DOI: 10.1055/a-1470-0446
Natural Product Chemistry and Analytical Studies
Original Papers

13C NMR Dereplication Using MixONat Software: A Practical Guide to Decipher Natural Products Mixtures[ # ]

Antoine Bruguière
1   Univ Angers, SONAS, SFR QUASAV, Faculty of Health Sciences, Dpt Pharmacy, Angers, France
,
Séverine Derbré
1   Univ Angers, SONAS, SFR QUASAV, Faculty of Health Sciences, Dpt Pharmacy, Angers, France
,
Dimitri Bréard
1   Univ Angers, SONAS, SFR QUASAV, Faculty of Health Sciences, Dpt Pharmacy, Angers, France
,
Félix Tomi
2   Université de Corse-CNRS, UMR 6134 SPE, Equipe Chimie et Biomasse, Ajaccio, France
,
Jean-Marc Nuzillard
3   Université de Reims Champagne Ardenne, CNRS, ICMR UMR 7312, Reims, France
,
Pascal Richomme
1   Univ Angers, SONAS, SFR QUASAV, Faculty of Health Sciences, Dpt Pharmacy, Angers, France
› Author Affiliations
 

Abstract

The growing use of herbal medicines worldwide requires ensuring their quality, safety, and efficiency to consumers and patients. Quality controls of vegetal extracts are usually undertaken according to pharmacopeial monographs. Analyses may range from simple chemical experiments to more sophisticated but more accurate methods. Nowadays, metabolomic analyses allow a fast characterization of complex mixtures. In the field, besides mass spectrometry (MS), nuclear magnetic resonance spectroscopy (NMR) has gained importance in the direct identification of natural products in complex herbal extracts. For a decade, automated dereplication processes based on 13C-NMR have been emerging to efficiently identify known major compounds in mixtures. Though less sensitive than MS, 13C-NMR has the advantage of being appropriate to discriminate stereoisomers. Since NMR spectrometers nowadays provide useful datasets in a reasonable time frame, we have recently made available MixONat, a software that processes 13C as well as distortionless enhancement by polarization transfer (DEPT)-135 and -90 data, allowing carbon multiplicity (i.e., CH3, CH2, CH, and C) filtering as a critical step. MixONat requires experimental or predicted chemical shifts (δ C) databases and displays interactive results that can be refined based on the userʼs phytochemical knowledge. The present article provides step-by-step instructions to use MixONat starting from database creation with freely available and/or marketed δ C datasets. Then, for training purposes, the reader is led through a 30 – 60 min procedure consisting of the 13C-NMR based dereplication of a peppermint essential oil.


#

Abbreviations

DB: database
DEPT: distortionless enhancement by polarization transfer
EO: essential oil
NPs: natural products
δ C : recorded carbon 13 chemical shifts
δ C-SDF : carbon 13 chemical shifts in the DB
 

Introduction

Currently, among alternative medicines, the use of herbal drugs and dietary supplements continues to expand worldwide with many people resorting to them to either prevent or cure various minor illnesses or in a quest for well-being [1], [2], [3]. In this framework, the quality of raw materials and plant extracts contributes to their safety and effectiveness. Indeed, for a given species, the chemical content of a medicinal plant, and thus its biological effects, may vary depending on chemotypes, growing conditions (e.g., soils, weather), and harvest time, as well as post-harvest handling (e.g., drying, storage) [4], [5]. Quality controls of herbal drugs are usually based on scientific benchmarks such as pharmacopeias aiming at controlling their identity (botanical and chemical criteria), purity (i.e., assays such as ash values, loss on drying or TLC to detect contaminant), and, finally, the content of active constituents or markers (i.e., assays using analytical methods) to check quantitatively their composition. Addressing these issues, besides spectrophotometric analyses, different chromatographic methods such as HPLC-UV or GC with flame ionization detector (GC-FID) are routinely used to rapidly determine complex mixtures.

Metabolomic analyses allow the rapid deciphering of complex mixtures of organic compounds, including vegetal extracts. In this field, beside LC-MS and GC-MS [6], [7], NMR spectroscopy tends to gain importance in the direct identification of NPs in complex herbal matrices. For a decade, automated dereplication processes based on 13C-NMR have been emerging to efficiently identify major compounds in mixtures using moderate field instruments (400 MHz), freely available automation procedures, and dedicated software [8], [9], [10], [11]. While a higher sensitivity is a major advantage of MS, 13C-NMR is indeed highly suitable for the discrimination of diastereomers. Additionally, MS analysis usually requires a separation step of the mixture using appropriate columns and chromatographic conditions as well as standards or benchmarks to do so. It should be noted that commercial (e.g., ACD Labs [12]) and open source (e.g., CSEARCH [13]) solutions allow efficient computer-assisted peer-reviewing of pure NPs based on their NMR spectra. However, such tools were not developed to perform dereplication analyses of crude extracts.

In this context, we recently proposed a freely distributed algorithm, namely MixONat, for dereplication analyses of major NPs in crude extracts or less complex fractions. MixONat analyses a single {1H}-13C NMR spectrum that may be optionally combined with DEPT-135 and DEPT-90 data to discriminate between CH3, CH2, CH, and C, as well as with molecular weight filtering. The software requires predicted or experimental carbon chemical shifts (δ C-SDF) DBs, and displays results that can be refined interactively [11], [14] ([Fig. 1]). MixONat has demonstrated its effectiveness in elucidating the composition of various mixtures containing alkaloids, di- and triterpenes, or xanthones [11].

Zoom Image
Fig. 1 Schematic representation of the 13C-NMR-based dereplication process. MixONat (orange, middle) requires appropriate DBs including either experimental δ C or freely available and/or commercial predicted δ C-SDF datasets (green, right) as well as peak lists (.csv files) exported from experimental data (blue, left). *Optional

EOs are complex mixtures of volatile and odorous principles, including monoterpenes, a significant proportion of them consisting of chiral compounds [4]. GC-FID and GC-MS are usually used to identify and/or quantify such volatiles in EOs [15]. They require comparisons with standards or spectral libraries. Moreover, several analytical problems may occur: the absence of elution or co-elution of volatiles and sometimes their thermosensitivity may impair identifications [16], [17], [18]. One drawback of MS detection is also its reduced ability to distinguish between diastereomers and positional isomers. Due to these limitations, we think that other spectroscopic tools such as {1H}-13C-NMR associated with dereplication software may be useful to accurately characterize major volatiles in EOs [19].

Thus, for training purposes, the present paper presents a workflow for using MixONat software to analyze peppermint EO using 13C-NMR data as well as DEPT-135 and DEPT-90 experiments. The process starts from the building of appropriate DBs consisting of either experimental δ C or free and/or commercial predicted δ C-SDF datasets whereas practical information for exporting input files from NMR spectra is given. Finally, it helps the user to correctly interpret the results suggested by MixONat. The ability of the process to rapidly characterize major monoterpenes from EOs, including diastereomers, is eventually demonstrated.


#

Results and Discussion

As a reference, peppermint EO was analyzed using both GC-FID and GC-MS following the conditions described by the European Pharmacopoeia. As expected, menthol (1), menthone (2), 1,8-cineole (3), menthyl acetate (4), isomenthone (5), limonene (6), and menthofurane (7) ([Fig. 2]) were identified as major monoterpenes by comparison of their retention times with those of authentic samples and by computer matching of their fragmentation patterns against the NIST mass spectral library ([Table 1], Fig. 1S, Supporting Information). In the present work, we evaluated 13C NMR and MixONat software as an alternative tool to identify monoterpenes 17 in peppermint EO through a step-by-step procedure.

Zoom Image
Fig. 2 Structures and atom numbering of major monoterpenes from peppermint EO identified using GC-MS.

Table 1 Major monoterpenes in peppermint essential oil determined by GC-MS as well as their ranks using 13C-NMR-based dereplication, the MixONat software, and various DBs of experimental or predicted δ C-SDF.

Compound name

Amount (%)
GC-MS

Peppermint EO DB1
Experimental δ C
30 NPs
(High | Low threshold values)

Lamiaceae DB2
SciFinder/ACD
952 NPs
(High threshold value)

Lamiaceae DB3
Knapsack/ACD
958 NPs
(High threshold value)

Lamiaceae DB4
Knapsack/nmrshiftDB
958 NPs
(High threshold value)

a Not detected; b Not in Lamiaceae DB2 | Neomenthol acetate suggested in rank 7 (score 1.0).

1

Menthol

41.6

3 | 1

2 and 6

4

11

2

Menthone

27.0

6 | 4

11

7 and 8

2 and 3

3

1,8-Cineole

6.1

1 | 2

4

12

1

4

Menthyl acetate

6.1

2 | 3

1 and 8

2 and 3

37

5

Isomenthone

5.7

4 | 5

3

5 and 6

4 and 5

6

Limonene

1.9

7 | 7 (Score 0.9)

10

10 and 11

7 and 8

7

Menthofurane

2.9

12 | 8 (score 0.9)

207

40

58

8

Neomenthol

a

5 | 6

b

1

10

The first step consists of creating DBs. As far as experimental data are concerned, when DBs containing NPs of interest and their experimental δ C are available in the selected deuterated solvent, their use increases the odds for better matches [20]. However, such comprehensive DBs, already shaped to be used with MixONat, are not yet available although researchers usually keep spectral data including δ C in their laboratories and share them through academic publishing. Thus, even if the task is tedious, for given botanical genera or families of interest, using dedicated software or even a simple free text editor (Chart 1), small DBs of NPs associated with their δ C may be manually built [9], [10] and ultimately easily shared with the scientific community [21]. As far as volatile compounds from EOs are concerned, such an approach was initiated years ago [22]. Thus, Mentha DB1 (30 NPs) including the δ C-SDF was manually built from the experimental δ C of the monoterpenes usually described in peppermint oil [23] using ACD NMR predictors (C, H) software.

Chart 1

Useful software and algorithms

  • KnapsackSearch [24]. This program allows exporting of NPs previously isolated from a set of genera as an SDF together with associated data (e.g., name, molecular weight, sources, predicted δ C-SDF using nmrshiftdb2).

  • MarvinView [25]. This free advanced chemical viewer allows visualizing compounds from a DB (SDF) as well as their associated data (e.g., molecular weight, NMR predicted shifts).

  • Notepad++ [26]. This is a free source code editor that supports several languages. It is useful to create DBs suitable for MixONat software.

  • MixONat [14]. It allows the dereplication of NP mixtures using 13C-NMR and DEPT data as well as experimental or predicted δ C DB. It displays interactive results that can be refined based on the userʼs phytochemical knowledge.

Alternatively, in a first approach, δ C-SDF may be predicted when data are not easily available or when the number of NPs of interest is too large to achieve the task in a reasonable time. The first step to achieving this goal consists in finding the best way to collect NPs of interest as a structure-data file (SDF), comprising, for each compound, the MDL molfile with associated data (e.g., name, CAS number, molecular formula, molecular weight, source, etc.). As aforementioned, considering dereplication of plant or microorganism extracts, a chemotaxonomic-based selection is relevant [27]: all NPs previously isolated from a genus or a family are easily exported as SDF using various DBs accessible through subscription. For the dereplication of peppermint EO, a search on SciFinder [28] using the keyword Lamiaceae allowed us to select more than 10 000 references. They were further reduced to 3499 referring to “natural products”, “pharmaceutical natural products”, “essential oil”, etc. using an analysis of the references with the filter “CA concept heading” proposed by SciFinder. After additional refining by “categories” (i.e., analytes and matrices) and filtering based on the molecular weight, all relevant NPs were subsequently exported as SDF to obtain the NPs from Lamiaceae DB2 (982 NPs) (Fig. 2S, Supporting Information). A similar selection is also possible using the upgraded version of Scifinder-n or Reaxys [29].

Alternatively, the freely available KnapsackSearch program (KS) was used [30]. KS is closely related to the KNApSAcK project, which contains 129 662 species-metabolite relationships encompassing 23 911 species and 53 032 metabolites. [24], [31]. KS associates a set of NPs related to a list of genera with their sources. Running KS with the set of genera described in the Lamiaceae family as a keyword input results in an SDF of 958 NPs, which was used to obtain easily Lamiaceae DB3 and DB4 during the following step.

The second step consists of predicting the necessary δ C-SDF values. Useful prediction tools are either commercial or freely available. Therefore, as an example, we used both ACD NMR predictors (C, H) [10], [12] and nmrshiftdb2 [32], [33] to predict δ C-SDF and obtain Lamiaceae DB2 – 3 and DB4 respectively.

Finally, whether starting from a DB of experimental or predicted δ C-SDF, the use of the “CtypeGen” tab in MixONat software (Fig. 3S, Supporting Information) is mandatory to sort all δ C-SDF by carbon type (i.e., Cq, CH, CH2, CH3) so that the DB will be readable by MixONat (see Supporting Information: Create a DB Readable by MixONat).

In a second step, NMR spectra are recorded, processed, and δ C exported in the required format. The {1H}-13C NMR spectrum (1024 scans) of peppermint EO (90 mg) was recorded in CDCl3 (0.6 mL) using a routine 400 MHz NMR spectrometer. Though not mandatory for the software, DEPT-135 and DEPT-90 spectra were also recorded since carbon multiplicity was previously shown as a powerful discriminant filter [11]. From there, the present 13C NMR-based dereplication process requires 30 to 60 min only until completion.

As usual, 13C NMR, DEPT-135, and DEPT-90 spectra are to be phased and baseline corrected using dedicated software. Among critical features, the user should obviously reference the 13C-NMR spectrum on the central resonance of the deuterated solvent with caution; concerning DEPT spectra, their careful alignment on a selected δ C, (i.e., a chosen CH) is essential for MixONat proper use. Whatever the reason, if it is not the case, the default values of “DEPT alignment” at 0.02 ppm (MixONat, tab2: Parameters, Fig. 4S, Supporting Information) should be slightly increased accordingly. Of course, the peak picking process is also critical. It may be manual but the use of a minimum intensity threshold is preferred over automatically-collected positive 13C NMR and DEPT-90 signals and positive and negative DEPT-135 signals while avoiding potential noise artifacts. This step is sometimes difficult to implement as, for example, when the intensities of δ C from major NPsʼ quaternary carbons are in the same range as the ones from minor NPsʼ methyl groups. Back to peppermint EO, a high threshold value was first chosen to pick peaks (Fig. 5S6S, Supporting Information). However, the minor signal at δ C 8.20 ppm is obviously arising from a methyl group, which means that, if the corresponding NPs include quaternary carbons, their intensities will be close to the noise level. Alternatively, a second peak picking was carried out using a lower threshold value (Fig. 7S8S, Supporting Information). After eliminating deuterated solvent signals, the lists of chemical shifts and intensities from 13C NMR, DEPT-135, and DEPT-90 were exported and/or saved as separated Comma Separated Values (CSV) files, which were then used as input files by the MixONat software (Fig. 9S, Supporting Information). Thus, in the present practical exercise, 2 batches of data were used as input files. The first batch consisted of those selected based on a high threshold value, namely Peppermint EO 13C.csv, Peppermint EO DEPT 135.csv, Peppermint EO DEPT 90.csv (Fig. 5S, files available in Supporting Information) corresponding to 13C-NMR, DEPT-135, and DEPT-90 data respectively. The second batch was made of chemical shifts picked using a lower threshold value, namely Peppermint EO Minor 13C.csv, Peppermint EO Minor DEPT 135.csv, Peppermint EO Minor DEPT 90.csv (Fig. 7S, files available in Supporting Information).

In a third and last step, MixONat basic and advanced exploitation can be detailed as follows: a first dereplication analysis was undertaken using the small experimental Mentha DB1 (30 monoterpenes) as well as δ C and associated DEPT data picked using the high threshold value as input files in tab 1 “Inputs” of the graphical user interface of MixONat (Fig. 10S, Supporting Information). Before running the matching process with MixONat, a verification step of each file is recommended using the “Check” button. A description of the SDF and each. csv file is thus expected (Fig. 11S, Supporting Information) to be displayed. If not, a careful examination of the defective file and a formatting step using Notepad++ usually fixes the bug. In the “Parameters” tab, the tolerance value ε reflects the accuracy of the used DB. It was set at 0.5 ppm since an experimental DB was used [34]. The “equivalent carbons” were authorized, meaning that the same δ C might be matched multiple times if several identical δ C-SDF were found. As a result, the software sorted out compounds of the DB by decreasing score and increasing error. The score is defined as the number of carbon chemical shifts in the 13C-NMR spectrum (δ C) matched with δ C-SDF out of the number of carbons of the compound ([Fig. 1]). The error is the cumulated absolute difference between matched signals (i.e., Σ |δ C-SDF − δ C|). MixONat sorted out 5 major monoterpenes (15: ranks 1 – 4 and 6) as well as neomenthol (8: rank 5), a diastereomer of menthol with a perfect match (i.e., score 1.0). The presence of 8 in peppermint EO was confirmed by a careful examination of δ C (Table 1S, Supporting Information). It should be noted that neither GC-FID nor GC-MS using the method described in the European Pharmacopeia was able to distinguish neomenthol (8) from menthol (3) because of their co-elution and similar MS fragmentation, pointing to the complementarity of 13C NMR for quality control. Minor monoterpenes such as limonene (6: rank 7, score 0.9) and menthofuran (7: rank 12, score 0.7) were also identified ([Table 1]). An automatic peak picking using a lower threshold value identified additional quaternary carbons for these minor NPs. Finally, using these δ C lists, the first 7 guess compounds were 6 monoterpenes (16) expected in peppermint EO according to the European Pharmacopeia as well as neomenthol (8). Menthofuran (7) was ranked 8th (score 0.9) as the quaternary C-2 is in the background noise ([Table 1], Fig. 7S, 12S, Supporting Information).

Alternatively, MixONat can work with DBs of predicted δ C-SDF. As MixONat software ranks all compounds of the DB by decreasing score and increasing error, the result of the dereplication process depends on 2 crucial parameters (i.e., the selected NPs and the accuracy of the δ C-SDF). Thus, one can wonder about the relevance of such predicted DBs. In the present study, δ C-SDF from Lamiaceae DB2 – 3 and DB4 were predicted using either the commercial ACD NMR predictors (C, H) or the open NMR web DB nmrshiftdb2.

Using Lamiaceae DB2 (952 NPs | ACD NMR predictors [C, H]), δ C obtained with a high threshold value and a tolerance kept at 1.3 ppm [10], [11], 8 monoterpenes reached a perfect score. Among them, menthyl acetate (4), menthol (1), isomenthone (5), and 1,8-cineole (3) were ranked at positions 1 to 4 respectively. NPs suggested in positions 4 to 8 were either the same monoterpenes without any stereochemistry or isomers of menthyl acetate ([Table 1], Figs. 13S, 15S, Supporting Information). To conclude on the actual presence of these monoterpenes first ranked in the studied EO, a comparison of δ C picked in the 13C NMR spectrum should be done with experimental data in the same deuterated solvent. This may be easily achieved using SpectraBase [35], a free spectral DB including NMR data of various NPs. Alternatively, SciFinder through subscription allows fast access to previous publications describing δ C of a given NP [28]. On this basis, menthol (1), isomenthone (5), 1,8-cineole (3), and menthyl acetate (4) could be rapidly identified with certainty in peppermint EO using 13C NMR (Table S1, Supporting Information). Moreover, on 13C NMR spectra, for a given compound, the individual intensities of C, CH, CH2, and CH3 δ C are approximately proportional. As a consequence, a way to rapidly select NPs to be checked is to consult the reconstructed 13C NMR spectrum by clicking on the button “Details” for monoterpenes suggested in the first positions (Fig. 13S, Supporting Information): in the present example, considering the relative intensities of the signals in each reconstructed spectrum, experimental chemical shifts of isomenthone (2: rank 3) were searched among δ C while this was considered useless for oplopanone (rank 9) (Fig. 14S, Supporting Information). Then, limonene (6) and menthone (2) were ranked 10th and 11th with a score of 0.9 (Fig. 15S, Supporting Information). For limonene (6), a minor monoterpene from EO, the quaternary C-8 (δ C 150.1 ppm in CDCl3), was not matched because it was not picked when using the high threshold value (Fig. 5S, Supporting Information). But the corresponding δ C were identified as a minor signal after a careful examination of the 13C NMR spectrum. Regarding menthone (2), ketone C-1 (δ C 212.6 ppm in CDCl3) was inaccurately predicted by ACD NMR predictors (C, H) in Lamiaceae DB2 (δ C-SDF 210.8 ppm, Δδ > 1.3 ppm). However, the MixONat software offers an interactive interface (Fig. 14S, Supporting Information) that allows the user to manually add or remove mismatched signals. After adding the missing signals for limonene (6) and menthone (2), they moved to positions 2 and 3 respectively, while menthyl acetate (4) remained at rank 1 and menthol (1), isomenthone (5), and 1,8-cineole (3) shifted from 2 – 4 to 4 – 6 positions respectively.

Using the Lamiaceae DB3 (958 NPs, KnapsackSearch, ACD NMR predictors [C, H]), the same approach first suggested neomenthol (8) (Fig. 16S, Supporting Information), a diastereomer of menthol (1), which was not identified by GC in the condition described by the European Pharmacopeia. It was not suggested while using Lamiaceae DB2, as unfortunately the approach to collect NPs from Lamiaceae using SciFinder (see above) did not select this monoterpene. Nonetheless, it should be noted that neomenthyl acetate was ranked 7th with Lamiaceae DB2. This exemplifies the relevance of using various DBs in such a 13C NMR-based dereplication approach. As with DB2, the use of Lamiaceae DB3 suggested menthyl acetate (4), menthol (1), isomenthone (5), menthone (2), limonene (6), and 1,8-cineole (3) among the 12 first hypotheses ([Table 1], Fig. 16S, Supporting Information).

Finally, Lamiaceae DB4 was constituted of the same 958 NPs as Lamiaceae DB3 but with predicted δ C-SDF using a combination of KnapsackSearch program and nmrshiftdb2. As a result, even if the δ C-SDF prediction is less accurate (Table S1, Supporting Information), this free solution succeeded in suggesting the presence of all major monoterpenes (13, 56, 8) in peppermint EO among the 11 first suggestions, except for menthyl acetate (4: rank 37). It should be noted that both enantiomers were usually suggested (i.e., rank 2: (−)-menthone and rank 3: (+)-menthone) ([Table 1], Fig. 17S, Supporting Information).

Through this practical case, we exemplified the simplicity and efficacy of 13C-NMR-based dereplication using the freely available MixONat and KnapsackSearch software to identify the major products in complex extracts such as EOs. The process requires 13C-NMR and DEPT data recorded thanks to a routine NMR spectrometer and DBs inventorying structures of interest associated with their δ C. A chemotaxonomic approach is proposed to build reasonably sized libraries from selected NPs. When available, experimental δ C lead to the best outcomes, but suggestions using predicted values calculated by commercial and free programs are precise enough to rapidly identify the major NPs. The chosen example, peppermint EO, also showed that dereplication by 13C-NMR distinguishes menthol diastereomers, whereas GC-FID or GC-MS using the method described in the European Pharmacopeia fail to do so. 13C-NMR-based dereplication processes may thus be considered for the study as well as for the quality control of EOs and medicinal plants.


#

Materials and Methods

Chemicals

Mentha × piperita L. EO (16 020 152/K) was purchased from Laboratoire Cooper.


#

Apparatus and operation conditions

GC-FID of peppermint oil was performed as described in the European Pharmacopeia [15] with a 6890 GC system (Agilent Technologies) equipped with a Phenomenex Zebron ZB-5 column (30 m × 0.25 mm × 0.25 µm film thickness). The temperature program started with a 10 min period at 60 °C, then the temperature was increased to 180 °C, at a rate of 2 °C/min, and finally stabilized at 180 °C for 5 min before returning to the initial value. The carrier gas was helium (1.5 ml/min); 1 µL of the sample (2% in methanol) was injected; the split ratio was 10 : 1. Identification of the monoterpenes was based on the comparison of the retention times with those of authentic samples.

GC-MS analysis of peppermint EO was performed with a GCMS-QP2010 apparatus (Shimadzu) in the same conditions as those described for GC-FID analyses. The ionic source and interface temperatures were 220 and 200 °C respectively, operating in the electron impact (EI) ionization mode (ionization energy at − 70 eV). Identification of the monoterpenes was based on computer matching against the commercial NIST 11 and 11S mass spectral libraries.

Peppermint EO (90 mg) was dissolved in 600 µL of CDCl3. NMR analyses were performed at 298 K on a JEOL 400 MHz YH spectrometer (JEOL Europe) equipped with an inverse 5 mm probe (ROYAL RO5). For 13C NMR (100 MHz) spectra, a WALTZ-16 decoupling sequence was used with an acquisition time of 1.04 s (32,768 complex data points) and a relaxation delay of 2 s; 1024 scans were collected for 90 mg of EO to obtain a satisfactory S/N ratio. A 1 Hz exponential line broadening filter was applied to each FID before the Fourier transformation. Spectra were manually phased and baseline corrected using the MestReNova software (Mestrelab Research) and referenced on the central resonance of the deuterated solvent [36] at δ C 77.16. For DEPT experiments, 512 scans were required for 90 mg of EO, and alignments with the 13C spectrum were made using a given δ C. A minimum intensity threshold was then used to automatically collect positive 13C NMR and DEPT-90 signals and positive and negative DEPT-135 signals while avoiding potential noise artefacts.


#

Procedure

Building a DB of predicted δ C-SDF

To create a DB of molecules and their δ C that can be used by MixONat, the first step is to gather the structures of the compounds of interest (e.g., NPs previously identified in a genus or a botanical family). The easiest way consists of downloading them from various DBs accessible through subscription (e.g., SciFinder [28], Dictionary of Natural Products [20]) or from freely available ones (e.g., KNApSAcK [31], Universal Natural Products Database [37], LOTUS [38], [39]). Once the individual files of each molecule (.mol,. cdx,. sk2) are collected in a structure data file (.sdf), their δ C-SDF are predicted using a NMR prediction software under license (e.g., ACD NMR predictors [C, H]) [10], [12] or not (e.g., nmrshiftdb2) [32], [33]. From such DBs containing NPs together with their δ C, the CTypeGen routine included in MixONat (Fig. 3S, Supporting Information) creates a suitable DB: it reads the SDF and sorts chemical shifts by carbon type. A new SDF is then created. The latter contains, for each compound of the DB, the predicted δ C values organized as methyl, methylene, methine, or quaternary carbons. The creation of such a DB is required for the MixONat algorithm to work properly.[*]


#

Specific DBs

Lamiaceae DB2 was built by searching for compounds described in the Lamiaceae family on SciFinder, resulting in a DB of 982 NPs. δ C were predicted using ACD NMR predictors (C, H). Lamiaceae DB3 contains the 958 NPs from Lamiaceae according to KNApSAcK. δ C were also predicted using ACD NMR predictors (C, H). Finally, Lamiaceae DB4 contains the same 958 NPs but δ C were predicted using nmrshiftdb2 and was automatically assembled using the KnapsackSearch (KS) program.


#

The KS program

KS, available for free from https://github.com/nuzillard/KnapsackSearch/, is a tool for the construction of focused NP libraries that relate together structure, biological taxonomy, and predicted 13C NMR data. In this context, a focused library is defined by a user-supplied list of organism genera, possibly related to a taxonomic family. As clearly stated by its name, KnapsackSearch is related to the KNApSAcK project [31]. Searching in KNApSAcK for an organism according to its genus returns a list of pairs constituted by the organismʼs binomial name and by the KNApSAcK compound identifier. Searching in KNApSAcK for a compound identifier returns structural descriptors of this compound. These 2 types of searches are combined by KS to associate a set of compounds related to a list of genera with the organisms in which they have been reported. Running KS with a set of genera as input results in an SDF in which 2D stereo-aware structures are derived from the SMILES chains stored in KNApSAcK using the cheminformatics toolkit RDKit [40]. The final SDF contains tags that define the molecular properties such as the compoundʼs name, its molecular formula, molecular weight, CAS registry number, InChI key, InChI code, SMILES chain, KNApSAcK identifier, the associated list of organism binomial names, and the calculated NMR data. The latter associate each carbon atom index with the 13C NMR chemical shift predicted by nmrshiftdb2 [32]. KS is written as a collection of python scripts and is run from the command line interface. Assuming that the list of the genera from the Lamiaceae family is stored in a file named lamiaceae_genera.txt, the command “python process lamiaceae” automatically produces an SDF named lamiaceae_knapsack.sdf. It should also be noted that the PNMRNP DB is now available and can be used to create DBs based on chemotaxonomic or phytochemical criteria. It consists of an SDF file that reports to date the structure, properties, and classification of 211,280 NPs as well as their predicted δ C-SDF using nmrshiftdb2 (version 0.0.2) or ACD NMR predictors (C, H) (version 0.0.3) [41], [42].


#

NMR data export

The peak list and intensity data obtained from each spectrum were exported as a. csv file using Microsoft Excel (Microsoft) software and used as an input file in MixONat software. The file consists of a list of δ C ordered in decreasing order associated with their intensities on the same line, separated by a comma.


#
#
#

Contributorsʼ Statement

This article is based on the results obtained by A. B., supervised by S. D. and P. R. during his PhD thesis. A. B., S. D., D. B., J. M. N. and P. R. performed algorithms writing and/or analyses. F. T. provided the experimental spectral data of the peppermint monoterpenes. A. B., S. D., J. M. N. and P. R. prepared the figures and tables and wrote the manuscript together. All authors discussed the results from the experiments and commented on the manuscript.


#
#

Conflict of Interest

The authors declare that they have no conflict of interest.

Acknowledgements

A. B. gratefully acknowledges the Ministère de lʼEnseignement Supérieur, de la Recherche et de lʼInnovation (MESRI) for his Ph.D. grant.

# Dedicated to Professor Arnold Vlietinck on the occasion of his 80th birthday.


* Please note that the program has been optimized for DBs created with ACD Labs and hence may not work properly with a different type of DB.


Supporting Information

  • References

  • 1 Tilburt JC, Kaptchuk TJ. Herbal medicine research and global health: an ethical analysis. Bull World Health Organ 2008; 86: 577-656
  • 2 Eisenberg DM, Davis RB, Ettner SL, Appel S, Wilkey S, Van Rompay M, Kessler RC. Trends in alternative medicine use in the United States, 1990–1997. Results of a follow-up national survey. JAMA 1998; 280: 1569-1575
  • 3 Welz AN, Emberger-Klein A, Menrad K. The importance of herbal medicine use in the German health-care system: prevalence, usage pattern, and influencing factors. BMC Health Serv Res 2019; 19: 952
  • 4 Bruneton J. Pharmacognosy: Phytochemistry, medicinal Plants. 2nd ed.. ed. Londres: Tec & Doc Lavoisier; 2008
  • 5 World Health Organization. Quality Control Methods for herbal Materials. Geneva: World Health Organization; 2011
  • 6 Wang M, Carver JJ, Phelan VV, Sanchez LM, Garg N, Peng Y, Nguyen DD, Watrous J, Kapono CA, Luzzatto-Knaan T, Porto C, Bouslimani A, Melnik AV, Meehan MJ, Liu WT, Crüsemann M, Boudreau PD, Esquenazi E, Sandoval-Calderón M, Kersten RD, Pace LA, Quinn RA, Duncan KR, Hsu CC, Floros DJ, Gavilan RG, Kleigrewe K, Northen T, Dutton RJ, Parrot D, Carlson EE, Aigle B, Michelsen CF, Jelsbak L, Sohlenkamp C, Pevzner P, Edlund A, McLean J, Piel J, Murphy BT, Gerwick L, Liaw CC, Yang YL, Humpf HU, Maansson M, Keyzers RA, Sims AC, Johnson AR, Sidebottom AM, Sedio BE, Klitgaard A, Larson CB, Boya PCA, Torres-Mendoza D, Gonzalez DJ, Silva DB, Marques LM, Demarque DP, Pociute E, OʼNeill EC, Briand E, Helfrich EJN, Granatosky EA, Glukhov E, Ryffel F, Houson H, Mohimani H, Kharbush JJ, Zeng Y, Vorholt JA, Kurita KL, Charusanti P, McPhail KL, Nielsen KF, Vuong L, Elfeki M, Traxler MF, Engene N, Koyama N, Vining OB, Baric R, Silva RR, Mascuch SJ, Tomasi S, Jenkins S, Macherla V, Hoffman T, Agarwal V, Williams PG, Dai J, Neupane R, Gurr J, Rodríguez AMC, Lamsa A, Zhang C, Dorrestein K, Duggan BM, Almaliti J, Allard PM, Phapale P, Nothias LF, Alexandrov T, Litaudon M, Wolfender JL, Kyle JE, Metz TO, Peryea T, Nguyen DT, VanLeer D, Shinn P, Jadhav A, Müller R, Waters KM, Shi W, Liu X, Zhang L, Knight R, Jensen PR, Palsson BØ, Pogliano K, Linington RG, Gutiérrez M, Lopes NP, Gerwick WH, Moore BS, Dorrestein PC, Bandeira N. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. Nat Biotechnol 2016; 34: 828
  • 7 Mimica-Dukić N, Božin B, Soković M, Mihajlović B, Matavulj M. Antimicrobial and antioxidant activities of three Mentha species essential oils. Planta Med 2003; 69: 413-419
  • 8 Bakiri A, Hubert J, Reynaud R, Lanthony S, Harakat D, Renault JH, Nuzillard JM. Computer-aided 13C NMR chemical profiling of crude natural extracts without fractionation. J Nat Prod 2017; 80: 1387-1396
  • 9 Hubert J, Nuzillard JM, Purson S, Hamzaoui M, Borie N, Reynaud R, Renault JH. Identification of natural metabolites in mixture: a pattern recognition strategy based on 13C NMR. Anal Chem 2014; 86: 2955-2962
  • 10 Bruguière A, Derbré S, Coste C, Le Bot M, Siegler B, Leong ST, Sulaiman SN, Awang K, Richomme P. 13C-NMR dereplication of Garcinia extracts: predicted chemical shifts as reliable databases. Fitoterapia 2018; 131: 59-64
  • 11 Bruguière A, Derbré S, Dietsch J, Leguy J, Rahier V, Pottier Q, Bréard D, Suor-Cherer S, Viault G, Le Ray AM, Saubion F, Richomme P. MixONat, a software for mixtures dereplication based on 13C NMR experiments. Anal Chem 2020; 92: 8793-8801
  • 12 ACD Labs. NMR Spectroscopy Software. Accessed March 15, 2019 at: https://www.acdlabs.com/products/adh/nmr/
  • 13 Robien W. CSEARCH Spectral Similarity Search with Ranking. Accessed March 11, 2021 at: http://c13nmr.at/planta_medica/eval.php
  • 14 Bruguière A, Derbré S. MixONat. 13C-NMR based dereplication software. Accessed November 18, 2020 at: https://sourceforge.net/projects/mixonat/
  • 15 Council of Europe, European Directorate for the Quality of Medicines and Healthcare. European Pharmacopoeia. 10th ed. Strasbourg: Council of Europe; 2019
  • 16 Xu T, Gherib M, Bekhechi C, Atik-Bekkara F, Casabianca H, Tomi F, Casanova J, Bighelli A. Thymyl esters derivatives and a new natural product modhephanone from Pulicaria mauritanica Coss. (Asteraceae) root oil. Flavour Fragr J 2015; 30: 83-90
  • 17 Ouattara ZA, Boti JB, Ahibo AC, Sutour S, Casanova J, Tomi F, Bighelli A. The key role of 13C NMR analysis in the identification of individual components of Polyalthia longifolia leaf oil. Flavour Fragr J 2014; 29: 371-379
  • 18 Baldovini N, Tomi F, Casanova J. Identification and quantitative determination of furanodiene, a heat-sensitive compound, in essential oil by 13C-NMR. Phytochem Anal 2001; 12: 58-63
  • 19 Zavahir JS, Smith JSP, Blundell S, Waktola HD, Nolvachai Y, Wood BR, Marriott PJ. Relationships in gas chromatography-Fourier transform infrared spectroscopy. Comprehensive and multilinear analysis. Separations 2020; 7: 27
  • 20 ChemNetBase. Dictionary of Natural Products 29.1. Accessed November 18, 2020 at: http://dnp.chemnetbase.com
  • 21 Pupier M, Nuzillard JM, Wist J, Schlörer NE, Kuhn S, Erdelyi M, Steinbeck C, Williams AJ, Butts C, Claridge TDW, Mikhova B, Robien W, Dashti H, Eghbalnia HR, Farès C, Adam C, Kessler P, Moriaud F, Elyashberg M, Argyropoulos D, Pérez M, Giraudeau P, Gil RR, Trevorrow P, Jeannerat D. NMReDATA, a standard to report the NMR assignment and parameters of organic compounds. Magn Reson Chem 2018; 56: 703-715
  • 22 Tomi F, Bradesi P, Bighelli A, Casanova J. Computer aided identification of individual components of essential oils using carbon 13 NMR spectroscopy. Journal of Magnetic Resonance Analysis 1995; 1: 25-34
  • 23 Lanfranchi DA, Blanc MC, Vellutini M, Bradesi P, Casanova J, Tomi F. Enantiomeric differentiation of oxygenated p-menthane derivatives by 13C NMR using Yb(hfc)3. Magn Reson Chem 2008; 46: 1188-1194
  • 24 KNApSAcK. KNApSAcK core System. Accessed November 18, 2020 at: http://www.knapsackfamily.com/knapsack_core/top.php
  • 25 ChemAxon. Marvin. MarvinView–View your molecules. Accessed March 15, 2019 at: https://chemaxon.com/products/marvin
  • 26 Ho D. Notepad++. What is Notepad++. Accessed March 15, 2019 at: https://notepad-plus-plus.org/
  • 27 Rutz A, Dounoue-Kubo M, Ollivier S, Bisson J, Bagheri M, Saesong T, Ebrahimi SN, Ingkaninan K, Wolfender JL, Allard PM. Taxonomically informed scoring enhances confidence in natural products annotation. Front Plant Sci 2019; 10: 1-15
  • 28 CAS. SciFinder. Accessed March 15, 2019 at: https://www.cas.org/products/scifinder
  • 29 Elsevier. Reaxys. Accessed November 18, 2020 at: https://www.elsevier.com/solutions/reaxys
  • 30 Nuzillard JM. KnapsackSearch. Automated data search in the KNApSAcK database. Accessed June 22, 2020 at: https://github.com/nuzillard/KnapsackSearch
  • 31 Afendi FM, Okada T, Yamazaki M, Hirai-Morita A, Nakamura Y, Nakamura K, Ikeda S, Takahashi H, Altaf-Ul-Amin M, Darusman LK, Saito K, Kanaya S. KNApSAcK family databases: Integrated metabolite-plant species databases for multifaceted plant research. Plant Cell Physiol 2011; 53: e1
  • 32 Kuhn S. NMRShiftDB2. Accessed November 18, 2020 at: https://nmrshiftdb.nmr.uni-koeln.de/
  • 33 Kuhn S, Schlörer NE. Facilitating quality control for spectra assignments of small organic molecules: nmrshiftdb2–a free in-house NMR database with integrated LIMS for academic service laboratories. Magn Reson Chem 2015; 53: 582-589
  • 34 Dona AC, Kyriakides M, Scott F, Shephard EA, Varshavi D, Veselkov K, Everett JR. A guide to the identification of metabolites in NMR-based metabonomics/metabolomics experiments. Comput Struct Biotechnol J 2016; 14: 135-153
  • 35 John Wiley & Sons Inc.. SpectraBase. Accessed January 11, 2021 at: https://spectrabase.com/
  • 36 Gottlieb HE, Kotlyar V, Nudelman A. NMR chemical shifts of common laboratory solvents as trace impurities. J Org Chem 1997; 62: 7512-7515
  • 37 Gu J, Gui Y, Chen L, Yuan G, Lu HZ, Xu X. Use of natural products as chemical library for drug discovery and network pharmacology. PLoS One 2013; 8: e62839
  • 38 LOTUS. The naturaL prOducTs occUrrences databaSe. Accessed March 8, 2021 at: https://lotus.naturalproducts.net/
  • 39 Rutz A, Sorokina M, Galgonek J, Mietchen D, Willighagen E, Graham J, Stephan R, Page R, Vondrášek J, Steinbeck C, Pauli GF, Wolfender JL, Bisson J, Allard PM. Open natural products research: curation and dissemination of biological occurrences of chemical structures through wikidata. bioRxiv 2021; DOI: 10.1101/2021.02.28.433265.
  • 40 Landrum G. An overview of the RDKit. Accessed November 18, 2020 at: https://www.rdkit.org/docs/Overview.html
  • 41 Lianza M, Leroy R, Machado Rodrigues C, Borie N, Sayagh C, Remy S, Kuhn S, Renault JH, Nuzillard JM. The three pillars of natural product dereplication. Alkaloids from the bulbs of Urceolina peruviana (C. Presl) J.F. Macbr. as a preliminary test case. Molecules 2021; 26: 637
  • 42 Nuzillard JM, Leroy R, Kuhn S. Predicted carbon-13 NMR data of natural products (PNMRNP). Accessed January 13, 2021 at: https://zenodo.org/record/4420849%23.YFNMy9zjJPY

Correspondence

Dr Séverine Derbré, Pharm.D., Ph.D.
Univ Angers
SONAS, SFR QUASAV
Faculty of Health Sciences
Dpt Pharmacy
16 Bd Daviers
49000 Angers
France   
Phone: + 33 2 49 18 04 40   
Fax: + 33 2 41 22 66 34   

 

Prof Pascal Richomme, Ph.D.
Univ Angers
SONAS
SFR QUASAV
Faculty of Health Sciences
Dpt Pharmacy
16 Bd Daviers
49000 Angers
France   
Phone: + 33 2 49 18 04 37   
Fax: + 33 2 41 22 66 34   

Publication History

Received: 29 January 2021

Accepted after revision: 25 March 2021

Article published online:
06 May 2021

© 2021. Thieme. All rights reserved.

Georg Thieme Verlag KG
Rüdigerstraße 14, 70469 Stuttgart, Germany

  • References

  • 1 Tilburt JC, Kaptchuk TJ. Herbal medicine research and global health: an ethical analysis. Bull World Health Organ 2008; 86: 577-656
  • 2 Eisenberg DM, Davis RB, Ettner SL, Appel S, Wilkey S, Van Rompay M, Kessler RC. Trends in alternative medicine use in the United States, 1990–1997. Results of a follow-up national survey. JAMA 1998; 280: 1569-1575
  • 3 Welz AN, Emberger-Klein A, Menrad K. The importance of herbal medicine use in the German health-care system: prevalence, usage pattern, and influencing factors. BMC Health Serv Res 2019; 19: 952
  • 4 Bruneton J. Pharmacognosy: Phytochemistry, medicinal Plants. 2nd ed.. ed. Londres: Tec & Doc Lavoisier; 2008
  • 5 World Health Organization. Quality Control Methods for herbal Materials. Geneva: World Health Organization; 2011
  • 6 Wang M, Carver JJ, Phelan VV, Sanchez LM, Garg N, Peng Y, Nguyen DD, Watrous J, Kapono CA, Luzzatto-Knaan T, Porto C, Bouslimani A, Melnik AV, Meehan MJ, Liu WT, Crüsemann M, Boudreau PD, Esquenazi E, Sandoval-Calderón M, Kersten RD, Pace LA, Quinn RA, Duncan KR, Hsu CC, Floros DJ, Gavilan RG, Kleigrewe K, Northen T, Dutton RJ, Parrot D, Carlson EE, Aigle B, Michelsen CF, Jelsbak L, Sohlenkamp C, Pevzner P, Edlund A, McLean J, Piel J, Murphy BT, Gerwick L, Liaw CC, Yang YL, Humpf HU, Maansson M, Keyzers RA, Sims AC, Johnson AR, Sidebottom AM, Sedio BE, Klitgaard A, Larson CB, Boya PCA, Torres-Mendoza D, Gonzalez DJ, Silva DB, Marques LM, Demarque DP, Pociute E, OʼNeill EC, Briand E, Helfrich EJN, Granatosky EA, Glukhov E, Ryffel F, Houson H, Mohimani H, Kharbush JJ, Zeng Y, Vorholt JA, Kurita KL, Charusanti P, McPhail KL, Nielsen KF, Vuong L, Elfeki M, Traxler MF, Engene N, Koyama N, Vining OB, Baric R, Silva RR, Mascuch SJ, Tomasi S, Jenkins S, Macherla V, Hoffman T, Agarwal V, Williams PG, Dai J, Neupane R, Gurr J, Rodríguez AMC, Lamsa A, Zhang C, Dorrestein K, Duggan BM, Almaliti J, Allard PM, Phapale P, Nothias LF, Alexandrov T, Litaudon M, Wolfender JL, Kyle JE, Metz TO, Peryea T, Nguyen DT, VanLeer D, Shinn P, Jadhav A, Müller R, Waters KM, Shi W, Liu X, Zhang L, Knight R, Jensen PR, Palsson BØ, Pogliano K, Linington RG, Gutiérrez M, Lopes NP, Gerwick WH, Moore BS, Dorrestein PC, Bandeira N. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. Nat Biotechnol 2016; 34: 828
  • 7 Mimica-Dukić N, Božin B, Soković M, Mihajlović B, Matavulj M. Antimicrobial and antioxidant activities of three Mentha species essential oils. Planta Med 2003; 69: 413-419
  • 8 Bakiri A, Hubert J, Reynaud R, Lanthony S, Harakat D, Renault JH, Nuzillard JM. Computer-aided 13C NMR chemical profiling of crude natural extracts without fractionation. J Nat Prod 2017; 80: 1387-1396
  • 9 Hubert J, Nuzillard JM, Purson S, Hamzaoui M, Borie N, Reynaud R, Renault JH. Identification of natural metabolites in mixture: a pattern recognition strategy based on 13C NMR. Anal Chem 2014; 86: 2955-2962
  • 10 Bruguière A, Derbré S, Coste C, Le Bot M, Siegler B, Leong ST, Sulaiman SN, Awang K, Richomme P. 13C-NMR dereplication of Garcinia extracts: predicted chemical shifts as reliable databases. Fitoterapia 2018; 131: 59-64
  • 11 Bruguière A, Derbré S, Dietsch J, Leguy J, Rahier V, Pottier Q, Bréard D, Suor-Cherer S, Viault G, Le Ray AM, Saubion F, Richomme P. MixONat, a software for mixtures dereplication based on 13C NMR experiments. Anal Chem 2020; 92: 8793-8801
  • 12 ACD Labs. NMR Spectroscopy Software. Accessed March 15, 2019 at: https://www.acdlabs.com/products/adh/nmr/
  • 13 Robien W. CSEARCH Spectral Similarity Search with Ranking. Accessed March 11, 2021 at: http://c13nmr.at/planta_medica/eval.php
  • 14 Bruguière A, Derbré S. MixONat. 13C-NMR based dereplication software. Accessed November 18, 2020 at: https://sourceforge.net/projects/mixonat/
  • 15 Council of Europe, European Directorate for the Quality of Medicines and Healthcare. European Pharmacopoeia. 10th ed. Strasbourg: Council of Europe; 2019
  • 16 Xu T, Gherib M, Bekhechi C, Atik-Bekkara F, Casabianca H, Tomi F, Casanova J, Bighelli A. Thymyl esters derivatives and a new natural product modhephanone from Pulicaria mauritanica Coss. (Asteraceae) root oil. Flavour Fragr J 2015; 30: 83-90
  • 17 Ouattara ZA, Boti JB, Ahibo AC, Sutour S, Casanova J, Tomi F, Bighelli A. The key role of 13C NMR analysis in the identification of individual components of Polyalthia longifolia leaf oil. Flavour Fragr J 2014; 29: 371-379
  • 18 Baldovini N, Tomi F, Casanova J. Identification and quantitative determination of furanodiene, a heat-sensitive compound, in essential oil by 13C-NMR. Phytochem Anal 2001; 12: 58-63
  • 19 Zavahir JS, Smith JSP, Blundell S, Waktola HD, Nolvachai Y, Wood BR, Marriott PJ. Relationships in gas chromatography-Fourier transform infrared spectroscopy. Comprehensive and multilinear analysis. Separations 2020; 7: 27
  • 20 ChemNetBase. Dictionary of Natural Products 29.1. Accessed November 18, 2020 at: http://dnp.chemnetbase.com
  • 21 Pupier M, Nuzillard JM, Wist J, Schlörer NE, Kuhn S, Erdelyi M, Steinbeck C, Williams AJ, Butts C, Claridge TDW, Mikhova B, Robien W, Dashti H, Eghbalnia HR, Farès C, Adam C, Kessler P, Moriaud F, Elyashberg M, Argyropoulos D, Pérez M, Giraudeau P, Gil RR, Trevorrow P, Jeannerat D. NMReDATA, a standard to report the NMR assignment and parameters of organic compounds. Magn Reson Chem 2018; 56: 703-715
  • 22 Tomi F, Bradesi P, Bighelli A, Casanova J. Computer aided identification of individual components of essential oils using carbon 13 NMR spectroscopy. Journal of Magnetic Resonance Analysis 1995; 1: 25-34
  • 23 Lanfranchi DA, Blanc MC, Vellutini M, Bradesi P, Casanova J, Tomi F. Enantiomeric differentiation of oxygenated p-menthane derivatives by 13C NMR using Yb(hfc)3. Magn Reson Chem 2008; 46: 1188-1194
  • 24 KNApSAcK. KNApSAcK core System. Accessed November 18, 2020 at: http://www.knapsackfamily.com/knapsack_core/top.php
  • 25 ChemAxon. Marvin. MarvinView–View your molecules. Accessed March 15, 2019 at: https://chemaxon.com/products/marvin
  • 26 Ho D. Notepad++. What is Notepad++. Accessed March 15, 2019 at: https://notepad-plus-plus.org/
  • 27 Rutz A, Dounoue-Kubo M, Ollivier S, Bisson J, Bagheri M, Saesong T, Ebrahimi SN, Ingkaninan K, Wolfender JL, Allard PM. Taxonomically informed scoring enhances confidence in natural products annotation. Front Plant Sci 2019; 10: 1-15
  • 28 CAS. SciFinder. Accessed March 15, 2019 at: https://www.cas.org/products/scifinder
  • 29 Elsevier. Reaxys. Accessed November 18, 2020 at: https://www.elsevier.com/solutions/reaxys
  • 30 Nuzillard JM. KnapsackSearch. Automated data search in the KNApSAcK database. Accessed June 22, 2020 at: https://github.com/nuzillard/KnapsackSearch
  • 31 Afendi FM, Okada T, Yamazaki M, Hirai-Morita A, Nakamura Y, Nakamura K, Ikeda S, Takahashi H, Altaf-Ul-Amin M, Darusman LK, Saito K, Kanaya S. KNApSAcK family databases: Integrated metabolite-plant species databases for multifaceted plant research. Plant Cell Physiol 2011; 53: e1
  • 32 Kuhn S. NMRShiftDB2. Accessed November 18, 2020 at: https://nmrshiftdb.nmr.uni-koeln.de/
  • 33 Kuhn S, Schlörer NE. Facilitating quality control for spectra assignments of small organic molecules: nmrshiftdb2–a free in-house NMR database with integrated LIMS for academic service laboratories. Magn Reson Chem 2015; 53: 582-589
  • 34 Dona AC, Kyriakides M, Scott F, Shephard EA, Varshavi D, Veselkov K, Everett JR. A guide to the identification of metabolites in NMR-based metabonomics/metabolomics experiments. Comput Struct Biotechnol J 2016; 14: 135-153
  • 35 John Wiley & Sons Inc.. SpectraBase. Accessed January 11, 2021 at: https://spectrabase.com/
  • 36 Gottlieb HE, Kotlyar V, Nudelman A. NMR chemical shifts of common laboratory solvents as trace impurities. J Org Chem 1997; 62: 7512-7515
  • 37 Gu J, Gui Y, Chen L, Yuan G, Lu HZ, Xu X. Use of natural products as chemical library for drug discovery and network pharmacology. PLoS One 2013; 8: e62839
  • 38 LOTUS. The naturaL prOducTs occUrrences databaSe. Accessed March 8, 2021 at: https://lotus.naturalproducts.net/
  • 39 Rutz A, Sorokina M, Galgonek J, Mietchen D, Willighagen E, Graham J, Stephan R, Page R, Vondrášek J, Steinbeck C, Pauli GF, Wolfender JL, Bisson J, Allard PM. Open natural products research: curation and dissemination of biological occurrences of chemical structures through wikidata. bioRxiv 2021; DOI: 10.1101/2021.02.28.433265.
  • 40 Landrum G. An overview of the RDKit. Accessed November 18, 2020 at: https://www.rdkit.org/docs/Overview.html
  • 41 Lianza M, Leroy R, Machado Rodrigues C, Borie N, Sayagh C, Remy S, Kuhn S, Renault JH, Nuzillard JM. The three pillars of natural product dereplication. Alkaloids from the bulbs of Urceolina peruviana (C. Presl) J.F. Macbr. as a preliminary test case. Molecules 2021; 26: 637
  • 42 Nuzillard JM, Leroy R, Kuhn S. Predicted carbon-13 NMR data of natural products (PNMRNP). Accessed January 13, 2021 at: https://zenodo.org/record/4420849%23.YFNMy9zjJPY

Zoom Image
Fig. 1 Schematic representation of the 13C-NMR-based dereplication process. MixONat (orange, middle) requires appropriate DBs including either experimental δ C or freely available and/or commercial predicted δ C-SDF datasets (green, right) as well as peak lists (.csv files) exported from experimental data (blue, left). *Optional
Zoom Image
Fig. 2 Structures and atom numbering of major monoterpenes from peppermint EO identified using GC-MS.