AmrPredictor

Diana Lab Logo

Help & Documentation

How to Make a Prediction

1

Navigate to the Prediction Page

From the main dashboard, click on "AMR Prediction" in the navigation menu on the left sidebar. This will take you to the AMR prediction interface.

2

Select the Bacterial Genus and Upload Your Genome File

Use the dropdown menu to select the genus of your bacterial isolate. The available genera are:

  • Escherichia (e.g., E. coli)
  • Klebsiella
  • Acinetobacter
  • Pseudomonas
  • Enterobacter
  • Staphylococcus

Click the "Click to upload" button or drag and drop your genome file into the upload area. The system accepts FASTA format files (.fna, .fasta, .fa).

Genus Selection Dropdown

Figure 1: Select the bacterial genus from the dropdown menu and upload your genome file using the file selector or drag-and-drop

Important: Make sure to select the correct genus for your sample. Selecting the wrong genus will result in inaccurate predictions.
Your genome file should contain assembled contigs or a complete genome sequence in FASTA format.
3

Configure SHAP Analysis (Optional)

By default, SHAP (SHapley Additive exPlanations) analysis is enabled. This provides interpretability for the predictions by showing which genomic features contributed most to each prediction.

If you want faster results and don't need feature importance analysis, you can check the "Disable SHAP Analysis" checkbox.

SHAP Configuration Option

Figure 2: Toggle SHAP analysis on or off based on your needs

Tip: Keep SHAP enabled if you want to understand why the model made specific predictions. Disable it for quicker results when processing multiple samples.
4

Start the Analysis

Once you've selected the genus and uploaded your file, click the "Start Analysis" button. The system will begin processing your genome.

Start Analysis Button

Figure 3: The job status page, until the analysis is complete

You will be redirected to a job status page where you can monitor the progress of your analysis in real-time. The job page displays a real-time log of the analysis pipeline. You'll see status updates as the system:

  • Extracts k-mers from your genome
  • Runs predictions for each antibiotic
  • Generates SHAP values (if enabled)
5

View Results

Once the analysis is complete, the results table will appear showing predictions for each antibiotic. Each row displays:

  • Antibiotic name
  • Prediction (Resistant or Susceptible)
  • Confidence score (shown as a percentage bar)
  • SHAP analysis link (if enabled)
Job Progress Log

Figure 4: View the AMR prediction results for all tested antibiotics

Understanding Your Results

Prediction Categories

ResistantThe isolate is predicted to be resistant to this antibiotic
SusceptibleThe isolate is predicted to be susceptible to this antibiotic

Confidence Scores

The confidence bar indicates how certain the model is about its prediction:

>90% - High confidence
70-90% - Moderate confidence
<70% - Low confidence (interpret with caution)

File Requirements

RequirementDetails
File FormatFASTA (.fna, .fasta, .fa)
ContentAssembled genome contigs or complete genome
QualityHigh-quality assembly recommended for best results
Maximum Size100 MB per file
Note: Raw sequencing reads (FASTQ) are not supported. Please assemble your reads before uploading.

Supported Genera

Escherichia

AMR prediction models trained on clinical isolates

Klebsiella

AMR prediction models trained on clinical isolates

Acinetobacter

AMR prediction models trained on clinical isolates

Pseudomonas

AMR prediction models trained on clinical isolates

Enterobacter

AMR prediction models trained on clinical isolates

Staphylococcus

AMR prediction models trained on clinical isolates

SHAP Analysis Explained

SHAP (SHapley Additive exPlanations) is a method to explain individual predictions by computing the contribution of each feature to the prediction. In the context of AMR prediction, SHAP values show which k-mers (short DNA sequences) contributed most to classifying an isolate as resistant or susceptible.

Results Table

Figure 5: Example SHAP waterfall plot showing feature contributions

How to Interpret SHAP Values

  • +Positive values (red): Features that push the prediction toward "Resistant"
  • Negative values (blue): Features that push the prediction toward "Susceptible"
  • |Bar length: The magnitude of the feature's contribution to the prediction
Tip: SHAP analysis can help identify potential resistance genes or mutations by highlighting which genomic regions are most important for the prediction.
🧬 In Silico Mutagenesis

What is In Silico Mutagenesis?

In Silico Mutagenesis is a computational feature analysis tool that allows you to explore how the presence or absence of specific genomic features (k-mers) affects antimicrobial resistance predictions without performing actual laboratory experiments. By modifying which features are considered "present" in your genome, you can observe how the model's predictions change for different antibiotics.

🔬

Extract Features

Identify important k-mers and genes in your genome

🔄

Toggle Presence

Simulate adding or removing genes/k-mers

📊

Compare Results

See how changes affect resistance predictions

Why use In Silico Mutagenesis? This approach is valuable for understanding which genomic features drive resistance predictions, exploring "what if" scenarios (e.g., "What if this gene were absent?"), and prioritizing targets for experimental validation.

How to Run Mutagenesis Analysis

1

Navigate to In Silico Mutagenesis

From the sidebar, click on "In Silico Mutagenesis" (microscope icon). This opens the mutagenesis feature analysis interface.

Navigate to Mutagenesis Page

Figure 6: Access In Silico Mutagenesis from the sidebar navigation

2

Upload Genome and Select Parameters

Configure your analysis by:

  • Upload a genome file - FASTA format (.fasta, .fa, .fas, .fna), max 100 MB
  • Select the bacterial genus - Same options as AMR prediction (Escherichia, Klebsiella, Acinetobacter, Pseudomonas, Enterobacter, Staphylococcus)
  • Select a target antibiotic - The available antibiotics will depend on the selected genus
The antibiotic dropdown only becomes available after selecting a genus. Different genera have different sets of antibiotics with trained models.
3

Start Feature Extraction

Click "Start Analysis" to begin the feature extraction process. The system will:

  • Extract important k-mers from your genome
  • Map features to known genes and genomic regions
  • Calculate importance scores for each feature
  • Determine feature direction (Protective vs Risk)

Processing typically takes 1-5 minutes.

4

Review Extracted Features

Once analysis is complete, you'll see a table of extracted features showing:

ColumnDescription
GeneThe gene or genomic region associated with this k-mer
SequenceThe actual k-mer DNA sequence
TypeFeature type (Gene, Intergenic, promoter, rRNA, protein)
ImportanceHow strongly this feature contributes to the prediction
DirectionProtective (toward susceptible) or Risk (toward resistant)
PresentWhether this feature is currently present in your genome

You can download this data as a CSV file using the "Download CSV" button.

5

Modify Features and Run Prediction

Click the "In Silico Mutagenesis" button to open the modification modal. Here you can:

  • Gene Types tab - Modify all k-mers for a gene at once using "All Present" or "All Absent" buttons
  • Individual K-mers tab - Toggle presence for specific k-mers using the dropdown
  • Search - Filter features by gene name or sequence
  • Sort - Click column headers to sort by importance, presence, etc.
Tip: Start by modifying high-importance "Risk" features to "Absent" to see if removing them would make the isolate more susceptible.

When ready, click "Run Prediction" to see how your modifications affect the AMR predictions.

The first mutagenesis results page

Figure 7: The first results page. You can select to modify features and run again the predictions

Modal to modify features and run predictions

Figure 8: Modal to modify features and run predictions

The new mutagenesis results page showing the comparison table

Figure 9: The new mutagenesis results page showing the comparison table. In this example, the resistance prediction changed to susceptible after modifications.

The cummulative modifications section

Figure 10: The cummulative modifications section showing which features were changed. We can make further modifications and click the initial modifications to see the previous state.

Interpreting Mutagenesis Results

After running a prediction with modified features, you'll see a comparison table showing how your changes affected the predictions for all antibiotics available for that genus.

Comparison Table Columns

AntibioticName of the antibiotic (your selected antibiotic appears at the top, highlighted)
OriginalPrediction before modifications (Resistant/Susceptible)
ModifiedPrediction after your feature modifications
Original/Modified ProbResistance probability (0-100%) shown as a bar chart
ChangeHow much the probability shifted (with ↑↓ arrows indicating direction)
Status"Changed" if the prediction flipped, "Unchanged" otherwise

Understanding Probability Changes

↑ +15.2%
Increased Resistance

Your changes made the isolate more likely to be resistant

↓ -23.8%
Decreased Resistance

Your changes made the isolate more likely to be susceptible

Applied Modifications

Below the comparison table, you'll see an "Applied Modifications" section showing exactly which features were changed:

  • Index - Feature index number
  • Feature - The k-mer or gene name
  • Old Count → New Count - Shows 0 (absent) or 5 (present)
Iterate on Your Analysis: Click the "Modify & Predict Again" button to make additional changes. Modifications are cumulative—you can explore different scenarios by building on previous results.
Results can be downloaded as CSV using the "Download CSV" button for further analysis or inclusion in publications.

Use Cases & Applications

🔍

Gene Importance Analysis

Determine which genes are most important for resistance by simulating their removal. High-importance genes that flip predictions when removed are strong candidates for targeted interventions.

🧬

Resistance Mechanism Exploration

Explore "what if" scenarios: What if a known resistance gene were absent? What if a protective gene were present? Understand the model's reasoning and identify key genomic drivers.

🎯

Target Prioritization

Prioritize genes for experimental validation by identifying which features have the highest impact on predictions. Focus wet-lab efforts on the most promising targets.

📚

Educational Tool

Demonstrate the relationship between genotype and phenotype interactively. Show students and trainees how specific genetic features contribute to antimicrobial resistance predictions.

Important: In silico predictions are computational estimates based on machine learning models. Results should be interpreted as hypotheses to guide further investigation, not as definitive biological conclusions. Always validate important findings experimentally.
General

Troubleshooting

AMR Prediction Issues

❌ File upload fails

Ensure your file is in FASTA format and is under 100 MB. Check that the file extension is .fna, .fasta, or .fa.

❌ Analysis takes too long

Large genomes may take longer to process. Try disabling SHAP analysis for faster results. If the job seems stuck, check the log for error messages.

❌ Low confidence predictions

Low confidence may indicate that the isolate is genetically different from the training data, or that the assembly quality is poor. Consider re-assembling your genome with stricter quality parameters.

❌ Wrong genus selected

If you accidentally selected the wrong genus, you'll need to start a new analysis. Cancel the current job and submit a new one with the correct genus.

In Silico Mutagenesis Issues

❌ Antibiotic dropdown is empty

Make sure you've selected a genus first. The antibiotic options are genus-specific and will only appear after a genus is selected.

❌ "Start Analysis" button is disabled

Ensure all three required fields are complete: (1) a FASTA file has been uploaded and shows "File ready", (2) a genus is selected, and (3) an antibiotic is selected. All three are required to start the analysis.

❌ No features extracted

This may occur if the uploaded genome is very short, corrupted, or contains non-DNA characters. Verify your FASTA file contains valid DNA sequences with standard nucleotides (A, T, G, C).

❌ Prediction shows no change after modifications

Some features may have minimal impact on predictions, especially if they have low importance scores. Try modifying features with higher importance values or modify multiple related features (e.g., all k-mers in a gene) for a more noticeable effect.

❌ "Run Prediction" shows error

Make sure you've actually modified at least one feature. The modal shows the count of modified features in the footer. If it shows "0 modified", toggle at least one feature's presence before running the prediction.

Still need help?

If you encounter any issues or have questions not covered in this guide, please contact our support team.

Contact Support