Troubleshooting Mira QC failures & sequencing artifacts
Lead Bioinformatics Scientist
US CDC – Influenza Division
Low-coverage / Incomplete segment coverage
Tip
A low Ct usually means more template — start your troubleshooting there before re-running the pipeline.
Minor variant count > 10
Warning
A pattern of high minor-variant counts across all segments is a strong red flag for contamination.
| sample_id | total_reads | reads_mapped | reference | % ref. cov. | median cov. | minor SNVs ≥ 5% | pass / fail reason |
|---|---|---|---|---|---|---|---|
| 95f48e8a | 95,828 | 10,388 | A_HA_H1 | 99.82 | 735 | 2 | Pass |
| 95f48e8a | 95,828 | 5,818 | A_MP | 100 | 686 | 148 | minor variants > 10 |
| 95f48e8a | 95,828 | 9,528 | A_NA_N1 | 99.79 | 814 | 3 | Pass |
| 95f48e8a | 95,828 | 9,146 | A_NP | 100 | 722 | 274 | minor variants > 10 |
| 95f48e8a | 95,828 | 5,704 | A_NS | 97.1 | 777 | 147 | minor variants > 10 |
| 95f48e8a | 95,828 | 11,790 | A_PA | 100 | 607 | 361 | minor variants > 10 |
| 95f48e8a | 95,828 | 13,230 | A_PB1 | 100 | 687 | 246 | minor variants > 10 |
| 95f48e8a | 95,828 | 12,175 | A_PB2 | 100 | 590 | 391 | minor variants > 10 |
| c282a097 | 16,170 | 1,992 | A_HA_H3 | 99.82 | 158 | 35 | minor variants > 10 |
| c282a097 | 16,170 | 1,382 | A_MP | 100 | 187 | 12 | minor variants > 10 |
| c282a097 | 16,170 | 1,852 | A_NA_N2 | 100 | 178 | 25 | minor variants > 10 |
| c282a097 | 16,170 | 1,720 | A_NP | 100 | 157 | 18 | minor variants > 10 |
| c282a097 | 16,170 | 1,336 | A_NS | 97.1 | 206 | 10 | Pass |
| c282a097 | 16,170 | 2,516 | A_PA | 100 | 148 | 22 | minor variants > 10 |
| c282a097 | 16,170 | 2,712 | A_PB1 | 100 | 161 | 23 | minor variants > 10 |
| c282a097 | 16,170 | 2,660 | A_PB2 | 100 | 160 | 28 | minor variants > 10 |
Two samples (95f48e8a, c282a097) — note how nearly every segment fails with Count of minor variants at or over 5% > 10. pass_qc column (= total_reads) omitted for clarity.
2+ mutations on one molecule are “phased” or “linked”
A clean bimodal signal across many reads strongly suggests mixed populations / contamination, not random sequencing error.
Premature stop-codon?
Note
Always inspect the alignment around the premature stop before discarding the segment — many premature stops are fixable artifacts, not biology.
Coverage drop-offs (red triangle) align with apparent indels — often a DI-particle artifact rather than a true mutation.
~ mutationQuality data in → quality decisions out. Every NIC matters.
Questions & Discussion
Ben Rambo-Martin, PhD
Lead Bioinformatics Scientist
US CDC — Influenza Division