Array 1 1581787-1582180 **** Predicted by CRISPRDetect 2.4 *** >NC_014210.1 Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 chromosome 1, complete sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================== =============================== ================== 1581787 30 100.0 31 .............................. CCGCGAACGAGATGCTCATGGGCGGCGGGAT 1581848 30 96.7 29 .............................C TCCAAGCTCTGACCCAGCTCACGCCTGAA 1581907 30 96.7 31 .............................G ACGACGGGGTCGCTGCTGCTGGCCAGGGTGG 1581968 30 93.3 31 ............................GA TCCTCCTGCCGCCGCCCTGGGACGCGATGAG 1582029 30 100.0 31 .............................. CTTCTCGGGTCATTGCACTTTCTCCTGTTCT 1582090 30 96.7 31 .............................A GGACGGAGCGCGGGTGCGCCTGGTGTGCGGG 1582151 30 93.3 0 .........................C...C | ========== ====== ====== ====== ============================== =============================== ================== 7 30 96.7 31 GTGCTCCCCGCGCACGCGGGGATGGTCCCT # Left flank : CGAGCCCGACCGGGTCGAGGACGTGGAGGCCTACCTGCGGGACTACGCGCGCCGCCAGCTCACGGTGGTGCTCACGCCGCGGTTGATGCGGTTGAGGCGGATGGTGATCGGTGAGGTCGCGCGGTTCCCGGAGCTGGCGCGGGTGCTGTACGAGTGCGGGCCGCAGAGGGCGATCGCCGGGCTCGCTGAGAGCTTCGCCCGGTTGGGGGAGAGGGGGCTGCTGGCGGTGGGGGACCCCCTGCGGGCGGCGTCCCACTTCAACTGGCTGGTGATGTCGGAGCCGGTGAACCGGGCGATGCTGCTGGGGGACGAGGCCATTCCGGCGGAGGCGGAATTGTGCCAGCATGCGGAGGAGGGAGTGCGGGTGTTCCTCGCGGCGTATGGGGTTGACGGTCGTTAGGGGAGTTGTGGGGCGTTTGGGTTTTATGGGGTCGGTGTCTGTTTTTCGGGAAGTTGGAAAAACAGGCGCTTGCCACTGGTAAGTTCGCAGGTGGTTGACT # Right flank : CGATCGCACCAGCGGCAGCTACCTCCTTCGTCACCCCAGGCCCGGCACACCCGCTCACATCCGGTCAGACGAAGTACCGCGCCTGGAGGCGAAACAGCTCGGCGTAGAGCCCACCGGCCCGGCTCAACGTGTCGTGGGTGCCGGTCTCCACGACCCGGCCGCCGTCCAGGACCACGATGAGATCAGCCGCCCGGGCGGTCGTGAAGCGGTGCGTGACAAGGACGGTCACACCGCCCTCCTGGCGGCCCAGGTCGGCGTACTGCTGGAACAGCCGCTCCTCAGCCCGTGCGTCGAGGCTGGCGGTGGGCTCGTCCATGATCCGCAGCAGCGGCGCCCGCCGCATCGCCGCACGGGCGTTGGCGACCGTCTGCCACTGGCCCCCGGACAGCCCCTCGCCGTCCCACGACCCGCGCCCGAGCCGGGTGTCCAGGCCCCCGGGCAGCGAGGCGAGGAGCCCGTCCCCGCCGACCGTCCGCAGCGCCTCGATGACCCGCTCAGGG # Questionable array : NO Score: 5.66 # Score Detail : 1:0, 2:3, 3:0, 4:0.84, 5:0, 6:0.25, 7:0.01, 8:1, 9:0.56, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGCTCCCCGCGCACGCGGGGATGGTCCCT # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [2,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGCTCCCCGCGCACGCGGGGATGGTCCCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-12.70,-12.10] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-2] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [50.0-31.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.55,0.37 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 2 1587461-1588161 **** Predicted by CRISPRDetect 2.4 *** >NC_014210.1 Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 chromosome 1, complete sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================== =============================== ================== 1587461 30 96.7 31 .............................A AGGCCACCCACTGCTGGCAGGACCCGCGCAC 1587522 30 100.0 31 .............................. AGTCCAACAACGTCACGGACGCCCCGTACTT 1587583 30 96.7 31 .............................G CCATCAACGCCGCCGCCCGCTCCGGTGGTAT 1587644 30 96.7 31 .............................A AGAAGTTGCGCGGGCGCTGGTCCGGACTGGT 1587705 30 100.0 31 .............................. GGTGCGTGACCGGTGCGCCCAGATGGTCGAC 1587766 30 96.7 31 .............................G GTCACCCTCGGCCCGGCATTACGGGCCATCC 1587827 30 100.0 31 .............................. CAGGCCGCCGACCCTCGCGCGGCCCGCCACG 1587888 30 96.7 31 ............................T. ACCACCCGAGACCGCTTGGCCATCGGGACCG 1587949 30 96.7 31 ............................G. GGGTCCGAGGTCGGCGGGCGGGCGGTCCTCG 1588010 30 93.3 31 ..................A..........G GCACGGCACCCTGGACCGCCGCTGCGTCATC 1588071 30 96.7 31 .............................G GGAAGCCCACCGGCACCGTCCGCAAGGTCGC 1588132 30 96.7 0 .............................T | ========== ====== ====== ====== ============================== =============================== ================== 12 30 97.2 31 GTGCTCCCCGCGCACGCGGGGATGGTCCCC # Left flank : TCGGCCACACGCCGTCCGAGGAGTTCGCCCATATGACCGAGAAGCAGCGCAGGGCCATGTTCCTCGCCGAAGAGCTCCTGCCCGGTCTGCCGGACGACTTCGAGGCCCTCGTGGTGGAACAGCGGCCCCGGAAGCTGACCCGTGAGGGAACCGCCGTGGACGCCCACGACTCCACGCTGTTCGCCCGCCGTCGACGCTGAGAGCGACCGGCACGAGGCGTCTTGATGGGGGCCGACGGTCCCGGGAGCACCTTTGACGGGGCGTTTTCGGGACCTGTTGTCTGCGGGAGCTTCGGGACCGCGGTATCCCCTTCTGTTGGTTGGTGATGTCGGAACCGATGAGCCGGGTTATGTTGCTGTGGGACAAGGTGGTCTCGGGGCGTGTGGCCTGGTGGCGATTGAGGGGTTTTGTTGCTTATGGGGTTTTGGTGGTGCATGTCTGTTTTTCAGGAAGTTGGAAATTCGACTGCTTGCCATTGGTAAGTTCGCAGGTCGTTGACT # Right flank : TACTTCCCCACCGGCACCGCGCCGCCGCCCGCCTCCGCGCGGGCCCTGGCGGCCTCCGCGCTCCGGCTGCCCTACCAGTTCACCTTCCCCAAGGAACTGGACCGCGCCATCACCGAACTCGAAACCGACCTCGTCCTCGCCTGGCAGACCAAGGACGCCCACTGGATCGCCGAGGAACTCATCCTCTTCCTGGACGAGGACGACCGCGCCGAACTCACCGGATTCCGACTGCACTACACCCCGACCGACGGACTGGAGGTCCACCGTGCCGACTGACGCCTCCGGGCCAACACCACCCCCACCGCGACCCGCGCCCCCGCCGACGCTCCCGCCCTCCTTCGACCTGACCAGCCGACCCTGGGTTCCCGTCCAGCGGCTCGACGGGACGGAGGCCGAACTCTCCCTGACCGGGGTCTTCGAGCAGGCCGCGCGGATCCGGCGCCTGGTCGGGGACGTGCCCACCCAGGACTTCGCCCTCCTGCGGCTGCTCCTGGCGATCC # Questionable array : NO Score: 5.53 # Score Detail : 1:0, 2:3, 3:0, 4:0.86, 5:0, 6:0.25, 7:0.01, 8:1, 9:0.41, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGCTCCCCGCGCACGCGGGGATGGTCCCC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [2,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGCTCCCCGCGCACGCGGGGATGGTCCCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-12.70,-12.10] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-3] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [55.0-15.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.55,0.37 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 3 1595026-1595672 **** Predicted by CRISPRDetect 2.4 *** >NC_014210.1 Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 chromosome 1, complete sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================== =================================== ================== 1595026 30 100.0 35 .............................. CCTCGGGGGCGCTCTCCAGCTTCGCCACGCGCGCG 1595091 30 96.7 32 .............................A AAACCCTTACTCAGCAGGGGTTTAGGCGTACA 1595153 30 100.0 31 .............................. AGCGTCGTCGACCCCAGGCTGACGGAGGTCT 1595214 30 100.0 31 .............................. CTGCGGTTGCGGGGTTGTCCGCGAGCGGGAG 1595275 30 100.0 31 .............................. CGGACCGCCACGTTATCCCGTGGGCCGAGCT 1595336 30 96.7 31 .............................T GGCCGTTCGAGGGTGAGCGCGTTTACCTCGG 1595397 30 93.3 31 ..............T..............G CGGACCGGCCCACCCGGCTGGCGACCGTCGC 1595458 30 96.7 31 .............................G CCGTCGCCGCCTACGCGACCGCCCTCGGGCT 1595519 30 100.0 32 .............................. ATGCGGCAATCCACCCCACAGAGAAGCCCGTG 1595581 30 96.7 31 .............................T CGATGACCGAGGTCGACGGCACCGGGTCGGA 1595642 30 96.7 0 .............................G | T [1595669] ========== ====== ====== ====== ============================== =================================== ================== 11 30 97.9 32 GTGCTCCCCGCGCACGCGGGGATGGTCCCC # Left flank : GGGGTGAGCTGCTGGAGTCGGGTTACAGCTATGCCGACGAGGTGGTCTGGTGACGGTCGTCGTGCTCACGAACTGTCCGGCCGGGCTGCGCGGGTTCCTGACCCGGTGGCTCATGGAGATATCGGCGGGGGTGTTCATCGGCAACCCTTCGCGCCGGATCCGGGAGGCGCTGTGGGCGGAGGTGAAGGAGTACGCGGGGAACGGGAGGGCGTTGCTGGCGTACAGCGACGACTCCGAGCAGGGCTTCACCTTCCAGACCTTCGAGCGTCACTGGGAGCCGGTGGACCATGAAGGGCTCACCTTGATGCATCGCCCCAAGAAGGTGCAGGAGGAGAACAGAAGGCCTCCGAAGAGCGGGTGGAGCAAGGCGTCGAAGAGGCGGCGCTTTGGTGGGAGGTGAGTTGTTTTGGGTGCTTTGATGGATTGTGGTGCGGGTGTCCGTCTCTGAAGAAGTTGGAAAAACGGCCGCTTGCCACTGGTAAGTTCGCAGGTCGTTGACT # Right flank : GAGCGCGAACGGGCAGCCCATGGTCAGGTGATGGCGTCGATCGGTAACAGGAACGCTTGTTCCCGGGTTCGAGCAACACCATCAGCCTTCTCGTCACCGAAATGTCGATGAGGTTCAGCAACGGCGCCACGGGCGAGAGTGTTGGCGGCTTCGGAAATGCCAGTGGATCGGCCACCAACTCGTCAGTGAGCATCGGCCGCGGTCAGCGGGCACACGGTGAGCGTCACCGCCAACGCCGCGGGCAAGGGGCGCCAACGGTGCTGGCCCTGCCATGGCGTCTGGTCGAAGGCCGACGAGGGCCGTTTCAAGGCCGAGGGGTGCCGCGGCACTCGGCTTCGCCGGTCGGGCCCGCAGGTGTGACGCCGATCAGCGACGCACCATCCGGACCAGGGCGGCGGTGTCGTGTCGGCGGAACCAGGCCGCGACACAGACCAGGACCAGGGTGACCGCCGGGATGGCGACCAGGTCCGGGGTCACGATGGCGCTCAGGGCCGTCGCGC # Questionable array : NO Score: 5.95 # Score Detail : 1:0, 2:3, 3:0, 4:0.90, 5:0, 6:0.25, 7:0.01, 8:1, 9:0.79, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGCTCCCCGCGCACGCGGGGATGGTCCCC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [2,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGCTCCCCGCGCACGCGGGGATGGTCCCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-12.70,-12.10] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-2] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [48.3-36.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.55,0.37 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 4 1602745-1603445 **** Predicted by CRISPRDetect 2.4 *** >NC_014210.1 Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 chromosome 1, complete sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================== =============================== ================== 1602745 30 100.0 31 .............................. GAGAAACCGCTCATGAACTGGAGAAACACAT 1602806 30 96.7 31 .............................C GGGGCCGTGGTGTTGGACGCCGCGGATGTGG 1602867 30 100.0 31 .............................. CGGACGCGGGCGATGACTTCTTCTGCGTCTT 1602928 30 100.0 31 .............................. CGTCGGCCTTGACCATCATCTGCACCCCGCG 1602989 30 96.7 31 .............................C GGCCGCCGGGCCGCCGCGGTGGCCAGCTCCT 1603050 30 96.7 31 .............................G AGGCGGCGCGCGCGGACCCGTCGTCGCTGTC 1603111 30 96.7 31 .............................A TCCCGCGTGGCGCTGCCCTCTACCTGCGCGG 1603172 30 96.7 31 .......................A...... ACTGGCGAACCTGCTGGAGGACCTGTCCGAC 1603233 30 100.0 31 .............................. GCGCGGCCGCCGCCTTGTTTGCCCACGTCCC 1603294 30 96.7 31 .............................G GCAGTGGTGAGGGGCTCGCCCTTCAGGCCAA 1603355 30 96.7 31 .............................C CTCCGTCCGTAGGTGGCCAACCACAGGAGCA 1603416 30 90.0 0 ...............A.......A.G.... | ========== ====== ====== ====== ============================== =============================== ================== 12 30 97.2 31 GTGCTCCCCGCGCACGCGGGGATGGTCCCT # Left flank : CAGGATCAGTGACGCATCCTATCCCTGTGCCCCGGCCTTCCAGCGACCTGCTGCACCGACCGCTTCCGCCGCTGCCTCGTCGTCGCCCGCTGGTCGGCCCGTTCTGCCCGGCCTGCGAGCACCCTTCGTGCCGTCGGCGTCGGGCCGCGCGCCTGCCCCGCCTGGGCGGCCACCGCTCTGAGTTCACTCGCGAGCACGCCCGGGCCGCAGTTCTCCAGCGGCATCACCCGCACCTGCTCATCTGGTTCGGTGAGCAGACTCTGTCCTACTGGGTGGCCTCGCCTGTCGGACTGACTGAGGTTTCCGAATCCGAGGTGCTGTTGCTCCTCGCGGAGCCGGTTTCCGTGAACTGAGGGGTTACCGGCAGGGGCGTGATGCGCTCCTGCCGGTATTCTGCGGACATGGTCTTCCCAGGGCTGAGTGCGTCGTCCGTTTTGTCTGCTTCTGAAGAAGTTGGAAAAACGGCTGCTTGCCACTGGTAAGTTCGCAGGTCGTTGACT # Right flank : TGGGAGTACGGCTGATGCCCTCCCAAGGCCTTCAGGGACAGGAGTCGTAATCCACGGACGCGCTTCCGGCTGCGGCGTCGTAGCACTGGCTGACCGGCGGTATCCCGGAGACACCCGGCACGAAGTCGGGGTCTCCGAAGGTGAGCGGGCCCCGCTCGGGAGCACCCGGGAAGGTGAGGACGAGCCTGTCGCCGACCGAGACCGGTCCGGGGGCCTCCTCCGGCTCGATCGGGCCCTCAGGAGTGGGCACCGTGAACAGCTCAGGCACGGAGGCGCTGTGGCCCTCGACCAGGTTCCCGACGAGCACGGTGAAGCGCACCCCCTCGGGGACCTCCTCCACGGAGCTCACGGAGAGGTGGAAGAGCGTCTCGTACGAGTCAGGGGTGGCCTGGAAGGTCGGCATGCCGTTCGGCGAGGTGATCACCTCCGCCAGCCATTCGGCGTTGGTGCCGAAGGGCGTCGGTTCCGGCATCACCCATTCCGTGTCGGGTCCCTCCGTA # Questionable array : NO Score: 5.66 # Score Detail : 1:0, 2:3, 3:0, 4:0.86, 5:0, 6:0.25, 7:0.01, 8:1, 9:0.54, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGCTCCCCGCGCACGCGGGGATGGTCCCT # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [2,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGCTCCCCGCGCACGCGGGGATGGTCCCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-12.70,-12.10] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-4] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [50.0-38.3]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.55,0.37 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 5 1638344-1638677 **** Predicted by CRISPRDetect 2.4 *** >NC_014210.1 Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 chromosome 1, complete sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================== =============================== ================== 1638344 30 96.7 31 .............................T ACGCATCCACGGCAAGCCGCCGCTGCGCGAA 1638405 30 96.7 31 .............................C GCCGTCGGTCTTCTCAGTCAGGTCCGCGTAG 1638466 30 100.0 30 .............................. GCTGTCCGTGGCCAGTGTCGGTTCCGGGCG 1638526 30 96.7 31 .............................T TCTTGGCGTTCATCGCTTCAGCTCCGGCAGG 1638587 30 100.0 31 .............................. GGGTGAGCGGGTCCACGGCGCCGGTGACGAT 1638648 30 96.7 0 .........................C.... | ========== ====== ====== ====== ============================== =============================== ================== 6 30 97.8 31 GTGCTCCCCGCGCACGCGGGGATGGTCCCA # Left flank : TGCTTGATGCCGCCGCCCGCGTACTTGTCGGTCCAGGGTGTGCTGCTGGCGCTCGTCAACACCCATGAGCGTGTTTTCACCCGCGAACAGGAGGAGACCAAACGGCTCCAGTTGTGTCTGGATGCGCAGGAACGGACTCAGAAACGCGCCGATCGTGCCTACTTCGGCACCGTGGACCGCCACACCAGCCCTTTTTGCCCGATTCGAAGGATGGTCCCGAGAACGGGAAGAACGGTGGCTCCTGATCACACCCGTTCCGAACCCATGCGATACCGCCCTGAGCTGTTCGAGGGATGCCTGGTTGGGGCTGGCCCGTCCCTGGAACCGTAATCCCACAAATCAAAGTTCGGTGATGGATTACCGGCAGGGGAGTGCTGTGCTCCTGCCGGTATTCTGCGGGCATGGTCTTCCTGGGGCTGAGCGCGCCGTCCGTTTTGTCTGTTTCTGAAGAAGTTGGAAAAACGGCTGCTTGCCACTGGTAAGTTCGCAGGTCGTTGACT # Right flank : AGACCAGGCACCATGCGCGACATCAGCTGAGCGTCTGCGTCCCGATGACCGACAGCACCTGGAGCTTCTCGTAGCTCTCGGTGCCGGGGACGGCGGTGTAGACCATCAGCGAGTGCGCCTGCCCCGGGTCGACCAGCCTCTGGCAGGTCAGCTCCAGCGCGCCGACCTCGGGGTGGACGAAGTGCTTGACCTCGTGGGGACGTATCCCGATCTCGTGGTCGCTCCACACCCGCCGGAACTCCTCGCTGCGGGCCAGCAGCAGCTCGGCCAGCTGGGCGGCGGGGGACTCCGGGCCGCGGAGGGTGACGAGCTCACGCAGCCCCGCGGCGTACATCCGGGTCAGGAACGGGTGCTCCTCCGGCGCGTAGATCCGCCGGGATTCGGGATCGGTGAACCACCGGTACCCCGCGCTACGGGCAGGCCCCGTGTACCGCGTCGCGTCCCCGGTCAGCGCGACGCCCATCGGGGACTGCCGCAGGGTCTCGCCGAGCTCGGTGACG # Questionable array : NO Score: 5.84 # Score Detail : 1:0, 2:3, 3:0, 4:0.89, 5:0, 6:0.25, 7:0.01, 8:1, 9:0.69, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGCTCCCCGCGCACGCGGGGATGGTCCCA # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [2,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGCTCCCCGCGCACGCGGGGATGGTCCCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-12.70,-12.10] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-1] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [51.7-36.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.55,0.37 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 6 1645359-1645631 **** Predicted by CRISPRDetect 2.4 *** >NC_014210.1 Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 chromosome 1, complete sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 1645359 29 100.0 32 ............................. GGGACGGAACGGGACCGCGACGGTCACCCTGG 1645420 29 100.0 32 ............................. GGGACGGAACGGGACCGCGACGGTCACCCTGG 1645481 29 100.0 32 ............................. TGGCCCTTGAACTTGACGGTGGTCACGGTGAC 1645542 29 100.0 32 ............................. GACGGCGTCACCGTGCTCCTGGTCGATGCCGC 1645603 29 79.3 0 .............G...A...G.CA..T. | ========== ====== ====== ====== ============================= ================================ ================== 5 29 95.9 32 GTGCTCCCCGCGCACGCGGGGATGGCCCC # Left flank : GGCGCTGGTGACCGACGAGATCTTCGGGCCCGTGCTGGTGATCCAGGTCTACGACTCGGTGGGCGAGGCCGTCGATCTGGCCAACCGCACGCCCTACGGCCTGTGCGCCGGGGTGTGGGGCGCCGACCGCGCCGAGGCCGTCGAGGTGGCGGGGCGGTTGCAGGTCGGCCAGGTCTTCGTCAACGGCGCCGGGTTCAATCCGGACGTCCCGTTCGGCGGCTTCAAGCGGTCGGGGATCGGCCGCGAGTACGGGCGCTACGGGCTGGAGGAGTTCCAGCAGACCAAGGGGCTGGTGTTCGGCGCCGACGCTGTCGGCTGTGGTGGATACCGCTGACGGAAGCGGTGCGGGCGGGTGGCCGCTACCGGGAGACGATCGAAGGGGCGCACTTCTCGGCCGGTATTCTGCGGCCATGTTGTTTCCAGGGCTGGGCGCTTTGTCTGTTTTCCAGGAAGTTGGAGAAACGGCTGCTTGCCACTGGTAAGTTCGCAGGTTGTTGACT # Right flank : CGCAGCCGCCGCCGGTGGGGAGGGAGGCGTTTCCAGTCCGGCTTCCTCCCCTCGGTCCGGACCGGGCTCGGCGGTCCCGGGGCCGGGAGTGGGGCTCGCAGTGCCGGAACGGTAGCGCTCGTCCTCGCGGAGCCAGCGCACCAGGGAGGCGGCGACCAGGCTCAGGATGAAGATCAGCGGGAAGGCGGTCACGACCGACAGGGTCTGCAACGCGTCGGTGCCTCCGGCCGACATCACCGAGACCGACACCGCGGCCAGGACGATCGCCCAGAGCACGCGGTTGGCCCGGGAGGGCTCGACCTCGTTGGGCAGGTCACGGGAGGTGGCAGAGCCCATGATGTAGGACGCGGAGTCCAGCGTGGTGGCCAGGAAGACCAGCAACAGCAGCAGGAACAGCGGGGTGATGATCCAGCTGAGCGGGAACGCCTCCAGCGTGGCGAAGATGGCCGGGACCGCGCCCTCGGCCTCCAGGGTGTCGACGATGGGCGCCTGTCCGCTCA # Questionable array : NO Score: 5.66 # Score Detail : 1:0, 2:3, 3:0, 4:1.00, 5:0, 6:0.25, 7:0.01, 8:0.4, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGCTCCCCGCGCACGCGGGGATGGCCCC # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [2,3] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGGTCCCCGCGCACGCGGGGATGGCCCC with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-12.70,-12.10] Score: 0.37/0.37 # Array degeneracy analysis prediction: NA [0-0] Score: 0/0.41 # AT richness analysis in flanks prediction: F [50.0-25.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.14,0.37 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 7 1662442-1662838 **** Predicted by CRISPRDetect 2.4 *** >NC_014210.1 Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 chromosome 1, complete sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================== =============================== ================== 1662442 30 96.7 31 .............................T CCACGTCCTGCCCCGGCAGCAAGAACTACTC 1662503 30 96.7 31 .............................A CCAGAACGTGATCCGCGGCGGCGGCATCGTC 1662564 30 96.7 31 .............................C GCTCCAGCCCGAAACCCTGGCTCGCGCCGAC 1662625 30 96.7 31 .............................T CACCGGTGGCCGACCCTTCCTGCACGCCATC 1662686 30 100.0 31 .............................. GCGCCGACGGCGACGACTGGTCGCTCTCCAA 1662747 30 100.0 31 .............................. TGGGTGGGGCCGGGCGGGGGAAGTGGAAGCG 1662808 30 96.7 0 .........................G.... | G [1662831] ========== ====== ====== ====== ============================== =============================== ================== 7 30 97.6 31 GTGCTCCCCGCGCACGCGGGGATGGTCCCG # Left flank : CCCGCCGCGCCAGCAGCGCGGTGGTCAGCCCGACGATGCCGCCTCCCACCACCGCGACGTCGACCTCCACGTCCTGGCTCAGACGGGGAAACTCCGGTCGGTTCTCCACGACCGTCCACACGCTCCTGTGCTGCATCCTCTGCCCCTCCCGATCAGGTCCAGACGCCGTCGGCCCCCGTGCGACGGACAGCGCCTACACCAGCTACCCAGCGTTCCCGGTCGCTACCGCCGGAGCCTCGGGTCGGGGCATGGACCCGCGCCGCTTCCAGAGCCGGGGGAGGGGTGCGTGGCGAACACGGGCTGGGCGAAAGGAGAGTGGTGGTGTGCGGTGGTTCCCCGGCAGAGGGGGATAGCGGTCTGTAGGTTGGGTCTTCTGGAGCTACAGCGTTTTGGTGGTTGGTGAGTTTTTGGTTCTCTTGGGGCCTGTGGTGCGTGTGTCTGTTTCTGAAGAAGTTGGAAATTCGGCTGCTTGCCGCTGGTAAGTTCGCAGGTCATTGACT # Right flank : GATCCCCACTGACTTCCGCGTTCGTCGTCCCCGGAAAACCGCTCGACGGCTTCGGGGCGACCTGACACAGTGGCGCACCATGACGGGTTCCGAACTTGTGACCGCCGCGGACGTGCGGCTCATGCAGGGGCTGGCGCAGCGCGTCACCGCGATCCGCCCCGACCTGGTGAACAGCGACGCCTCGTTCGGCGAGCTGGCCTGGAACTGGGGCCGGGGGCACGCCAGCGACGGCGCGACCTGGCCGCGTCGGCTGTGGTTCTCCGGCGGGGAACTGGTCGCGTGGGGCTGGCTCCGCCTTCCGCGCCGGGTGAGGCTGAGCGACGGCTCGGTCAGGGACGTCACCGGCGCCTACCTGATGCACCAGGTCCACCCCGACCACGCCGGGCTGGTCGACGAGGTGATCGCCTGGTACGACGCCACGGCGGCGGGCCTCGAACGCACGGTGCTGCCCAGCGCCGCCGACGGGTTCGCCCTGGAACGGTGGGCGGCGCACGGCTACG # Questionable array : NO Score: 5.70 # Score Detail : 1:0, 2:3, 3:0, 4:0.88, 5:0, 6:0.25, 7:0.01, 8:1, 9:0.56, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGCTCCCCGCGCACGCGGGGATGGTCCCG # Alternate repeat : NA # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: R [2,4] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGCTCCCCGCGCACGCGGGGATGGTCCCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-12.70,-12.10] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-2] Score: 0.41/0.41 # AT richness analysis in flanks prediction: F [51.7-31.7]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.55,0.37 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 8 2013511-2015306 **** Predicted by CRISPRDetect 2.4 *** >NC_014210.1 Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 chromosome 1, complete sequence Array_Orientation: Forward Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================= ================================ ================== 2013511 29 100.0 32 ............................. GGCAGCAGGTCCTCCAGGGAACCGCCGCCCCG 2013572 29 100.0 31 ............................. CACTCGCCTTCACTCTGTCGAACGACCGGTC 2013632 29 100.0 31 ............................. GTCTGATCACGTCTATGGTGGTCGGTCTTCG 2013692 29 100.0 32 ............................. TTCACCCCCCGGCGTCCGGTGACCCGTGCTCC 2013753 29 100.0 32 ............................. CGCACGTCGATGCTCCCCGAGACGCGGCGGGA 2013814 29 100.0 32 ............................. CCGGGGCGGGTCACGGACGTGTACCTGCTCCA 2013875 29 100.0 32 ............................. CCGGGGCGGGTCACGGACGTGTACCTGCTCCA 2013936 29 100.0 32 ............................. GCCATGCTCGCGTCCGAGACCCGCAAGATCCA 2013997 29 100.0 32 ............................. GTGACGTCGTCGGGCGTGTAGAGCTCGCGCTT 2014058 29 100.0 32 ............................. ACTTGCCCCCCGTTGGCCCTACTGCTGTCTCT 2014119 29 100.0 32 ............................. GCGTTCATCAGCTGAGGCTCACCGGCGGCGGG 2014180 29 100.0 32 ............................. ATCCCGGCCAGCCCCCTGCTCAGGTTGCCGCT 2014241 29 100.0 32 ............................. GGATTGACGGGGTTCGTAGCCGGTGACGATTC 2014302 29 100.0 32 ............................. GCTCAGCCCACTGAGGGCTGAGAGGTCACCGC 2014363 29 100.0 32 ............................. AGCATCATGCCCGTCTGTGCGGGGGAGCCCAC 2014424 29 100.0 32 ............................. CCGCAGGTCGAGAGGTACAGGTCAATCTCGGT 2014485 29 100.0 32 ............................. GGGCGACAGACCACCCGGCTACCAACCGTGTC 2014546 29 100.0 32 ............................. GAGGAGTTCGCCTACGCCATCGCTGGGTCGGG 2014607 29 100.0 32 ............................. TACGGCCAGCCGGTCGGTCCCTACCGCATCGC 2014668 29 96.6 32 ........................G.... TGCGCGGCCACGTACCCGGCGAGGTCGATCAC 2014729 29 96.6 32 ........................G.... GGATGACGCAAGCCGAGATCGTCCGGGCCACG 2014790 29 96.6 32 ........................G.... CCATCGACAGGGGCACACGCCCCGCCGACCTA 2014851 29 96.6 32 ........................G.... GCCATCGCGATCGCCGCTGTCCTGGGCATGCA 2014912 29 96.6 32 ........................G.... GCGACCCCCGCTGACGCTCCCATCGTCGTCTC 2014973 29 96.6 32 ........................G.... CCCCTCCAGCGCAAGCTGGGCGTGGACGCTGA 2015034 29 96.6 32 ........................G.... ATGATGGTGGTGGCGTTGGACGCCGGGGCGGG 2015095 29 96.6 32 ........................G.... ACATCGACCTCCTCACCGCAGAATCCCCCCAC 2015156 29 96.6 32 ........................G.... GCCGTCAGCGTGGAACTCAGAGGCGTATGGGA 2015217 29 100.0 32 ............................. CGGGCCATCTGCATGGCGGCCTCTTCGGCGAC 2015278 29 86.2 0 ........T.T.T...........G.... | ========== ====== ====== ====== ============================= ================================ ================== 30 29 98.5 32 GTGAGCCCCGCGCACGCGGGGATGAACCG # Left flank : GAACAGGGCTGAGGCCCTTCCGGGGTGAGCCCGGACGGATCGTGTGGATGGGTGTTGGGCAGGCGTGGGCGTACGTGTGGCCGGACGGCACCGTCACCGGCGGCGGTGATCGTGCCTTGGGTGAGGAGTTGGGGCACGCCTATCGGATGCTGCGGGGCGCCGGGTTCCCGGGTCTGGAGAGCTTTGTCCTGGAAGCCGTCGCTGGTCAGGATTCCTGCCGGGTGGGGTGTGCGGCCCTGGAGCAGCACTGGGATCACTCCGTATCGCCCTGACTTTCTCGTCTTGCTCATCCTGCTCGTAGTGCTCTTCAGGGCTGTTTCTCCGGGCCGGGGGCCTGCGACCGCAGCCCAGGCATGGAAAGCCTGTGTGCCCGGCGCGGTAGTGGGGATGAGCGCCGCCGTGCACTGTGTGAGGATGTACGTCGAAGCCGGGAGTGCTCAGCATGTCCAGGGCCAAGGTTGCGGCGAGATAACGGCCCGCGTTGCCGCAGGTCGTTAAGC # Right flank : GGTGACCGTGAGTGGCCACTGTGCGTTCGTGGCGAAGCGTGGTGTTCTCGGGGCGGCGGACGCAGAGAACCCCACCTACCGAGAGGGAGGTGGGGTTTGTTCGCTGTGCCCGAGGTCAGACGGCGGGGTTCTCCGTCATGCCGAGCTGACCGAGGTCGATCCGCTCGGCGCGGCGCGCGAGCAGGTACAGGTGCCCCGAGGTCCGGTCCATCTCGCGGATGAGCGTGAACCCCTGCCCCTGGTAGTACTTCGCCAGCTCGGGGAAGAAGCACCCTTGCCGCACCCACTGGCGGTCCTCTCGCGCCGCCCGGTCCACCGCCCACAGGTCGATCAGGGTGCCCGGCTTGTGCTCCCGGTAGTCGGGGTGGGTGCAGGTGGAGAACAGGTACAGGCTCGGCTCGTCGGCCTCATCGGGGGTCCAGTCCTTCGGCGGCGCGTGTTCGGTCACCGTCGTGCACCCGACGACCCGGTCTTCGTCTTCGAGCACCCACATGAACCCG # Questionable array : NO Score: 6.19 # Score Detail : 1:0, 2:3, 3:0, 4:0.93, 5:0, 6:0.25, 7:0.01, 8:1, 9:1, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTGAGCCCCGCGCACGCGGGGATGAACCG # Alternate repeat : GTGAGCCCCGCGCACGCGGGGATGGACCG # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [5,2] Score: 0.37/0.37 # Reference repeat match prediction: F [matched GTGATCCCCGCGCCAGCGGGGATGAACCG with 90% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: F [-12.70,-12.10] Score: 0.37/0.37 # Array degeneracy analysis prediction: F [0-4] Score: 0.41/0.41 # AT richness analysis in flanks prediction: NA [35.0-31.7]%AT Score: 0/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: F [5.65,0 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], // Array 9 5555955-5555559 **** Predicted by CRISPRDetect 2.4 *** >NC_014210.1 Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 chromosome 1, complete sequence Array_Orientation: Reverse Position Repeat %id Spacer Repeat_Sequence Spacer_Sequence Insertion/Deletion ========== ====== ====== ====== ============================== =============================== ================== 5555954 30 100.0 31 .............................. ATGACTTGTCCGCTGAGAGTGAGCCGTTGCT 5555893 30 100.0 31 .............................. CCGACGGGTTACTGATGAGCTTGGCGCGCTG 5555832 30 96.7 31 .............................T CCGCGTCCGGCGCGGTGAGCTGGGTGGGCAG 5555771 30 96.7 31 ..G........................... CCTGCGTCTGCGCCTGGAGGGCGGCCAGCTG 5555710 30 93.3 31 ..G..........................C TCGGCGCGACCGTGGGCAACCTGGTGACGGC 5555649 30 93.3 31 ..G..........................A GCGCTGGGCGCATCCAGGTAGAGGACAAGGA 5555588 30 93.3 0 ..G.........................A. | ========== ====== ====== ====== ============================== =============================== ================== 7 30 96.2 31 GTTCTCCCCGCGCACGCGGGGATGGTCCCG # Left flank : AGCCGGTCGGTGTCCTCCTGGCCGTACTCCGCGATCCGCTCCCGGTCGAAGCCCTCCAGGGCCCGCCGGTAGTTCTCCCGCTTGTTGAGCACGGTGGACCAGGACAGCCCGGCCTGCGCGCCCTCCAGGACGAGCATCTCGAACAGGTGCGCGTCGTCGCGGGAGGGCCTGCCCCACTCGTGGTCGTGGTAGGCCACCATCAGCTCGGAGGACCCCCGTGCCCAGGCACAGCGCTGGTCGGACATGGCAGCTCCTCGGGGCCGGCGGCGGTGGTCGCCTCCATGGTGCCGCAGGTCGGAGACAGTCCGGACGCGACCTCCGGTCGGGGTGCGTTGCGCAGGTGTCTCGCTGGTCGCGGTCCTGATCAGTAGGGCGAGTGCGCCGGTCCTGTGTGTGAAACCATGGGATGGTTCGGACTTGGCTCTGTGGGTGAAATGCCTGTTTCTCGGGAAGTTGGAAATTCGGCTGCTTGTCGCTGGTAATTTCCCAGGTCAGTGACT # Right flank : TGAGCAGGTTTTGCAGGAACTGAGAAGGGGCCGCCAGCGGAGAACGCTGGCGGCCCCTTTACGCGTGTTACCGGGAACGAACGCTGGTGAGGAACGCTGCCCACTCGGCTGAGGGCGCCTCCAGGTGACCGAGGTGACGGTTCTGCGTGTCACGGACGGCGGCACCGGTGCCGGTGTCCGCGACCTCGACGCAGTTGCCGCCGCTGGGCTGGCTGTAGCTCGACTTACGGAAGTTCAACAGGGTGGGAACTGAGGTCATGGTCTTCTACTCGATCATCTGTCGAAGGTGGTCGATCGTCTCGCCGGGACGGAGAGCGGCCAGGCGTAGGTGGTTCATCAGGTTTCTGTAGCGGGCGACCTCGTCGGGCTCCTCCAAGTAGATTCCATCGGTGTTCGTTTCCAAATAGACCACCGTGGGGTCGACCGATTCGGGGAACTCCAAGATGACGAAGGGACCTCCGCTTCCCGCGTTGAGCTGGGACGCGCGTGTCATCTGGAGG # Questionable array : NO Score: 5.63 # Score Detail : 1:0, 2:3, 3:0, 4:0.81, 5:0, 6:0.25, 7:0.01, 8:1, 9:0.56, # Score Legend : 1: cas, 2: likely_repeat, 3: motif_match, 4: overall_repeat_identity, 5: one_repeat_cluster, 6: exp_repeat_length, 7: exp_spacer_length, 8: spacer_identity, 9: log(total repeats) - log(total mutated repeats), # Primary repeat : GTTCTCCCCGCGCACGCGGGGATGGTCCCG # Alternate repeat : GTTCTCCCCGCGCACGCGGGGATGGTCCC # Directional analysis summary from each method: # Motif ATTGAAA(N) match prediction: NA Score: 0/4.5 # A,T distribution in repeat prediction: F [4,2] Score: 0.37/0.37 # Reference repeat match prediction: R [matched GTGCTCCCCGCGCACGCGGGGATGGTCCCG with 100% identity] Score: 4.5/4.5 # Secondary Structural analysis prediction: R [-12.10,-12.70] Score: 0.37/0.37 # Array degeneracy analysis prediction: R [1-0] Score: 0.41/0.41 # AT richness analysis in flanks prediction: R [35.0-50.0]%AT Score: 0.27/0.27 # Longer leader analysis prediction: NA # ---------------------------------------------------------------------------- # Final direction: R [0.37,5.55 Confidence: HIGH] # Array family : I-E [Matched known repeat from this family], //